Overview

Get Started

Quick Start

Core Concepts

An overview of AgentMark’s core concepts

What is AgentMark?

AgentMark is a comprehensive LLM observability and prompt engineering platform that enables cross-functional teams to collaborate on AI applications. Built for both technical and non-technical users, AgentMark provides intuitive tools for prompt creation, systematic testing, and production monitoring—all in one unified platform.

Core Platform Features

Prompt Management

Collaborative prompt engineering that empowers both technical and non-technical team members to create, test, and iterate on prompts together.

Visual Editor: Intuitive web interface for editing prompts—no coding required
Real-Time Testing: Run prompts instantly with live preview and immediate results
Team Collaboration: SMEs and developers work together with review workflows
Version History: Automatic tracking of all changes with side-by-side comparison
Multiple Output Types: Support for text, structured objects, images, and speech
Automatic Sync: Changes deploy seamlessly to your application

Learn more about Prompt Management →

Testing & Evaluation

Systematic validation and quality assurance for your prompts and agents.

Datasets: Bulk testing against collections of input/output pairs with version control
Evals: Evaluate your prompts with custom grading functions
Custom Metrics: Define numeric scores, boolean checks, classifications, and more
Annotations: Human-in-the-loop manual labeling and quality assessment
Experiment Tracking: Compare prompt versions and track performance over time
Regression Detection: Catch quality issues before they reach production

Learn more about Testing →

Observability & Monitoring

Production monitoring built on OpenTelemetry standards for comprehensive visibility into your LLM applications.

Distributed Tracing: Track complete execution paths across inference spans and tool calls
Real-Time Metrics: Monitor latency, token usage, success rates, and streaming performance
Cost Tracking: Detailed spending analysis across models, users, and time periods
Sessions: Group related traces to understand multi-turn conversations and workflows
Alerts: Proactive notifications for cost thresholds, latency spikes, error rates, and quality issues
Performance Analysis: Identify bottlenecks and optimize response times with detailed telemetry

Learn more about Observability →

Configuration & Integration

Flexible platform configuration that adapts to your needs.

Model Management: Configure and manage multiple LLM providers and models
API Keys: Secure credential management for your LLM providers
Webhooks: Integrate with external systems for custom workflows
Alerts: Set up notifications for cost, latency, errors, and quality metrics
Team Management: Organizations, apps, and branch-based access control

Development Workflow

AgentMark supports both TypeScript and Python, with adapters for popular AI SDKs.

Language Support: TypeScript and Python
Adapter Options: AI SDK (Vercel), Claude Agent SDK, Mastra (TypeScript); Pydantic AI, Claude Agent SDK (Python)
Local Development: Built-in dev server with hot-reloading, local telemetry, and a CLI for running prompts and experiments
IDE Integration: MCP server configuration for Claude Code, Cursor, VS Code, and Zed
Deployment Modes: AgentMark Cloud (managed prompt delivery and sync) or self-hosted (file-based prompt loading)

Get started with the quickstart →

Key Benefits

Cross-Functional Collaboration - Subject matter experts, product teams, and developers work together seamlessly with an intuitive interface that requires no coding experience. Complete Visibility - Monitor production LLM usage with distributed tracing, cost tracking, performance metrics, and custom alerts. Quality Assurance - Systematic testing with datasets, automated evaluations, and human annotations to maintain prompt quality. Version Control Built-In - Every change is automatically tracked with full audit history, comparison tools, and rollback capabilities. Flexible Configuration - Use any LLM provider, manage your own API keys, and integrate with existing tools via webhooks.

Have Questions?

We’re here to help! Choose the best way to reach us:

Join our Discord community for quick answers and discussions
Email us at hello@agentmark.co for support
Schedule an Enterprise Demo to learn about our business solutions

Getting Started

Prompt Management

Observability

Testing

Further Reference

Get Started

Quick Start

Core Concepts

What is AgentMark?

Core Platform Features

Prompt Management

Testing & Evaluation

Observability & Monitoring

Configuration & Integration

Development Workflow

Key Benefits

Have Questions?

Getting Started

Prompt Management

Observability

Testing

Further Reference

​Get Started

Quick Start

Core Concepts

​What is AgentMark?

​Core Platform Features

​Prompt Management

​Testing & Evaluation

​Observability & Monitoring

​Configuration & Integration

​Development Workflow

​Key Benefits

​Have Questions?

Get Started

What is AgentMark?

Core Platform Features

Prompt Management

Testing & Evaluation

Observability & Monitoring

Configuration & Integration

Development Workflow

Key Benefits

Have Questions?