> ## Documentation Index
> Fetch the complete documentation index at: https://docs.agentmark.co/llms.txt
> Use this file to discover all available pages before exploring further.

# What's AgentMark?

> AgentMark helps teams build reliable AI agents. Manage prompts, trace executions, run evaluations, and deploy, locally or with AgentMark Cloud.

AgentMark helps you build reliable AI agents: write prompts as code, run them against any model, trace every execution, score the results, and monitor what ships to production. Prompts, evals, and datasets live in your codebase; traces start on your machine; nothing requires an account.

When you want it, **AgentMark Cloud** adds visual editing, rich trace exploration, team collaboration, and production monitoring.

## Jump to what you need

<CardGroup cols={2}>
  <Card title="Author and version prompts" icon="pen-line" href="/build/overview">
    Write prompts as `.prompt.mdx` files with TemplateDX, and get history and rollback on every change.
  </Card>

  <Card title="Test quality with evals and experiments" icon="flask-conical" href="/evaluate/overview">
    Score outputs with evals, run them over datasets, and compare prompt versions to catch regressions before they ship.
  </Card>

  <Card title="Trace and monitor production" icon="activity" href="/observe/overview">
    Capture a trace of every execution down to each span, group them into sessions, and alert on cost, latency, and errors.
  </Card>

  <Card title="Ship and promote across environments" icon="rocket" href="/deploy/deployment">
    Promote a version you validated in `staging` into `prod`, so a push to `dev` never silently changes production.
  </Card>
</CardGroup>

## Two ways to work

<CardGroup cols={2}>
  <Card title="Local" icon="terminal">
    Everything on your machine. Create prompts as files, run them via SDK or CLI, trace executions locally, run evaluations from your terminal. No account needed. No data leaves your environment.
  </Card>

  <Card title="Cloud" icon="cloud">
    Edit prompts in a browser, explore traces visually, share dashboards and annotations with your team, and get alerted in production, all on top of your local workflow. No setup beyond connecting a repo.
  </Card>
</CardGroup>

Most teams start local and add Cloud as they grow. Some stay local-only. Both are fully supported. See [pricing](/deploy/billing-and-usage) for Cloud tier details.

Cloud builds on the local workflow rather than replacing it. Your `.prompt.mdx` files, local traces, and `agentmark dev` work the same whether or not you connect Cloud.

**Team and Enterprise** tiers add SSO (SAML), custom roles, and app-level permissions. For data residency options, [contact the team](mailto:hello@agentmark.co).

## What you can do

### Build agents

Create prompts as `.prompt.mdx` files in your editor, or use the visual editor in the Dashboard. Both produce the same format, so you can switch between them freely.

* **TemplateDX syntax** with variables, expressions, logic, and reusable components
* **Multiple output types**: text, structured objects, images, and speech
* **Tools and function calling** for agentic workflows
* **Version control** built in: every change tracked with history and rollback

[Learn more about Build](/build/overview)

### Evaluate quality

Run evaluators from code or CLI to score outputs automatically. Use the Dashboard for human annotations and shared experiment results.

* **Datasets** for bulk testing against input/output pairs
* **Custom evaluators**: numeric scores, pass/fail, classifications, LLM-as-judge
* **Experiments** to compare prompt versions and track performance over time
* **Annotations** for human-in-the-loop scoring and labeling

[Learn more about Evaluate](/evaluate/overview)

### Observe in production

Add the SDK to your app and it captures a trace of every execution automatically, no manual logging. View traces in your terminal locally, or open the Dashboard to search, filter, chart, and set alerts on them.

* **Distributed tracing** built on OpenTelemetry, tracking inference spans, tool calls, and streaming
* **Sessions** to group related traces across multi-turn conversations
* **Cost and token tracking** across models and time periods
* **Alerts** for latency spikes, cost thresholds, error rates, and quality drops
* **[REST API](/api-reference/overview)** for programmatic access to traces, scores, and metrics
* **[`agentmark-mcp` MCP server](/coding-agents/gateway-mcp)** exposes the gateway as MCP tools. It works with both the local dev server and Cloud, and is what your IDE agent (Claude Code, Cursor) uses to query AgentMark headlessly

[Learn more about Observe](/observe/overview)

### Integrate with your stack

AgentMark works with the tools you already use.

* **TypeScript**: Vercel AI SDK, Claude Agent SDK, Mastra
* **Python**: Pydantic AI, Claude Agent SDK
* **Any framework** by wiring up [your own SDK](/getting-started/client-setup#connect-your-sdk) and OpenTelemetry

[Learn more about Integrations](/integrations/overview)

## Get started

<CardGroup cols={2}>
  <Card title="Quickstart" icon="play" href="/getting-started/quickstart">
    Create your first prompt and see traces in under 5 minutes
  </Card>

  <Card title="Core concepts" icon="table-list" href="/introduction/core-concepts">
    Organizations, apps, branches, and how they fit together
  </Card>

  <Card title="API reference" icon="code" href="/api-reference/overview">
    Query traces, scores, and metrics via REST API
  </Card>

  <Card title="CLI reference" icon="terminal" href="/reference/cli-commands">
    Manage prompts, run evals, and query the API from your terminal
  </Card>
</CardGroup>

<div className="mt-8 rounded-lg bg-blue-50 p-6 dark:bg-blue-900/30">
  <h3 className="font-semibold mb-3">Have questions?</h3>
  <p className="mb-4">Reach out any time:</p>

  <ul>
    <li>
      Email the team at <a href="mailto:hello@agentmark.co" className="text-blue-600 hover:text-blue-800 dark:text-blue-400 dark:hover:text-blue-200">[hello@agentmark.co](mailto:hello@agentmark.co)</a> for support
    </li>

    <li>
      Schedule an <a href="https://cal.com/ryan-randall/enterprise" className="text-blue-600 hover:text-blue-800 dark:text-blue-400 dark:hover:text-blue-200">Enterprise Demo</a> to learn about AgentMark's business solutions
    </li>
  </ul>
</div>
