> ## Documentation Index > Fetch the complete documentation index at: https://docs.agentmark.co/llms.txt > Use this file to discover all available pages before exploring further. # What's AgentMark? > AgentMark helps teams build reliable AI agents. Manage prompts, trace executions, run evaluations, and deploy, locally or with AgentMark Cloud. AgentMark helps you build reliable AI agents: write prompts as code, run them against any model, trace every execution, score the results, and monitor what ships to production. Prompts, evals, and datasets live in your codebase; traces start on your machine; nothing requires an account. When you want it, **AgentMark Cloud** adds visual editing, rich trace exploration, team collaboration, and production monitoring. ## Jump to what you need Write prompts as `.prompt.mdx` files with TemplateDX, and get history and rollback on every change. Score outputs with evals, run them over datasets, and compare prompt versions to catch regressions before they ship. Capture a trace of every execution down to each span, group them into sessions, and alert on cost, latency, and errors. Promote a version you validated in `staging` into `prod`, so a push to `dev` never silently changes production. ## Two ways to work Everything on your machine. Create prompts as files, run them via SDK or CLI, trace executions locally, run evaluations from your terminal. No account needed. No data leaves your environment. Edit prompts in a browser, explore traces visually, share dashboards and annotations with your team, and get alerted in production, all on top of your local workflow. No setup beyond connecting a repo. Most teams start local and add Cloud as they grow. Some stay local-only. Both are fully supported. See [pricing](/deploy/billing-and-usage) for Cloud tier details. Cloud builds on the local workflow rather than replacing it. Your `.prompt.mdx` files, local traces, and `agentmark dev` work the same whether or not you connect Cloud. **Team and Enterprise** tiers add SSO (SAML), custom roles, and app-level permissions. For data residency options, [contact the team](mailto:hello@agentmark.co). ## What you can do ### Build agents Create prompts as `.prompt.mdx` files in your editor, or use the visual editor in the Dashboard. Both produce the same format, so you can switch between them freely. * **TemplateDX syntax** with variables, expressions, logic, and reusable components * **Multiple output types**: text, structured objects, images, and speech * **Tools and function calling** for agentic workflows * **Version control** built in: every change tracked with history and rollback [Learn more about Build](/build/overview) ### Evaluate quality Run evaluators from code or CLI to score outputs automatically. Use the Dashboard for human annotations and shared experiment results. * **Datasets** for bulk testing against input/output pairs * **Custom evaluators**: numeric scores, pass/fail, classifications, LLM-as-judge * **Experiments** to compare prompt versions and track performance over time * **Annotations** for human-in-the-loop scoring and labeling [Learn more about Evaluate](/evaluate/overview) ### Observe in production Add the SDK to your app and it captures a trace of every execution automatically, no manual logging. View traces in your terminal locally, or open the Dashboard to search, filter, chart, and set alerts on them. * **Distributed tracing** built on OpenTelemetry, tracking inference spans, tool calls, and streaming * **Sessions** to group related traces across multi-turn conversations * **Cost and token tracking** across models and time periods * **Alerts** for latency spikes, cost thresholds, error rates, and quality drops * **[REST API](/api-reference/overview)** for programmatic access to traces, scores, and metrics * **[`agentmark-mcp` MCP server](/coding-agents/gateway-mcp)** exposes the gateway as MCP tools. It works with both the local dev server and Cloud, and is what your IDE agent (Claude Code, Cursor) uses to query AgentMark headlessly [Learn more about Observe](/observe/overview) ### Integrate with your stack AgentMark works with the tools you already use. * **TypeScript**: Vercel AI SDK, Claude Agent SDK, Mastra * **Python**: Pydantic AI, Claude Agent SDK * **Any framework** by wiring up [your own SDK](/getting-started/client-setup#connect-your-sdk) and OpenTelemetry [Learn more about Integrations](/integrations/overview) ## Get started Create your first prompt and see traces in under 5 minutes Organizations, apps, branches, and how they fit together Query traces, scores, and metrics via REST API Manage prompts, run evals, and query the API from your terminal

Have questions?

Reach out any time:

Email the team at [hello@agentmark.co](mailto:hello@agentmark.co) for support
Schedule an Enterprise Demo to learn about AgentMark's business solutions