Skip to main content
Generation types define what kind of output your prompt produces. AgentMark supports four types, each suited to different use cases:
  • Text: natural language responses for chatbots, content generation, and analysis
  • Object: structured JSON data with schema validation for APIs and data extraction
  • Image: visual content from models like DALL-E 3
  • Speech: spoken audio for voice applications and text-to-speech

Choosing the right type

TypeBest forOutput formatExample use cases
TextConversational AI, content writingStringChatbots, summarization, Q&A
ObjectStructured data extractionJSON with schemaForm parsing, data normalization, API responses
ImageVisual content creationImage fileMarketing assets, illustrations, prototypes
SpeechVoice applicationsAudio filePodcasts, audiobooks, voice assistants
You declare each type with its frontmatter config key (text_config, object_config, image_config, or speech_config) and load it with the matching client method (loadTextPrompt, loadObjectPrompt, loadImagePrompt, loadSpeechPrompt). The guides below cover each type’s configuration options and SDK usage.

Detailed guides

Text generation

Natural language responses with conversation history

Object generation

Structured JSON with schema validation

Image generation

Visual content with DALL-E and similar models

Speech generation

Audio synthesis with voice customization

Have questions?

Reach out any time: