- Text: natural language responses for chatbots, content generation, and analysis
- Object: structured JSON data with schema validation for APIs and data extraction
- Image: visual content from models like DALL-E 3
- Speech: spoken audio for voice applications and text-to-speech
Choosing the right type
| Type | Best for | Output format | Example use cases |
|---|---|---|---|
| Text | Conversational AI, content writing | String | Chatbots, summarization, Q&A |
| Object | Structured data extraction | JSON with schema | Form parsing, data normalization, API responses |
| Image | Visual content creation | Image file | Marketing assets, illustrations, prototypes |
| Speech | Voice applications | Audio file | Podcasts, audiobooks, voice assistants |
text_config, object_config, image_config, or speech_config) and load it with the matching client method (loadTextPrompt, loadObjectPrompt, loadImagePrompt, loadSpeechPrompt). The guides below cover each type’s configuration options and SDK usage.
Detailed guides
Text generation
Natural language responses with conversation history
Object generation
Structured JSON with schema validation
Image generation
Visual content with DALL-E and similar models
Speech generation
Audio synthesis with voice customization
Have questions?
Reach out any time:
- Email us at hello@agentmark.co for support
- Schedule an Enterprise Demo to learn about our business solutions