

Image Generation Agent
Model
Available ActionsEach successful request consumes credits as outlined below.
generate_budget_image8crgenerate_image_0_5k10crgenerate_image_1k15crgenerate_image_2k25crgenerate_image_4k40cr
Details
AI image generator powered by Google Gemini 3 Flash Image and "Nano Banana". Create photorealistic product photography, marketing creative, social graphics, app icons, concept art, hero images, and brand assets from a single text prompt — or edit an existing image by passing up to four reference photos for style, subject, or scene guidance. Choose your output tier: a low-cost budget draft for ideation, or crisp 0.5K, 1K, 2K, and 4K final renders for print, ads, e-commerce, and presentation use. Supports 14 aspect ratios including 1:1, 16:9, 9:16, 21:9, 4:5, and ultra-wide 8:1 banner formats. Every generated image is auto-saved to AgentPMT File Manager with a 7-day signed download URL, file_id, width, height, MIME type, and size bytes — ready to drop into chat, hand off to another tool, or pull into a workflow. Built for designers, marketers, e-commerce sellers, content creators, and AI agents that need on-demand visuals without leaving the conversation.
Use Cases
AI image generation, Nano Banana image creation, Google Gemini image API, text-to-image, image editing with reference photos, product photography mockups, hero banner generation, social media graphics for Instagram and TikTok and LinkedIn, e-commerce product visuals, concept art, app and product icon design, marketing campaign creative, ad creative generation, brand asset production, style transfer with reference images, photoshoot replacement, background swapping, ultra-wide 21:9 banner generation, 4K poster rendering, multi-aspect-ratio variants for omnichannel campaigns, AI-generated stock imagery, illustration generation, storyboard panels, presentation graphics, content marketing visuals, blog header images, YouTube thumbnails, podcast cover art, book cover generation, packaging mockups, real estate listing renders, automated visual content pipelines for AI agents
Dynamic MCP Setup
Connect once through AgentPMT Dynamic MCP, then use approved tools from the same agent connection.
30 Second Setup
STDIO connector for Claude Code, Codex, Cursor, Zed, and other LLMs that require STDIO or custom connections.
npm install -g @agentpmt/mcp-routeragentpmt-setupHosted Streamable HTTPS
MCP endpoint for browser-based apps like ChatGPT, Claude, Grok, or any time you want a streamable connection with no local install.
https://api.agentpmt.com/mcpConfig Example
Use the hosted endpoint directly in clients that support remote MCP. Store your Bearer token in the client config or secret field.
{
"mcpServers": {
"agentpmt": {
"type": "streamable-http",
"url": "https://api.agentpmt.com/mcp",
"headers": {
"Authorization": "Bearer <AGENTPMT_BEARER_TOKEN>",
"x-instance-metadata": "{\"client\":\"generic-mcp\",\"platform\":\"remote\"}"
}
}
}
}Need client videos, organization controls, audit details, and the full feature overview?
More About Dynamic MCPAbout this Product
AI image generation powered by Nano Banana (Google Gemini 3 Flash Image)
Turn a text prompt into polished, photorealistic visuals in seconds. The Image Generation Agent runs on Google's Gemini 3 Flash Image model — the "Nano Banana" family — to create marketing creative, product photography, social graphics, icons, concept art, and brand assets, or to edit an existing image with reference photos. Generate by hand or wire it into your agents and workflows for on-demand visuals without leaving the conversation.
What you can create
- Photorealistic product photography and e-commerce mockups.
- Marketing and ad creative, hero banners, and campaign visuals.
- Social graphics for Instagram, TikTok, LinkedIn, and YouTube thumbnails.
- App and product icons, concept art, illustrations, and storyboard panels.
- Brand assets, packaging mockups, blog headers, and presentation graphics.
Generate or edit
Text-to-image
Describe what you want and get a finished image — composition, lighting, and style follow your prompt.
Reference-image editing
Attach up to four reference photos to guide subject, style, or scene. Keep the same product across new backgrounds, restyle a shot, swap a setting, or carry a consistent look through an entire campaign — the subject stays locked from one render to the next.
Pick your resolution
- Budget draft — fast, low-cost ideation and previews.
- 0.5K & 1K — efficient standard finals for web and social.
- 2K — crisp social, presentation, and product assets.
- 4K — highest-resolution renders for print, ads, and large-format use.
14 aspect ratios, including ultra-wide banners
Render square, portrait, landscape, and cinematic formats — 1:1, 16:9, 9:16, 21:9, 4:5, and ultra-wide 4:1 and 8:1 banners — so one prompt can fuel an omnichannel campaign.
Ready for your workflow
Every image is auto-saved to AgentPMT File Manager and returned with a download link plus its file_id, width, height, MIME type, and size — ready to drop into chat, hand off to another tool, or pull into an automated pipeline.
Who it's for
Designers, marketers, e-commerce sellers, content creators, and AI agents that need on-demand, high-quality visuals at any resolution.
Frequently Asked Questions
How do I connect this tool to an external agent?
You can install the local MCP server by opening a terminal and running:
Install commands
npm install -g @agentpmt/mcp-router
agentpmt-setupThis will connect you to local agents like Claude Code, Windsurf, Grok Build, Cursor, etc.
Alternatively you can connect to the hosted version with this config block, no installation required:
Hosted MCP config
{
"mcpServers": {
"agentpmt": {
"type": "streamable-http",
"url": "https://api.agentpmt.com/mcp",
"headers": {
"Authorization": "Bearer <AGENTPMT_BEARER_TOKEN>",
"x-instance-metadata": "{\"client\":\"generic-mcp\",\"platform\":\"remote\"}"
}
}
}
}View MCP Connection Instructions for more details.
How does an external agent use this tool?
After the external agent is connected to an Agent Group that can use this tool, paste this prompt into the agent:
Agent prompt
Use the AgentPMT-Tool-Search-and-Execution tool. First call action 'get_instructions' so you know how to use the tool search interface. Then call action 'get_schema' with tool_id 6a054f5c90a57115271c1316 ("Image Generation Agent"). After reading the schema and any returned instructions, tell me what this tool can do, what inputs it needs, and what you need from me before running it. Do not call action 'call_tool' until I confirm the request and provide the required parameters.
The agent should fetch the tool schema first, collect the required parameters for your request, and then call the tool through AgentPMT.
Can I edit an existing image or keep a product consistent across scenes?
Yes. Attach up to four reference images to guide subject, style, or scene. Reference-image editing keeps the same subject across new backgrounds, so you can place the same product in different settings or carry one look through an entire campaign.
What aspect ratios are supported?
14 aspect ratios, including 1:1, 16:9, 9:16, 21:9, and 4:5, plus ultra-wide 4:1 and 8:1 banner formats — so a single prompt can produce variants for an omnichannel campaign.
What is the Image Generation Agent?
An AI image generator powered by Nano Banana (Google Gemini 3 Flash Image). Create photorealistic product photos, marketing creative, social graphics, icons, concept art, and brand assets from a text prompt, or edit an existing image with reference photos — directly in chat, agents, and workflows.
What resolutions can I generate?
Choose the tier that fits the job: a low-cost budget draft for ideation, efficient 0.5K and 1K finals, crisp 2K for social and product assets, and 4K for print, ads, and large-format use.
Where do my images go after they're generated?
Every image is auto-saved to AgentPMT File Manager and returned with a download link, file_id, width, height, MIME type, and size — ready to drop into chat, hand to another tool, or use in an automated workflow.
Who is it built for?
Designers, marketers, e-commerce sellers, content creators, and AI agents that need on-demand, high-quality visuals at any resolution without leaving the conversation.










