AgentPMT
Image Generation Agent

Image Generation Agent

Model

Available ActionsEach successful request consumes credits as outlined below.

generate_budget_image8crgenerate_image_0_5k10crgenerate_image_1k15crgenerate_image_2k25crgenerate_image_4k40cr

Details

AI image generator powered by Google Gemini 3 Flash Image and "Nano Banana". Create photorealistic product photography, marketing creative, social graphics, app icons, concept art, hero images, and brand assets from a single text prompt — or edit an existing image by passing up to four reference photos for style, subject, or scene guidance. Choose your output tier: a low-cost budget draft for ideation, or crisp 0.5K, 1K, 2K, and 4K final renders for print, ads, e-commerce, and presentation use. Supports 14 aspect ratios including 1:1, 16:9, 9:16, 21:9, 4:5, and ultra-wide 8:1 banner formats. Every generated image is auto-saved to AgentPMT File Manager with a 7-day signed download URL, file_id, width, height, MIME type, and size bytes — ready to drop into chat, hand off to another tool, or pull into a workflow. Built for designers, marketers, e-commerce sellers, content creators, and AI agents that need on-demand visuals without leaving the conversation.

Use Cases

AI image generation, Nano Banana image creation, Google Gemini image API, text-to-image, image editing with reference photos, product photography mockups, hero banner generation, social media graphics for Instagram and TikTok and LinkedIn, e-commerce product visuals, concept art, app and product icon design, marketing campaign creative, ad creative generation, brand asset production, style transfer with reference images, photoshoot replacement, background swapping, ultra-wide 21:9 banner generation, 4K poster rendering, multi-aspect-ratio variants for omnichannel campaigns, AI-generated stock imagery, illustration generation, storyboard panels, presentation graphics, content marketing visuals, blog header images, YouTube thumbnails, podcast cover art, book cover generation, packaging mockups, real estate listing renders, automated visual content pipelines for AI agents

Dynamic MCP Setup

Connect once through AgentPMT Dynamic MCP, then use approved tools from the same agent connection.

30 Second Setup

STDIO connector for Claude Code, Codex, Cursor, Zed, and other LLMs that require STDIO or custom connections.

npm install -g @agentpmt/mcp-routeragentpmt-setup

Hosted Streamable HTTPS

MCP endpoint for browser-based apps like ChatGPT, Claude, Grok, or any time you want a streamable connection with no local install.

https://api.agentpmt.com/mcp

Config Example

Use the hosted endpoint directly in clients that support remote MCP. Store your Bearer token in the client config or secret field.

Full connection guide
{
  "mcpServers": {
    "agentpmt": {
      "type": "streamable-http",
      "url": "https://api.agentpmt.com/mcp",
      "headers": {
        "Authorization": "Bearer <AGENTPMT_BEARER_TOKEN>",
        "x-instance-metadata": "{\"client\":\"generic-mcp\",\"platform\":\"remote\"}"
      }
    }
  }
}

Need client videos, organization controls, audit details, and the full feature overview?

More About Dynamic MCP

About this Product

AI image generation powered by Nano Banana (Google Gemini 3 Flash Image)

Turn a text prompt into polished, photorealistic visuals in seconds. The Image Generation Agent runs on Google's Gemini 3 Flash Image model — the "Nano Banana" family — to create marketing creative, product photography, social graphics, icons, concept art, and brand assets, or to edit an existing image with reference photos. Generate by hand or wire it into your agents and workflows for on-demand visuals without leaving the conversation.

What you can create

  1. Photorealistic product photography and e-commerce mockups.
  2. Marketing and ad creative, hero banners, and campaign visuals.
  3. Social graphics for Instagram, TikTok, LinkedIn, and YouTube thumbnails.
  4. App and product icons, concept art, illustrations, and storyboard panels.
  5. Brand assets, packaging mockups, blog headers, and presentation graphics.

Generate or edit

Text-to-image

Describe what you want and get a finished image — composition, lighting, and style follow your prompt.

Reference-image editing

Attach up to four reference photos to guide subject, style, or scene. Keep the same product across new backgrounds, restyle a shot, swap a setting, or carry a consistent look through an entire campaign — the subject stays locked from one render to the next.

Pick your resolution

  1. Budget draft — fast, low-cost ideation and previews.
  2. 0.5K & 1K — efficient standard finals for web and social.
  3. 2K — crisp social, presentation, and product assets.
  4. 4K — highest-resolution renders for print, ads, and large-format use.

14 aspect ratios, including ultra-wide banners

Render square, portrait, landscape, and cinematic formats — 1:1, 16:9, 9:16, 21:9, 4:5, and ultra-wide 4:1 and 8:1 banners — so one prompt can fuel an omnichannel campaign.

Ready for your workflow

Every image is auto-saved to AgentPMT File Manager and returned with a download link plus its file_id, width, height, MIME type, and size — ready to drop into chat, hand off to another tool, or pull into an automated pipeline.

Who it's for

Designers, marketers, e-commerce sellers, content creators, and AI agents that need on-demand, high-quality visuals at any resolution.

Frequently Asked Questions

How do I connect this tool to an external agent?

Install commands

npm install -g @agentpmt/mcp-router
agentpmt-setup

Hosted MCP config

{
  "mcpServers": {
    "agentpmt": {
      "type": "streamable-http",
      "url": "https://api.agentpmt.com/mcp",
      "headers": {
        "Authorization": "Bearer <AGENTPMT_BEARER_TOKEN>",
        "x-instance-metadata": "{\"client\":\"generic-mcp\",\"platform\":\"remote\"}"
      }
    }
  }
}

How does an external agent use this tool?

Agent prompt

Use the AgentPMT-Tool-Search-and-Execution tool. First call action 'get_instructions' so you know how to use the tool search interface. Then call action 'get_schema' with tool_id 6a054f5c90a57115271c1316 ("Image Generation Agent"). After reading the schema and any returned instructions, tell me what this tool can do, what inputs it needs, and what you need from me before running it. Do not call action 'call_tool' until I confirm the request and provide the required parameters.

Can I edit an existing image or keep a product consistent across scenes?

What aspect ratios are supported?

What is the Image Generation Agent?

What resolutions can I generate?

Where do my images go after they're generated?

Who is it built for?

Workflows Using This Tool

2 / 3
Workflow
Saves ~3 hr
Turn a topic or a content-calendar spreadsheet into a publish-ready, fact-checked blog article written in a natural human voice. This AI blog writing workflow picks the next due topic from your Google Sheet (or takes one directly), researches it across live news and authoritative web sources, builds a sourced fact sheet and SEO outline, then drafts the full long-form article with a human-style writing agent that writes only from verified facts. Every draft runs through an automated writing quality check that catches robotic, banned AI phrases and rewrites them until the copy passes. A custom hero image is generated to match the story, the finished article is assembled into a formatted Google Doc with a sources section, the run is logged back to your content calendar, and the doc link lands in your inbox. Ideal for content marketing teams, SEO agencies, founders, newsletters, and solo bloggers who want an AI blog post generator and content automation pipeline that delivers consistent, on-brand, long-form SEO content without the research grind or the telltale AI voice.

Looking for help integrating AI into your business? Set up a free consultation.