AgentPMT
Image Generation Agent

Image Generation Agent

Model

Available ActionsEach successful request consumes credits as outlined below.

generate_budget_image8crgenerate_image_0_5k10crgenerate_image_1k15crgenerate_image_2k25crgenerate_image_4k40cr

Details

AI image generator powered by Google Gemini 3 Flash Image and "Nano Banana". Create photorealistic product photography, marketing creative, social graphics, app icons, concept art, hero images, and brand assets from a single text prompt — or edit an existing image by passing up to four reference photos for style, subject, or scene guidance. Choose your output tier: a low-cost budget draft for ideation, or crisp 0.5K, 1K, 2K, and 4K final renders for print, ads, e-commerce, and presentation use. Supports 14 aspect ratios including 1:1, 16:9, 9:16, 21:9, 4:5, and ultra-wide 8:1 banner formats. Every generated image is auto-saved to AgentPMT File Manager with a 7-day signed download URL, file_id, width, height, MIME type, and size bytes — ready to drop into chat, hand off to another tool, or pull into a workflow. Built for designers, marketers, e-commerce sellers, content creators, and AI agents that need on-demand visuals without leaving the conversation.

Use Cases

AI image generation, Nano Banana image creation, Google Gemini image API, text-to-image, image editing with reference photos, product photography mockups, hero banner generation, social media graphics for Instagram and TikTok and LinkedIn, e-commerce product visuals, concept art, app and product icon design, marketing campaign creative, ad creative generation, brand asset production, style transfer with reference images, photoshoot replacement, background swapping, ultra-wide 21:9 banner generation, 4K poster rendering, multi-aspect-ratio variants for omnichannel campaigns, AI-generated stock imagery, illustration generation, storyboard panels, presentation graphics, content marketing visuals, blog header images, YouTube thumbnails, podcast cover art, book cover generation, packaging mockups, real estate listing renders, automated visual content pipelines for AI agents

Actions(5)

generate_budget_image8cr5 params(1 required)

Create or edit a lower-cost image from a prompt and optional reference images. Use for drafts, previews, and standard 1024px-class outputs.

Create or edit a lower-cost image from a prompt and optional reference images. Use for drafts, previews, and standard 1024px-class outputs.

promptrequiredstring

Image generation or edit instruction, 3 to 4000 characters.

aspect_ratiostring

Desired output aspect ratio. Default is 1:1.

Values:
1:12:33:23:44:34:55:49:1616:921:9
reference_imagesarray

Optional reference images for edits or style/subject guidance. Maximum 4.

Array of: object
filenamestring

Optional output filename base. Extension is inferred from generated image MIME type.

expiration_daysinteger

File Manager expiration in days, from 1 to 7. Default is 7.

generate_image_0_5k10cr5 params(1 required)

Create or edit a high-efficiency 0.5K image from a prompt and optional reference images.

Create or edit a high-efficiency 0.5K image from a prompt and optional reference images.

promptrequiredstring

Image generation or edit instruction, 3 to 4000 characters.

aspect_ratiostring

Desired output aspect ratio. Default is 1:1.

Values:
1:11:41:82:33:23:44:14:34:55:48:19:1616:921:9
reference_imagesarray

Optional reference images for edits or style/subject guidance. Maximum 4.

Array of: object
filenamestring

Optional output filename base. Extension is inferred from generated image MIME type.

expiration_daysinteger

File Manager expiration in days, from 1 to 7. Default is 7.

generate_image_1k15cr5 params(1 required)

Create or edit a 1K image from a prompt and optional reference images.

Create or edit a 1K image from a prompt and optional reference images.

promptrequiredstring

Image generation or edit instruction, 3 to 4000 characters.

aspect_ratiostring

Desired output aspect ratio. Default is 1:1.

Values:
1:11:41:82:33:23:44:14:34:55:48:19:1616:921:9
reference_imagesarray

Optional reference images for edits or style/subject guidance. Maximum 4.

Array of: object
filenamestring

Optional output filename base. Extension is inferred from generated image MIME type.

expiration_daysinteger

File Manager expiration in days, from 1 to 7. Default is 7.

generate_image_2k25cr5 params(1 required)

Create or edit a 2K image from a prompt and optional reference images.

Create or edit a 2K image from a prompt and optional reference images.

promptrequiredstring

Image generation or edit instruction, 3 to 4000 characters.

aspect_ratiostring

Desired output aspect ratio. Default is 1:1.

Values:
1:11:41:82:33:23:44:14:34:55:48:19:1616:921:9
reference_imagesarray

Optional reference images for edits or style/subject guidance. Maximum 4.

Array of: object
filenamestring

Optional output filename base. Extension is inferred from generated image MIME type.

expiration_daysinteger

File Manager expiration in days, from 1 to 7. Default is 7.

generate_image_4k40cr5 params(1 required)

Create or edit a 4K image from a prompt and optional reference images.

Create or edit a 4K image from a prompt and optional reference images.

promptrequiredstring

Image generation or edit instruction, 3 to 4000 characters.

aspect_ratiostring

Desired output aspect ratio. Default is 1:1.

Values:
1:11:41:82:33:23:44:14:34:55:48:19:1616:921:9
reference_imagesarray

Optional reference images for edits or style/subject guidance. Maximum 4.

Array of: object
filenamestring

Optional output filename base. Extension is inferred from generated image MIME type.

expiration_daysinteger

File Manager expiration in days, from 1 to 7. Default is 7.

About this Product

AI image generation powered by Nano Banana (Google Gemini 3 Flash Image)

Turn a text prompt into polished, photorealistic visuals in seconds. The Image Generation Agent runs on Google's Gemini 3 Flash Image model — the "Nano Banana" family — to create marketing creative, product photography, social graphics, icons, concept art, and brand assets, or to edit an existing image with reference photos. Generate by hand or wire it into your agents and workflows for on-demand visuals without leaving the conversation.

What you can create

  1. Photorealistic product photography and e-commerce mockups.
  2. Marketing and ad creative, hero banners, and campaign visuals.
  3. Social graphics for Instagram, TikTok, LinkedIn, and YouTube thumbnails.
  4. App and product icons, concept art, illustrations, and storyboard panels.
  5. Brand assets, packaging mockups, blog headers, and presentation graphics.

Generate or edit

Text-to-image

Describe what you want and get a finished image — composition, lighting, and style follow your prompt.

Reference-image editing

Attach up to four reference photos to guide subject, style, or scene. Keep the same product across new backgrounds, restyle a shot, swap a setting, or carry a consistent look through an entire campaign — the subject stays locked from one render to the next.

Pick your resolution

  1. Budget draft — fast, low-cost ideation and previews.
  2. 0.5K & 1K — efficient standard finals for web and social.
  3. 2K — crisp social, presentation, and product assets.
  4. 4K — highest-resolution renders for print, ads, and large-format use.

14 aspect ratios, including ultra-wide banners

Render square, portrait, landscape, and cinematic formats — 1:1, 16:9, 9:16, 21:9, 4:5, and ultra-wide 4:1 and 8:1 banners — so one prompt can fuel an omnichannel campaign.

Ready for your workflow

Every image is auto-saved to AgentPMT File Manager and returned with a download link plus its file_id, width, height, MIME type, and size — ready to drop into chat, hand off to another tool, or pull into an automated pipeline.

Who it's for

Designers, marketers, e-commerce sellers, content creators, and AI agents that need on-demand, high-quality visuals at any resolution.

Frequently Asked Questions

Can I edit an existing image or keep a product consistent across scenes?

Yes. Attach up to four reference images to guide subject, style, or scene. Reference-image editing keeps the same subject across new backgrounds, so you can place the same product in different settings or carry one look through an entire campaign.

What aspect ratios are supported?

14 aspect ratios, including 1:1, 16:9, 9:16, 21:9, and 4:5, plus ultra-wide 4:1 and 8:1 banner formats — so a single prompt can produce variants for an omnichannel campaign.

What is the Image Generation Agent?

An AI image generator powered by Nano Banana (Google Gemini 3 Flash Image). Create photorealistic product photos, marketing creative, social graphics, icons, concept art, and brand assets from a text prompt, or edit an existing image with reference photos — directly in chat, agents, and workflows.

What resolutions can I generate?

Choose the tier that fits the job: a low-cost budget draft for ideation, efficient 0.5K and 1K finals, crisp 2K for social and product assets, and 4K for print, ads, and large-format use.

Where do my images go after they're generated?

Every image is auto-saved to AgentPMT File Manager and returned with a download link, file_id, width, height, MIME type, and size — ready to drop into chat, hand to another tool, or use in an automated workflow.

Who is it built for?

Designers, marketers, e-commerce sellers, content creators, and AI agents that need on-demand, high-quality visuals at any resolution without leaving the conversation.

Looking for help integrating AI into your business? Set up a free consultation.