Skip to main content

Image Generator Node

The Image Generator node is the primary image creation engine in Flow Studio. You feed it a prompt and optional reference assets, choose a model and style, and it produces high-quality AI-generated images at up to 4K resolution.

What It Does

This node takes text prompts (and optionally reference images) as input and generates images using AI models. You control the artistic style, aspect ratio, output resolution, and camera angle. When connected to a Prompt Generator node in Multiply mode, it can produce a batch of images -- one per prompt variation.

Configuration

ParameterDescriptionOptions / Range
ModelThe AI model used for generationSee models table below
StyleArtistic direction applied to the outputauto, realistic, artistic, anime, 3d
Aspect RatioWidth-to-height ratio of the generated image1:1, 16:9, 9:16, 4:3, 3:4
SizeOutput resolution tier1K, 2K, 4K
Camera AnglePerspective control for the generated sceneConfigurable per generation
Multi-Angle ModeGenerates the same scene from multiple camera angles in one executionOn / Off

Available Models

ModelCredits/ImageNotes
Z-Image5Fastest. Text-to-image only, custom resolution (512-2048px)
Grok Imagine22Creative/expressive style (xAI via fal.ai)
Nano Banana (Gemini 2.5 Flash)40Fast, creative generation
SeeDream 4.540High quality, auto-ratio, custom resolution, 2K+
Recraft V380Vector illustrations & icons
Nano Banana Pro (Gemini 3 Pro) 1K/2K150High quality, up to 14 reference images
GPT Image 210–410OpenAI. Quality tiers (low / medium / high), up to 16 reference images, best-in-class text rendering (Latin + CJK)
Nano Banana Pro (Gemini 3 Pro) 4K280Maximum quality 4K output

GPT Image 2

GPT Image 2 introduces per-output quality tiers that directly affect cost and detail level. The quality selector defaults to high.

QualityUse CaseCredits Range
LowDrafts, quick iterations10–60 cr
MediumStandard production50–200 cr
High (default)Final assets, hero images220–410 cr

The exact cost depends on the chosen size. Larger sizes and higher quality tiers cost more.

Aspect ratios supported: 1:1, 3:2, 2:3, 16:9, 9:16, 4:3, 3:4, plus custom resolution (max edge 3840px, between 655K and 8.3MP, dimensions multiples of 16, aspect ratio ≤ 3:1).

Reference images: Up to 16 input images for character consistency, style transfer, or composition.

Text rendering: GPT Image 2 produces sharper, more legible text in Latin and CJK scripts than other models — a strong choice for posters, packaging, and infographics.

Edit mode and aspect ratio: When you supply a reference image to GPT Image 2, the model respects the aspect ratio you've configured on the node — it does not auto-size to the source image. Set the ratio explicitly (or use one of the custom resolutions above) to control the output dimensions.

Batch quality editing: Select multiple GPT Image 2 nodes on the canvas and the floating toolbar exposes a quality selector. Switching it updates the quality tier (low / medium / high) for every selected node at once.

Available Styles

StyleDescription
autoThe model automatically selects the best style for the prompt
realisticPhotorealistic rendering with natural lighting and detail
artisticPainterly, illustrative, or stylized output
animeJapanese animation aesthetic
3dThree-dimensional rendered appearance

Aspect Ratios

RatioCommon Use Case
1:1Social media posts, profile images, product shots
16:9Landscape banners, video thumbnails, desktop wallpapers
9:16Vertical stories, mobile wallpapers, portrait banners
4:3Presentations, traditional photo prints
3:4Portrait-oriented prints, book covers

Usage

  1. Drag an Image Generator node onto the canvas.
  2. Connect a Prompt Generator to the prompt input and optionally connect an Asset Node as a reference image.
  3. Select your preferred Model, Style, Aspect Ratio, and Size.
  4. Optionally configure Camera Angle or enable Multi-Angle Mode.
  5. Execute the node to generate your image(s).
tip

For product photography workflows, use realistic style with a 4:3 or 1:1 ratio and connect a product asset as a reference to maintain visual consistency.