AI models

Every renderer you need — wired into one direction workspace.

Veyra is model-forward by design: pick the right engine for the shot, from Veo 3.1 and Kling 3 Pro to Nano Banana 2 and OmniHuman 1.5 — then stay in the timeline instead of re-learning a new app per vendor.

Request access How this shows up in features

Veo 3.1Nano Banana 2Kling 3 ProKling 2.5 Turbo StdKling 2.5 Turbo ProKling 2 MasterKling 1 StdOmniHuman 1.5Hailuo 02 StdPixVerse v6Seedance 1 ProLTX 2.3 FastLTX 2 ProPika 2.2MAGI-1Imagen 4 UltraImagen 4 falFLUX 2 ProSeedream 4.5Ideogram v2Kling LipSyncGemini (planner default)Demucs — vocal separationFaster-Whisper — lyrics transcriptionFLUX ProFLUX SchnellGemini — audio/lyric analysisGemini 3.1 Flash Image (preview) — sub-routesGPT (planner option)Imagen 4Imagen 4 FastNano BananaNano Banana ProOmniHumanVeo 2Veo 3Veo 3 FastVeo 3.1 FastVeo 3.1 Lite

Built for “latest and greatest”

When new preview tiers land (Veo 3.1, Nano Banana 2, Kling 3, OmniHuman 1.5…), Veyra is structured so they can show up in-product without you rebuilding a pipeline.

Not just video — a full production stack

Stills, motion, performance capture, analysis, and transcription: one creative space with credits and exports that map to how MVs are actually made.

Full in-app model catalogue

The list below is what Veyra can route to today. Exact defaults and per-environment overrides follow your deployment configuration (API keys, env model picks, and preview availability from providers).

Stills + look dev

Every image model from the in-app stills picker, including Google-native, Imagen, FLUX, and the FAL text-to-image rows — IDs below match production.

Nano Banana

gemini-2.5-flash-image

Nano Banana 2

Preview

gemini-3.1-flash-image-preview

Nano Banana Pro

gemini-3-pro-image-preview

Imagen 4

imagen-4.0-generate-001

Imagen 4 Fast

imagen-4.0-fast-generate-001

Imagen 4 Ultra

imagen-4.0-ultra-generate-001

FLUX Schnell

fal-ai/flux/schnell

FLUX Pro

fal-ai/flux-pro

FLUX 2 Pro

fal-ai/flux-2-pro

Imagen 4 fal

Preview

fal-ai/imagen4/preview

Imagen 4 (preview) on the FAL text-to-image path — `shortLabel` matches the in-app picker.

Seedream 4.5

fal-ai/bytedance/seedream/v4.5/text-to-image

Ideogram v2

fal-ai/ideogram/v2

Video (Veo + FAL i2v)

Full Veo list plus the complete FAL / partner i2v catalogue: Hailuo, Kling, OmniHuman, LTX, Pika, and every Kling tier exposed in the app. Each row is the same `fal-ai/...` id you’ll see in exports and config.

Veo 3.1

Preview

veo-3.1-generate-preview

Google Veo 3.1 (preview) — up to 4K where supported in-app.

Veo 3.1 Fast

veo-3.1-fast-generate-preview

Veo 3.1 Lite

veo-3.1-lite-generate-preview

Veo 3

veo-3.0-generate-001

Veo 3 Fast

veo-3.0-fast-generate-001

Veo 2

Default

veo-2.0-generate-001

Hailuo 02 Std

fal-ai/minimax/hailuo-02/standard/image-to-video

PixVerse v6

fal-ai/pixverse/v6/image-to-video

Seedance 1 Pro

fal-ai/bytedance/seedance/v1/pro/image-to-video

LTX 2 Pro

fal-ai/ltx-2/image-to-video

LTX 2.3 Fast

fal-ai/ltx-2.3/image-to-video/fast

Pika 2.2

fal-ai/pika/v2.2/image-to-video

MAGI-1

fal-ai/magi/image-to-video

OmniHuman 1.5

New

fal-ai/bytedance/omnihuman/v1.5

Image + audio — performance and lip sync oriented.

OmniHuman

fal-ai/bytedance/omnihuman

Image + audio — performance / dialogue.

Kling 3 Pro

New

fal-ai/kling-video/v3/pro/image-to-video

Flagship Kling i2v (broad duration range).

Kling 2.5 Turbo Std

fal-ai/kling-video/v2.5-turbo/standard/image-to-video

Kling 2.5 Turbo Pro

fal-ai/kling-video/v2.5-turbo/pro/image-to-video

Kling 2 Master

fal-ai/kling-video/v2/master/image-to-video

Kling 1 Std

fal-ai/kling-video/v1/standard/image-to-video

Lip sync

Align mouth performance to audio when the cut calls for it.

Kling LipSync

fal-ai/kling-video/lipsync/audio-to-video

Kling — audio-to-video lip sync (FAL) for finishing closeups.

Planning + chat intelligence

Switch the brain behind the planner — Gemini (default) or GPT — same workspace, your choice per session where enabled.

Gemini (planner default)

gemini-2.5-pro

Multi-step planning + structured JSON, with a flash fallback. Override via `GEMINI_PLANNER_MODEL`.

GPT (planner option)

gpt-4.1

Gemini 3.1 Flash Image (preview) — sub-routes

Preview

gemini-3.1-flash-image-preview

Selected planner micro-flows use this preview image model (e.g. some environment/location tools).

Audio + lyrics intelligence

Analyse the track, transcribe lyrics with word timing, and split stems when you need vocal-forward alignment.

Gemini — audio/lyric analysis

Default `gemini-2.5-flash`, fallback `gemini-2.5-flash-lite` — overridable via `GEMINI_AUDIO_ANALYSIS_MODEL` and fallback env.

Faster-Whisper — lyrics transcription

On-device (beat service) transcription with selectable weights: tiny…large-v2, large-v3, plus .en variants where applicable.

Demucs — vocal separation

High-quality stem split before word-level lyrics when you want vocals isolated.