veo 3.1

Google's 4K AI Video Generator with Ingredients-to-Video & Native Audio

Key Features of Veo 3.1

Discover what makes Veo 3.1 stand out

4K Output

720p / 1080p / 4K Resolution

Veo 3.1 is one of the few AI video models offering true 4K output. Choose 720p for speed, 1080p for balance, or 4K for broadcast-quality footage at 24 fps.

Ingredients to Video

Multi-Image Reference Generation

Feed up to 3 reference images to guide character, object, or scene appearance. Veo 3.1 maintains visual consistency with your references throughout the entire clip — Google calls it 'Ingredients to Video.'

Native Audio

Rich Synchronized Audio Generation

Veo 3.1 generates native audio — natural conversations, ambient sound, and synchronized effects — directly with the video. No post-production audio needed.

Scene Extension

Extend Clips into Longer Sequences

Generate a clip and extend it seamlessly — Veo 3.1 maintains visual continuity across extensions, enabling longer narratives up to a minute or more by chaining segments.

How to Use Veo 3.1

Add Reference Images or Keyframes

Upload up to 3 reference images for Ingredients-to-Video mode, or set start/end frames for precise keyframing. Both modes maintain visual consistency throughout.

Write Your Prompt & Set Resolution

Describe your scene in natural language. Choose 720p, 1080p, or 4K resolution and 16:9 (landscape) or 9:16 (portrait) aspect ratio.

Generate in Up to 4K

Click Generate to create an 8-second video with native audio. Export in up to 4K resolution for professional production or social media.

FAQ About Veo 3.1

We've answered the most frequently asked questions

Veo 3.1 is Google's upgraded video model with true 4K output, enhanced 1080p quality, native vertical video support, richer audio generation, and the Ingredients-to-Video feature for multi-image reference.

Veo 3.1 outputs at 720p, 1080p, or 4K resolution. Each generation produces an 8-second clip at 24 fps in 16:9 or 9:16 format.

Ingredients to Video lets you upload up to 3 reference images as 'ingredients' — the model uses them to guide character appearance, object details, or scene composition while maintaining consistency.

Yes. Veo 3.1 generates rich native audio synchronized with the video — including natural conversations, ambient sounds, and sound effects — without needing a separate audio model.

Yes. Veo 3.1 supports scene extension — you can extend generated clips while maintaining visual continuity, chaining segments to create longer narratives.

Explore

Veo 3.1 delivers up to 4K resolution with Ingredients-to-Video reference, native audio, and scene extension — the most versatile video AI from Google DeepMind.

Try Veo 3.1 Now

veo 3.1

Key Features of Veo 3.1

720p / 1080p / 4K Resolution

Multi-Image Reference Generation

Rich Synchronized Audio Generation

Extend Clips into Longer Sequences

How to Use Veo 3.1

Add Reference Images or Keyframes

Write Your Prompt & Set Resolution

Generate in Up to 4K

FAQ About Veo 3.1

What is Veo 3.1 and how is it different from Veo 3?

What resolution and duration does Veo 3.1 support?

What is 'Ingredients to Video' in Veo 3.1?

Does Veo 3.1 generate audio?

Can I make videos longer than 8 seconds with Veo 3.1?

More AI Models

Seedance 2.0 Fast

Seedance 2.0

Kling 3.0

Kling 3.0 Omni

Veo 3.1 Fast

Seedance 1.5 Pro

Google's 4K Video AI. Your Creative Vision.