Native audio drafts
Generate a short clip where motion and the generated sound bed can be judged together.
AI Video Model
Open AuraTuner with Gemini Omni Video when the first question is whether a prompt-led clip can carry generated audio and a short storyboard. Start with 720p and a narrow 6-second brief before trying longer or higher-resolution runs.
Starter setup
Mode
Text to video
Length
4, 6, 8, or 10 seconds
Audio
Generated audio enabled
Resolution
720p starter
Open AuraTuner with Gemini Omni Video when the first question is whether a prompt-led clip can carry generated audio and a short storyboard. Start with 720p and a narrow 6-second brief before trying longer or higher-resolution runs.
These are the practical jobs this setup helps you finish faster.
Generate a short clip where motion and the generated sound bed can be judged together.
Start from text when the idea, timing, and audio direction are still being explored.
Switch to video-to-video when an existing clip should guide the next pass.
Use this path before spending credits on variants or higher-quality runs.
Name the ambient bed, product sound, or narration feel in the prompt.
Use 4 or 6 seconds for the first timing check before longer runs.
Use 720p or 1080p to test direction, then move to 4K after the idea works.
Short answer
Use Gemini Omni Video when you need a short prompt-led video with generated audio in the same first pass. Keep the prompt narrow so you can judge audio fit, subject stability, and timing before moving to longer or higher-resolution settings.
Related motion sample
Use this model when these jobs match your first run.
Short clips where ambient audio, timing, or spoken-style motion needs an early check.
Video-to-video experiments when an existing source clip should guide the result.
4 to 10 second tests with one subject, one camera move, and one audio job.
Keep the first generation narrow and easy to grade.
Long enough to hear timing, short enough to catch drift.
Ask for ambient bed, product sound, or simple narration feel.
Do not ask for a full ad, cutdowns, and scene changes at once.
Avoid wasting credits by checking these constraints.
This is an audio-on video generation path, not a tool for syncing an uploaded audio track.
Use 720p or 1080p before testing 4K.
Video-to-video has a different cost profile from prompt-only runs.
Yes. AuraTuner runs Gemini Omni Video with audio enabled, so the first test should include what kind of sound or timing you want to judge.
No. Treat this as generated audio with video, not audio-to-video from an uploaded track.
The page starts with a 6-second 720p prompt-only run at 150 credits in AuraTuner. A 4-second 720p run is lower if you switch duration in Studio.
Use video-to-video when an existing clip should guide the remix. Use text-to-video when the scene is still being discovered.
No. Use it for the generated clip and audio direction. Keep final cuts, captions, music edits, and publishing review in your normal editor.