AISVIT / AI Video / Text to Video
Runway Gen-4.5 — Text to Video
Text to Video with Runway Gen-4.5 in AISVIT. Turn prompts into high-quality AI videos with advanced text-to-video models. Generate ad concepts, story scenes, product visuals, and social clips in minutes.
About this model
AI video model for premium 5 or 10 second cinematic clips with strong prompt adherence, image-guided starts, and realistic motion.
When is this model useful?
Runway Gen-4.5 is a strong fit when the clip needs to feel polished, directed, and visually expensive rather than like a rough prototype.
Best fit tasks
- Premium text-to-video ads, product films, fashion or lifestyle shots, atmospheric scenes, and concept trailers.
- Image-to-video animation when you already have a strong still frame, such as a product photo, poster, portrait, storyboard keyframe, or concept render.
- Camera-led shots where motion quality matters, including push-ins, reveal shots, orbit moves, dramatic close-ups, and stylized cinematic movement.
- Previsualization and creative pitching when a team needs a near-finished visual reference before live production or post-production.
Main advantages
- It is known for strong prompt adherence, so camera direction, motion cues, lighting, and scene intent tend to come through more clearly than in many fast models.
- Materials, textures, motion, and physical behavior usually look more believable, which helps with premium-looking brand and editorial content.
- It supports both text-only generation and image-guided generation, so you can start from an idea or from a locked first frame.
- The output often feels more cinematic and visually consistent frame to frame than cheaper models optimized mainly for speed.
Limitations to know
- In AISVIT, clips are limited to 5 or 10 seconds and the output is silent.
- The model works best when one generation is treated as one scene. Prompts that try to force too many sequential events into one clip may become unstable.
- When you upload a start image, that frame strongly controls composition, style, and subject placement, so contradictory prompt instructions may need extra retries.
- Like other video models, it can still struggle with exact text rendering, very complex interactions, perfect object permanence, and flawless cause-and-effect logic.
How to use this model
The easiest workflow is to write one clear scene description first, then add a start image or seed only if you need tighter control.
Simple workflow
- Write the prompt in plain language: what is in the scene, what happens, what the environment looks like, how the camera moves, and what mood the shot should have.
- If you are generating from text only, describe both the visual scene and the motion. If you upload a start image, focus the prompt more on motion, camera movement, and what should change over time.
- Choose 5 seconds for faster testing and cheaper iteration, or 10 seconds when the action needs more time to develop naturally.
- Select the frame shape you need: 16:9 for websites and YouTube, 9:16 for vertical social platforms, 1:1 for square feeds, or 4:3, 3:4, and 21:9 for more specific layouts.
- Upload one start image when you want the shot to begin from a specific product frame, portrait, illustration, or composition.
Supported inputs
- Required: a text prompt.
- Optional: one start image for image-guided generation.
- Aspect ratio choices in this integration: 16:9, 9:16, 4:3, 3:4, 1:1, and 21:9.
- Duration choices in this integration: 5 or 10 seconds.
- In the AISVIT upload flow, standard image formats such as JPG, PNG, and WEBP are the safest choice.
What you get
- A generated MP4 video file.
- 720p output at 24 frames per second in the current Runway Gen-4.5 workflow.
- A 5 or 10 second clip.
- Silent output in this integration, so voiceover, music, or sound design is added later if needed.
Other workflows for this model
More Text to Video models
AISVIT pricing details
- Fixed rate: 12 credits per second of video
- 5 seconds = 60 credits
- 10 seconds = 120 credits