Vercel AI Gateway adds Kling video models with multishot and audio generation

Kling Video Models Now Available

Kling video models are now integrated into Vercel's AI Gateway, making state-of-the-art video generation accessible through a unified API. This expansion includes the latest Kling 3.0 models alongside earlier versions, giving developers multiple options for performance and capability trade-offs.

Key Capabilities

Kling models are optimized for several video generation scenarios:

Image-to-Video: Strong animation of still images into video clips with realistic motion and physics
Multishot Narratives: Kling 3.0 can generate multi-scene videos from a single narrative prompt with intelligent shot composition
High-Quality Output: Supports up to 1080p generation in pro mode
Audio Generation: Creates synchronized sound effects and ambient audio alongside video
Precise Control: First and last frame control allows developers to specify exact start and end frames for scene transitions

Getting Started

Video generation is available for Pro and Enterprise plan users and paid AI Gateway customers. Developers have two main integration paths:

AI SDK 6: Use the generateVideo method for programmatic video generation with full control over model parameters and provider-specific options
AI Gateway Playground: A no-code interface for experimenting with video models, comparing providers, tweaking prompts, and downloading results directly from the browser

Available Models

The service offers six Kling model variants across different generations and use cases:

Model	Type	Highlights
Kling v3.0 Text-to-Video	Latest generation	Multishot support, highest quality
Kling v3.0 Image-to-Video	Latest generation	Multiple frame control with v3 quality
Kling v2.6 Text-to-Video	Production-ready	Audio generation support
Kling v2.6 Image-to-Video	Production-ready	Image reference with audio
Kling v2.5 Turbo Text-to-Video	Speed-optimized	Faster generation times
Kling v2.5 Turbo Image-to-Video	Speed-optimized	Faster generation for images

Usage Examples

The AI SDK provides straightforward integration for common workflows. Simple text-to-video generation requires only a prompt and basic parameters like aspect ratio and duration. Advanced multishot narratives can be created by structuring prompts as multi-scene stories and enabling Kling-specific options like intelligent shot composition and synchronized audio. For precise scene transitions, developers can provide both start and end frame images to control exactly how videos begin and conclude.

Kling Video Models Now Available

Key Capabilities

Getting Started

Available Models

Usage Examples

Products

Tags

Published

Source

Related News