← Back
Vercel
Vercel AI Gateway adds Kling video models with multishot and audio generation
Vercel AI SDKVercel · featureapiintegrationmodel · vercel.com ↗

Kling Video Models Now Available

Kling video models are now integrated into Vercel's AI Gateway, making state-of-the-art video generation accessible through a unified API. This expansion includes the latest Kling 3.0 models alongside earlier versions, giving developers multiple options for performance and capability trade-offs.

Key Capabilities

Kling models are optimized for several video generation scenarios:

  • Image-to-Video: Strong animation of still images into video clips with realistic motion and physics
  • Multishot Narratives: Kling 3.0 can generate multi-scene videos from a single narrative prompt with intelligent shot composition
  • High-Quality Output: Supports up to 1080p generation in pro mode
  • Audio Generation: Creates synchronized sound effects and ambient audio alongside video
  • Precise Control: First and last frame control allows developers to specify exact start and end frames for scene transitions

Getting Started

Video generation is available for Pro and Enterprise plan users and paid AI Gateway customers. Developers have two main integration paths:

  • AI SDK 6: Use the generateVideo method for programmatic video generation with full control over model parameters and provider-specific options
  • AI Gateway Playground: A no-code interface for experimenting with video models, comparing providers, tweaking prompts, and downloading results directly from the browser

Available Models

The service offers six Kling model variants across different generations and use cases:

Model Type Highlights
Kling v3.0 Text-to-Video Latest generation Multishot support, highest quality
Kling v3.0 Image-to-Video Latest generation Multiple frame control with v3 quality
Kling v2.6 Text-to-Video Production-ready Audio generation support
Kling v2.6 Image-to-Video Production-ready Image reference with audio
Kling v2.5 Turbo Text-to-Video Speed-optimized Faster generation times
Kling v2.5 Turbo Image-to-Video Speed-optimized Faster generation for images

Usage Examples

The AI SDK provides straightforward integration for common workflows. Simple text-to-video generation requires only a prompt and basic parameters like aspect ratio and duration. Advanced multishot narratives can be created by structuring prompts as multi-scene stories and enabling Kling-specific options like intelligent shot composition and synchronized audio. For precise scene transitions, developers can provide both start and end frame images to control exactly how videos begin and conclude.