Skip to main content

Overview

Kling V3 is the recommended default video model. Made by Kuaishou, it offers the best balance of quality, flexibility, and cost. It supports continuous duration from 3-15 seconds (integer only) and both text-to-video and image-to-video generation.
Model IDTierCreditsDuration
kling-v3Pro1503-15s
kling-v3-standardStandard1003-15s
Both variants support first-frame and last-frame keyframe control via end_image_url.

Quick start

import { createVarg } from "vargai/ai"

const varg = createVarg({ apiKey: process.env.VARG_API_KEY! })

const result = await varg.videoModel("kling-v3").generate({
  prompt: "cinematic shot of ocean waves crashing on rocky cliffs at golden hour",
  duration: 5,
  aspectRatio: "16:9",
})

console.log(result.video.url)

Capabilities

Text-to-video

const result = await varg.videoModel("kling-v3").generate({
  prompt: "a cat leaping gracefully from one rooftop to another, city skyline behind",
  duration: 8,
})

Image-to-video

Animate a still image with a motion prompt:
const result = await varg.videoModel("kling-v3").generate({
  prompt: "slow camera push-in, the subject blinks and smiles",
  imageUrl: "https://example.com/portrait.jpg",
  duration: 5,
})

Keyframe control (first + last frame)

Provide two images to control the start and end frames of the video. The model interpolates the motion between them.
cURL
curl -X POST https://api.varg.ai/v1/video \
  -H "Authorization: Bearer $VARG_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kling-v3",
    "prompt": "smooth transition from day to night",
    "duration": 5,
    "files": [
      { "url": "https://example.com/day-scene.jpg" },
      { "url": "https://example.com/night-scene.jpg" }
    ]
  }'
The first file becomes image_url (start frame) and the second becomes end_image_url (end frame).

Parameters

prompt
string
required
Text description of the video to generate. For image-to-video, describe the desired motion.
duration
number
default:"5"
Video duration in seconds. Integer values from 3 to 15. No decimals — 5.5 will fail.
aspect_ratio
string
default:"16:9"
Output aspect ratio. Common values: 16:9, 9:16, 1:1.
files
array
Array of { url: string } objects. One image for image-to-video. Two images for first+last frame keyframe control.
Duration must be an integer. kling-v3 accepts 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, or 15. Decimal values like 5.5 or out-of-range values will fail.

Pricing

ModelCreditsUSD
kling-v3150$1.50
kling-v3-standard100$1.00
Standard tier is 33% cheaper with slightly lower quality. Good for drafts and iteration.

Tips

  • Image-to-video is significantly better than text-to-video. Always generate a reference image first when possible.
  • Keyframe control is powerful for transitions — provide a start and end frame, and Kling interpolates the motion between them.
  • Keep prompts concise and motion-focused. Describe what moves and how, not static scene details (those belong in the reference image).
  • Standard tier (kling-v3-standard) is recommended for iteration and testing. Switch to pro for final renders.
  • Flexible duration is Kling V3’s advantage over other models that only support 5 or 10 seconds.

Seedance 2

ByteDance’s premium model. Excellent quality, supports video editing.

Sora 2

OpenAI’s video model with remix capability.

Wan 2.5

Budget alternative at 80 credits. Good motion quality.

Kling Legacy

Previous Kling versions (V2.6, V2.5, V2.1, V2).