VEED Fabric

Overview

VEED Fabric 1.0 generates talking head videos from a single portrait image and an audio file. It’s the simplest lipsync pipeline — no video input needed, just image + audio.

Model ID	Input	Speed	Credits	~Cost
`veed-fabric-1.0`	Image + audio	~30-50s	100	$1.00

Quick start

import { createVarg } from "vargai/ai"

const varg = createVarg({ apiKey: process.env.VARG_API_KEY! })

const result = await varg.videoModel("veed-fabric-1.0").generate({
  imageUrl: "https://example.com/portrait.jpg",
  audioUrl: "https://example.com/speech.mp3",
})

console.log(result.video.url)

Parameters

files

array

required

Two files: one image (portrait) and one audio (speech). The gateway auto-detects file types by extension.

provider_options

object

resolution — "480p" or "720p" (defaults to "720p").

VEED Fabric doesn’t use a prompt or duration parameter. The video length matches the audio duration.

Pricing

Model	Credits	USD
`veed-fabric-1.0`	100	$1.00

Tips

Simplest pipeline for talking heads. Image + audio = talking video. No intermediate video step needed.
Use with ElevenLabs speech — generate narration with eleven_v3, then animate with VEED Fabric.
720p recommended for quality. 480p is faster but noticeably lower resolution.
Portrait images work best. Front-facing, well-lit, neutral expression for cleanest results.

Sync V2 Pro

Higher quality lipsync. Requires video input instead of image.

OmniHuman

Full-body animation from image + audio.

ElevenLabs

Generate the speech audio to pair with VEED Fabric.

Soul Sync Lipsync

​Overview

​Quick start

​Parameters

​Pricing

​Tips

​Related models

Sync V2 Pro

OmniHuman

ElevenLabs

Overview

Quick start

Parameters

Pricing

Tips

Related models