Overview
VEED Fabric 1.0 generates talking head videos from a single portrait image and an audio file. It’s the simplest lipsync pipeline — no video input needed, just image + audio.| Model ID | Input | Speed | Credits | ~Cost |
|---|---|---|---|---|
veed-fabric-1.0 | Image + audio | ~30-50s | 100 | $1.00 |
Quick start
Parameters
Two files: one image (portrait) and one audio (speech). The gateway auto-detects file types by extension.
resolution — "480p" or "720p" (defaults to "720p").VEED Fabric doesn’t use a prompt or duration parameter. The video length matches the audio duration.
Pricing
| Model | Credits | USD |
|---|---|---|
veed-fabric-1.0 | 100 | $1.00 |
Tips
- Simplest pipeline for talking heads. Image + audio = talking video. No intermediate video step needed.
- Use with ElevenLabs speech — generate narration with
eleven_v3, then animate with VEED Fabric. - 720p recommended for quality. 480p is faster but noticeably lower resolution.
- Portrait images work best. Front-facing, well-lit, neutral expression for cleanest results.
Related models
Sync V2 Pro
Higher quality lipsync. Requires video input instead of image.
OmniHuman
Full-body animation from image + audio.
ElevenLabs
Generate the speech audio to pair with VEED Fabric.