For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DashboardAPI PricingGet an API key
  • Guides
    • Introduction
    • Getting started
    • Use with AI agents
    • Examples
    • Authentication
    • Handling async tasks
    • File uploads
    • File hydration
    • Embedding videos
    • Errors
    • Rate limits
    • Libraries & SDKs
  • REST API Reference
    • Overview
    • Workflows
        • POSTGenerate image
        • POSTGenerate video clip
        • POSTText to speech
        • POSTGenerate sound effect
        • POSTGenerate avatar clip
        • POSTVectorize image
        • POSTRemove background from an image
        • POSTRemove background from a video
        • POSTUpscale an image
        • POSTUpscale a video
        • POSTCancel tool execution
        • GETGet tool execution info
        • GETList files
        • POSTSearch files
        • GETGet file
        • POSTCreate file upload
        • POSTHydrate file
        • POSTArchive file
        • POSTEnable public preview
        • POSTDisable public preview
        • GETList avatar presenters
        • GETList TTS voices
        • GETList webhooks
        • POSTCreate webhook
        • DELDelete webhook
  • Webhook events
    • Overview
    • Changelog
LogoLogo
DashboardAPI PricingGet an API key
On this page
  • Base URL
  • What you can build
  • Conventions
  • Timestamps
  • Next steps
Guides

Introduction

Programmatically generate images, videos, voiceovers, sound effects, and avatar clips.
Was this page helpful?

Getting started

Install an SDK and generate your first video clip in under a minute.
Next
Built with

The VideoGen API lets you integrate AI media generation into your product, pipeline, or workflow. Generate assets with a single API call and retrieve them when they’re ready.

Base URL

https://api.videogen.io

What you can build

CapabilityEndpoint
Generate images (from text or image)POST /v1/tools/generate-image
Generate video clips (from text, image, or video)POST /v1/tools/generate-video-clip
Convert text to speech with 100+ voicesPOST /v1/tools/text-to-speech
Generate sound effects from a promptPOST /v1/tools/generate-sound-effect
Create avatar videos with a presenterPOST /v1/tools/generate-avatar
Vectorize imagesPOST /v1/tools/vectorize-image
Remove image backgroundsPOST /v1/tools/remove-image-background
Remove video backgroundsPOST /v1/tools/remove-video-background
Upscale imagesPOST /v1/tools/upscale-image
Upscale videoPOST /v1/tools/upscale-video

Conventions

Timestamps

Every numeric timestamp field in the API (expiresAt, occurredAt, createdAt, and any future additions) is an integer representing seconds since the Unix epoch (UTC, no milliseconds). For example 1745409600 corresponds to 2025-04-23T12:00:00Z.

Next steps

Getting started

Install an SDK and make your first request.

Authentication

Create an API key and authenticate requests.

Handling async tasks

Learn how to retrieve results via polling or webhooks.

Libraries & SDKs

TypeScript and Python client libraries.