Generate text from video
This quickstart guide provides a simplified introduction to generating text from video using the TwelveLabs Video Understanding Platform. It includes:
- A basic working example
- Minimal implementation details
- Core parameters for common use cases
For a comprehensive guide, see the Generate text from video section.
Prerequisites
-
To use the platform, you need an API key:
-
Ensure the TwelveLabs SDK is installed on your computer:
-
The videos you wish to use must meet the following requirements:
- Video resolution: Must be at least 360x360 and must not exceed 3840x2160.
- Aspect ratio: Must be one of 1:1, 4:3, 4:5, 5:4, 16:9, or 9:16.
- Video and audio formats: Your video files must be encoded in the video and audio formats listed on the FFmpeg Formats Documentation page. For videos in other formats, contact us at support@twelvelabs.io.
- Duration: Must be between 4 seconds and 60 minutes (3600s). In a future release, the maximum duration will be 2 hours (7,200 seconds).
- File size: Must not exceed 2 GB.
If you require different options, contact us at support@twelvelabs.io.
Starter code
You can copy and paste the code below to generate text from video. Replace the placeholders surrounded by <>
with your values.