Introduction

The TwelveLabs Video Understanding Platform identifies objects, actions, speech, and text in your video content. Access the following core capabilities through simple APIs:

  • Search: Find specific moments in your videos using text or image queries. Search across visual content, spoken words, on-screen text, and audio.
  • Analyze: Generate tailored text outputs such as tables of contents, action items, memos, reports, and comprehensive analyses based on your prompts. Extract structured, timestamped segments from your videos by defining custom segment types and fields.
  • Embed: Create multimodal embeddings for semantic search, recommendations, classification, and other downstream AI tasks.

Most popular

TwelveLabs models