Voxel51 - Semantic video search plugin
Voxel51 - Semantic video search plugin
Voxel51 - Semantic video search plugin

Summary: The Semantic Video Search plugin integrates Voxel FiftyOne, an open-source tool for building and enhancing machine learning datasets, with the TwelveLabs Video Understanding Platform, enabling you to perform semantic searches across multiple modalities.
Description: The plugin allows you to accurately identify movements, actions, objects, people, sounds, on-screen text, and speech. For example, this feature is helpful in scenarios where you need to quickly locate and analyze specific scenes based on actions or spoken words, significantly improving your efficiency in categorizing and analyzing video data.
Code explanation: Our blog post, Search Your Videos Semantically with TwelveLabs and FiftyOne Plugin, walks you through the steps required to create this plugin from scratch.
GitHub: Semantic Video Search
The integration with the TwelveLabs Video Understanding Platform is comprised of three distinct steps:
The plugin invokes the POST method of the /indexes endpoint to create an index and enable the Marengo video understanding engine with the engine options that the user has selected:
The plugin invokes the POST method of the /tasks endpoint. Then, it monitors the indexing process using the GET method of the /tasks/{task_id} endpoint:
The plugin invokes the POST method of the /search endpoint to search across the sources of information that the user has selected:
After reading this page, you have several options: