The applications on this page demonstrate the capabilities of the TwelveLabs Video Understanding Platform in transforming how you interact with and consume digital content.

Product Placement Assistant

Summary: Product Placement Assistant analyzes videos to identify optimal segments for brand placements. It utilizes the TwelveLabs Video Understanding Platform to provide actionable insights that enhance the impact of product placements.

Description: The application allows you to upload videos and receive detailed insights into segments that are ideal for product placements, based on a customizable prompt. This feature helps brands enhance their visibility. The process involves creating an index, uploading and indexing videos, and generating insights.

GitHub repo: akshaymijar17/productPlacement.

Demo: Product Placement Assistant.

Integration with TwelveLabs

Product Placement Assistant integrates the TwelveLabs Video Understanding Platform to analyze video content and recommend product placement. The following code snippets highlight key integration points.

Create indexes

The create_index function creates a new index using the client.index.create method. It enables the Marengo 2.7 and Pegasus 1.2 video understanding models for visual and audio analysis, along with the thumbnail addon.

Python

1 def create_index(client: TwelveLabs, index_name: str):
2     """
3     Create a new 12Labs index with specified models/addons.
4     """
5     models = [
6         {"name": "marengo2.7", "options": ["visual", "audio"]},
7         {"name": "pegasus1.2", "options": ["visual", "audio"]},
8     ]
9     try:
10         created_index = client.index.create(
11             name=index_name,
12             models=models,
13             addons=["thumbnail"]
14         )
15         return created_index
16     except Exception as e:
17         raise RuntimeError(f"Failed to create index: {e}")

Upload and index videos

The upload_video_and_wait function uploads a video to the specified index using the client.task.create method and monitors the indexing process with the wait_for_done method. It provides users with real-time status updates via a Streamlit placeholder.

Python

1 def upload_video_and_wait(client: TwelveLabs, index_id: str, video_file):
2     """
3     Upload the video to the specified index and wait for indexing to complete.
4     Updates the status message in place every 30 seconds.
5     """
6     try:
7         task = client.task.create(
8             index_id=index_id,
9             file=video_file,
10         )
11 
12         # Create a placeholder to update status in place.
13         status_placeholder = st.empty()
14 
15         def on_task_update(t: Task):
16             # Update the same placeholder each time
17             status_placeholder.write(f"Indexing Status: {t.status}")
18 
19         # Sleep interval = 30s, so it updates the placeholder every 30 seconds
20         task.wait_for_done(sleep_interval=30, callback=on_task_update)
21 
22         if task.status != "ready":
23             raise RuntimeError(f"Indexing failed with status '{task.status}'")
24 
25         return task.video_id
26     except Exception as e:
27         raise RuntimeError(f"Video upload/indexing failed: {e}")

Generate placement insights

The generate_text_from_video function generates text-based insights from the indexed video using the client.generate.text method. It uses a customizable prompt, which users can override to extract specific recommendations for product placements.

Python

1 def generate_text_from_video(client: TwelveLabs, video_id: str, prompt: str) -> str:
2     """
3     Generate text from an indexed video using the provided prompt.
4     """
5     try:
6         result = client.generate.text(video_id=video_id, prompt=prompt, temperature=0.7)
7         return result.data
8     except Exception as e:
9         raise RuntimeError(f"Text generation failed: {e}")

Verbatim

Summary: Verbatim is an application that enables you to translate videos into over 20 languages while synchronizing lip movements. It also offers summarization and interactive Q&A features based on an analysis of the video content.

Description: With Verbatim, you can access videos that have been translated and voice-cloned, and you can use a chatbot that answers questions and finds specific timestamps by analyzing both audio and visual content. The application utilizes the TwelveLabs Video Understanding Platform to create an interactive Q&A interface.

Verbatim was created by Sonny Chen, Cindy Yang, Karthik Thyagarajan, and Pranav Neti.

GitHub repo: TheXDShrimp/verbatim

Website: Verbatim

Integration with TwelveLabs

Verbatim utilizes the TwelveLabs Video Understanding Platform to enable intelligent video analysis and interaction. The following code snippets demonstrate key integration points.

Generate titles, topics, and hashtags

The generateMetadata function utilizes the generate.gist method to extract the title, topics, and hashtags from video content. These metadata elements enhance searchability and organization within the user interface.

Node.js

1 // ------------> GENERATE A TITLE, AND RELATED DETAILS (USE TO TAG VIDEOS IN UI) <---------------
2 export async function generateMetadata(videoId) {
3   const gist = await client.generate.gist(videoId, [
4     "title",
5     "topic",
6     "hashtag",
7   ]);
8   console.log(
9     `Title: ${gist.title}\nTopics=${gist.topics}\nHashtags=${gist.hashtags}`
10   );
11   return gist;
12 }

Generate chapters

The generateChapters function generates chapter markers for videos by calling the generate.summarize method with a custom prompt. The function formats timestamps and returns structured chapter information that helps you navigate through content more efficiently.

Node.js

1 // --------------------> GENERATE CHAPTERS (DO THIS INITIALLY TO EXPLAIN VIDEO) <----------------
2 function formatTime(seconds) {
3   const minutes = Math.floor(seconds / 60);
4   const remainingSeconds = seconds % 60;
5   return `${minutes}:${remainingSeconds.toString().padStart(2, "0")}`;
6 }
7 
8 export async function generateChapters(videoId) {
9   const chapters = await client.generate.summarize(
10     videoId,
11     "chapter",
12     "Generate chapters (max 10) while matching the teaching style of the video. Make sure to keep the titles relatively broad, while making the chapter summaries use simple but specific language"
13   );
14 
15   for (const chapter of chapters.chapters) {
16     console.log(
17       `Chapter ${chapter.chapterNumber} - ${
18         chapter.chapterTitle
19       }\nTime: ${formatTime(chapter.start)} - ${formatTime(
20         chapter.end
21       )}\nSummary: ${chapter.chapterSummary}\n`
22     );
23   }
24 
25   return chapters.chapters.map((chapter) => ({
26     chapterNumber: chapter.chapterNumber,
27     chapterTitle: chapter.chapterTitle,
28     start: formatTime(chapter.start),
29     end: formatTime(chapter.end),
30     summary: chapter.chapterSummary,
31   }));
32 }

Generate open-ended text

The generateText function answers questions about video content using the generate.text method. It ensures that responses correspond to the user’s query language by translating them when necessary, thus enhancing accessibility for international users.

Node.js

1 // --------------> QUESTIONS QUERY NOT STREAMING (formated output for questions) <---------------
2 export async function generateText(videoId, prompt) {
3   const promptLang = await detectLanguage(prompt);
4   const text = await client.generate.text(videoId, prompt);
5   const responseLang = await detectLanguage(text.data);
6   console.log(text.data);
7 
8   if(promptLang === responseLang) {
9     return text.data;
10   }
11 
12   else {
13     const translatedResponse = await translateText(text.data, promptLang);
14     return translatedResponse;
15   }
16 }

Search

This searchQuery function identifies specific moments in videos using the search.query method. The function formats and processes search results to show relevant timestamps for segments that match the query.

Node.js

1 // --------------> SEARCH FOR TEXT QUERIES (find time stamps of intrest) <-----------------
2 export async function searchQuery(
3   indexId,
4   queryText,
5   limitResults,
6   desiredVideo
7 ) {
8   let searchResults = await client.search.query({
9     indexId: indexId,
10     queryText: queryText,
11     options: ["visual", "audio"],
12     operator: "and",
13   });
14 
15   // printPage(searchResults.data, limitResults, desiredVideo);
16 
17   // while (true) {
18   //   const page = await searchResults.next();
19   //   if (page === null) break;
20   //   else printPage(page, limitResults, desiredVideo);
21   // }
22   // console.log("Search results: ", searchResults.data);
23   return searchResults.data;
24 }
25 
26 // Print search results
27 function printPage(searchData, limitResults, desiredVideo) {
28   if (!Array.isArray(searchData)) {
29     console.error("Expected searchData to be an array");
30     return;
31   }
32 
33   searchData.forEach((clip) => {
34     if (limitResults && clip.videoId === desiredVideo) {
35       console.log(
36         `Video ID: ${clip.videoId}\n` +
37           `Score: ${clip.score}\n` +
38           `Start: ${clip.start}\n` +
39           `End: ${clip.end}\n` +
40           `Confidence: ${clip.confidence}\n`
41       );
42     } else if (!limitResults) {
43       console.log(
44         `Video ID: ${clip.videoId}\n` +
45           `Score: ${clip.score}\n` +
46           `Start: ${clip.start}\n` +
47           `End: ${clip.end}\n` +
48           `Confidence: ${clip.confidence}\n`
49       );
50     }
51   });
52 }

Ghool

Summary: Ghool is a trailer creation application that uses the TwelveLabs Video Understanding Platform to search, retrieve, and concatenate video clips based on specific prompts and quality metrics.

Description: The application uses both the Marengo and Pegasus video understanding models to process video content and create optimized video sequences with minimal user intervention. It retrieves relevant video clips based on user prompts. It then processes these videos by concatenating them and evaluating the transitions between different videos based on specific quality metrics, such as smoothness.

Ghool was created by Fazil Onuralp Adic and Oscar Chen and was one of the winners at the Cerebral Beach Hacks – LA Tech Week 2024 Kickoff Hackathon.

GitHub repo: cerebral_ghool

Integration with TwelveLabs

Ghool integrates with the TwelveLabs Video Understanding Platform to search, retrieve, and analyze video content based on specific prompts. The integration enables automated evaluation of video transitions and quality assessment without manual intervention.

The find_video function uses the TwelveLabs Python SDK to perform text queries against a specified index. It filters results by confidence level and evaluates each clip’s quality against user-defined criteria.

Python

1 def find_video(prompt,wanted,quality):
2 
3 page = client.search.query(index_id="66f1cde8163dbc55ba3bb220", query_text=prompt, options=["visual"])
4 
5 video_vec = []
6 
7 i = 0
8 
9 for clip in page.data:
10 
11 if clip.confidence == "high" and i<wanted+4:
12 
13 i+=1
14 
15 video_dict = {"id":clip.video_id,"start":clip.start, "end":clip.end}
16 
17 video_quality = get_comment(quality, video_dict)
18 
19 video_dict["quality"] =video_quality
20 
21 video_vec.append(video_dict)
22 
23 video_vec = sorted(video_vec, key=lambda x: x["quality"], reverse=True)[:wanted]
24 
25 return video_vec

The get_video function retrieves the HLS streaming URL for a video by its ID and uses MoviePy to extract the relevant segment.

Python

1 def get_video(video_info, save_file=None, duration=9999):
2     url = f"https://api.twelvelabs.io/v1.2/indexes/66f1cde8163dbc55ba3bb220/videos/{video_info['id']}"
3     response = requests.get(url, headers=headers)
4     video_url = response.json()["hls"]["video_url"]
5     start = video_info["start"]
6     end = video_info["end"]
7     if duration < video_info["end"]-video_info["start"]:
8         end = video_info["start"]+duration
9     clip = VideoFileClip(video_url).subclip(start, end) 
10     # Additional handling for saving or previewing

The get_comment function utlizies the Analyze API to evaluate video clips, producing numeric quality scores for criteria like “scariness” or “smoothness of transitions.”

Python

1 def get_comment(prompt, video, headers=headers):
2 
3 url = "https://api.twelvelabs.io/v1.2/generate"
4 
5   
6 
7 payload = {
8 
9 "temperature": 0.7,
10 
11 "prompt": f"only output a number from 0-100 evaluating the following clip on these measures: {prompt}",
12 
13 "stream": False,
14 
15 "video_id": video["id"]
16 
17 }
18 
19   
20 
21 response = requests.post(url, json=payload, headers=headers)
22 
23   
24 
25 return response.json()["data"]

The compare_transition function creates and evaluates transitions between video clips by:

Concatenating pairs of videos
Uploading the concatenated videos to TwelveLabs
Analyzing transition quality using the get_comment function described above
Ranking the transitions based on smoothness

Python

1 def compare_transition(videos1,videos2,transition_name,wanted=2):
2 
3 combined_videos = []
4 
5   
6 
7 # Iterate over all combinations of videos1 and videos2
8 
9 for i, video1 in enumerate(videos1):
10 
11 for j, video2 in enumerate(videos2):
12 
13 concatenate_videos([video1,video2],f"{transition_name}{i}{j}.mp4")
14 
15 task = client_mine.task.create(index_id=upload_id, file=f"{transition_name}{i}{j}.mp4", language="en")
16 
17 task.wait_for_done(sleep_interval=10, callback=on_task_update)
18 
19 quality = get_comment("smoothness of the transition between the two stitched videos", {"id":task.video_id},headers_mine)
20 
21 video_props = {"name": f"{transition_name}{i}{j}.mp4", "id":task.video_id,"quality":quality}
22 
23 combined_videos.append(video_props)
24 
25   
26 
27 combined_videos = sorted(combined_videos, key=lambda x: x["quality"], reverse=True)[:wanted]
28 
29   
30 
31 return combined_videos

AI Sports Recap

Summary: AI Sports Recap is a Streamlit-based application that generates video highlights and textual summaries of sports press conferences from YouTube video links. The application utilizes the Pegasus video understanding engine, GPT-4o, and Docker to provide an efficient and user-friendly experience.

Description: The application streamlines the process of extracting essential information from sports press conferences. Users can input a YouTube video link and a specific query, and the application will generate relevant video highlights and a concise textual summary. This application is particularly useful for sports enthusiasts, journalists, and analysts who need to extract and share important information from lengthy press conferences quickly.

The application was developed by Prateek Chhikara, Omkar Masur, Tanmay Rode, and it has won second place at the Multimodal AI Media & Entertainment Hackathon.

GitHub: sports-highlights.

Integration with TwelveLabs

The code below initializes the TwelveLabs Python SDK and creates a new index for which it enables the Marengo and Pegasus video understanding engines:

1 client = TwelveLabs(api_key = api_key)
2 
3 engines = [
4         {
5           "name": "marengo2.7",
6           "options": ["visual", "conversation", "text_in_video", "logo"]
7         },
8         {
9             "name": "pegasus1",
10             "options": ["visual", "conversation"]
11         }
12   ]
13 
14 index = client.index.create(
15     name = "tlabs2",
16     engines=engines,
17     addons=["thumbnail"] # Optional
18 )
19 print(f"A new index has been created: id={index.id} name={index.name} engines={index.engines}")

The upload_video function uploads a video to YouTube:

1 def upload_video(index_id, video_url, transcription_url=None):
2     print("INSIDE UPLOAD VIDEO FUNCTION")
3     task = client.task.external_provider(
4         index_id = index_id,
5         url = video_url
6         )
7     
8     print(f"Task id={task.id}")
9 
10     return task.id

The get_transcript function generates the transcript for a specified video. For each segment, the function returns the text, start times, and end times in separate lists:

1 def get_transcript(index_id, video_id):
2     print("INSIDE GET TRANSCRIPT FUNCTION")
3     transcriptions = client.index.video.transcription(
4         index_id = index_id,
5         id = video_id
6     )
7 
8     transcription_list = []
9     start_points = []
10     end_points = []
11 
12     for transcription in transcriptions:
13         print(
14             f"value={transcription.value} start={transcription.start} end={transcription.end}"
15         )
16 
17         transcription_list.append(transcription.value)
18         start_points.append(transcription.start)
19         end_points.append(transcription.end)
20 
21     return transcription_list, start_points, end_points

ThirteenLabs Smart AI Editor

Summary: ThirteenLabs Smart AI Editor uses artificial intelligence to process videos based on user prompts. It creates highlight reels, generates descriptions, and dubs in multiple languages.

Description: The application utilizes AI models from TwelveLabs, Gemini, and ElevenLabs to deliver high-quality video editing and transcription services. It offers a user-friendly interface built with Gradio.

The application was developed by Dylan Ler and it has won first place at the Multimodal AI Media & Entertainment Hackathon.

Integration with TwelveLabs

The get_transcript function retrieves the transcript of a video using the TwelveLabs Python SDK. It returns the text, start time, and end time for each segment as a list of dictionaries.

1 def get_transcript(video_file_name, video_id_input, which_index):
2     video_id = get_video_id(video_file_name)
3     if video_id is None or video_id_input != "":
4         video_id = video_id_input
5     client = TwelveLabs(api_key="YOUR_API_KEY")
6     transcriptions = client.index.video.transcription(index_id="INDEX_ID", id=f"{video_id}")
7     output = []
8     for transcription in transcriptions:
9         output.append({"transciption": transcription.value, "start_time": transcription.start, "end_time": transcription.end})
10     return output

Cactus

Summary: Cactus is a content generation application that uses functionalities of the TwelveLabs Video Understanding Platform to automatically transform long-form YouTube videos into engaging short-form reels. The application democratizes content creation, making it accessible and affordable for all creators, regardless of their resources.
Description: Cactus addresses the challenge of time-consuming video editing by bridging the gap between long-form and short-form content creation. The platform analyzes video content, identifies the most engaging moments, and compiles them into optimized highlight reels tailored for various social media platforms.

Cactus offers several key features and benefits for content creators:

Saves time by automating the editing process, allowing creators to focus on content creation.
Reduces costs by minimizing the need for professional editing services.
Enhances reach by enabling the quick production of more content.
Content creators can expand their audience across multiple platforms.
The application was developed by Saurabh Ghanekar, Noah Bergren, Christopher Kinoshita, and Shrutika Nikola.

GitHub: Cactus

Integration with TwelveLabs

The generate_segment_itinerary function invokes the POST method of the /generate endpoint to segment a video and create an itinerary based on activities and locations shown in the video. The function returns the segmented itinerary if successful or logs any errors encountered.

1 async def generate_segment_itinerary(video_id: str) -> str:
2    url = f"{BASE_TWELVE_URL}/generate"
3    payload = {
4        "prompt": "Given the following video, segment the videos and provide corresponding timestamps based on the different activities and places that the subject does and visits so that the segmented videos can later be used to build an itinerary.\nMake the response concise & precise.",
5        "video_id": f"{video_id}",
6        "temperature": 0.4,
7    }
8    headers = {
9        "accept": "application/json",
10        "x-api-key": TWELVE_LABS_API_KEY,
11        "Content-Type": "application/json",
12    }
13 
14 
15    logging.info("Generating segmented itinerary")
16    async with aiohttp.ClientSession() as session:
17        async with session.post(url, json=payload, headers=headers) as response:
18            if response.status == 200:
19                try:
20                    result = await response.text()
21                    segmented_itinerary = json.loads(result).get("data")
22                    return segmented_itinerary
23                except Exception as e:
24                    logging.exception(e)
25            else:
26                logging.info(response.status)

Hello Garfield

Summary: The “Hello Garfield” application provides an immersive virtual reality experience combining traditional movie theaters with cutting-edge technology. It features a personalized AI concierge, themed environments, and interactive elements to enhance the movie-watching experience.
Description: The application transforms how you engage with movies in a virtual space. Upon entering the virtual theater, you are greeted by an AI concierge. This concierge offers personalized movie recommendations based on your preferences and viewing history.

Key features include:

Video Q&A chatbot: The application uses the Analyze API to allow you to ask questions about the movies you’re watching.
Immersive VR/MR environment: A realistic virtual movie theater with a large screen, created using VR/MR development platforms such as Unity and Unreal Engine.
AI concierge: A chatbot that provides personalized movie suggestions and enhances the user experience through friendly interaction.
Enhanced viewing experience: The concierge suggests themed snacks, recipes, and merchandise related to the chosen movie, creating a more immersive and enjoyable viewing experience.
AR filters: You can “try on” costumes from your favorite films and decorate your virtual spaces using augmented reality technology.
Community interaction: A shared virtual theater space that allows you to connect with other film enthusiasts, fostering a sense of community.

The application was developed by Lauren Descher, Dulce Baerga, and Catherine Rhee.

GitHub: aila-hack

Integration with TwelveLabs

The code below handles different types of requests:

It checks the type of request and prepares appropriate data for each.
For each type of request, it constructs a data object with a specific video ID and other relevant parameters.
It sends a POST request to the /gist or /summarize endpoints.

1   if (event.request === "gist") {
2     if (event.trailername === "garfield") {
3 
4       data = {
5         "video_id": "666581dbd22b3a3c97bf1d57",
6         "types": [
7           "title",
8           "hashtag",
9           "topic"
10         ]
11       };
12     }
13     data = {
14       "video_id": "666581dbd22b3a3c97bf1d57",
15       "types": [
16         "title",
17         "hashtag",
18         "topic"
19       ]
20     };
21     response = await fetch(baseUrl + "/gist", {
22       method: "POST",
23       headers: { "x-api-key": apiKey, "Content-Type": "application/json" },
24       body: JSON.stringify(data)
25     });
26   }
27   else if (event.request === "summary") {
28     // SUMMARY REQUESTED
29     data = {
30       "video_id": "666581dbd22b3a3c97bf1d57",
31       "type": "summary"
32     };
33     if (event.trailername === "garfield") {
34       data = {
35         "video_id": "666581dbd22b3a3c97bf1d57",
36         "type": "summary"
37       };
38       response = await fetch(baseUrl + "/summarize", {
39         method: "POST",
40         headers: { "x-api-key": apiKey, "Content-Type": "application/json" },
41         body: JSON.stringify(data)
42       });
43     }
44     response = await fetch(baseUrl + "/summarize", {
45       method: "POST",
46       headers: { "x-api-key": apiKey, "Content-Type": "application/json" },
47       body: JSON.stringify(data)
48     });
49 
50   }
51   else if (event.request === "chapters") {
52     data = {
53       "video_id": "666581dbd22b3a3c97bf1d57",
54       "type": "chapter"
55     };
56 
57     if (event.trailername === "garfield") {
58       data = {
59         "video_id": "666581dbd22b3a3c97bf1d57",
60         "type": "chapter"
61       };
62       response = await fetch(baseUrl + "/summarize", {
63         method: "POST",
64         headers: { "x-api-key": apiKey, "Content-Type": "application/json" },
65         body: JSON.stringify(data)
66       });
67     }
68     response = await fetch(baseUrl + "/summarize", {
69       method: "POST",
70       headers: { "x-api-key": apiKey, "Content-Type": "application/json" },
71       body: JSON.stringify(data)
72     });
73   }
74   else if (event.request === "highlights") {
75     data = {
76       "video_id": "666581dbd22b3a3c97bf1d57",
77       "type": "highlight",
78       "prompt": "tell me about food\n"
79     };
80 
81     if (event.trailername === "garfield") {
82       data = {
83         "video_id": "666581dbd22b3a3c97bf1d57",
84         "type": "highlight",
85         "prompt": "tell me about food\n"
86       };
87       response = await fetch(baseUrl + "/summarize", {
88         method: "POST",
89         headers: { "x-api-key": apiKey, "Content-Type": "application/json" },
90         body: JSON.stringify(data)
91       });
92     }
93     response = await fetch(baseUrl + "/summarize", {
94       method: "POST",
95       headers: { "x-api-key": apiKey, "Content-Type": "application/json" },
96       body: JSON.stringify(data)
97     });
98   }

Sports Recap

Summary: Sports Recap is a NextJS-based application that generates video highlights and summaries from sports press conferences.

Description: The application is designed to transform lengthy sports press conferences into concise, engaging highlight reels. The application utilizes the Analyze API to create relevant highlights based on user-specified criteria.

The application was developed by Daniel Jacobs, Yurko Turskiy, Suxu Li, and Melissa Regan.

GitHub: hackathone-challenge-3

Integration with TwelveLabs

The function below extracts data from the incoming request and invokes the POST method of the /summarize endpoint, passing the unique identifier of a video and a prompt to generate highlights:

1 export async function POST(req: NextRequest) {
2   const { projectId, videoId, prompt } = await req.json();
3   const baseUrl = "https://api.twelvelabs.io/v1.2";
4   const apiKey = process.env.TWELVELABS_API as string;
5   const data = {
6     prompt: prompt,
7     video_id: videoId,
8     type: "highlight",
9   };
10 
11   // Send request
12   const response = await fetch(baseUrl + "/summarize", {
13     method: "POST",
14     headers: { "x-api-key": apiKey, "Content-Type": "application/json" },
15     body: JSON.stringify(data),
16   });

The function below extracts the unique identifier of a video from the incoming request and invokes the POST method of the /summarize endpoint. It passes the video ID along with a predefined prompt to generate a summary that identifies the main character and lists key topics discussed:

1 export async function POST(req: NextRequest) {
2   const { videoId } = await req.json();
3 
4   // Variables
5   const baseUrl = "https://api.twelvelabs.io/v1.2";
6   const apiKey = process.env.TWELVELABS_API as string;
7   const data = {
8     video_id: videoId,
9     type: "summary",
10     prompt:
11       "Specify the name of the main character of the video. Generate bullet list of key topics the main character is talking about",
12   };
13 
14   // Send request
15   const response = await fetch(baseUrl + "/summarize", {
16     method: "POST",
17     headers: { "x-api-key": apiKey, "Content-Type": "application/json" },
18     body: JSON.stringify(data),
19   });

Human, Please

Summary: The application combines human verification, ad monetization, and AI analysis. Users watch a short video and describe it with keywords, which the application evaluates to verify human-like responses.

Description: The application utilizes Langflow to prompt OpenAI and TwelveLabs to perform offline processing of videos. It performs the following main tasks:

Presents a video-based captcha: The application displays a selection of short videos, such as advertisements or wildlife clips, to the user. After viewing a video, the application prompts the user to enter a few keywords describing the video content.
Verifies user input with AI: The application compares the input and determines if the response is human-like, then shows the user a result indicating whether they passed the verification.
Monetizes screen real estate: The application integrates advertisements or branded content into the CAPTCHA process. This approach demonstrates a method to monetize user attention while performing bot checks.

GitHub: Human, Please

TwelveSocial

Summary: The application transforms long videos into social media-ready clips. It uses AI-driven video analysis, a chatbot for user guidance, and automated clip and post generation to create engaging, shareable content.

Description: The applicaiton simplifies the process of creating short, social media-friendly video clips from long-form videos. It performs the following tasks:

Uploads video files: Users can upload video files, such as MP4 or MOV, through a user-friendly interface. Guides Users with an AI chatbot: After uploading a video, an AI chatbot powered by LangChain and OpenAI engages users. The chatbot asks about their goals, such as highlighting specific moments, breaking tutorials into steps, or creating clips for different topics.
Analyzes and searches video content: The app uses the welve Labs Video Understanding Platform to index the uploaded video and search for relevant segments based on user input.
Generates short clips: Selected video segments are stitched into short, social media-ready clips using FFmpeg.
Creates social media posts: The application generates suggested social media text posts with hashtags and emojis, tailored to the content of the generated clips.
Previews and downloads clips: Users can preview, download, edit, or select their favorite clips and social posts, with options to share directly to platforms like Instagram.

GitHub: TwelveSocial

Stich Studios

Summary: Stich Studios is an MCP server for managing, searching, and processing sports video metadata. It integrates with the TwelveLabs Video Understanding platform to analyze sports videos, detect key events like goals, and generate highlight clips.

Description: Stich Studios manages and processes sports video metadata, with the following key features:

Metadata management: Provides CRUD operations and advanced search capabilities for sports video metadata stored in PostgreSQL.
Video processing: Uploads and processes sports videos, using the TwelveLabs Video Understanding Platform to analyze content, detect events, and generate highlights.
Highlight generation: Creates shareable highlight clips from the main video based on the detected events.

GitHub: Stich Studios