This guide shows how you can use the Analyze API to perform open-ended analysis on video content, generating tailored text outputs based on your prompts. This feature provides more customization options than the summarization feature. It supports generating various content types based on your prompts, including, but not limited to, tables of content, action items, memos, reports, and comprehensive analyses.

The platform provides two distinct methods for retrieving the results of the open-ended analysis:

Streaming responses

Streaming responses deliver text fragments in real-time as they are generated, enabling immediate processing and feedback. This method is the default behavior of the platform and is ideal for applications requiring incremental updates.

Response format: A stream of JSON objects in NDJSON format, with three event types:
- stream_start: Marks the beginning of the stream.
- text_generation: Delivers a fragment of the generated text.
- stream_end: Signals the end of the stream.
Response handling:
- Iterate over the stream to process text fragments as they arrive.
Advantages:
- Real-time processing of partial results.
- Reduced perceived latency.
Use case: Live transcription, real-time analysis, or applications needing instant updates.

Non-streaming responses

Non-streaming responses deliver the complete generated text in a single response, simplifying processing when the full result is needed.

Response format: A single string containing the full generated text.
Response handling:
- Access the complete text directly from the response.
Advantages:
- Simplicity in handling the full result.
- Immediate access to the entire text.
Use case: Generating reports, summaries, or any scenario where the whole text is required at once.

This guide provides a complete example. For a simplified introduction with just the essentials, see the Analyze videos quickstart guide.

Prerequisites

To use the platform, you need an API key:

1
If you don’t have an account, sign up for a free account.
2
Go to the API Keys page.
3
Select the Copy icon next to your key.

Depending on the programming language you are using, install the TwelveLabs SDK by entering one of the following commands:

$ pip install twelvelabs

Your video files must meet the format requirements.

Complete example

This complete example shows how to create an index, upload a video, and perform open-ended analysis to generate text based on the content of your video. Ensure you replace the placeholders surrounded by <> with your values.

Streaming responses

Non-streaming responses

1 from twelvelabs import TwelveLabs
2 from twelvelabs.indexes import IndexesCreateRequestModelsItem
3 from twelvelabs.tasks import TasksRetrieveResponse
4 
5 # 1. Initialize the client
6 # An index is a container for organizing your video content
7 client = TwelveLabs(api_key="<YOUR_API_KEY>")
8 
9 # 2. Create an index
10 index = client.indexes.create(
11     index_name="<YOUR_INDEX_NAME>",
12     models=[
13         IndexesCreateRequestModelsItem(
14             model_name="pegasus1.2", model_options=["visual", "audio"]
15         )
16     ]
17 )
18 print(f"Created index: id={index.id}")
19 
20 # 3. Upload a video
21 task = client.tasks.create(
22     index_id=index.id,
23     video_url="<YOUR_VIDEO_URL>"
24     # Or for a local file: video_file=open("<PATH_TO_VIDEO_FILE>", "rb")
25     )
26 print(f"Created task: id={task.id}")
27 
28 # 4. Monitor the indexing process
29 def on_task_update(task: TasksRetrieveResponse):
30     print(f"  Status={task.status}")
31 
32 task = client.tasks.wait_for_done(
33     sleep_interval=5, task_id=task.id, callback=on_task_update)
34 if task.status != "ready":
35     raise RuntimeError(f"Indexing failed with status {task.status}")
36 print(
37     f"Upload complete. The unique identifier of your video is {task.video_id}.")
38 
39 # 5. Perform open-ended analysis
40 text_stream = client.analyze_stream(
41     video_id=task.video_id,
42     prompt="<YOUR_PROMPT>",
43     # temperature=0.2
44 )
45 
46 # 6. Process the results
47 for text in text_stream:
48     if text.event_type == "text_generation":
49         print(text.text)

Step-by-step guide

Python

Node.js

Import the SDK and initialize the client

Create a client instance to interact with the TwelveLabs Video Understanding Platform.
Function call: You call the constructor of the TwelveLabs class.
Parameters:

api_key: The API key to authenticate your requests to the platform.

Return value: An object of type TwelveLabs configured for making API calls.

Create an index

Indexes store and organize your video data, allowing you to group related videos. Create one before uploading videos.
Function call: You call the indexes.create function.

Parameters:

index_name: The name of the index.
models: An array specifying your model configuration.

See the Indexes page for more details on creating an index and specifying the model configuration.

Return value: An object containing, among other information, a field named id representing the unique identifier of the newly created index.

Upload videos

To perform any downstream tasks, you must first upload your videos, and the platform must finish indexing them.
Function call: You call the tasks.create function.
Parameters:

index_id: The unique identifier of your index.
video_url or video_file:
- video_url: The publicly accessible URL of your video file (string)
- video_file: An opened file object in binary read mode. Use open(path, 'rb') to open your local file

Return value: An object of type TasksCreateResponse that you can use to track the status of your video upload and indexing process. This object contains, among other information, the following fields:

id: The unique identifier of your video indexing task.
video_id: The unique identifier of your video.

Monitor the indexing process

The platform requires some time to index videos. Check the status of the video indexing task until it’s completed.
Function call: You call the tasks.wait_for_done function.
Parameters:

sleep_interval: The time interval, in seconds, between successive status checks. In this example, the method checks the status every five seconds.
task_id: The unique identifier of your video indexing task.
callback: A callback function that the SDK executes each time it checks the status.

Return value: An object of type TasksRetrieveResponse containing, among other information, a field named status representing the status of your task. Wait until the value of this field is ready.

Perform open-ended analysis

Streaming responses

Non-streaming responses

Function call: You call the analyze_stream method.
Parameters:

video_id: The unique identifier of the video for which you want to generate text.
prompt: A string that guides the model on the desired format or content. The maximum length of a prompt is 2,000 tokens.
(Optional) temperature: A number that controls the randomness of the text. A higher value generates more creative text, while a lower value produces more deterministic text.

Return value: An object that handles streaming HTTP responses and provides an iterator interface allowing you to process text fragments as they arrive. The maximum length of the response is 4,096 tokens.

Note

If you encounter timeout errors, increase the timeout parameter when you initialize the TwelveLabs client. The default timeout is 60 seconds, which may not be sufficient for complex prompts, especially with non-streaming responses..

Process the results

Streaming responses

Non-streaming responses

Use a loop to iterate over the stream. Inside the loop, handle each text fragment as it arrives.