This page shows examples of using the Mengo and Pegasus video understanding engines. Note that the screenshots in the sections below are from the Playground. However, the principles demonstrated are similar when invoking the API programmatically.

Marengo

This section contains examples of using the Marengo video understanding engine.

Steve Jobs introducing the iPhone

In the example screenshot below, the query was "How did Steve Jobs introduce the iPhone?". The Marengo video understanding engine used information found in the visual and conversation modalities to perform the following tasks:

  • Visual recognition of a famous person (Steve Jobs)
  • Joint speech and visual recognition to semantically search for the moment when Steve Jobs introduced the iPhone. Note that semantic search finds information based on the intended meaning of the query rather than the literal words you used, meaning that the platform identified the matching video fragments even if Steve Jobs didn't explicitly say the words in the query.

To see this example in the Playground, ensure you're logged in, and then open this URL in your browser.

Polar bear holding a Coca-Cola bottle

In the example screenshot below, the query was "Polar bear holding a Coca-Cola bottle." The Marengo video understanding engine used information found in the visual and logo modalities to perform the following tasks:

  • Recognition of a cartoon character (polar bear)
  • Identification of an object (bottle)
  • Detection of a specific brand logo (Coca-Cola)
  • Identification of an action (polar bear holding a bottle)

To see this example in the Playground, ensure you're logged in, and then open this URL in your browser.

Pegasus

This section contains examples of using the Pegasus video understanding engine.

Summarizing educational videos

In the example screenshot below, the platform has summarized an educational video using predefined templates without any customization:

To see this example in the Playground, ensure you're logged in, and then open this URL in your browser.

Generating captions for social media

In the example screenshot below, the prompt instructs the platform to generate a caption for a social media post:

To see this example in the Playground, ensure you're logged in, and then open this URL in your browser.

Writing police reports

In the example screenshot below, the prompt instructs the platform to write a police report using a specific template for a video showing a robbery: