The leading multimodal AI model for video search & curation

Learn more about our groundbreaking AI models, developed in-house, with industry-leading security and rightsholder protections.

We’re moving beyond simple object and face recognition, and unlocking full machine perception of video content.

Our models at a glance

Vulpus

Our pioneering multimodal AI system, designed to analyze videos through visuals, audio, and dialogue while understanding the passing of time.

By processing and integrating data from each of these modalities and utilizing an advanced search and discovery engine, Vulpus offers exceptional search capabilities across extensive video libraries without the need for labels or metadata.

Vulpus sets a new standard for video content retrieval and analysis, making it easier and more efficient to index, curate and manage libraries or any size.

Key info

Current version1.2
First releasedSeptember 20, 2022
ModalitiesVisual, audio, dialogue
AvailabilityImaginario app, API

Capabilities

  • Scalable processing and analysis of all video content (any length / any file size)
  • Selective analysis by modality (e.g. analyse only visuals / only audio)
  • Customizable clip retrieval (choose number of results, length of clips)
  • Near-instant result retrieval after initial indexing
  • Natural language multimodal search (combine object and audio search, e.g. “a busy restaurant with piano music playing”)
  • Customizable face search

Selected use cases

See it in action

Cetus

Our second multimodal AI system, specifically developed to increase explainability and provide a modern replacement for video labels and metadata.

Utilizing the analyses of Vulpus, Cetus provides descriptions of video content at a scene level, taking into account the action, characters, speech and context.

Key info

Current version1.0
First releasedSeptember 12, 2024
ModalitiesVisual, audio, dialogue
AvailabilityImaginario app, API

Capabilities

  • Scalable processing and analysis of all video content (any length / any file size)
  • Natural-language descriptions of any scene, taking into account visuals, audio and diaglogue
  • Instant description generation
  • Percentage confidence score based on search term

Get started with our free-forever Starter tier

Enjoy 30 minutes of free AI transcription and auto-captions every month, plus much more. No credit card required.