The leading multimodal AI for video search & curation

Learn more about our groundbreaking AI models, developed in-house, with industry-leading security and rightsholder protections.

We’re moving beyond simple object and face recognition, and unlocking full machine perception of video content.

Book a Demo

Our models at a glance

Vulpus

Our pioneering multimodal AI system, designed to analyze videos through visuals, audio, and dialogue while understanding the passing of time.

By processing and integrating data from each of these modalities and utilizing an advanced search and discovery engine, Vulpus offers exceptional search capabilities across extensive video libraries without the need for labels or metadata.

Vulpus sets a new standard for video content retrieval and analysis, making it easier and more efficient to index, curate and manage libraries or any size.

Key info

Current version	1.2
First released	September 20, 2022
Modalities	Visual, audio, dialogue
Availability	Imaginario app, API

Capabilities

Scalable processing and analysis of all video content (any length / any file size)
Selective analysis by modality (e.g. analyse only visuals / only audio)
Customizable clip retrieval (choose number of results, length of clips)

Near-instant result retrieval after initial indexing
Natural language multimodal search (combine object and audio search, e.g. “a busy restaurant with piano music playing”)
Customizable face search

Selected use cases

Replacing or complementing traditional or AI tagging and labelling systems
Locating B-roll, dialogue or SFX
Assembling highlight reels

See it in action

Cetus

Our second multimodal AI system, specifically developed to increase explainability and provide a modern complement for video labels and metadata.

Utilizing the analyses of Vulpus, Cetus provides descriptions of video content at a scene level, taking into account the action, characters, speech and context.

Key info

Current version	2.0
First released	September 12, 2025
Modalities	Visual, audio, dialogue, faces, temproal understanding, fine-tuning for private taxonomies and schemas
Availability	Imaginario app, API

Capabilities

Scalable processing and analysis of all video content (any length / any file size)
Natural-language descriptions of any scene, taking into account visuals, audio and diaglogue