The leading multimodal AI model for video search & curation
Learn more about our groundbreaking AI models, developed in-house, with industry-leading security and rightsholder protections.
We’re moving beyond simple object and face recognition, and unlocking full machine perception of video content.
Our models at a glance
Vulpus
Our pioneering multimodal AI system, designed to analyze videos through visuals, audio, and dialogue while understanding the passing of time.
By processing and integrating data from each of these modalities and utilizing an advanced search and discovery engine, Vulpus offers exceptional search capabilities across extensive video libraries without the need for labels or metadata.
Vulpus sets a new standard for video content retrieval and analysis, making it easier and more efficient to index, curate and manage libraries or any size.
Key info
Current version | 1.2 |
First released | September 20, 2022 |
Modalities | Visual, audio, dialogue |
Availability | Imaginario app, API |
Capabilities
- Scalable processing and analysis of all video content (any length / any file size)
- Selective analysis by modality (e.g. analyse only visuals / only audio)
- Customizable clip retrieval (choose number of results, length of clips)
- Near-instant result retrieval after initial indexing
- Natural language multimodal search (combine object and audio search, e.g. “a busy restaurant with piano music playing”)
- Customizable face search
Selected use cases
- Video library search and curation
- Video repurposing and clip creation
- Transcription and chapterization
- Replacing traditional or AI tagging and labelling systems
- Locating B-roll, dialogue or SFX
- Assembling highlight reels
See it in action
Cetus
Our second multimodal AI system, specifically developed to increase explainability and provide a modern replacement for video labels and metadata.
Utilizing the analyses of Vulpus, Cetus provides descriptions of video content at a scene level, taking into account the action, characters, speech and context.
Key info
Current version | 1.0 |
First released | September 12, 2024 |
Modalities | Visual, audio, dialogue |
Availability | Imaginario app, API |
Capabilities
- Scalable processing and analysis of all video content (any length / any file size)
- Natural-language descriptions of any scene, taking into account visuals, audio and diaglogue
- Instant description generation
- Percentage confidence score based on search term
Get started with our free-forever Starter tier
Enjoy 30 minutes of free AI transcription and auto-captions every month, plus much more. No credit card required.