Features
A standalone desktop application with mobile companion apps. 12 specialized AI models work together to understand your media in depth - without a single byte leaving your device.
Overview
Whether you're editing a documentary, managing a production archive, or sharing family photos - Colibri is a complete media management platform that searches across all your media types at once and keeps everything on your own hardware. See who it's built for →
Scenes, faces, objects, OCR, transcription, translation & summaries - helping you prepare edits, log footage, and process large archives faster.
One search across photos, videos, audio files and documents. Describe what you need in natural language and find it in seconds, whether it's a specific interview clip, a photo from a particular shoot, or a document mentioned in a recording.
A knowledge graph that links people, places and topics across your entire media archive into a navigable network - perfect for research and investigative work.
Share collections across your team or family at ease. Everyone can browse, search and work with shared media - from production crews syncing footage to families sharing phone photos. Synchronization stays fully local, no third-party cloud required.
Backups, synchronization and quick media exchange. Colibri separates original material from lightweight previews - originals stay safely archived while previews keep everything browsable and searchable, even on devices with limited storage.
Available on macOS, Windows, Linux, iOS and Android. Your media archive syncs seamlessly across all your devices - search and browse on your phone while originals live on your workstation.
Entire collections can be encrypted at rest. All data transfer and synchronization is end-to-end encrypted. Your originals stay untouched and verifiable - in times of AI-generated fakes, knowing your source material is authentic and has not been accessed by unauthorized parties is more important than ever. Colibri uses no generative AI - it analyzes and understands your media, but never generates, alters, or fabricates content. And your data is never used for training any AI models.
How It Works
Drag and drop your files. Colibri works with media where it already lives - no copying required.
12 specialized models run locally in the background - describing scenes, transcribing speech, recognizing faces.
Instant results in natural language. No internet required. No API costs. No limits. Ever.
AI Models
Instead of one massive general-purpose model, Colibri uses many small, efficient specialist models - each optimized for a specific task.
Detects and describes what's in photos and videos - scenes, objects, moods and context.
Reads text in images (OCR), generates tags and summaries automatically.
Converts spoken word to text in over 100 languages and translates between them - faster than real-time.
Find any media file by describing what you need in your own words - instant results.
Identifies and clusters faces across your entire archive so you can find every photo of a specific person instantly.
Sound classification, quality scoring, speaker diarization - the modular architecture makes it easy to add new capabilities over time.
Many small, efficient models instead of one big all-rounder - that's how AI becomes possible on regular hardware.
The Application
Get Started
Apply for early beta access and be among the first to experience local AI media management.