Your data is hierarchical.Your model should be too.

hyper3-clip is an image-text embedding model built for hierarchy, variants, and compositional visual queries, ranking the exact visual match general models often miss.

Request Eval View model

See precision retrieval under pressure.

Start with operational region search where the target is only part of the image, then move into logo, catalog, fashion, geospatial, and safety workspaces powered by HyperView.

Precision Region Search

Compare model accuracy when searching for a small region within a complex scene.

Target Scene

Target Area

Target Region

A grey couch with pillows on it.

hyper3-CLIP

OpenAI-CLIP

#20

hyper3-CLIPRank: #1

A grey couch with pillows on it.

Target

Brown chair or love seat facing the tv and shelves.

a beige side chair with a brown pillow

The couch on the bottom left corner

A tan chair between two green tables.

OpenAI-CLIPRank: #20

Brown chair or love seat facing the tv and shelves.

The couch on the bottom left corner

Chair closest to curtains

A tan chair between two green tables.

A person in brown pants sitting on a bench next to a child.

Why Geometry Matters

Standard embeddings crush hierarchy.

Embedding models like OpenAI CLIP project hierarchical data onto flat Euclidean vectors. Because volume in flat spaces grows only polynomially ($r^d$), leaf categories crowd together. This triggers severe retrieval failures where sibling and cousin variants bleed into each other.

Flat Euclidean Space

Polynomial Growth

Leaf nodes crowd tightly near the bottom. Sibling details overlap and cause retrieval confusion.

Hyperbolic Space

Exponential Growth

Poincaré Disk naturally expands outward. Branch paths separate cleanly, maintaining distance.

Model Evidence

The hyper³labs Stack

We combine specialized vision-language models with an open-source inspection workbench, then test them against concrete retrieval failure modes by industry.

hyper3-CLIPMultimodal Model

A specialized vision-language model trained with hierarchy-aware geometry to reduce category and sibling bleed in dense visual catalogs.

HyperViewInspection Workbench

An open-source viewer for mapping embedding projections, tracing nearest neighbors, and identifying the failure modes behind bad retrieval results.

Industry / Dataset	Benchmark	hyper3-CLIP	OpenAI-CLIP	Readout
Ecommerce Catalog Retrieval – Amazon Berkeley Objects
Retail catalogs500 product images, 20 product types	Product-type mAP	0.582	0.552	+3.05 pts
Retail catalogs50 parsed catalog departments	Department mAP	0.264	0.212	+5.20 pts
Retail catalogsParent category retrieves diverse children	Child coverage@50	0.780	0.655	+12.50 pts
Fashion Product Search – DeepFashion In-Shop
Apparel retailSame-item product image retrieval	mAP	0.407	0.240	+16.7 pts
Apparel retailSame-item first-result recovery	Recall@1	0.595	0.375	+22.0 pts
Apparel retailSpecific typed product search	Hit@10	0.572	0.550	+2.2 pts
General Visual Hierarchy – COCO Objects
Object search5,000 COCO val images, 80 categories	Category mAP	0.554	0.532	+2.22 pts
Object search12 object supercategories	Supercategory mAP	0.536	0.516	+2.08 pts
Object searchCoverage of child types under broad labels	Child coverage@100	0.887	0.951	CLIP +6.40 pts

Research

Our Research

Understand how our frontier specialized models are trained from scratch, and explore our latest preprints and technical deep-dives.

Collab with us ↗

arXiv:2604.09690v1 [cs.CV] 12 Apr 2026

arXiv

Are We Recognizing the Jaguar or Its Background? A Diagnostic Framework for Jaguar Re-ID

M. Mahmood & A. Rueda-Toicen

Abstract—Standard deep vision embeddings are heavily biased by background shortcuts. We introduce a diagnostic framework that evaluates background biases in wildlife monitoring datasets, showcasing substantial accuracy drops in non-aligned environments...

Proceedings of Biodiversity Computer Vision (BCV) 2026

April 12, 2026

Are We Recognizing the Jaguar or Its Background? A Diagnostic Framework for Jaguar Re-ID

The geometry mistake — Euclidean vs hyperbolic embedding spaces

BlogApr 10, 2024

The Geometry Mistake Behind Modern Embedding Models

Why mainstream ML's reliance on Euclidean manifolds is a mistake, and how hyperbolic spaces can efficiently encode hierarchical data with fewer dimensions.

read post →

Evaluation

Request a hyper3-clip eval.

Send a representative retrieval slice. We compare hyper3-clip with your current baseline and return metrics, ranked examples, and a pilot recommendation.

Send an eval set

Provide sample images, queries, expected matches, and your current baseline.

Compare embeddings

We compare hyper3-clip with your baseline and inspect the missed exact matches.

Get the readout

Receive metrics, ranked examples, and a clear recommendation on whether the model is a fit.

Request Eval