Your data is hierarchical.Your model should be too.

hyper3-clip is an image-text embedding model built for hierarchy, variants, and compositional visual queries, ranking the exact visual match general models often miss.

See precision retrieval under pressure.

Start with operational region search where the target is only part of the image, then move into logo, catalog, fashion, geospatial, and safety workspaces powered by HyperView.

Precision Region Search

Compare model accuracy when searching for a small region within a complex scene.

Target Scene
Workspace / Facilities target scene
Target Area
A grey couch with pillows on it.
Target Region

A grey couch with pillows on it.

hyper3-CLIP
#1
OpenAI-CLIP
#20
hyper3-CLIPRank: #1
A grey couch with pillows on it.#1

A grey couch with pillows on it.

Target
Brown chair or love seat facing the tv and shelves.#2

Brown chair or love seat facing the tv and shelves.

a beige side chair with a brown pillow#3

a beige side chair with a brown pillow

The couch on the bottom left corner#4

The couch on the bottom left corner

A tan chair between two green tables.#5

A tan chair between two green tables.

OpenAI-CLIPRank: #20
Brown chair or love seat facing the tv and shelves.#1

Brown chair or love seat facing the tv and shelves.

The couch on the bottom left corner#2

The couch on the bottom left corner

Chair closest to curtains#3

Chair closest to curtains

A tan chair between two green tables.#4

A tan chair between two green tables.

A person in brown pants sitting on a bench next to a child.#5

A person in brown pants sitting on a bench next to a child.

Why Geometry Matters

Standard embeddings crush hierarchy.

Embedding models like OpenAI CLIP project hierarchical data onto flat Euclidean vectors. Because volume in flat spaces grows only polynomially ($r^d$), leaf categories crowd together. This triggers severe retrieval failures where sibling and cousin variants bleed into each other.

Flat Euclidean Space

Polynomial Growth

Leaf nodes crowd tightly near the bottom. Sibling details overlap and cause retrieval confusion.

Overcrowding

Hyperbolic Space

Exponential Growth

Poincaré Disk naturally expands outward. Branch paths separate cleanly, maintaining distance.

Separation
Model Evidence

The hyper³labs Stack

We combine specialized vision-language models with an open-source inspection workbench, then test them against concrete retrieval failure modes by industry.

hyper3-CLIPMultimodal Model

A specialized vision-language model trained with hierarchy-aware geometry to reduce category and sibling bleed in dense visual catalogs.

HyperViewInspection Workbench

An open-source viewer for mapping embedding projections, tracing nearest neighbors, and identifying the failure modes behind bad retrieval results.

Industry / DatasetBenchmarkhyper3-CLIPOpenAI-CLIPReadout
Ecommerce Catalog Retrieval – Amazon Berkeley Objects
Retail catalogs500 product images, 20 product typesProduct-type mAP0.5820.552+3.05 pts
Retail catalogs50 parsed catalog departmentsDepartment mAP0.2640.212+5.20 pts
Retail catalogsParent category retrieves diverse childrenChild coverage@500.7800.655+12.50 pts
Fashion Product Search – DeepFashion In-Shop
Apparel retailSame-item product image retrievalmAP0.4070.240+16.7 pts
Apparel retailSame-item first-result recoveryRecall@10.5950.375+22.0 pts
Apparel retailSpecific typed product searchHit@100.5720.550+2.2 pts
General Visual Hierarchy – COCO Objects
Object search5,000 COCO val images, 80 categoriesCategory mAP0.5540.532+2.22 pts
Object search12 object supercategoriesSupercategory mAP0.5360.516+2.08 pts
Object searchCoverage of child types under broad labelsChild coverage@1000.8870.951CLIP +6.40 pts
Research

Our Research

Understand how our frontier specialized models are trained from scratch, and explore our latest preprints and technical deep-dives.

arXiv:2604.09690v1 [cs.CV] 12 Apr 2026
arXiv
Are We Recognizing the Jaguar or Its Background? A Diagnostic Framework for Jaguar Re-ID
M. Mahmood & A. Rueda-Toicen
Abstract—Standard deep vision embeddings are heavily biased by background shortcuts. We introduce a diagnostic framework that evaluates background biases in wildlife monitoring datasets, showcasing substantial accuracy drops in non-aligned environments...
Proceedings of Biodiversity Computer Vision (BCV) 2026
The geometry mistake — Euclidean vs hyperbolic embedding spaces
Blog

The Geometry Mistake Behind Modern Embedding Models

Why mainstream ML's reliance on Euclidean manifolds is a mistake, and how hyperbolic spaces can efficiently encode hierarchical data with fewer dimensions.

read post →
Evaluation

Request a hyper3-clip eval.

Send a representative retrieval slice. We compare hyper3-clip with your current baseline and return metrics, ranked examples, and a pilot recommendation.

01

Send an eval set

Provide sample images, queries, expected matches, and your current baseline.

02

Compare embeddings

We compare hyper3-clip with your baseline and inspect the missed exact matches.

03

Get the readout

Receive metrics, ranked examples, and a clear recommendation on whether the model is a fit.

Request Eval