Advanced

Unlike Ahnlich DB, which is concerned with similarity algorithms and indexing, Ahnlich AI focuses on embedding generation. The service introduces model-aware stores, where you define the embedding models used for both data insertion (indexing) and querying. This abstraction lets developers work directly with raw inputs (text or images) while the AI proxy handles embedding generation.

Supported Models

Ahnlich AI includes several pre-trained models that can be configured depending on your workload. These cover both text embeddings and image embeddings:

Model Name	String Name	Type	Max Input	Description
ALL_MINI_LM_L6_V2	all-minilm-l6-v2	Text	256 tokens	Lightweight sentence transformer. Fast and memory-efficient, ideal for semantic similarity in applications like FAQ search or chatbots.
ALL_MINI_LM_L12_V2	all-minilm-l12-v2	Text	256 tokens	Larger variant of MiniLM. Higher accuracy for nuanced text similarity tasks, but with increased compute requirements.
BGE_BASE_EN_V15	bge-base-en-v1.5	Text	512 tokens	Base version of the BGE (English v1.5) model. Balanced performance and speed, suitable for production-scale applications.
BGE_LARGE_EN_V15	bge-large-en-v1.5	Text	512 tokens	High-accuracy embedding model for semantic search and retrieval. Best choice when precision is more important than latency.
RESNET50	resnet-50	Image	224x224 px	Convolutional Neural Network (CNN) for extracting embeddings from images. Useful for content-based image retrieval and clustering.
CLIP_VIT_B32_IMAGE	clip-vit-b32-image	Image	224x224 px	Vision Transformer encoder from the CLIP model. Produces embeddings aligned with its paired text encoder for multimodal tasks.
CLIP_VIT_B32_TEXT	clip-vit-b32-text	Text	77 tokens	Text encoder from CLIP. Designed to map textual inputs into the same space as CLIP image embeddings for text-to-image or image-to-text search.

Supported Input Types

Input Type	Description
RAW_STRING	Accepts natural text (sentences, paragraphs). Transformed into embeddings via a selected text-based model.
IMAGE	Accepts image files as input. Converted into embeddings via a selected image-based model (e.g., ResNet or CLIP).

Example – Creating a Model-Aware Store

CREATESTORE my_store QUERYMODEL all-minilm-l6-v2 INDEXMODEL all-minilm-l6-v2

index_model - defines how inserted data is embedded before being stored in Ahnlich DB.
query_model - defines how queries are embedded at search time.
Both models must output embeddings of the same dimensionality to ensure compatibility.

Choosing the Right Model

Model	Best Use Case
MiniLM (L6/L12)	Fast, efficient semantic similarity (FAQs, chatbots).
BGE (Base/Large)	High semantic accuracy for production-scale applications.
ResNet50	Image-to-image similarity and clustering.
CLIP (Text+Image)	Multimodal retrieval (text-to-image / image-to-text search).

Supported Models​

Supported Input Types​

Example – Creating a Model-Aware Store​

Choosing the Right Model​

Supported Models

Supported Input Types

Example – Creating a Model-Aware Store

Choosing the Right Model