Multimodal Multilingual Embedding model launch

JigsawStack Embedding (1).jpg

We’re launching an embedding API as part of JigsawStack AI SDK. We see an opportunity here for RAG applications, especially for applications that have a wide range of different document types, including images, audio, and multiple languages

Current problem

Most embedding models, even the ones provided by OpenAI, only support one modality, which is text. Language support is another key issue where most models tend to specialize in English or a small set of languages, which could be a huge pain point for larger multilingual models. Having different embedding models for each modality and language doesn’t work for embedding models as they share different vector spaces, which won’t allow for retrieval across languages and modalities.

Solution

So, to solve this, we built a model that supports multi-modality like most LLMs do today with PDF, image, audio, and text and with support for over 80+ languages.

We need your help

Unlike many JigsawStack services, embedding models are hard to test with just our small team. Embedding models are used for a wide range of industries, law, Chinese documents, medical notes, general searches, and the list goes on. We need to test this in the wild in production applications with real documents. This model is in Alpha, and we need your help to get it to Beta production. While this model is in Alpha, it is still hosted on the same infra as every of JigsawStack’s which shares the same security standard and document management policy keeping everything private and deletable.

Technical specs:

Support inputs: text, image, pdf, audio, other text formatted documents like csv
Supports auto embedding chunking: yes
Max input token: 8192 (Auto chunking enabled for tokens larger than 8192)
Output embedding vector size: 768
Benchmark
- MTEB STS (en): 82.11
- MTEB STS (zh): 53.73
- MTEB STS (fr): 81.36
- MTEB STS (other): 73.41
- Average nDCG@10 (Code): 71.17

How to get started:

Drop me a DM here or email me at [email protected] to get the API. We’re happy to provide the embedding model free of charge to whoever has helped test the model in Alpha and for an additional 4 months free with the full suite of JigsawStack when the model is in production.

API route

**POST** /v1/embedding