The best open source AI on demand in a sovereign cloud
Discover the best open source alternatives to ChatGPT, Gemini, Midjourney or Claude for processing sensitive data in full compliance with European and Swiss law.
LLM↓
Embedding↓
Audio↓
Image↓
Large language models (LLM)
The best open source alternatives to ChatGPT, Gemini and Microsoft Copilot for interacting, analysing and generating content with AI.
Qwen/Qwen3.5-122B-A10B-FP8
The most powerful
Beta
- ●
Designed for complex tasks that require a broad context and greater precision in logical reasoning.
- ●
An architecture optimized for faster inference and reduced power consumption, freeing up significant computational resources.
- ●
Trained on millions of agents and tasks of increasing complexity to ensure robust adaptability in the real world.
Modality
Image-Text to Text
Max. input tokens
200’000
Languages
100+ languages
Function call
Yes
Template category
chat_large
- ●
Designed for complex tasks that require a broad context and greater precision in logical reasoning.
- ●
An architecture optimized for faster inference and reduced power consumption, freeing up significant computational resources.
- ●
Trained on millions of agents and tasks of increasing complexity to ensure robust adaptability in the real world.
Modality
Image-Text to Text
Max. input tokens
200’000
Languages
100+ languages
Function call
Yes
Template category
chat_large
Apertus-70B-Instruct-2509
The most ethical
Beta
- ●
Ideal for multilingual services, government agencies and R&D teams looking for a reliable, adaptable model
- ●
Data and methods documented for unprecedented transparency
- ●
Compliant with the AI Act and respectful of privacy and intellectual property
- ●
A 70B version with performance on a par with current market leaders
Modality
Text to Text
Max. input tokens
65’536
Languages
100+ languages
Function call
No
Template category
chat_medium
- ●
Ideal for multilingual services, government agencies and R&D teams looking for a reliable, adaptable model
- ●
Data and methods documented for unprecedented transparency
- ●
Compliant with the AI Act and respectful of privacy and intellectual property
- ●
A 70B version with performance on a par with current market leaders
Modality
Text to Text
Max. input tokens
65’536
Languages
100+ languages
Function call
No
Template category
chat_medium
google/gemma-4-31B-it
The perfect balance
Beta
- ●
The perfect balance between responsiveness and power, designed to excel at logical reasoning, in-depth document analysis, and the generation of reliable code.
- ●
Leverages cutting-edge architecture to provide a nuanced understanding of broad contexts and complex instructions.
- ●
Ideal for advanced chatbots and enterprise workflows that require high flexibility without compromising on processing speed.
Modality
Text-to-Text (optimized for education)
Max. input tokens
100’000
Languages
140+ languages
Function call
Yes (native and optimized)
Template category
chat_medium
- ●
The perfect balance between responsiveness and power, designed to excel at logical reasoning, in-depth document analysis, and the generation of reliable code.
- ●
Leverages cutting-edge architecture to provide a nuanced understanding of broad contexts and complex instructions.
- ●
Ideal for advanced chatbots and enterprise workflows that require high flexibility without compromising on processing speed.
Modality
Text-to-Text (optimized for education)
Max. input tokens
100’000
Languages
140+ languages
Function call
Yes (native and optimized)
Template category
chat_medium
moonshotai/Kimi-K2.6
The most powerful for vibe coding
Beta
- ●
Native multimodal: converts text, images or mockups into fully functional code.
- ●
Designed for large-scale development: includes an extended context window of up to 256k tokens to manage complex projects
- ●
Optimised for vibe coding: a fast, fluid and creative experience designed for developers and product designers
- ●
Compatible with agent-based workflows: automates analysis, code generation, and end-to-end execution
Modality
Image-Text to Text
Max. input tokens
256’000
Languages
Multilingual
Function call
Yes
Template category
code
- ●
Native multimodal: converts text, images or mockups into fully functional code.
- ●
Designed for large-scale development: includes an extended context window of up to 256k tokens to manage complex projects
- ●
Optimised for vibe coding: a fast, fluid and creative experience designed for developers and product designers
- ●
Compatible with agent-based workflows: automates analysis, code generation, and end-to-end execution
Modality
Image-Text to Text
Max. input tokens
256’000
Languages
Multilingual
Function call
Yes
Template category
code
mistralai/Ministral-3-14B-Instruct-2512
The most versatile
Beta
- ●
Optimized for fast and cost-effective deployment, ideal for chatbots, document analysis, and specialized tasks.
- ●
Offers performance comparable to the Mistral Small 3.2 24B with minimal resources.
- ●
Capable of analyzing images and providing information based on visual content, in addition to text.
Modality
Image-Text to Text
Max. input tokens
100’000
Languages
EN, ES, FR, DE, IT...
Function call
Yes
Template category
chat_small
- ●
Optimized for fast and cost-effective deployment, ideal for chatbots, document analysis, and specialized tasks.
- ●
Offers performance comparable to the Mistral Small 3.2 24B with minimal resources.
- ●
Capable of analyzing images and providing information based on visual content, in addition to text.
Modality
Image-Text to Text
Max. input tokens
100’000
Languages
EN, ES, FR, DE, IT...
Function call
Yes
Template category
chat_small
Re-ranking models
The best compatible open-source alternatives for optimizing the relevance of your search results. Refine your document rankings, improve the accuracy of your RAG systems, and ensure smarter, more context-aware information retrieval.
BAAI/bge-reranker-v2-m3
The most versatile
- ●
A state-of-the-art multilingual model capable of processing short queries, paragraphs, and long documents of up to 8192 tokens simultaneously
- ●
Combines lexical (keywords) and semantic (meaning) analysis for unmatched classification accuracy on complex corpora
- ●
The ideal solution for enterprise search engines and RAG applications that require a deep understanding of context
Modality
Text to Text
Max. input tokens
8192
Languages
100+ languages
Function call
No
Type
to rank
- ●
A state-of-the-art multilingual model capable of processing short queries, paragraphs, and long documents of up to 8192 tokens simultaneously
- ●
Combines lexical (keywords) and semantic (meaning) analysis for unmatched classification accuracy on complex corpora
- ●
The ideal solution for enterprise search engines and RAG applications that require a deep understanding of context
Modality
Text to Text
Max. input tokens
8192
Languages
100+ languages
Function call
No
Type
to rank
Qwen/Qwen3-Reranker-0.6B
The most efficient
- ●
Ultra-lightweight architecture (0.6 billion parameters) designed for ultra-low-latency inference and minimal power consumption
- ●
Maintains high relevance accuracy even with a context window expanded up to 32768 tokens
- ●
Ideal for real-time data streams, autonomous agents, and large-scale deployments
Modality
Text to Text
Max. input tokens
32768
Languages
100+ languages
Function call
No
Type
to rank
- ●
Ultra-lightweight architecture (0.6 billion parameters) designed for ultra-low-latency inference and minimal power consumption
- ●
Maintains high relevance accuracy even with a context window expanded up to 32768 tokens
- ●
Ideal for real-time data streams, autonomous agents, and large-scale deployments
Modality
Text to Text
Max. input tokens
32768
Languages
100+ languages
Function call
No
Type
to rank
Embedding models
The best open-source embedding models to transform your data into intelligent vectors. Improve search accuracy, personalise recommendations, simplify data analysis, explore semantic links and easily classify text.
Bge Multilingual Gemma2
The highest quality
- ●
The most powerful open-source embedding model on the market
- ●
The benchmark for semantic search and augmented search (RAG) tasks
- ●
Ideal for advanced use of embedding vectors in a variety of use cases
- ●
Outstanding performance, whatever language the text is in (100+ languages)
Max. input tokens
8192
Parameters
9.2 B
Dimensions
3584
Languages
EN, ES, FR, DE, IT...
Type
Text
- ●
The most powerful open-source embedding model on the market
- ●
The benchmark for semantic search and augmented search (RAG) tasks
- ●
Ideal for advanced use of embedding vectors in a variety of use cases
- ●
Outstanding performance, whatever language the text is in (100+ languages)
Max. input tokens
8192
Parameters
9.2 B
Dimensions
3584
Languages
EN, ES, FR, DE, IT...
Type
Text
All MiniLM L12 v2
The best value for money
- ●
This model is the result of community work based on a model published by Microsoft.
- ●
Excellent value for money, perfect for prototyping and simple tasks with limited resources
- ●
Great performance for relatively simple tasks, whatever language the text is in
- ●
Extreme speed for indexing huge databases or real-time processing
- ●
High energy efficiency to reduce environmental impact
Max. input tokens
512
Parameters
33 M
Dimensions
384
Languages
EN, ES, FR, DE, IT...
Type
Text
- ●
This model is the result of community work based on a model published by Microsoft.
- ●
Excellent value for money, perfect for prototyping and simple tasks with limited resources
- ●
Great performance for relatively simple tasks, whatever language the text is in
- ●
Extreme speed for indexing huge databases or real-time processing
- ●
High energy efficiency to reduce environmental impact
Max. input tokens
512
Parameters
33 M
Dimensions
384
Languages
EN, ES, FR, DE, IT...
Type
Text
Voice recognition
The best open source AI for transcribing audio files into text or generating realistic human voices.
Whisper V3
For complex transcriptions
- ●
Model trained on over 1 million hours of data
- ●
Transcription errors reduced by up to 20% compared with Whisper V2
- ●
Better handling of accents, background noise and complex speech (e.g., calls or videoconferences)
- ●
Enhanced multilingual support and translation of transcriptions into languages other than English
Maximum file size
25 MB
Formats supported
mp3, mp4, aac, wav, flac, ogg, opus, wma, m4a
- ●
Model trained on over 1 million hours of data
- ●
Transcription errors reduced by up to 20% compared with Whisper V2
- ●
Better handling of accents, background noise and complex speech (e.g., calls or videoconferences)
- ●
Enhanced multilingual support and translation of transcriptions into languages other than English
Maximum file size
25 MB
Formats supported
mp3, mp4, aac, wav, flac, ogg, opus, wma, m4a
Image generation and processing
The best open source alternatives to Midjourney, Microsoft Copilot Designer and Gemini for generating, merging or interpreting images.
Photomaker V2
Ideal for generating images
- ●
The best combination of quality and speed in generative AI image creation
- ●
Fast generation of photo-realistic images in 1, 2, 4 or 8 steps based on prompts
- ●
Operates by distillation, which increases energy efficiency and ensures excellent quality
- ●
Optimised for English, with limited knowledge of other languages (FR, DE, ES, IT, etc.)
Max. input tokens
77
Max. image output
5
Languages
EN
Maximum resolution
1024x1024, 1792x1024, 1024x1792
- ●
The best combination of quality and speed in generative AI image creation
- ●
Fast generation of photo-realistic images in 1, 2, 4 or 8 steps based on prompts
- ●
Operates by distillation, which increases energy efficiency and ensures excellent quality
- ●
Optimised for English, with limited knowledge of other languages (FR, DE, ES, IT, etc.)
Max. input tokens
77
Max. image output
5
Languages
EN
Maximum resolution
1024x1024, 1792x1024, 1024x1792
Flux schnell
Ideal for modifying and merging portraits of people
- ●
Create photos in multiple styles from one or more profile photos
- ●
Powerful and flexible: recontextualisation, colourisation, age and gender change, mix of identities, etc.
Max. input tokens
77
Max. image input
6
Max. image output
5
Languages
EN
Maximum resolution
1024x1024, 1792x1024, 1024x1792
- ●
Create photos in multiple styles from one or more profile photos
- ●
Powerful and flexible: recontextualisation, colourisation, age and gender change, mix of identities, etc.
Max. input tokens
77
Max. image input
6
Max. image output
5
Languages
EN
Maximum resolution
1024x1024, 1792x1024, 1024x1792


