The best open source AI on demand in a sovereign cloud

Discover the best open source alternatives to ChatGPT, Gemini, Midjourney or Claude for processing sensitive data in full compliance with European and Swiss law.

LLM↓

Embedding↓

Audio↓

Image↓

Large language models (LLM)

The best open source alternatives to ChatGPT, Gemini and Microsoft Copilot for interacting, analysing and generating content with AI.

Qwen/Qwen3.5-122B-A10B-FP8

The most efficient

Get started for free

Consult the API documentation

●
Designed for complex tasks that require a broad context and greater precision in logical reasoning.
●
An optimised architecture for faster inference and reduced power consumption, freeing up significant computational resources.
●
Trained on millions of agents and tasks of increasing complexity to ensure robust adaptability in the real world.

Modality

Image-Text to Text

Max. input tokens

200’000

Languages

100+ languages

Function call

Yes

Template category

chat_large

Get started for free

Consult the API documentation

●
Designed for complex tasks that require a broad context and greater precision in logical reasoning.
●
An optimised architecture for faster inference and reduced power consumption, freeing up significant computational resources.
●
Trained on millions of agents and tasks of increasing complexity to ensure robust adaptability in the real world.

Modality

Image-Text to Text

Max. input tokens

200’000

Languages

100+ languages

Function call

Yes

Template category

chat_large

Apertus-70B-Instruct-2509

The most ethical

Beta

Get started for free

Consult the API documentation

●
Ideal for multilingual services, government agencies and R&D teams looking for a reliable, adaptable model
●
Data and methods documented for unprecedented transparency
●
Compliant with the AI Act and respectful of privacy and intellectual property
●
A 70B version with performance on a par with current market leaders

Modality

Text to Text

Max. input tokens

65’536

Languages

100+ languages

Function call

Template category

chat_medium

Get started for free

Consult the API documentation

●
Ideal for multilingual services, government agencies and R&D teams looking for a reliable, adaptable model
●
Data and methods documented for unprecedented transparency
●
Compliant with the AI Act and respectful of privacy and intellectual property
●
A 70B version with performance on a par with current market leaders

Modality

Text to Text

Max. input tokens

65’536

Languages

100+ languages

Function call

Template category

chat_medium

google/gemma-4-31B-it

The perfect balance

Get started for free

Consult the API documentation

●
The perfect balance between responsiveness and power, designed to excel at logical reasoning, in-depth document analysis and the generation of reliable code.
●
Leverages cutting-edge architecture to provide a nuanced understanding of broad contexts and complex instructions.
●
Ideal for advanced chatbots and enterprise workflows that require high flexibility without compromising on processing speed.

Modality

Text-to-Text (optimised for learning)

Max. input tokens

100’000

Languages

140+ languages

Function call

Yes (native and optimised)

Template category

chat_medium

Get started for free

Consult the API documentation

●
The perfect balance between responsiveness and power, designed to excel at logical reasoning, in-depth document analysis and the generation of reliable code.
●
Leverages cutting-edge architecture to provide a nuanced understanding of broad contexts and complex instructions.
●
Ideal for advanced chatbots and enterprise workflows that require high flexibility without compromising on processing speed.

Modality

Text-to-Text (optimised for learning)

Max. input tokens

100’000

Languages

140+ languages

Function call

Yes (native and optimised)

Template category

chat_medium

moonshotai/Kimi-K2.6

The most powerful for vibe coding

Beta

Get started for free

Consult the API documentation

●
Native multimodal: converts text, images or mockups into fully functional code.
●
Designed for large-scale development: includes an extended context window of up to 256k tokens to manage complex projects
●
Optimised for vibe coding: a fast, fluid and creative experience designed for developers and product designers
●
Compatible with agent-based workflows: automates analysis, code generation, and end-to-end execution

Modality

Image-Text to Text

Max. input tokens

256’000

Languages

Multilingual

Function call

Yes

Template category

code

Get started for free

Consult the API documentation

●
Native multimodal: converts text, images or mockups into fully functional code.
●
Designed for large-scale development: includes an extended context window of up to 256k tokens to manage complex projects
●
Optimised for vibe coding: a fast, fluid and creative experience designed for developers and product designers
●
Compatible with agent-based workflows: automates analysis, code generation, and end-to-end execution

Modality

Image-Text to Text

Max. input tokens

256’000

Languages

Multilingual

Function call

Yes

Template category

code

mistralai/Ministral-3-14B-Instruct-2512

The most versatile

Beta

Get started for free

Consult the API documentation

●
Optimised for fast and cost-effective deployment, ideal for chatbots, document analysis and specialised tasks.
●
Offers performance comparable to Mistral Small 3.2 24B with minimal resources.
●
Capable of analysing images and providing information based on visual content, in addition to text.

Modality

Image-Text to Text

Max. input tokens

100’000

Languages

EN, ES, FR, DE, IT...

Function call

Yes

Template category

chat_small

Get started for free

Consult the API documentation

●
Optimised for fast and cost-effective deployment, ideal for chatbots, document analysis and specialised tasks.
●
Offers performance comparable to Mistral Small 3.2 24B with minimal resources.
●
Capable of analysing images and providing information based on visual content, in addition to text.

Modality

Image-Text to Text

Max. input tokens

100’000

Languages

EN, ES, FR, DE, IT...

Function call

Yes

Template category

chat_small

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

The most efficient architecture

Beta

Get started for free

Consult the API documentation

●
An innovative hybrid architecture that delivers the power of a large model with the speed and cost of a small model.
●
Excels at logical reasoning, summarising complex data and generating structured code thanks to training on high-quality technical datasets.
●
Ideal for large-scale deployments that require a balance between cutting-edge performance and infrastructure cost control.

Modality

Text to Text

Max. input tokens

1’000’000

Languages

EN, ES, FR, DE, IT, JP

Function call

Yes

Template category

chat_medium

Get started for free

Consult the API documentation

●
An innovative hybrid architecture that delivers the power of a large model with the speed and cost of a small model.
●
Excels at logical reasoning, summarising complex data and generating structured code thanks to training on high-quality technical datasets.
●
Ideal for large-scale deployments that require a balance between cutting-edge performance and infrastructure cost control.

Modality

Text to Text

Max. input tokens

1’000’000

Languages

EN, ES, FR, DE, IT, JP

Function call

Yes

Template category

chat_medium

mistralai/Mistral-Small-4-119B-2603

Most effective for learning and reasoning

Get started for free

Consult the API documentation

●
A versatile model capable of switching easily between general instruction and complex reasoning.
●
Designed for advanced autonomous workflows and business applications that require maximum reliability and consistency.
●
It far outperforms Mistral Small 3 in terms of latency and query throughput.

Modality

Image-Text to Text

Max. input tokens

256’000

Languages

Multilingual

Function call

Yes

Template category

chat_large

Get started for free

Consult the API documentation

●
A versatile model capable of switching easily between general instruction and complex reasoning.
●
Designed for advanced autonomous workflows and business applications that require maximum reliability and consistency.
●
It far outperforms Mistral Small 3 in terms of latency and query throughput.

Modality

Image-Text to Text

Max. input tokens

256’000

Languages

Multilingual

Function call

Yes

Template category

chat_large

Qwen/Qwen3.5-397B-A17B-FP8

The most powerful

Beta

Get started for free

Consult the API documentation

●
Cutting-edge MoE architecture designed for highly complex tasks, offering unmatched precision in scientific reasoning, multi-step planning and tool execution.
●
Take advantage of a significantly expanded global knowledge base to better master broad general knowledge and the generation of complex code.
●
Outperforms previous models in rigorous intelligence benchmarks.

Modality

Image-Text to Text

Max. input tokens

200’000

Languages

100+ languages

Function call

Yes

Template category

chat_large

Get started for free

Consult the API documentation

●
Cutting-edge MoE architecture designed for highly complex tasks, offering unmatched precision in scientific reasoning, multi-step planning and tool execution.
●
Take advantage of a significantly expanded global knowledge base to better master broad general knowledge and the generation of complex code.
●
Outperforms previous models in rigorous intelligence benchmarks.

Modality

Image-Text to Text

Max. input tokens

200’000

Languages

100+ languages

Function call

Yes

Template category

chat_large

Re-ranking models

The best compatible open-source alternatives for optimising the relevance of your search results. Refine your document rankings, improve the accuracy of your RAG systems and ensure smarter, more context-aware information retrieval.

BAAI/bge-reranker-v2-m3

The most versatile

Beta

Get started for free

Consult the API documentation

●
A state-of-the-art multilingual model capable of processing short queries, paragraphs and long documents of up to 8192 tokens simultaneously
●
Combines lexical (keywords) and semantic (meaning) analysis for unrivalled ranking accuracy on complex data sets
●
The ideal solution for enterprise search engines and RAG applications that require a deep understanding of context

Modality

Text to Text

Max. input tokens

8192

Languages

100+ languages

Function call

Type

re-rank

Get started for free

Consult the API documentation

●
A state-of-the-art multilingual model capable of processing short queries, paragraphs and long documents of up to 8192 tokens simultaneously
●
Combines lexical (keywords) and semantic (meaning) analysis for unrivalled ranking accuracy on complex data sets
●
The ideal solution for enterprise search engines and RAG applications that require a deep understanding of context

Modality

Text to Text

Max. input tokens

8192

Languages

100+ languages

Function call

Type

re-rank

Qwen/Qwen3-Reranker-0.6B

The most efficient

Beta

Get started for free

Consult the API documentation

●
Ultra-lightweight architecture (0.6 billion parameters) designed for ultra-low-latency inference and minimal power consumption
●
Maintains high relevance accuracy, even with a context window expanded up to 32768 tokens
●
Ideal for real-time data feeds, autonomous agents and large-scale deployments

Modality

Text to Text

Max. input tokens

32768

Languages

100+ languages

Function call

Type

re-rank

Get started for free

Consult the API documentation

●
Ultra-lightweight architecture (0.6 billion parameters) designed for ultra-low-latency inference and minimal power consumption
●
Maintains high relevance accuracy, even with a context window expanded up to 32768 tokens
●
Ideal for real-time data feeds, autonomous agents and large-scale deployments

Modality

Text to Text

Max. input tokens

32768

Languages

100+ languages

Function call

Type

re-rank

Embedding models

The best open-source embedding models to transform your data into intelligent vectors. Improve search accuracy, personalise recommendations, simplify data analysis, explore semantic links and easily classify text.

Bge Multilingual Gemma2

The highest quality

Get started for free

Consult the API documentation

●
The most powerful open-source embedding model on the market
●
The benchmark for semantic search and augmented search (RAG) tasks
●
Ideal for advanced use of embedding vectors in a variety of use cases
●
Outstanding performance, whatever language the text is in (100+ languages)

Max. input tokens

8192

Parameters

9.2 B

Dimensions

3584

Languages

EN, ES, FR, DE, IT...

Type

Text

Get started for free

Consult the API documentation

●
The most powerful open-source embedding model on the market
●
The benchmark for semantic search and augmented search (RAG) tasks
●
Ideal for advanced use of embedding vectors in a variety of use cases
●
Outstanding performance, whatever language the text is in (100+ languages)

Max. input tokens

8192

Parameters

9.2 B

Dimensions

3584

Languages

EN, ES, FR, DE, IT...

Type

Text

All MiniLM L12 v2

The best value for money

Get started for free

Consult the API documentation

●
This model is the result of community work based on a model published by Microsoft.
●
Excellent value for money, perfect for prototyping and simple tasks with limited resources
●
Great performance for relatively simple tasks, whatever language the text is in
●
Extreme speed for indexing huge databases or real-time processing
●
High energy efficiency to reduce environmental impact

Max. input tokens

512

Parameters

33 M

Dimensions

384

Languages

EN, ES, FR, DE, IT...

Type

Text

Get started for free

Consult the API documentation

●
This model is the result of community work based on a model published by Microsoft.
●
Excellent value for money, perfect for prototyping and simple tasks with limited resources
●
Great performance for relatively simple tasks, whatever language the text is in
●
Extreme speed for indexing huge databases or real-time processing
●
High energy efficiency to reduce environmental impact

Max. input tokens

512

Parameters

33 M

Dimensions

384

Languages

EN, ES, FR, DE, IT...

Type

Text

Voice recognition

The best open source AI for transcribing audio files into text or generating realistic human voices.

Whisper V3

For complex transcriptions

Get started for free

Consult the API documentation

●
Model trained on over 1 million hours of data
●
Transcription errors reduced by up to 20% compared with Whisper V2
●
Better handling of accents, background noise and complex speech (e.g., calls or videoconferences)
●
Enhanced multilingual support and translation of transcriptions into languages other than English

Maximum file size

25 MB

Formats supported

mp3, mp4, aac, wav, flac, ogg, opus, wma, m4a

Get started for free

Consult the API documentation

●
Model trained on over 1 million hours of data
●
Transcription errors reduced by up to 20% compared with Whisper V2
●
Better handling of accents, background noise and complex speech (e.g., calls or videoconferences)
●
Enhanced multilingual support and translation of transcriptions into languages other than English

Maximum file size

25 MB

Formats supported

mp3, mp4, aac, wav, flac, ogg, opus, wma, m4a

Image generation and processing

The best open source alternatives to Midjourney, Microsoft Copilot Designer and Gemini for generating, merging or interpreting images.

Photomaker V2

Ideal for generating images

Get started for free

Consult the API documentation

●
The best combination of quality and speed in generative AI image creation
●
Fast generation of photo-realistic images in 1, 2, 4 or 8 steps based on prompts
●
Operates by distillation, which increases energy efficiency and ensures excellent quality
●
Optimised for English, with limited knowledge of other languages (FR, DE, ES, IT, etc.)

Max. input tokens

Max. image output

Languages

Maximum resolution

1024x1024, 1792x1024, 1024x1792

Get started for free

Consult the API documentation

●
The best combination of quality and speed in generative AI image creation
●
Fast generation of photo-realistic images in 1, 2, 4 or 8 steps based on prompts
●
Operates by distillation, which increases energy efficiency and ensures excellent quality
●
Optimised for English, with limited knowledge of other languages (FR, DE, ES, IT, etc.)

Max. input tokens

Max. image output

Languages

Maximum resolution

1024x1024, 1792x1024, 1024x1792

Flux schnell

Ideal for modifying and merging portraits of people

Get started for free

Consult the API documentation

●
Create photos in multiple styles from one or more profile photos
●
Powerful and flexible: recontextualisation, colourisation, age and gender change, mix of identities, etc.

Max. input tokens

Max. image input

Max. image output

Languages

Maximum resolution

1024x1024, 1792x1024, 1024x1792

Get started for free

Consult the API documentation

●
Create photos in multiple styles from one or more profile photos
●
Powerful and flexible: recontextualisation, colourisation, age and gender change, mix of identities, etc.

Max. input tokens

Max. image input

Max. image output

Languages

Maximum resolution

1024x1024, 1792x1024, 1024x1792