The best open source AI on demand in a sovereign cloud

Discover the best open source alternatives to ChatGPT, Gemini, Midjourney or Claude for processing sensitive data in full compliance with European and Swiss law.

LLM

Embedding

Audio

Image

Large language models (LLM)

The best open source alternatives to ChatGPT, Gemini and Microsoft Copilot for interacting, analysing and generating content with AI.

Qwen/Qwen3.5-122B-A10B-FP8

Qwen/Qwen3.5-122B-A10B-FP8

The most powerful

Beta

  • Designed for complex tasks that require a broad context and greater precision in logical reasoning.

  • An architecture optimized for faster inference and reduced power consumption, freeing up significant computational resources.

  • Trained on millions of agents and tasks of increasing complexity to ensure robust adaptability in the real world.

Modality

Image-Text to Text

Max. input tokens

200’000

Languages

100+ languages

Function call

Yes

Template category

chat_large

  • Designed for complex tasks that require a broad context and greater precision in logical reasoning.

  • An architecture optimized for faster inference and reduced power consumption, freeing up significant computational resources.

  • Trained on millions of agents and tasks of increasing complexity to ensure robust adaptability in the real world.

Modality

Image-Text to Text

Max. input tokens

200’000

Languages

100+ languages

Function call

Yes

Template category

chat_large

Apertus-70B-Instruct-2509

Apertus-70B-Instruct-2509

The most ethical

Beta

  • Ideal for multilingual services, government agencies and R&D teams looking for a reliable, adaptable model

  • Data and methods documented for unprecedented transparency

  • Compliant with the AI Act and respectful of privacy and intellectual property

  • A 70B version with performance on a par with current market leaders

Modality

Text to Text

Max. input tokens

65’536

Languages

100+ languages

Function call

No

Template category

chat_medium

  • Ideal for multilingual services, government agencies and R&D teams looking for a reliable, adaptable model

  • Data and methods documented for unprecedented transparency

  • Compliant with the AI Act and respectful of privacy and intellectual property

  • A 70B version with performance on a par with current market leaders

Modality

Text to Text

Max. input tokens

65’536

Languages

100+ languages

Function call

No

Template category

chat_medium

google/gemma-4-31B-it

google/gemma-4-31B-it

The perfect balance

Beta

  • The perfect balance between responsiveness and power, designed to excel at logical reasoning, in-depth document analysis, and the generation of reliable code.

  • Leverages cutting-edge architecture to provide a nuanced understanding of broad contexts and complex instructions.

  • Ideal for advanced chatbots and enterprise workflows that require high flexibility without compromising on processing speed.

Modality

Text-to-Text (optimized for education)

Max. input tokens

100’000

Languages

140+ languages

Function call

Yes (native and optimized)

Template category

chat_medium

  • The perfect balance between responsiveness and power, designed to excel at logical reasoning, in-depth document analysis, and the generation of reliable code.

  • Leverages cutting-edge architecture to provide a nuanced understanding of broad contexts and complex instructions.

  • Ideal for advanced chatbots and enterprise workflows that require high flexibility without compromising on processing speed.

Modality

Text-to-Text (optimized for education)

Max. input tokens

100’000

Languages

140+ languages

Function call

Yes (native and optimized)

Template category

chat_medium

moonshotai/Kimi-K2.6

moonshotai/Kimi-K2.6

The most powerful for vibe coding

Beta

  • Native multimodal: converts text, images or mockups into fully functional code.

  • Designed for large-scale development: includes an extended context window of up to 256k tokens to manage complex projects

  • Optimised for vibe coding: a fast, fluid and creative experience designed for developers and product designers

  • Compatible with agent-based workflows: automates analysis, code generation, and end-to-end execution

Modality

Image-Text to Text

Max. input tokens

256’000

Languages

Multilingual

Function call

Yes

Template category

code

  • Native multimodal: converts text, images or mockups into fully functional code.

  • Designed for large-scale development: includes an extended context window of up to 256k tokens to manage complex projects

  • Optimised for vibe coding: a fast, fluid and creative experience designed for developers and product designers

  • Compatible with agent-based workflows: automates analysis, code generation, and end-to-end execution

Modality

Image-Text to Text

Max. input tokens

256’000

Languages

Multilingual

Function call

Yes

Template category

code

mistralai/Ministral-3-14B-Instruct-2512

mistralai/Ministral-3-14B-Instruct-2512

The most versatile

Beta

  • Optimized for fast and cost-effective deployment, ideal for chatbots, document analysis, and specialized tasks.

  • Offers performance comparable to the Mistral Small 3.2 24B with minimal resources.

  • Capable of analyzing images and providing information based on visual content, in addition to text.

Modality

Image-Text to Text

Max. input tokens

100’000

Languages

EN, ES, FR, DE, IT...

Function call

Yes

Template category

chat_small

  • Optimized for fast and cost-effective deployment, ideal for chatbots, document analysis, and specialized tasks.

  • Offers performance comparable to the Mistral Small 3.2 24B with minimal resources.

  • Capable of analyzing images and providing information based on visual content, in addition to text.

Modality

Image-Text to Text

Max. input tokens

100’000

Languages

EN, ES, FR, DE, IT...

Function call

Yes

Template category

chat_small

Re-ranking models

The best compatible open-source alternatives for optimizing the relevance of your search results. Refine your document rankings, improve the accuracy of your RAG systems, and ensure smarter, more context-aware information retrieval.

BAAI/bge-reranker-v2-m3

BAAI/bge-reranker-v2-m3

The most versatile

  • A state-of-the-art multilingual model capable of processing short queries, paragraphs, and long documents of up to 8192 tokens simultaneously

  • Combines lexical (keywords) and semantic (meaning) analysis for unmatched classification accuracy on complex corpora

  • The ideal solution for enterprise search engines and RAG applications that require a deep understanding of context

Modality

Text to Text

Max. input tokens

8192

Languages

100+ languages

Function call

No

Type

to rank

  • A state-of-the-art multilingual model capable of processing short queries, paragraphs, and long documents of up to 8192 tokens simultaneously

  • Combines lexical (keywords) and semantic (meaning) analysis for unmatched classification accuracy on complex corpora

  • The ideal solution for enterprise search engines and RAG applications that require a deep understanding of context

Modality

Text to Text

Max. input tokens

8192

Languages

100+ languages

Function call

No

Type

to rank

Qwen/Qwen3-Reranker-0.6B

Qwen/Qwen3-Reranker-0.6B

The most efficient

  • Ultra-lightweight architecture (0.6 billion parameters) designed for ultra-low-latency inference and minimal power consumption

  • Maintains high relevance accuracy even with a context window expanded up to 32768 tokens

  • Ideal for real-time data streams, autonomous agents, and large-scale deployments

Modality

Text to Text

Max. input tokens

32768

Languages

100+ languages

Function call

No

Type

to rank

  • Ultra-lightweight architecture (0.6 billion parameters) designed for ultra-low-latency inference and minimal power consumption

  • Maintains high relevance accuracy even with a context window expanded up to 32768 tokens

  • Ideal for real-time data streams, autonomous agents, and large-scale deployments

Modality

Text to Text

Max. input tokens

32768

Languages

100+ languages

Function call

No

Type

to rank

Embedding models

The best open-source embedding models to transform your data into intelligent vectors. Improve search accuracy, personalise recommendations, simplify data analysis, explore semantic links and easily classify text.

Bge Multilingual Gemma2

Bge Multilingual Gemma2

The highest quality

  • The most powerful open-source embedding model on the market

  • The benchmark for semantic search and augmented search (RAG) tasks

  • Ideal for advanced use of embedding vectors in a variety of use cases

  • Outstanding performance, whatever language the text is in (100+ languages)

Max. input tokens

8192

Parameters

9.2 B

Dimensions

3584

Languages

EN, ES, FR, DE, IT...

Type

Text

  • The most powerful open-source embedding model on the market

  • The benchmark for semantic search and augmented search (RAG) tasks

  • Ideal for advanced use of embedding vectors in a variety of use cases

  • Outstanding performance, whatever language the text is in (100+ languages)

Max. input tokens

8192

Parameters

9.2 B

Dimensions

3584

Languages

EN, ES, FR, DE, IT...

Type

Text

All MiniLM L12 v2

All MiniLM L12 v2

The best value for money

  • This model is the result of community work based on a model published by Microsoft.

  • Excellent value for money, perfect for prototyping and simple tasks with limited resources

  • Great performance for relatively simple tasks, whatever language the text is in

  • Extreme speed for indexing huge databases or real-time processing

  • High energy efficiency to reduce environmental impact

Max. input tokens

512

Parameters

33 M

Dimensions

384

Languages

EN, ES, FR, DE, IT...

Type

Text

  • This model is the result of community work based on a model published by Microsoft.

  • Excellent value for money, perfect for prototyping and simple tasks with limited resources

  • Great performance for relatively simple tasks, whatever language the text is in

  • Extreme speed for indexing huge databases or real-time processing

  • High energy efficiency to reduce environmental impact

Max. input tokens

512

Parameters

33 M

Dimensions

384

Languages

EN, ES, FR, DE, IT...

Type

Text

Voice recognition

The best open source AI for transcribing audio files into text or generating realistic human voices.

Whisper V3

Whisper V3

For complex transcriptions

  • Model trained on over 1 million hours of data

  • Transcription errors reduced by up to 20% compared with Whisper V2

  • Better handling of accents, background noise and complex speech (e.g., calls or videoconferences)

  • Enhanced multilingual support and translation of transcriptions into languages other than English

Maximum file size

25 MB

Formats supported

mp3, mp4, aac, wav, flac, ogg, opus, wma, m4a

  • Model trained on over 1 million hours of data

  • Transcription errors reduced by up to 20% compared with Whisper V2

  • Better handling of accents, background noise and complex speech (e.g., calls or videoconferences)

  • Enhanced multilingual support and translation of transcriptions into languages other than English

Maximum file size

25 MB

Formats supported

mp3, mp4, aac, wav, flac, ogg, opus, wma, m4a

Image generation and processing

The best open source alternatives to Midjourney, Microsoft Copilot Designer and Gemini for generating, merging or interpreting images.

Photomaker V2

Photomaker V2

Ideal for generating images

  • The best combination of quality and speed in generative AI image creation

  • Fast generation of photo-realistic images in 1, 2, 4 or 8 steps based on prompts

  • Operates by distillation, which increases energy efficiency and ensures excellent quality

  • Optimised for English, with limited knowledge of other languages (FR, DE, ES, IT, etc.)

Max. input tokens

77

Max. image output

5

Languages

EN

Maximum resolution

1024x1024, 1792x1024, 1024x1792

  • The best combination of quality and speed in generative AI image creation

  • Fast generation of photo-realistic images in 1, 2, 4 or 8 steps based on prompts

  • Operates by distillation, which increases energy efficiency and ensures excellent quality

  • Optimised for English, with limited knowledge of other languages (FR, DE, ES, IT, etc.)

Max. input tokens

77

Max. image output

5

Languages

EN

Maximum resolution

1024x1024, 1792x1024, 1024x1792

Flux schnell

Flux schnell

Ideal for modifying and merging portraits of people

  • Create photos in multiple styles from one or more profile photos

  • Powerful and flexible: recontextualisation, colourisation, age and gender change, mix of identities, etc.

Max. input tokens

77

Max. image input

6

Max. image output

5

Languages

EN

Maximum resolution

1024x1024, 1792x1024, 1024x1792

  • Create photos in multiple styles from one or more profile photos

  • Powerful and flexible: recontextualisation, colourisation, age and gender change, mix of identities, etc.

Max. input tokens

77

Max. image input

6

Max. image output

5

Languages

EN

Maximum resolution

1024x1024, 1792x1024, 1024x1792