Mistral / Transparency - Langbase · Serverless AI Developer Platform

Instructed Models

Mixtral released Mixtral 8x7B Instruct alongside Mixtral 8x7B. This model has been optimized through supervised fine-tuning and direct preference optimization (DPO) for careful instruction following. On MT-Bench, it reaches a score of 8.30, making it the best open-source model, with a performance comparable to GPT-3.5.

Note: Mixtral can be gracefully prompted to ban some outputs for constructing applications that require a strong level of moderation, as exemplified here. Proper preference tuning can also serve this purpose. Bear in mind that without such a prompt, the model will follow whatever instructions are given.

Deploy Mixtral with an Open-Source Deployment Stack

To enable the community to run Mixtral with a fully open-source stack, they have submitted changes to the vLLM project, which integrates Megablocks CUDA kernels for efficient inference.

Langbase

Model Card

Instructed Models

Deploy Mixtral with an Open-Source Deployment Stack

Meta data

Context

Prompt Cost

Completion Cost

⌘Langbase

Model Card

Instructed Models

Deploy Mixtral with an Open-Source Deployment Stack

Meta data

Context

Prompt Cost

Completion Cost

Langbase