Contact Support
    Mistral/Mixtral-8x7B
    License

    Model Card

    Instructed Models

    Mixtral released Mixtral 8x7B Instruct alongside Mixtral 8x7B. This model has been optimized through supervised fine-tuning and direct preference optimization (DPO) for careful instruction following. On MT-Bench, it reaches a score of 8.30, making it the best open-source model, with a performance comparable to GPT-3.5.

    Note: Mixtral can be gracefully prompted to ban some outputs for constructing applications that require a strong level of moderation, as exemplified here. Proper preference tuning can also serve this purpose. Bear in mind that without such a prompt, the model will follow whatever instructions are given.

    Deploy Mixtral with an Open-Source Deployment Stack

    To enable the community to run Mixtral with a fully open-source stack, they have submitted changes to the vLLM project, which integrates Megablocks CUDA kernels for efficient inference.

    Meta data

    32,768 tokens
    $0.6 per million
    $0.6 per million
    Create an agent Pipe