NGC Catalog
CLASSIC
Welcome Guest
Models
Mistral-7B Chat Int4

Mistral-7B Chat Int4

For downloads and more information, please view on a desktop device.
Logo for Mistral-7B Chat Int4
Description
The Mistral-7B-Instruct-v0.1 Large Language Model (LLM) is a instruct fine-tuned version of the Mistral-7B-v0.1 generative text model using a variety of publicly available conversation datasets.
Publisher
Mistral AI
Latest Version
1.2
Modified
March 6, 2025
Size
3.93 GB

Model Overview

Description:

The Mistral-7B-Instruct-v0.1 Large Language Model (LLM) is a instruct fine-tuned version of the Mistral-7B-v0.1 generative text model using a variety of publicly available conversation datasets. Mistral-7B is released under the Apache 2.0 license

This instruction model is a transformer model with the following architecture choices:

  • Grouped-Query Attention
  • Sliding-Window Attention
  • Byte-fallback BPE tokenizer

Terms of use:

By accessing this model, you are agreeing to the Mistral 7B Terms and Conditions of the License, Terms of Service.

References(s):

  • Mistral 7B Instruct Model Card on Hugging Face
  • Mistral 7B paper
  • Mistral 7B blogpost

Model Architecture:

Architecture Type: Transformer

Network Architecture: Mistral-7B

Input:

Input Format: Text

Input Parameters: None

Output:

Output Format: Text

Output Parameters: None

Software Integration:

Supported Hardware Platform(s): RTX 4090, Ada GPUs

Supported Operating System(s): Windows

Inference:

TRT-LLM Inference Engine
Windows Setup with TRT-LLM

Test Hardware:

RTX 4090