NGC | Catalog
CatalogModelsLlaMa2-7B Chat Int4

LlaMa2-7B Chat Int4

Logo for LlaMa2-7B Chat Int4
Description
LlaMa 2 is a large language AI model capable of generating text and code in response to prompts.
Publisher
Meta
Latest Version
1.1
Modified
January 4, 2024
Size
11.63 GB

Model Overview

Description:

Llama 2 is a large language AI model comprising a collection of models capable of generating text and code in response to prompts.

Terms of use

By accessing this model, you are agreeing to the LLama 2 terms and conditions of the license, acceptable use policy and Meta’s privacy policy

References(s):

Model Architecture:

Architecture Type: Transformer
Network Architecture: Llama 2
Model version: N/A

Input:

Input Format: Text
Input Parameters: Temperature, TopP
Other Properties Related to Output: None

Output:

Output Format: Text
Output Parameters: Max output tokens
Other Properties Related to Output: None

Software Integration:

Runtime(s): N/A
Supported Hardware Platform(s): RTX 4090, Supported Operating System(s): Windows

Training & Finetuning:

Dataset:

Llama 2 was pretrained on 2 trillion tokens of data from publicly available sources. The fine-tuning data includes publicly available instruction datasets, as well as over one million new human-annotated examples. Training Data

Inference:

TRT-LLM Inference Engine
Windows Setup with TRT-LLM

Test Hardware:

RTX 4090