LlaMa 2 is a large language AI model capable of generating text and code in response to prompts.
Model Overview
Description:
Llama 2 is a large language AI model comprising a collection of models capable of generating text and code in response to prompts.
Terms of use
By accessing this model, you are agreeing to the LLama 2 terms and conditions of the license, acceptable use policy and Meta’s privacy policy
References(s):
-
Meta's Llama 2 webpage
-
Meta's Llama 2 Model Card webpage
Model Architecture:
Architecture Type: Transformer
Network Architecture: Llama 2
Model version: N/A
Input:
Input Format: Text
Input Parameters: Temperature, TopP
Other Properties Related to Output: None
Output:
Output Format: Text
Output Parameters: Max output tokens
Other Properties Related to Output: None
Software Integration:
Runtime(s): N/A
Supported Hardware Platform(s): RTX 4090
Supported Operating System(s): Windows
Training & Finetuning:
Dataset:
Llama 2 was pretrained on 2 trillion tokens of data from publicly available sources. The fine-tuning data includes publicly available instruction datasets, as well as over one million new human-annotated examples. Training Data
Inference:
TRT-LLM Inference Engine
Windows Setup with TRT-LLM
Test Hardware:
RTX 4090