NGC | Catalog

Llama 2

Description
Llama 2 is a large language AI model capable of generating text and code in response to prompts.
Curator
Meta
Modified
November 28, 2023
Containers
Sorry, your browser does not support inline SVG.
Helm Charts
Sorry, your browser does not support inline SVG.
Models
Sorry, your browser does not support inline SVG.
Resources
Sorry, your browser does not support inline SVG.

Model Overview

Description:

Llama 2 is a large language AI model comprising a collection of models capable of generating text and code in response to prompts.

Terms of use

By accessing this model, you are agreeing to the LLama 2 terms and conditions of the license, acceptable use policy and Meta’s privacy policy

References(s):

Model Architecture:

Architecture Type: Transformer
Network Architecture: Llama 2
Model version: N/A

Input:

Input Format: Text
Input Parameters: Temperature, TopP
Other Properties Related to Output: None

Output:

Output Format: Text
Output Parameters: Max output tokens
Other Properties Related to Output: None

Software Integration:

Runtime(s): N/A
Supported Hardware Platform(s): Hopper, Ampere/Turing
Supported Operating System(s): Linux

Training & Finetuning:

Dataset:

Link:

Properties (Quantity, Dataset Descriptions, Sensor(s)):

  • Falcon RefinedWeb is a massive English web dataset containing 500-650GT depending on the used tokenizer
  • The Stack dataset is a collection of source code in over 300 programming languages
  • Wikipedia is 20.54GB of online encyclopedia written and maintained by a community of volunteers
  • The Pile: ArXiv is 56.21GB of ArXiv papers contents
  • The Pile: Books3 is 100.96GB dataset of books derived from a copy of the contents of the Bibliotik private tracker made available by Shawn Presser
  • StackExchange is an anonymized dump of all user-contributed content on the Stack Exchange network where each site is formatted as a separate archive consisting of zipped XML files

Dataset License: Free for research and commercial use.

Inference:

Engine: Triton
Test Hardware: Other