NGC Catalog
CLASSIC
Welcome Guest
Models
Llama3-8B Instruct Int4

Llama3-8B Instruct Int4

For downloads and more information, please view on a desktop device.
Logo for Llama3-8B Instruct Int4
Features
Description
Built with Meta Llama 3 - Meta Llama 3 family of large language models (LLMs) is a collection of pretrained and instruction tuned generative text models in 8B and 70B sizes.
Publisher
Meta
Latest Version
1.0
Modified
November 27, 2024
Size
5.42 GB

Model Overview

Description:

Built with Meta Llama 3 - The Meta Llama 3 family of large language models (LLMs), a collection of pretrained and instruction tuned generative text models in 8 and 70B sizes. The Llama 3 instruction tuned models are optimized for dialogue use cases and outperform many of the available open source chat models on common industry benchmarks. Llama 3 is an auto-regressive language model that uses an optimized transformer architecture.

Terms of use

By accessing this model, you are agreeing to the LLama 3 terms and conditions of the license, acceptable use policy, and Meta’s privacy policy

References(s):

  • Meta Llama 3 Model Card on Hugging Face
  • Meta Llama 3 blogpost

Model Architecture:

Architecture Type: Transformer
Network Architecture: Llama 3
Model version: N/A

Input:

Input Format: Text
Input Parameters: Temperature, TopP

Output:

Output Format: Text and code Output Parameters: Max output tokens

Software Integration:

Runtime(s): N/A
Supported Hardware Platform(s): RTX 4090
Supported Operating System(s): Windows

Inference:

TRT-LLM Inference Engine
Windows Setup with TRT-LLM

Test Hardware:

RTX 4090