CodeGemma-7B-IT-INT4-RTX

Google

Resource

Google

CodeGemma-7B-IT-INT4-RTX

CodeGemma is a collection of lightweight open code models built on top of Gemma. CodeGemma models are text-to-text decoder-only models. This is a 7 billion parameter instruction-tuned varient for code chat and instruction.

Model Overview

Description:

Terms of use:

Use of this model is governed by the NVIDIA AI Foundation Models Community License. ADDITIONAL INFORMATION: Gemma Terms of Use and Google Prohibited Use Policy.

References(s):

CodeGemma Model Card

Model Architecture:

Architecture Type: Transformer

Input:

Input Format: Text

Input Parameters: None

Output:

Output Format: Text

Output Parameters: None

Software Integration:

Supported Hardware Platform(s): RTX 4090

Supported Operating System(s): Windows

Inference:

TRT-LLM Inference Engine
Windows Setup with TRT-LLM

Test Hardware:

RTX 4090

Publisher

Google

Latest Version1.0

UpdatedApril 9, 2024 UTC

Compressed Size6.22 GB

Labels