CodeGemma is a collection of lightweight open code models built on top of Gemma. CodeGemma models are text-to-text decoder-only models. This is a 7 billion parameter instruction-tuned varient for code chat and instruction.
Use of this model is governed by the NVIDIA AI Foundation Models Community License. ADDITIONAL INFORMATION: Gemma Terms of Use and Google Prohibited Use Policy.
Architecture Type: Transformer
Input Format: Text
Input Parameters: None
Output Format: Text
Output Parameters: None
Supported Hardware Platform(s): RTX 4090
Supported Operating System(s): Windows
TRT-LLM Inference Engine
Windows Setup with TRT-LLM
RTX 4090