Gemma-2B is a 2.5B parameter model from Gemma family of models from Google. It has been instruction-tuned so it can respond to prompts in a conversation manner. Nvidia has converted original Gemma weights and format into weight and format that can be consumed by Tensorrt-LLM.
By accessing this model, you are agreeing to Gemma Terms of Use, Gemma Prohibited Use Policy .
Input Format: Text
Input Parameters: None
Output Format: Text
Output Parameters: None
Supported Hardware Platform(s): RTX 4090
Supported Operating System(s): Windows
RTX 4090