NGC | Catalog
CatalogModelsChatGLM3-6B Chat Int4

ChatGLM3-6B Chat Int4

Logo for ChatGLM3-6B Chat Int4
Description
ChatGLM3-6B is the latest open-source model in the ChatGLM series. ChatGLM3-6B introduces the following features (1) More Powerful Base Model (2) More Comprehensive Function Support (3) More Comprehensive Open-source Series.
Publisher
Knowledge Engineering Group (KEG) & Data Mining at Tsinghua University
Latest Version
1.0
Modified
April 11, 2024
Size
3.12 GB

Model Overview

Description:

ChatGLM3-6B is the latest open-source model in the ChatGLM series. While retaining many excellent features such as smooth dialogue and low deployment threshold from the previous two generations, ChatGLM3-6B introduces the following features. (1) More Powerful Base Model (2) More Comprehensive Function Support (3) More Comprehensive Open-source Series.

ChatGLM3-6B is released under the Apache 2.0 license.

Terms of use:

By accessing this model, you are agreeing to the ChatGLM3-6B Terms and Conditions of the Model License.

References(s):

Model Architecture:

Architecture Type: Transformer

Input:

Input Format: Text

Input Parameters: None

Output:

Output Format: Text

Output Parameters: None

Software Integration:

Supported Hardware Platform(s): RTX 4090, Ada GPUs

Supported Operating System(s): Windows

Inference:

TRT-LLM Inference Engine
Windows Setup with TRT-LLM

Test Hardware:

RTX 4090