NGC | Catalog
CatalogModelsRIVA Text Classification

RIVA Text Classification

Logo for RIVA Text Classification
Description
Domain classification of the query for weather chat bot.
Publisher
NVIDIA
Latest Version
deployable_v1.0
Modified
April 4, 2023
Size
420.37 MB

TextClassification Model Card

Model Overview

Text classification model is useful for text classification problems such as sentiment analysis or domain detection for dialogue systems. Provided model here is trained to classify the given query into 1 of 4 domains described below to use it as an initial step in the interactive weather chat bot, which was presented in GTC 2020 keynote.

Intended Use

Text Classification Model can be used for domain classification as the first step in the dialogue systems, to route query according to the appropriate domain. This classification is a task specific according to the domains and examples provided in the training data. Usually in practical settings you need to take this model (pretrained Bert model) and train it on you own dataset.

Model Architecture

Our text classification model uses a pretrained BERT model (or other BERT-like models) followed by a classification layer on the output of the first token ([CLS]).

Training Data

We used a proprietary data set that was collected via Mechanical Turk to describe large variety of queries that fall in one of the next 4 domains:

  • weather (all weather related queries that triggered call to weather API)
  • meteorology (questions about meteorology topic that went for IR+QA route)
  • personality (questions about personality that went to chit chat route)
  • nomatch (all other queries that does not fall to any of other categories)

Evaluation

Training dataset included 2150 example of queries divided for 4 domains described above. We got around 95% domain classification accuracy for this data.

How to Use This Model

These model checkpoints are intended to be used with Nvidia Riva.

Note: The model is encrypted and will only operate with the model load key tlt-encode.

References

License

By downloading and using the models and resources packaged with Nvidia Riva, you would be accepting the terms of the Riva license

Suggested reading

  • Deploy your model for Production using Riva. Learn more about Riva framework

Ethical AI

NVIDIA’s platforms and application frameworks enable developers to build a wide array of AI applications. Consider potential algorithmic bias when choosing or creating the models being deployed. Work with the model’s developer to ensure that it meets the requirements for the relevant industry and use case; that the necessary instruction and documentation are provided to understand error rates, confidence intervals, and results; and that the model is being used under the conditions and in the manner intended.