NGC | Catalog
CatalogModelsSIM checkpoint (TensorFlow2, prebatch4096)

SIM checkpoint (TensorFlow2, prebatch4096)

Logo for SIM checkpoint (TensorFlow2, prebatch4096)
Description
SIM TensorFlow2 checkpoint trained on Amazon Books 2014 Dataset prebatched with size of 4096
Publisher
NVIDIA Deep Learning Examples
Latest Version
22.01.0_fp32
Modified
April 4, 2023
Size
243.8 MB

Model Overview

Search-based Interest Model (SIM) is a system for predicting user behavior given sequences of previous interactions.

Model Architecture

SIM model consists of two components: General Search Unit (GSU) and the Exact Search Unit (ESU). The goal of the former is to filter down possibly long historical user behavior sequence to a shorter and relevant sequence. On the other hand, ESU utilizes the most recent user behaviors for a candidate item, for example, estimate click-through rate for a candidate ad. Both parts are trained jointly using data on past user behaviors.

A model architecture diagram is presented below.


Figure 1. The architecture of the model.

Embeddings in model architecture diagram are obtained by passing each feature from the dataset through the Embedding Layer. Item features from target item, short behavior history and long behavior history share embedding tables.


Figure 2. Embedding of input features.

Training

This model was trained using script available on NGC and in GitHub repo.

Dataset

The following datasets were used to train this model:

  • Amazon Books 2014 - Amazon Books is a category-wise subset of Amazon Product Data. It contains books reviews and metadata from Amazon, including 22.5 million reviews spanning from May 1996 to July 2014.

Performance

Performance numbers for this model are available in NGC.

References

License

This model was trained using open-source software available in Deep Learning Examples repository. For terms of use, please refer to the license of the script and the datasets the model was derived from.