NGC | Catalog
CatalogModelsSIM checkpoint (TensorFlow2, prebatch4096)

SIM checkpoint (TensorFlow2, prebatch4096)

For downloads and more information, please view on a desktop device.
Logo for SIM checkpoint (TensorFlow2, prebatch4096)

Description

SIM TensorFlow2 checkpoint trained on Amazon Books 2014 Dataset prebatched with size of 4096

Publisher

NVIDIA Deep Learning Examples

Use Case

Recommender

Framework

TensorFlow2

Latest Version

22.01.0_fp32

Modified

December 6, 2022

Size

243.8 MB

Model Overview

Search-based Interest Model (SIM) is a system for predicting user behavior given sequences of previous interactions.

Model Architecture

SIM model consists of two components: General Search Unit (GSU) and the Exact Search Unit (ESU). The goal of the former is to filter down possibly long historical user behavior sequence to a shorter and relevant sequence. On the other hand, ESU utilizes the most recent user behaviors for a candidate item, for example, estimate click-through rate for a candidate ad. Both parts are trained jointly using data on past user behaviors.

A model architecture diagram is presented below.


Figure 1. The architecture of the model.

Embeddings in model architecture diagram are obtained by passing each feature from the dataset through the Embedding Layer. Item features from target item, short behavior history and long behavior history share embedding tables.


Figure 2. Embedding of input features.

Training

This model was trained using script available on NGC and in GitHub repo.

Dataset

The following datasets were used to train this model:

  • Amazon Books 2014 - Amazon Books is a category-wise subset of Amazon Product Data. It contains books reviews and metadata from Amazon, including 22.5 million reviews spanning from May 1996 to July 2014.

Performance

Performance numbers for this model are available in NGC.

References

License

This model was trained using open-source software available in Deep Learning Examples repository. For terms of use, please refer to the license of the script and the datasets the model was derived from.