NGC | Catalog
Welcome Guest
CatalogContainersDotscience Runner

Dotscience Runner

For pull tags and more information, please view on a desktop device.
Logo for Dotscience Runner

Description

Dotscience is a platform for collaborative and reproducible machine learning. It enables tracking of all model variables, inputs, data and code versions throughout model development and deployment.

Publisher

Dotscience

Latest Tag

latest

Modified

September 24, 2020

Compressed Size

16.34 MB

Multinode Support

No

Multi-Arch Support

No

Dotscience

What is Dotscience?

Dotscience allows teams to put their machine learning development and deployment into a robust model management framework. Within the framework, models and all accompanying files and datasets are versioned. The provenance of all files is auto-recorded throughout training and into deployment. Additionally, metadata about any runs of a model is auto-recorded. This record allows teams and organizations to reliably keep track of their work, share insights about model behavior, reducing the need to re-execute costly training runs, or reverse-engineer models.

Dotscience tracks the variables of models run in training and optimization, such as their hyperparameter combinations and corresponding accuracy metrics. This record provides a run history of your team's work, so that team members can easily compare the performance of models using different code, data versions, hyperparameter combinations and execution environments, such as hardware accelerator.

The run history can also be visualized on the Dotscience web interface to give insights into model behavior: for instance, to show the effect of a hyperparameter choice on accuracy metrics. This helps teams decide the next experiment to try. The provenance graph of each file used can also be viewed. Models can be deployed via the web interface to production environments.

Using Dotscience

To use Dotscience, you need to install the Dotscience container image on your chosen runner, and annotate your model code with the Dotscience Python library. Any additional files, including training data, required by the model can be added to the Dotscience web interface to place them under version control.

Model specification

Models can be defined and trained in Jupyter notebooks, accessed via the Dotscience web interface. They may be GPU enabled. Runners can be any cloud VM or local machine.

Running Dotscience

Requirements

  • A runner (any cloud machine) running either Ubuntu 16.04+ or CentOS 7, with Docker installed.

  • If you want to run on your runner's GPUs, then you will need to have installed nvidia container runtime. For installation instructions, see the NVIDIA documentation.

Instructions

  1. Email sales@dotscience.com to receive a temporary Dotscience account, giving you 7 days access. Log in to cloud.dotscience.net with your credentials.
  2. Navigate to the Runners page of the Dotscience web interface. Click Add new to add a new runner. Give the new runner a name and, optionally, a description. On the new runner page, extract the runner TOKEN from the code snippet shown.
  3. If you want to run a GPU-accelerated model, tick the box on the runner page marked GPU runner. #TODO
  4. SSH into your runner. Store the extracted TOKEN as an environment variable there, named TOKEN:
$ export TOKEN="<your token>"  # replace <your token> with your copied value
  1. Run the following:
docker run --name dotscience-runner -d -e TOKEN=$TOKEN \
    --restart always -v /var/run/docker.sock:/var/run/docker.sock \
    -v dotscience-task-spool:/spool \
    nvcr.io/nvidia/dotmesh/dotscience-runner:latest #TODO update image location

Note that the dotscience-runner container will boot up a couple more Docker containers on your runner.

  1. Create a Dotscience project in the Projects view. Upload data files you will need, then Launch Jupyter to open a Jupyter lab instance using your runner as the backing compute. You can use the terminal on Jupyterlab to import more data, libraries, and other files.

  2. Annotate your model code with the Dotscience Python library. See documentation and examples here: github.com/dotmesh-io/dotscience-python

  3. Visualize your model metrics and provenance graph in the Dotscience web interface. Access collaboration features, including Github-style fork and merge, and version control for massive datasets.

Documentation

https://docs.dotscience.com/

Community

Join the Dotscience community Slack here.

Licensing

Dotscience is commercial software for enterprises. Access to Dotscience is provided here as a time-limited trial. Contact sales@dotscience.com to discuss signing up for an enterprise pilot.