H2O Driverless AI is an artificial intelligence (AI) platform that automates some of the most difficult data science and machine learning workflows such as feature engineering, model validation, model tuning, model selection and model deployment. It aims to achieve highest predictive accuracy, comparable to expert data scientists, but in much shorter time thanks to end-to-end automation. Driverless AI also offers automatic visualizations and machine learning interpretability (MLI). Especially in regulated industries, model transparency and explanation are just as important as predictive performance.
Driverless AI enables users of all backgrounds to draw the most value from their data. It was also specifically designed to take advantage of graphical processing units (GPUs), including multi-GPU workstations and servers such as the NVIDIA DGX-1 for order-of-magnitude faster training.
High-level capabilities:
Client/server application for rapid experimentation and deployment of state-of-the-art supervised machine learning models
Automatically creates machine learning modeling pipelines for highest predictive accuracy
Automatically creates stand-alone scoring pipeline for in-process scoring or client/server scoring via http or tcp protocols.
Python API or GUI
Multi-GPU and multi-CPU support for powerful workstations and NVidia DGX supercomputers
Machine Learning model interpretation module with global and local model interpretation Automatic Visualization module
Problem types supported:
Regression (continuous target variable, for age, income, house price, loss prediction)
Binary classification (0/1 or “N”/”Y”, for fraud prediction, churn prediction, failure prediction, etc.)
Multinomial classification (0/1/2/3 or “A”/”B”/”C”/”D” for categorical target variables, for prediction of membership type, next-action, product recommendation, etc.)
Data types supported:
Tabular structured data, rows are observations, columns are fields/features/variables i.i.d. (identically and independently distributed) data
Numeric, categorical and textual fields
Missing values are allowed
Time-series data with a single time-series (time flows across the entire dataset, not per block of data)
Data types NOT supported:
Image/video/audio
Grouped time-series (e.g., sales per item per store per hour, all in one file)
File formats supported:
Plain text formats of columnar data (.csv, .tsv, .txt)
Compressed archives (.zip, .gz)
Driverless AI is a commercial product. Please contact sales@h2o.ai for more information about an enterprise license.
To obtain a trial license key, please visit https://www.h2o.ai/try-driverless-ai/ and signup to get your 21-day trial key by email.
Notes: These installation steps assume that you have a license key for Driverless AI. Either use a trial key from the instructions above, or, for information on how to purchase a commercial license key for Driverless AI, contact sales@h2o.ai. Driverless AI is only supported using NVIDIA Docker. NVIDIA Docker is pre-installed on NVIDIA DGX systems and can be manually installed on other systems. Open a Terminal window, and ssh to the machine that will run Driverless AI. Once you are logged in, perform the following steps.
Step 1. Pull the Driverless AI docker image: In the Tags section, locate the container image release that you want to run. In the Actions column, hover over the white space to display the Docker command icon. Click the icon to display the docker pull command. Copy the docker pull command and click Close. Open a command prompt and paste the pull command. The pulling of the container image begins. Ensure the pull completes successfully before proceeding to the next step.
Step 2. Set up the data and log directories on the host machine:
Set up the data and log directories on the host machine
mkdir data
mkdir log
mkdir tmp
mkdir license
Step 3. At this point, you can copy data into the data directory on the host machine. The data will be visible inside the docker container.
Step 4. Start the Driverless AI docker image:
#Start the Driverless AI docker image
nvidia-docker run \
--rm \
-u `id -u`:`id -g` \
-p 12345:12345 \
-p 54321:54321 \
-p 8888:8888 \
-v `pwd`/data:/data \
-v `pwd`/log:/log \
-v `pwd`/license:/license \
-v `pwd`/tmp:/tmp \
nvcr.io/nvidia_partners/h2oai-driverless:latest
Step 5. (Optional) To view the logs, open another Terminal window and ssh to the host machine running Driverless AI. Once you are logged in, run the following:
tail -f log/YYYYMMDD-HHMMSS/h2oai.log
Step 6. Connect to Driverless AI with the Chrome browser:
http://Your-Driverless-AI-Host-Machine:12345
# Documentation and Suggested Webinars
[User Documentation](https://www.h2o.ai/wp-content/uploads/2017/09/driverlessai/index.html)
[Webinars](https://webinar.com/channel/4a90aa11b48f4a5d8823ec924e7bd8cf)
# DriverlessAI Support
You can email us at <support@h2o.ai>, or you can try posting your question to the [community](https://stackoverflow.com/questions/tagged/driverless-ai).