NVIDIA
NVIDIA
Posebusters DiffDock Pre-processing Test Data
Resource
NVIDIA
NVIDIA
Posebusters DiffDock Pre-processing Test Data

Sample dataset prepared by NVIDIA BioNeMo team from PoseBusters benchmark set.

DiffDock Dataset

This is a dataset generated following the preparation steps for PoseBusters benchmark set used for training diffdock score and confidence model. The PoseBusters Benchmark set is a new set of 428 carefully-selected publicly-available crystal complexes from the PDB. It is a diverse set of recent high-quality protein-ligand complexes which contain drug-like molecules. It only contains complexes released since 2021. A subset of 50 complexes from this database is used to create and train/validation/test datasets for training test of diffdock as presented in Posebusters DiffDock Processed Sample Data Resource. And 2 samples are selected for use in the data preprocessing test.

How to use the dataset?

You can use BioNeMo Framework to run DiffDock Score/Confidence model training using this dataset.

License

This dataset is being re-distributed under the same license as PoseBusters benchmark set (Creative Commons Attribution 4.0 International (CC BY 4.0) License)

Publisher
NVIDIA
NVIDIA
Latest Version1.1
UpdatedJanuary 18, 2024 UTC
Compressed Size14.75 MB

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.