## Model Overview

Mask R-CNN is a convolution based network for object instance segmentation. This implementation provides 1.3x faster training while maintaining target accuracy.

## Model Architecture

 
Mask R-CNN builds on top of FasterRCNN adding an additional mask head for the task of image segmentation.
 
The architecture consists of following:
- R-50 backbone with FPN
- RPN head
- RoI ALign
- Bounding and classification box head
- Mask head
 

    
## Training

This model was trained using script available on [NGC](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/dle/resources/maskrcnn_pyt) and in [GitHub repo](https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/Segmentation/MaskRCNN).

## Dataset

The following datasets were used to train this model:
- [COCO 2017](https://cocodataset.org/#download) - Dataset for large-scale object detection, segmentation and captioning.


## Performance

Performance numbers for this model are available in [NGC](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/dle/resources/maskrcnn_pyt/performance).

## References
- [Original paper](https://arxiv.org/abs/1703.06870)
- [NVIDIA model implementation in NGC](https://catalog.ngc.nvidia.com/orgs/nvidia/teams/dle/resources/maskrcnn_pyt)
- [NVIDIA model implementation on GitHub](https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/Segmentation/MaskRCNN)

## License

This model was trained using open-source software available in [Deep Learning Examples](https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/Segmentation/MaskRCNN) repository.
For terms of use, please refer to the license of the script and the datasets the model was derived from.

maskrcnn__pyt_ckpt

MaskRCNN PyTorch checkpoint trained with AMP

MaskRCNN PyTorch checkpoint (AMP)

Model Overview

Model Architecture

Training

Dataset

Performance

References

License