Sen descrición

kkudrynski 0d15a95c8f [DLRM/PyT] Readme fixes		%!s(int64=5) %!d(string=hai) anos
.github	30425620be Update issue templates	%!s(int64=6) %!d(string=hai) anos
CUDA-Optimized	e0fcb62e83 Add FastSpeech in CUDA-Optimized	%!s(int64=5) %!d(string=hai) anos
FasterTransformer	1aa6813450 [FT] 1. Fix the bug of TensorRT plugin of FasterTransformer encoder. (#640)	%!s(int64=5) %!d(string=hai) anos
Kaldi	3419c93192 Fixing config file header	%!s(int64=6) %!d(string=hai) anos
MxNet	e470c2150a Updating RN50/MxNet	%!s(int64=6) %!d(string=hai) anos
PyTorch	0d15a95c8f [DLRM/PyT] Readme fixes	%!s(int64=5) %!d(string=hai) anos
TensorFlow	7c0afee460 Merge pull request #648 from swethmandava/master	%!s(int64=5) %!d(string=hai) anos
TensorFlow2	fb40734b31 Remove autobench scripts (#647)	%!s(int64=5) %!d(string=hai) anos
.gitignore	0663b67c1a Updating models	%!s(int64=6) %!d(string=hai) anos
.gitmodules	36ad5fe657 Update .gitmodules	%!s(int64=5) %!d(string=hai) anos
README.md	63a2a2e431 Update README.md	%!s(int64=5) %!d(string=hai) anos
hubconf.py	ff16b6c649 removing torchhub access through master	%!s(int64=6) %!d(string=hai) anos

NVIDIA Deep Learning Examples for Tensor Cores

Introduction

This repository provides State-of-the-Art Deep Learning examples that are easy to train and deploy, achieving the best reproducible accuracy and performance with NVIDIA CUDA-X software stack running on NVIDIA Volta, Turing and Ampere GPUs.

NVIDIA GPU Cloud (NGC) Container Registry

These examples, along with our NVIDIA deep learning software stack, are provided in a monthly updated Docker container on the NGC container registry (https://ngc.nvidia.com). These containers include:

The latest NVIDIA examples from this repository
The latest NVIDIA contributions shared upstream to the respective framework
The latest NVIDIA Deep Learning software libraries, such as cuDNN, NCCL, cuBLAS, etc. which have all been through a rigorous monthly quality assurance process to ensure that they provide the best possible performance
Monthly release notes for each of the NVIDIA optimized containers

Computer Vision

Models	Framework	A100	AMP	Multi-GPU	Multi-Node	TRT	ONNX	Triton	TF-TRT	NB
ResNet-50	PyTorch	Yes	Yes	Yes	-	-	-	-	-	-
ResNeXt101	PyTorch	Yes	Yes	Yes	-	-	-	-	-	-
SEResNeXt101	PyTorch	Yes	Yes	Yes	-	-	-	-	-	-
Mask R-CNN	PyTorch	Yes	Yes	Yes	-	-	-	-	-	Yes
SSD	PyTorch	Yes	Yes	Yes	-	-	-	-	-	Yes
ResNet-50	TensorFlow	Yes	Yes	Yes	-	-	-	-	-	-
ResNeXt101	TensorFlow	Yes	Yes	Yes	-	-	-	-	-	-
SEResNeXt101	TensorFlow	Yes	Yes	Yes	-	-	-	-	-	-
Mask R-CNN	TensorFlow	Yes	Yes	Yes	-	-	-	-	-	-
SSD	TensorFlow	Yes	Yes	Yes	-	-	-	-	-	Yes
U-Net Ind	TensorFlow	Yes	Yes	Yes	-	Yes	-	-	Yes	Yes
U-Net Med	TensorFlow	Yes	Yes	Yes	-	Yes	-	-	Yes	-
U-Net 3D	TensorFlow	Yes	Yes	Yes	-	Yes	-	-	Yes	-
V-Net Med	TensorFlow	Yes	Yes	Yes	-	Yes	Yes	-	Yes	-
U-Net Med	TensorFlow2	Yes	Yes	Yes	-	Yes	-	-	Yes	-
Mask R-CNN	TensorFlow2	Yes	Yes	Yes	-	-	-	-	-	-
ResNet-50	MXNet	-	Yes	Yes	-	-	-	-	-	-

Natural Language Processing

Models	Framework	A100	AMP	Multi-GPU	Multi-Node	TRT	ONNX	Triton	TF-TRT	NB
BERT	PyTorch	Yes	Yes	Yes	Yes	-	-	Yes	-	-
TransformerXL	PyTorch	Yes	Yes	Yes	Yes	-	-	-	-	-
GNMT	PyTorch	Yes	Yes	Yes	-	-	-	-	-	-
Transformer	PyTorch	Yes	Yes	Yes	-	-	-	-	-	-
ELECTRA	TensorFlow2	Yes	Yes	Yes	-	-	-	-	-	-
BERT	TensorFlow	Yes	Yes	Yes	Yes	Yes	-	Yes	-	Yes
BioBert	TensorFlow	Yes	Yes	Yes	-	-	-	-	-	Yes
TransformerXL	TensorFlow	Yes	Yes	Yes	-	-	-	-	-	-
GNMT	TensorFlow	Yes	Yes	Yes	-	-	-	-	-	-
Faster Transformer	Tensorflow	-	-	-	-	Yes	-	-	-	-

Recommender Systems

Models	Framework	A100	AMP	Multi-GPU	Multi-Node	TRT	ONNX	Triton	TF-TRT	NB
DLRM	PyTorch	Yes	Yes	Yes	-	-	Yes	Yes	-	Yes
NCF	PyTorch	Yes	Yes	Yes	-	-	-	-	-	-
Wide&Deep	TensorFlow	Yes	Yes	Yes	-	-	-	-	-	-
NCF	TensorFlow	Yes	Yes	Yes	-	-	-	-	-	-
VAE-CF	TensorFlow	Yes	Yes	Yes	-	-	-	-	-	-

Speech to Text

Models	Framework	A100	AMP	Multi-GPU	Multi-Node	TRT	ONNX	Triton	TF-TRT	NB
Jasper	PyTorch	Yes	Yes	Yes	-	Yes	Yes	Yes	-	Yes
Hidden Markov Model	Kaldi	-	-	Yes	-	-	-	Yes	-	-

Text to Speech

| Models | Framework | A100 | AMP | Multi-GPU | Multi-Node | TRT | ONNX | Triton | TF-TRT | NB | | ------------- | ------------- | ------------- | ------------- | ------------- | ------------- |------------- |------------- |------------- |------------- |------------- | | FastPitch | PyTorch | Yes | Yes | Yes | - | - | - | - | - | - | | Tacotron 2 and WaveGlow | PyTorch | Yes | Yes | Yes | - | Yes | Yes | Yes | - | - |

NVIDIA support

In each of the network READMEs, we indicate the level of support that will be provided. The range is from ongoing updates and improvements to a point-in-time release for thought leadership.

Feedback / Contributions

We're posting these examples on GitHub to better support the community, facilitate feedback, as well as collect and implement contributions using GitHub Issues and pull requests. We welcome all contributions!

Known issues

In each of the network READMEs, we indicate any known issues and encourage the community to provide feedback.

README.md