Sen descrición

kkudrynski 0d15a95c8f [DLRM/PyT] Readme fixes %!s(int64=5) %!d(string=hai) anos
.github 30425620be Update issue templates %!s(int64=6) %!d(string=hai) anos
CUDA-Optimized e0fcb62e83 Add FastSpeech in CUDA-Optimized %!s(int64=5) %!d(string=hai) anos
FasterTransformer 1aa6813450 [FT] 1. Fix the bug of TensorRT plugin of FasterTransformer encoder. (#640) %!s(int64=5) %!d(string=hai) anos
Kaldi 3419c93192 Fixing config file header %!s(int64=6) %!d(string=hai) anos
MxNet e470c2150a Updating RN50/MxNet %!s(int64=6) %!d(string=hai) anos
PyTorch 0d15a95c8f [DLRM/PyT] Readme fixes %!s(int64=5) %!d(string=hai) anos
TensorFlow 7c0afee460 Merge pull request #648 from swethmandava/master %!s(int64=5) %!d(string=hai) anos
TensorFlow2 fb40734b31 Remove autobench scripts (#647) %!s(int64=5) %!d(string=hai) anos
.gitignore 0663b67c1a Updating models %!s(int64=6) %!d(string=hai) anos
.gitmodules 36ad5fe657 Update .gitmodules %!s(int64=5) %!d(string=hai) anos
README.md 63a2a2e431 Update README.md %!s(int64=5) %!d(string=hai) anos
hubconf.py ff16b6c649 removing torchhub access through master %!s(int64=6) %!d(string=hai) anos

README.md

NVIDIA Deep Learning Examples for Tensor Cores

Introduction

This repository provides State-of-the-Art Deep Learning examples that are easy to train and deploy, achieving the best reproducible accuracy and performance with NVIDIA CUDA-X software stack running on NVIDIA Volta, Turing and Ampere GPUs.

NVIDIA GPU Cloud (NGC) Container Registry

These examples, along with our NVIDIA deep learning software stack, are provided in a monthly updated Docker container on the NGC container registry (https://ngc.nvidia.com). These containers include:

  • The latest NVIDIA examples from this repository
  • The latest NVIDIA contributions shared upstream to the respective framework
  • The latest NVIDIA Deep Learning software libraries, such as cuDNN, NCCL, cuBLAS, etc. which have all been through a rigorous monthly quality assurance process to ensure that they provide the best possible performance
  • Monthly release notes for each of the NVIDIA optimized containers

Computer Vision

Models Framework A100 AMP Multi-GPU Multi-Node TRT ONNX Triton TF-TRT NB
ResNet-50 PyTorch Yes Yes Yes - - - - - -
ResNeXt101 PyTorch Yes Yes Yes - - - - - -
SEResNeXt101 PyTorch Yes Yes Yes - - - - - -
Mask R-CNN PyTorch Yes Yes Yes - - - - - Yes
SSD PyTorch Yes Yes Yes - - - - - Yes
ResNet-50 TensorFlow Yes Yes Yes - - - - - -
ResNeXt101 TensorFlow Yes Yes Yes - - - - - -
SEResNeXt101 TensorFlow Yes Yes Yes - - - - - -
Mask R-CNN TensorFlow Yes Yes Yes - - - - - -
SSD TensorFlow Yes Yes Yes - - - - - Yes
U-Net Ind TensorFlow Yes Yes Yes - Yes - - Yes Yes
U-Net Med TensorFlow Yes Yes Yes - Yes - - Yes -
U-Net 3D TensorFlow Yes Yes Yes - Yes - - Yes -
V-Net Med TensorFlow Yes Yes Yes - Yes Yes - Yes -
U-Net Med TensorFlow2 Yes Yes Yes - Yes - - Yes -
Mask R-CNN TensorFlow2 Yes Yes Yes - - - - - -
ResNet-50 MXNet - Yes Yes - - - - - -

Natural Language Processing

Models Framework A100 AMP Multi-GPU Multi-Node TRT ONNX Triton TF-TRT NB
BERT PyTorch Yes Yes Yes Yes - - Yes - -
TransformerXL PyTorch Yes Yes Yes Yes - - - - -
GNMT PyTorch Yes Yes Yes - - - - - -
Transformer PyTorch Yes Yes Yes - - - - - -
ELECTRA TensorFlow2 Yes Yes Yes - - - - - -
BERT TensorFlow Yes Yes Yes Yes Yes - Yes - Yes
BioBert TensorFlow Yes Yes Yes - - - - - Yes
TransformerXL TensorFlow Yes Yes Yes - - - - - -
GNMT TensorFlow Yes Yes Yes - - - - - -
Faster Transformer Tensorflow - - - - Yes - - - -

Recommender Systems

Models Framework A100 AMP Multi-GPU Multi-Node TRT ONNX Triton TF-TRT NB
DLRM PyTorch Yes Yes Yes - - Yes Yes - Yes
NCF PyTorch Yes Yes Yes - - - - - -
Wide&Deep TensorFlow Yes Yes Yes - - - - - -
NCF TensorFlow Yes Yes Yes - - - - - -
VAE-CF TensorFlow Yes Yes Yes - - - - - -

Speech to Text

Models Framework A100 AMP Multi-GPU Multi-Node TRT ONNX Triton TF-TRT NB
Jasper PyTorch Yes Yes Yes - Yes Yes Yes - Yes
Hidden Markov Model Kaldi - - Yes - - - Yes - -

Text to Speech

| Models | Framework | A100 | AMP | Multi-GPU | Multi-Node | TRT | ONNX | Triton | TF-TRT | NB | | ------------- | ------------- | ------------- | ------------- | ------------- | ------------- |------------- |------------- |------------- |------------- |------------- | | FastPitch | PyTorch | Yes | Yes | Yes | - | - | - | - | - | - | | Tacotron 2 and WaveGlow | PyTorch | Yes | Yes | Yes | - | Yes | Yes | Yes | - | - |

NVIDIA support

In each of the network READMEs, we indicate the level of support that will be provided. The range is from ongoing updates and improvements to a point-in-time release for thought leadership.

Feedback / Contributions

We're posting these examples on GitHub to better support the community, facilitate feedback, as well as collect and implement contributions using GitHub Issues and pull requests. We welcome all contributions!

Known issues

In each of the network READMEs, we indicate any known issues and encourage the community to provide feedback.