The Megatron-Turing Natural Language Generation model (MT-NLG) from Microsoft and NVIDIA has 530 billion parameters, three times more than its closest competitor. MT-NLG, a successor to Turing NLG 17B and Megatron-LM, excels in prediction, comprehension, reasoning, and disambiguation. The Selene supercomputer, built on NvidiaDGX SuperPOD and containing 560 DGX A100 servers, powers the model’s mixed-precision training, HDR InfiniBand, NVLink, and NVSwitch for seamless connectivity and training.

User objects:

  1. Researchers
  2. NLP Developers
  3. Data Scientists
  4. Tech Companies
  5. Linguists
  6. Educational Institutions
  7. Content Creators 
  8. AI Consultants.

