NeMo-Megatron-GPT-20B

Disclaimer

This is beta software containing preliminary data which is incomplete and may be inaccurate. If you experience errors with the tool or discover inaccurate information, please use the “report” feature in the model details to report inaccuracies or for site issues please contact us.
Download JSONDownload YAML

Class III - Open Model

Class III - Open Model Class III - Open Model Not met Not met

Missing components

  • Model architecture
  • Model parameters (Final)
  • Model card
  • Data card
  • Technical report
  • Evaluation results

Invalid components

  • Model parameters (Final)
  • Model card
  • Technical report
  • Evaluation results

Class II - Open Tooling

Class II - Open Tooling Class II - Open Tooling Not met Not met

Included components

  • Training code
  • Inference code
  • Evaluation code

Missing components

  • Model architecture
  • Model parameters (Final)
  • Evaluation data
  • Model card
  • Data card
  • Technical report
  • Evaluation results

Invalid components

  • Training code
  • Inference code
  • Evaluation code
  • Model parameters (Final)
  • Evaluation data
  • Model card
  • Technical report
  • Evaluation results

Class I - Open Science

Class I - Open Science Class I - Open Science Not met Not met

Included components

  • Data preprocessing code
  • Training code
  • Inference code
  • Evaluation code

Missing components

  • Model architecture
  • Model parameters (Final)
  • Model parameters (Intermediate)
  • Datasets
  • Evaluation data
  • Model card
  • Data card
  • Technical report
  • Research paper
  • Evaluation results

Invalid components

  • Data preprocessing code
  • Training code
  • Inference code
  • Evaluation code
  • Model parameters (Final)
  • Datasets
  • Evaluation data
  • Model card
  • Technical report
  • Research paper
  • Evaluation results
Version/Parameters
20B
Organization
NVIDIA
Status
Approved
Architecture
Decoder-only
Base model
NeMo-Megatron-GPT-20B
Last updated