GPT-NeoX: Large-Scale Autoregressive Language Modeling in PyTorch cover art

GPT-NeoX: Large-Scale Autoregressive Language Modeling in PyTorch

GPT-NeoX: Large-Scale Autoregressive Language Modeling in PyTorch

Listen for free

View show details

About this listen

Thus describes EleutherAI's GPT-NeoX library, a robust open-source framework for training large-scale autoregressive language models on GPUs, building upon the Megatron and DeepSpeed libraries. It highlights the library's advanced features like distributed training, support for various hardware and systems, and cutting-edge architectural innovations. The text also provides practical guidance on setup, configuration, data preparation, training, inference, and evaluation, alongside details on pretrained models like GPT-NeoX-20B and Pythia. Furthermore, it details how to export models to Hugging Face and monitor experiments, underscoring its widespread adoption in research and industry.


Source:

https://github.com/EleutherAI/gpt-neox


No reviews yet
In the spirit of reconciliation, Audible acknowledges the Traditional Custodians of country throughout Australia and their connections to land, sea and community. We pay our respect to their elders past and present and extend that respect to all Aboriginal and Torres Strait Islander peoples today.