AWS Neuron is the SDK used to run deep learning workloads on AWS Inferentia and AWS Trainium based instances. It supports customers in their end-to-end ML development lifecycle to build new models, train and optimize these models, and then deploy them for production. To learn about the model architectures currently supported on Inf1, Inf2, Trn1 and Trn1n instances, please see :ref:`model_samples_tutorials`. To learn about upcoming capabilities, please view the :ref:`neuron_roadmap`.
AWS Neuron includes a deep learning compiler, runtime, and tools that are natively integrated into TensorFlow, PyTorch and Apache MXNet. The EC2 Trn1/Trn1n instances are optimized for the highest performance and best price-performance training in AWS. The EC2 Inf2 instances are designed for high-performance deep learning inference applications. With Neuron, customers can quickly start using Inf/Trn instances through services like Amazon Sagemaker, Amazon Elastic Container Service (ECS), Amazon Elastic Kubernetes Service (EKS), AWS Batch, and AWS Parallel Cluster.
Check :ref:`announcements-main` and check :ref:`latest-neuron-release` for latest release.
.. grid:: 2 .. card:: Get Started with Neuron :link: neuron-quickstart :link-type: ref .. card:: Neuron Quick Links :link: docs-quick-links :link-type: ref
.. toctree:: :maxdepth: 1 :caption: Overview Quick Links </general/quick-start/docs-quicklinks> Get Started with Neuron </general/quick-start/index> Samples and Tutorials </general/models/index> Performance </general/benchmarks/index> What’s New </release-notes/index> Announcements </general/announcements/index>
.. toctree:: :maxdepth: 1 :caption: ML Frameworks PyTorch Neuron </frameworks/torch/index> JAX Neuron </frameworks/jax/index> TensorFlow Neuron </frameworks/tensorflow/index> Apache MXNet </frameworks/mxnet-neuron/index>
.. toctree:: :maxdepth: 1 :caption: NeuronX Distributed (NxD) NxD Training (Beta) </libraries/nxd-training/index> NxD Core </libraries/neuronx-distributed/index>
.. toctree:: :maxdepth: 1 :caption: Additional Libraries Transformers Neuron </libraries/transformers-neuronx/index> AWS Neuron Reference for NeMo Megatron </libraries/nemo-megatron/index>
.. toctree:: :maxdepth: 1 :caption: Developer Flows Neuron DLAMI </dlami/index> Neuron Containers </containers/index> Workload Orchestration </general/devflows/index> Setup Guide </general/setup/index>
.. toctree:: :maxdepth: 1 :caption: Runtime & Tools Neuron Runtime </neuron-runtime/index> Neuron Tools </tools/index> Neuron Calculator </general/calculator/neuron-calculator>
.. toctree:: :maxdepth: 1 :caption: Compiler Neuron Compiler </compiler/index> Neuron Kernel Interface (Beta) <general/nki/index> Neuron C++ Custom Operators </neuron-customops/index>
.. toctree:: :maxdepth: 1 :caption: Learning Neuron Architecture </general/arch/index> Features </general/arch/neuron-features/index> Application Notes </general/appnotes/index> FAQ </general/faq> Troubleshooting </general/troubleshooting>
.. toctree:: :maxdepth: 1 :caption: About Neuron /release-notes/release Roadmap </general/roadmap-readme> Support </general/support>