This directory contains examples for deploying the FlanT5 family of models on SageMaker Inference. Below you can find a structure of the content that is present within this directory.
- FlanT5-XXL Real-Time Inference Large Model Inference (LMI) Deployment
- To-Do:
- TGI
- Asynchronous Inference
- Inferentia2