Deploy error for Llama-3.2-vision-11B: "Sharded is not supported for AutoModel" #2571

xuan1905 · 2024-09-26T06:33:44Z

System Info

Hi Team,
When deploying the model on AWS with huggingface-pytorch-tgi-inference:2.3.0-tgi2.2.0, I got the above error.
Could you tell me when can TGI provide the new image? Is there any way I can work around the issue for the moment?

Information

Docker
The CLI directly

Tasks

An officially supported command
My own modifications

Reproduction

Run the image huggingface-pytorch-tgi-inference:2.3.0-tgi2.2.0 on Sagemaker.

Expected behavior

TGI can deploy the Llama3.2 model successfully

The text was updated successfully, but these errors were encountered:

dossjjx · 2024-09-27T06:46:30Z

Same issue here with the 90B model. Number of shards: 4.

xuan1905 · 2024-10-05T18:09:24Z

Is there any update?

renambot · 2024-10-07T21:01:20Z

TGI v2.3.1 works with llama 3.2 Vision now (mllama models)

xuan1905 · 2024-10-08T00:12:43Z

Great. Thanks. Is it available in AWS deep learning container images?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deploy error for Llama-3.2-vision-11B: "Sharded is not supported for AutoModel" #2571

Deploy error for Llama-3.2-vision-11B: "Sharded is not supported for AutoModel" #2571

xuan1905 commented Sep 26, 2024

dossjjx commented Sep 27, 2024

xuan1905 commented Oct 5, 2024

renambot commented Oct 7, 2024

xuan1905 commented Oct 8, 2024

Deploy error for Llama-3.2-vision-11B: "Sharded is not supported for AutoModel" #2571

Deploy error for Llama-3.2-vision-11B: "Sharded is not supported for AutoModel" #2571

Comments

xuan1905 commented Sep 26, 2024

System Info

Information

Tasks

Reproduction

Expected behavior

dossjjx commented Sep 27, 2024

xuan1905 commented Oct 5, 2024

renambot commented Oct 7, 2024

xuan1905 commented Oct 8, 2024