Support Apple Neural Engine (ANE) Transformers #13

LeiHao0 · 2023-03-25T00:57:22Z

I noticed Apple supports ANE Transformers.

According to their own words:

M1 or newer chip to achieve up to 10 times faster and 14 times lower peak memory

Does that mean running 30B or 65B will be possible on small-memory MacBooks?

Here are a few links
https://github.com/apple/ml-ane-transformers
https://machinelearning.apple.com/research/neural-engine-transformers

As this project is the top LLaMA that leverages Apple GPU, is it possible to support ANE too?

philipturner · 2023-05-27T12:32:43Z

I don't know whether that would provide much speedup for current LLM architectures, which are memory bound. Rather, it might be useful for Stable Diffusion (compute-bound) or MegaByte transformers.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support Apple Neural Engine (ANE) Transformers #13

Support Apple Neural Engine (ANE) Transformers #13

LeiHao0 commented Mar 25, 2023

philipturner commented May 27, 2023

Support Apple Neural Engine (ANE) Transformers #13

Support Apple Neural Engine (ANE) Transformers #13

Comments

LeiHao0 commented Mar 25, 2023

philipturner commented May 27, 2023