Does this work with LLama 3? #10

thomasgauthier · 2024-04-19T20:26:47Z

Question in the title.

Big thanks for making this accessible to the community!

ChenxinAn-fdu · 2024-04-20T07:23:51Z

Thank you for bringing up this issue! We are actively testing DCA with Llama3 8B/70B. The results will be updated next week!

ChenxinAn-fdu · 2024-04-24T03:38:40Z

Hi! The language modeling results for Llama3 8b/70b have been updated! DCA is still very effective! ChunkLlama3 shows an obvious performance gain over ChunkLlama2 in PPL.

Updates of needle in a haystack, few-shot, and zero-shot results are expected in two days. These are running a bit slow.

ChenxinAn-fdu · 2024-04-29T04:55:51Z

All results of ChunkLlama3 8b/70b has been updated. (see the results)

Generally, ChunkLlama3-8b achieves 100% retrieval accuracy across all document depths.
Results on real-world tasks show that ChunkLlama3-70b achieves performance on par with GPT-4 (2023/06/13) and Llama2 Long 70b!

ChenxinAn-fdu closed this as completed Apr 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does this work with LLama 3? #10

Does this work with LLama 3? #10

thomasgauthier commented Apr 19, 2024

ChenxinAn-fdu commented Apr 20, 2024 •

edited

Loading

ChenxinAn-fdu commented Apr 24, 2024

ChenxinAn-fdu commented Apr 29, 2024

Does this work with LLama 3? #10

Does this work with LLama 3? #10

Comments

thomasgauthier commented Apr 19, 2024

ChenxinAn-fdu commented Apr 20, 2024 • edited Loading

ChenxinAn-fdu commented Apr 24, 2024

ChenxinAn-fdu commented Apr 29, 2024

ChenxinAn-fdu commented Apr 20, 2024 •

edited

Loading