13B V2 model planned? #79

tmostak · 2023-07-17T15:53:05Z

Thank you for all your work on this project, it's really great to have a fully OSS Llama backbone.

I was excited to see the V2 version of the models with the original Llama tokenizer, and found that using the 7B model, performance (measured by perplexity) was indeed improved over the V1 model.

Are there plans to train a V2 version of the 13B model? If so, any idea of an ETA for that?

imoneoi · 2023-07-18T03:34:01Z

Also excited to see V2 13B! Better with coding + 8192 native context length

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

13B V2 model planned? #79

13B V2 model planned? #79

tmostak commented Jul 17, 2023 •

edited

Loading

imoneoi commented Jul 18, 2023

13B V2 model planned? #79

13B V2 model planned? #79

Comments

tmostak commented Jul 17, 2023 • edited Loading

imoneoi commented Jul 18, 2023

tmostak commented Jul 17, 2023 •

edited

Loading