Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

13B V2 model planned? #79

Open
tmostak opened this issue Jul 17, 2023 · 1 comment
Open

13B V2 model planned? #79

tmostak opened this issue Jul 17, 2023 · 1 comment

Comments

@tmostak
Copy link

tmostak commented Jul 17, 2023

Thank you for all your work on this project, it's really great to have a fully OSS Llama backbone.

I was excited to see the V2 version of the models with the original Llama tokenizer, and found that using the 7B model, performance (measured by perplexity) was indeed improved over the V1 model.

Are there plans to train a V2 version of the 13B model? If so, any idea of an ETA for that?

@imoneoi
Copy link

imoneoi commented Jul 18, 2023

Also excited to see V2 13B! Better with coding + 8192 native context length

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants