Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

2.7B Evaluations #1

Open
sdtblck opened this issue Jan 23, 2023 · 2 comments
Open

2.7B Evaluations #1

sdtblck opened this issue Jan 23, 2023 · 2 comments

Comments

@sdtblck
Copy link

sdtblck commented Jan 23, 2023

Hi, great work!

Very excited to try out the models.

Curious if you have more detailed evaluation for the 2.7B model, as I can't find this in the H3 paper

@DanFu09
Copy link
Contributor

DanFu09 commented Jan 23, 2023

Thanks for your interest! We plan to update the arxiv with the full evaluations soon.

For now, we have the PPL of the 2.7B model against GPT-Neo-2.7B on the Pile:

Model Pile PPL
GPT-Neo-2.7B 5.7
H3 + 3 attn (2.7B) 5.4

We'll be updating with evaluations of everything else soon (after this week's ICML deadline).

@DanFu09
Copy link
Contributor

DanFu09 commented Mar 6, 2023

This is updated in the arXiv now: https://arxiv.org/abs/2212.14052

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants