🏆 FineWeb2-C Leaderboard

Helping build better language models by rating educational quality of texts across different languages!

Contribute to the dataset: here

Check out the current version of the dataset: here

See this blog post for more information.

Total annotations submitted: 58,125

Languages with annotations: 122

Total contributors: 452

Language Leaderboard
User Leaderboard