Multilingual Web datasets
AI & ML interests
Open Source Language Models for Europe
Recent Activity
View all activity
Organization Card
Occiglot is an ongoing open research project for multilingual language models.
If you want to train a model for your own language or are working on evaluations, please contact us or join our Discord server. We are actively seeking collaborations!
models 10
occiglot/occiglot-7b-es-en-instruct
Text Generation • 7B • Updated • 172 • 2
occiglot/occiglot-7b-eu5
Text Generation • 7B • Updated • 33 • 27
occiglot/occiglot-7b-de-en-instruct
Text Generation • 7B • Updated • 321 • • 24
occiglot/occiglot-7b-eu5-instruct
Text Generation • 7B • Updated • 48 • 10
occiglot/occiglot-7b-it-en-instruct
Text Generation • 7B • Updated • 2.45k • • 5
occiglot/occiglot-7b-fr-en-instruct
Text Generation • 7B • Updated • 39 • 3
occiglot/occiglot-7b-it-en
Text Generation • 7B • Updated • 20 • 5
occiglot/occiglot-7b-fr-en
Text Generation • 7B • Updated • 296 • 3
occiglot/occiglot-7b-de-en
Text Generation • 7B • Updated • 394 • 7
occiglot/occiglot-7b-es-en
Text Generation • 7B • Updated • 307 • 4
datasets 6
occiglot/arcX
Viewer • Updated • 26.4k • 310
occiglot/hellaswagX
Viewer • Updated • 240k • 140
occiglot/euro-llm-leaderboard-requests
Updated • 46 • 2
occiglot/occiglot-fineweb-v1.0
Updated • 1.79k • 3
occiglot/occiglot-fineweb-v0.5
Viewer • Updated • 226M • 13 • 15
occiglot/tokenizer-wiki-bench
Viewer • Updated • 84.4M • 86.9k • 6