Multilingual Web datasets
AI & ML interests
Open Source Language Models for Europe
Recent Activity
View all activity
Organization Card
Occiglot is an ongoing open research project for multilingual language models.
If you want to train a model for your own language or are working on evaluations, please contact us or join our Discord server. We are actively seeking collaborations!
models
10
occiglot/occiglot-7b-es-en-instruct
Text Generation
•
7B
•
Updated
•
56
•
2
occiglot/occiglot-7b-eu5
Text Generation
•
7B
•
Updated
•
87
•
27
occiglot/occiglot-7b-de-en-instruct
Text Generation
•
7B
•
Updated
•
518
•
24
occiglot/occiglot-7b-eu5-instruct
Text Generation
•
7B
•
Updated
•
123
•
10
occiglot/occiglot-7b-it-en-instruct
Text Generation
•
7B
•
Updated
•
1.46k
•
5
occiglot/occiglot-7b-fr-en-instruct
Text Generation
•
7B
•
Updated
•
29
•
3
occiglot/occiglot-7b-it-en
Text Generation
•
7B
•
Updated
•
27
•
5
occiglot/occiglot-7b-fr-en
Text Generation
•
7B
•
Updated
•
52
•
3
occiglot/occiglot-7b-de-en
Text Generation
•
7B
•
Updated
•
46
•
7
occiglot/occiglot-7b-es-en
Text Generation
•
7B
•
Updated
•
25
•
4
datasets
6
occiglot/arcX
Viewer
•
Updated
•
26.4k
•
306
occiglot/hellaswagX
Viewer
•
Updated
•
240k
•
415
occiglot/euro-llm-leaderboard-requests
Updated
•
1.97k
•
2
occiglot/occiglot-fineweb-v1.0
Updated
•
34.6k
•
3
occiglot/occiglot-fineweb-v0.5
Viewer
•
Updated
•
226M
•
418
•
15
occiglot/tokenizer-wiki-bench
Viewer
•
Updated
•
84.4M
•
21.3k
•
6