Aletheia-ng/pidgin-corpus-synth
Viewer
•
Updated
•
6.86k
•
48
Aletheia-ng/nigerian-pidgin-corpus-synth
Aletheia-ng/pretrain_data10
Viewer
•
Updated
•
40.9M
•
44
Aletheia-ng/low_resource_languages_pretrain_data4
Viewer
•
Updated
•
469M
•
914
Aletheia-ng/pretrain_data11
Aletheia-ng/pretrain_data9
Viewer
•
Updated
•
79.1M
•
63
Aletheia-ng/pretrain_data5
Viewer
•
Updated
•
9.43M
•
190
Aletheia-ng/pretrain_data4
Viewer
•
Updated
•
124M
•
263
Aletheia-ng/pretrain_data7
Viewer
•
Updated
•
13M
•
49
Aletheia-ng/pretrain_data3
Viewer
•
Updated
•
143M
•
568
Viewer
•
Updated
•
136
•
53
Aletheia-ng/pretrain_data
Viewer
•
Updated
•
109M
•
360
Aletheia-ng/pretrain_data2
Viewer
•
Updated
•
18.2M
•
180
Aletheia-ng/low_resource_languages_pretrain
Viewer
•
Updated
•
202M
•
1.65k
•
1
Aletheia-ng/masakhaner_eval
Aletheia-ng/noisy_dataset
Viewer
•
Updated
•
84k
•
77
Viewer
•
Updated
•
84k
•
71
Aletheia-ng/personal_finance_v0.2
Viewer
•
Updated
•
56.6k
•
30
•
1
Aletheia-ng/bloomberg-news-articles-pretraining-dataset
Viewer
•
Updated
•
437k
•
60
•
5
Aletheia-ng/ChatML-aya_dataset
Viewer
•
Updated
•
202k
•
17
Aletheia-ng/yo_wiki_processed
Viewer
•
Updated
•
43.5k
•
16
Viewer
•
Updated
•
270k
•
24
Viewer
•
Updated
•
4.4k
•
14
Viewer
•
Updated
•
43.5k
•
16
Viewer
•
Updated
•
288
•
31
Viewer
•
Updated
•
1.01k
•
116
Viewer
•
Updated
•
3.67k
•
212