Viewer
• Updated
• 174M • 21
Viewer
• Updated
• 200M • 20
Aletheia-ng/pidgin-corpus-synth
Viewer
• Updated
• 57.1k • 30
Aletheia-ng/yoruba-corpus-synth
Viewer
• Updated
• 20.2k • 24
Aletheia-ng/nigerian-pidgin-corpus-synth
Aletheia-ng/pretrain_data10
Viewer
• Updated
• 40.9M • 27
Aletheia-ng/low_resource_languages_pretrain_data4
Viewer
• Updated
• 469M • 114
Aletheia-ng/pretrain_data11
Aletheia-ng/pretrain_data9
Viewer
• Updated
• 79.1M • 40
Aletheia-ng/pretrain_data5
Viewer
• Updated
• 9.43M • 37
Aletheia-ng/pretrain_data4
Viewer
• Updated
• 124M • 109
Aletheia-ng/pretrain_data7
Viewer
• Updated
• 13M • 8
Aletheia-ng/pretrain_data3
Viewer
• Updated
• 143M • 75
Viewer
• Updated
• 136 • 9
Aletheia-ng/pretrain_data
Viewer
• Updated
• 109M • 43
Aletheia-ng/pretrain_data2
Viewer
• Updated
• 18.2M • 24
Aletheia-ng/low_resource_languages_pretrain
Viewer
• Updated
• 202M • 228
• 1
Aletheia-ng/masakhaner_eval
Aletheia-ng/noisy_dataset
Viewer
• Updated
• 84k • 8
Viewer
• Updated
• 84k • 6
Aletheia-ng/personal_finance_v0.2
Viewer
• Updated
• 56.6k • 8
• 1
Aletheia-ng/bloomberg-news-articles-pretraining-dataset
Viewer
• Updated
• 437k • 14
• 5
Aletheia-ng/ChatML-aya_dataset
Viewer
• Updated
• 202k • 9
Aletheia-ng/yo_wiki_processed
Viewer
• Updated
• 43.5k • 6
Viewer
• Updated
• 270k • 7
Viewer
• Updated
• 4.4k • 7
Viewer
• Updated
• 43.5k • 5
Viewer
• Updated
• 288 • 5