KenCORPUS: Kenyan Languages Corpus Lacuna Project Performance Review Workshop
Content Builder
Data cleaning
a. Digitizing handwritten text
b. Proof reading the digitized text
Translation
a. Dhluo to Kiswahili
b. Luhya (Lumarachi, Luloogoli, Lubuusu) to Kiswahili
Transcription (Kiswahili)
a. Reading out sentences
b. Listening to voice data and typing out
Part of Speech (POS) Tagging
a. Dholuo Text
b. Luhya (Lumarachi, Luloogoli, Lubukusu)