PythonVLMData EngineeringMultimodal AIAfrican Languages
Afri-Aya Vision-Language Dataset
June 2024 – August 2024
Led the creation of a large-scale vision-language dataset representing African cultural contexts across 14 languages. The project involved building automated data collection and captioning pipelines using modern multimodal models, designing annotation tools, and coordinating human review to ensure data quality.

Key Highlights
Created dataset covering 14 African languages
Built automated data collection and captioning pipelines
Designed annotation tools for human validation
Coordinated review process ensuring data quality
Released dataset for public research use
Tech Stack
Data Collection
PythonScrapyBeautifulSoup
ML/AI
CLIPBLIP-2GPT-4V
Infrastructure
AWS S3PostgreSQLFastAPI
Tools
Label StudioDVCPandas
Interested in this project?
Let's discuss how we can collaborate or learn more about the implementation details.