PythonVLMData EngineeringMultimodal AIAfrican Languages

Afri-Aya Vision-Language Dataset

June 2024 – August 2024

Led the creation of a large-scale vision-language dataset representing African cultural contexts across 14 languages. The project involved building automated data collection and captioning pipelines using modern multimodal models, designing annotation tools, and coordinating human review to ensure data quality.

Afri-Aya Vision-Language Dataset

Key Highlights

Created dataset covering 14 African languages

Built automated data collection and captioning pipelines

Designed annotation tools for human validation

Coordinated review process ensuring data quality

Released dataset for public research use

Tech Stack

Data Collection

PythonScrapyBeautifulSoup

ML/AI

CLIPBLIP-2GPT-4V

Infrastructure

AWS S3PostgreSQLFastAPI

Tools

Label StudioDVCPandas

Interested in this project?

Let's discuss how we can collaborate or learn more about the implementation details.