Using AI to Translate Data Science Content into Arabic

Local Project Giza, Egypt

Coordinated by the Lead of Egypt,

Status: Completed

Project Duration: 08 Nov 2021 - 14 Dec 2021

Open Source resources available from this project

Partnership.

Egypt FWD (https://egfwd.com/)

Nmthgiat (Domain experts in translating data science content into Arabic) (https://nmthgiat.com/contact/) (https://twitter.com/nmthgiat)

Project goals.

  • Collect data about available Arabic resources explaining data science. Decide on one or a few under-represented topics in data science to work on translating into Arabic.
  • Apply Neural Machine Translation to translate data science blogs, articles, and lecture notes in the chosen under-represented topics from English to Arabic.
  • Collect parallel corpora consisting of text content in the chosen field that has been translated from English to Arabic by an expert human. Use these corpora to further improve the model's performance (e.g by fine-tuning a pre-trained model)
  • Create a website to host the translated articles.
    (Ideally, it would have Wikipedia-like features for users to improve machine-translated articles which could then be used as input for re-training the Neural Machine Translation model)

Project plan.

    Share project on: