1300+

Nouns

100,000+

Entries

1000+

Verbs

45,000+

Sentences

•••* Mission

Having advanced IT solutions that are well adapted to the Moroccan context passes inevitably through understanding Moroccan dialect. Hence, darija (Moroccan dialect) should be an active player in the domain of Natural Language Processing (NLP).


However, it turns out that step 0 in any serious engagement with darija in NLP will consist of translating its vocabulary to the widely used and most documented language in this field, namely English.


This open source project aims to be a reference in addressing this issue by providing the largest darija-english translation dataset. We hope for the contribution of Moroccan IT community in order to build a pedestal for any future application of NLP to benefit Moroccan people.


•••* Characteristics

Open source


The project is hosted on GitHub under CC BY-NC 4.0 Open Source license.

Check the license

Large


With more than 100K entries, DODa is arguably the largest darija dataset, and we are waiting for your contributions to make it even larger.

By/For Moroccans


Join us and let's contribute together to create the socle for any future NLP application that may help solving Moroccan problems.


Why contribute to DODa?

  • Strengthen your understanding of the project, and gain insights for potential applications
  • Boost your portfolio by a real-world project
  • Pay back to the community

Quotes

“I think, fundamentally, open source does tend to be more stable software. It’s the right way to do things.”

- Linus Torvalds

“The paradigm shift of the ImageNet thinking is that while a lot of people are paying attention to models, let's pay attention to data. Data will redefine how we think about models.”

- Fei-Fei Li

“3awno lfari9.”

- Random Moroccan asking for contributions

Main contributors (So far ..)

Hamza Raised circle image
Aissam Raised circle image

Wanna contribute but don't know how?

We actually made a tutorial just for you! Click the link below and follow a step-by-step tutorial to guide you through the process of using Git & Pull Requests.