Skip to content

GitHub Org's stars Twitter Follow Hugging Face

ARBML is a group of researchers working on democratizing Arabic NLP research and deveopment:

  • 🙋‍♀️ All about Arabic NLP and ML, open source for the win!
  • 🏵️ Contribution guidelines - open an issue and given the go-ahead submit a PR.
  • 👩‍💻 Some repos have specific contribution guidlines.
  • 📝 Remember to cite if you use one of our resources.

Pinned

  1. ARBML ARBML Public

    Implementation of many Arabic NLP and CV projects. Providing real time experience using many interfaces like web, command line and notebooks.

    JavaScript 375 45

  2. klaam klaam Public

    Arabic speech recognition, classification and text-to-speech.

    Jupyter Notebook 314 65

  3. masader masader Public

    The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.

    JavaScript 135 21

  4. Calliar Calliar Public

    A dataset for online Arabic calligraphy. A collection of 2500 annotated calligraphic styles.

    Jupyter Notebook 138 14

  5. tkseem tkseem Public

    Arabic Tokenization Library. It provides many tokenization algorithms.

    Jupyter Notebook 78 17

  6. CIDAR CIDAR Public

    Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.

    Jupyter Notebook 26 3

Repositories

Showing 10 of 30 repositories

Top languages

Loading…

Most used topics

Loading…