Skip to content

A curated list of resources about the Kurdish language, culture, science, and technology.

Notifications You must be signed in to change notification settings

HappyHackingSpace/awesome-kurdish

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 

Repository files navigation

Awesome Kurdish

A curated list of awesome resources for the Kurdish language, including tools, libraries, datasets, and works in the computer science field to help the academic advancement of the language.

Kurdish is an Indo-European language spoken by the Kurdish people. This repository aims to gather significant resources related to the Kurdish language in the fields of computer science, linguistics, and technology.

Table of Contents

Language Resources

Dictionaries and Corpora

  • Glosbe - Glosbe can be used as a platform for translating Kurdish (Kurmanji) into various languages
  • Ferheng Kurdi - Ferheng Kurdi can be used as a platform for translating Kurdish (Kurmanji) into various languages
  • Ferhengco - Ferheng.co is an Kurdish (Kurmanji) - Turkish Dictionary.
  • Roj Dictionary - Roj Dictionary is an English to Kurdish (Sorani) Dictionary.

Datasets

Language Learning

  • 50Languages - A free website used for learning Kurdish.

Natural Language Processing

Libraries and Tools

  • Character Convertor - Kurdish Language Library for converting characters and digits in Persian, English and Arabic to Kurdish and vice versa.
  • KurdishHunspell - A morphological analyzer and spell checker for Kurdish in Hunspell.
  • kurdinusLibrary - Kurdînûs is pure JavaScript tools for Kurdish language texts.
  • Kurdish number to words - Converts Numbers (including decimal points) into words for Central Kurdish Language. It also converts the numbers into words for currency.
  • Kurdish-BLARK - This project consists of a set of basic tools developed in Python 2.7 as part of the Kurdish BLARK project and a corpus for the Kurmanji and Sorani dialects of Kurdish. The tools include a transliterator, tokenizer, stemmer, word-level translator using a bidialectal dictionary, proper names recognizer, and utilities for building and sorting dictionaries.
  • KurdishTokenization - A Tokenization System for the Kurdish Language (Sorani & Kurmanji dialects).
  • kurdish-llama - This is an attempt to fine-tune the Llama model released by Meta for Central Kurdish. The initial model was then fine-tuned on a set of instructions provided by Stanford's Alpaca project.
  • Kurdish Language Processing Toolkit - Kurdish Language Processing Toolkit--KLPT is a natural language processing (NLP) toolkit in Python for the Kurdish language. The current version comes with four core modules, namely preprocess, stem, transliterate and tokenize and addresses basic language processing tasks such as text preprocessing, stemming, tokenization, spell-checking and morphological analysis for the Sorani and the Kurmanji dialects of Kurdish.
  • kurdi - Various Kurdi related work done by Kurdish developers.
  • kurdish_news - Kurdish News sources.
  • AI2001_Category-Linguistics-SC-Kurdish - linguistic:Kurdish category for AI2001, containing Kurdish language linguistic datasets.

Machine Translation

(This section is intentionally left empty.)

Speech Recognition

(This section is intentionally left empty.)

Text-to-Speech

(This section is intentionally left empty.)

Academic Research

Programming Resources

Localization Projects

Communities and Organizations

(This section is intentionally left empty.)

Contributing

Contributions are welcome! Please feel free to submit a pull request or open an issue to add new resources or suggest improvements.

About

A curated list of resources about the Kurdish language, culture, science, and technology.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •