OCR (Optical Character Recognition)

Developing systems to digitize and recognize Kurdish text from printed documents, handwritten text, and digital images

January 15, 2023
Kurdish Language Research
Completed
1 min read

Optical Character Recognition (OCR) for Kurdish language represents a significant advancement in digitizing Kurdish textual content. This research focuses on developing robust systems capable of accurately recognizing and converting Kurdish text from various sources including printed documents, handwritten materials, and digital images.

Research Objectives

The primary goal of this research is to create an efficient OCR system specifically tailored for the Kurdish language, addressing the unique challenges posed by Kurdish script variations and linguistic characteristics.

Key Features

  • Support for multiple Kurdish dialects and script variations
  • High accuracy recognition of both printed and handwritten text
  • Integration with modern deep learning architectures
  • Real-time processing capabilities

Impact and Applications

This OCR system has significant implications for preserving Kurdish cultural heritage, enabling the digitization of historical documents, and facilitating modern digital workflows for Kurdish language content.

Research Team

Isra Mahdi

MSc • Salahaddin University-Erbil

Research Focus: Handwritten OCR (Optical Character Recognition) for Kurdish Language

Completed

Publications

This research presents a comprehensive study on Kurdish handwritten character recognition using deep learning approac...

Kurdish Handwritten character recognition using deep learning techniques

Rebin M. Ahmed, Tarik A. Rashid, Polla Fattah, Abeer Alsadoon, Nebojsa Bacanin, Seyedali Mirjalili, S. Vimal, Amit Chhabra

December 2022

Handwriting recognition is regarded as a dynamic and inspiring topic in the exploration of pattern recognition and im...

An Extensive Dataset of Handwritten Central Kurdish Isolated Characters

Rebin Ahmed, Tarik Rashid, Polla Fatah, Abeer Alsadoon, Seyedali Mirjaliligh

December 2021

To collect the handwritten format of separate Kurdish characters, each character has been printed on a grid of 14 × 9...