Enhancing Script Identification in Dravidian Languages using Ensemble of Deep and Texture Features

Authors

  • Satishkumar Mallappa Author
  • Chandrashekar Gudada Department of Mathematical and Computational Sciences, Sri Sathya Sai University for Human Excellence Kalaburagi, Karnataka, India, Email: chandrashekar.vg@sssuhe.ac.in , chandrugudada@gmail.com , ORCID: https://orcid.org/0000-0003-2469-8274 Author https://orcid.org/0000-0003-2469-8274
  • P. Megana Santhoshi Department of Computer Science Engineering, KLM College of Engineering for Women, Kadapa-Pulivendula Road, YSR Kadapa Dt, Andhra Pradesh, 516003, India, Email: pmsanthoshi@gmail.com , ORCID: https://orcid.org/0000-0002-1376-7601 Author https://orcid.org/0000-0002-1376-7601
  • Raghavendra College of Horticulture (UHSB), Munirabad, Koppal, Karnataka, India, Email: Raghavendra.yadav@uhsbagalkot.edu.in, yadav156@gmail.com Author

DOI:

https://doi.org/10.71426/jcdt.v1.i1.pp50-58

Keywords:

Script languages, Data processing, Machine Learning, Convolutional Neural Network (CNN), Histogram of Oriented Gradients (HOG), Support vector machine (SVM), Local binary patterns (LBP)

Abstract

Dravidian languages, including Tamil, Telugu, Kannada, and Malayalam, have complex orthographic structures, making script identification challenging particularly for camera-based document images. This study proposes a hybrid approach that combines deep learning and texture-based methods for robust script recognition. The GoogLeNet convolutional neural network (CNN) model is used to extract deep features, while local binary patterns (LBP) and histogram of oriented gradients (HOG) capture texture characteristics. These features are fused and classified using support vector machine (SVM) classifier. Results show that CNN features alone achieve 84.50% accuracy, LBP achieves 85.90%, and HOG achieves 76.10%, while their fusion significantly improves accuracy to 92.10%. The combination of CNN and HOG features reaches 95.00% accuracy, demonstrating the effectiveness of integrating deep learning with texture-based approaches. This method has applications in OCR systems and assistive technologies for the visually impaired.

Downloads

Published

2025-06-30

How to Cite

Satishkumar Mallappa, Chandrashekar Gudada, P. Megana Santhoshi, & Raghavendra. (2025). Enhancing Script Identification in Dravidian Languages using Ensemble of Deep and Texture Features. Journal of Computing and Data Technology, 1(1), 50-58. https://doi.org/10.71426/jcdt.v1.i1.pp50-58