Character segmentation is an important step of optical character recognition (OCR) system. The segmentation of broken characters is one of the key factors which affect the performance of recognition system. So it is necessary to make OCR systems more effective and accurate for broken character segmentation. This paper explains the challenges in broken character segmentation method for Guajarati printed documents.
Features and limitations of various methods in broken character segmentation are discussed. Different types of broken characters in Guajarati language are explained. Several challenges are explained for Gujarati broken character segmentation which is useful to implement broken character segmentation method.