Title :
Ticker text extraction from Bangla news videos
Author :
Tiwari, Aditya ; Ghosh, Hiranmay
Author_Institution :
Innovation Labs., Tata Consultancy Services, Delhi, India
Abstract :
In this paper, a framework for recognition of Bangla ticker text1 from the Bangla news videos is presented. Tesseract OCR [1] has been used for Bangla script recognition. Tesseract OCR gives good results for text recognition in documents. But in case of images and videos, some processing is required beforehand. Approach here is to provide processed images to the Tesseract OCR to get better results than directly providing the raw video frames to the Tesseract OCR. The ticker text recognized can further be used for indexing of news videos on the basis of recognized keywords. Indexing of news videos is important for news monitoring agencies. Till now this is done manually. Automation of the monitoring process and indexing the news videos can save a lot of time as well as the efficiency of the news monitoring system.
Keywords :
feature extraction; monitoring; optical character recognition; text analysis; video signal processing; Bangla news video; Tesseract OCR; monitoring process system; text recognition; ticker text extraction; Character recognition; Image segmentation; Monitoring; Optical character recognition software; Pixel; Text recognition; Videos; Optical character recognition; Tesseract; Ticker text; Video analytics;
Conference_Titel :
India Conference (INDICON), 2010 Annual IEEE
Conference_Location :
Kolkata
Print_ISBN :
978-1-4244-9072-1
DOI :
10.1109/INDCON.2010.5712595