Title :
A video based interface to textual information for the visually impaired
Author :
Zandifar, Ali ; Duraiswami, Ramani ; Chahine, Antoine ; Davis, Larry S.
Author_Institution :
Perceptual Interfaces & Reality Lab., Maryland Univ., College Park, MD, USA
Abstract :
We describe the development of an interface to textual information for the visually impaired that uses video, image processing, optical-character-recognition (OCR) and text-to-speech (TTS). The video provides a sequence of low resolution images in which text must be detected, rectified and converted into high resolution rectangular blocks that are capable of being analyzed via off-the-shelf OCR. To achieve this, various problems related to feature detection, mosaicing, auto-focus, zoom, and systems integration were solved in the development of the system.
Keywords :
feature extraction; handicapped aids; image segmentation; image sequences; optical character recognition; speech synthesis; user interfaces; video signal processing; OCR; auto-focus; feature detection; high resolution rectangular blocks; image processing; low resolution image sequence; mosaicing; systems integration; text-to-speech; textual information; video based interface; visually impaired; zoom; Books; Digital cameras; Educational institutions; Image recognition; Image resolution; Laboratories; Layout; Optical character recognition software; Speech synthesis; Text recognition;
Conference_Titel :
Multimodal Interfaces, 2002. Proceedings. Fourth IEEE International Conference on
Print_ISBN :
0-7695-1834-6
DOI :
10.1109/ICMI.2002.1167016