DocumentCode :
2345892
Title :
Generating Subtitles Automatically Using Audio Extraction and Speech Recognition
Author :
Mathur, Abhinav ; Saxena, Tanya ; Krishnamurthi, Rajalakshmi
Author_Institution :
Dept. of Comput. Sci., Jaypee Inst. of Inf. Technol., Noida, India
fYear :
2015
fDate :
13-14 Feb. 2015
Firstpage :
621
Lastpage :
626
Abstract :
In present scenario, video plays a vital role to help people understand and comprehend the information for example the songs, movies or the video lectures or any other multimedia data relevant to the user. Hence, here it becomes important to make videos available to the people having auditory problems and even more for the people to remove the gaps of their native language. This can be best done by the use of subtitles of the video. However, downloading subtitles of any video from the internet is a monotonous process. Consequently, to generate subtitles automatically through the software itself and without the use of internet is a valid subject of research. Hence, this research paper resolves the above issue through three distinct modules namely Audio Extraction which converts an input file of any format supported by MPEG standards to .wav format. Here 24% reduction rate has been achieved in the size of the song after the extraction. Then Speech Recognition of the extracted .wav file is implemented and finally, Subtitle Generation in which a .txt/.srt file is generated which is synchronized with the input file.
Keywords :
audio signal processing; feature extraction; handicapped aids; speech recognition; video signal processing; .srt file; .txt file; MPEG standards; audio extraction; auditory problems; native language; speech recognition; subtitle automatic generation; video; wav format; Acoustics; Bit rate; Filter banks; Hidden Markov models; Psychoacoustic models; Speech recognition; .srt file; .wav format; Audio Extraction; MPEG standards; Speech Recognition; Subtitle Generation; Subtitles; Video;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence & Communication Technology (CICT), 2015 IEEE International Conference on
Conference_Location :
Ghaziabad
Print_ISBN :
978-1-4799-6022-4
Type :
conf
DOI :
10.1109/CICT.2015.46
Filename :
7078779
Link To Document :
بازگشت