DocumentCode
238081
Title
Automatic text summarization with statistical and linguistic features using successive thresholds
Author
PadmaLahari, E. ; Siva Kumar, D.V.N. ; Prasad, Santasriya
Author_Institution
C.S.E Dept., Vignan Univ., Guntur, India
fYear
2014
fDate
8-10 May 2014
Firstpage
1519
Lastpage
1524
Abstract
Text summarization is an emerging technique for finding out the summary of the text document. Text summarization is nothing but summarizing the content of given text document. Text summarization has got so uses such as Due to the massive amount of information getting increased on internet; it is difficult for the user to go through all the information available on web. Summarization techniques need to be used to reduce the user´s time in reading the whole information available on web. In this paper, we propose an automatic text summarization technique using both linguistic and statistical features using successive threshold for finding the summary i.e important sentences from the given input text document. Here the sentences are selected for summary based on the weight of the sentence. The weight of the sentences is calculated based on the statistical and linguistic features. Our approach assigns scores to the sentences by weighting the features like term frequency, word occurrences, and noun weight, phrases etc. In our approach, the number of sentences present in our summary would be equal to the number of paragraphs present in a text document, which can be achieved by using our successive threshold approach.
Keywords
Internet; natural language processing; statistical analysis; text analysis; Internet; World Wide Web; automatic text summarization; linguistic feature; noun weight; phrases; statistical feature; successive threshold approach; term frequency; word occurrences; Computers; Conferences; Feature extraction; Manuals; Media; Pragmatics; Time-frequency analysis; Linguistic Features; Statistical Features; Successive Threshold; Summarization;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Communication Control and Computing Technologies (ICACCCT), 2014 International Conference on
Conference_Location
Ramanathapuram
Print_ISBN
978-1-4799-3913-8
Type
conf
DOI
10.1109/ICACCCT.2014.7019360
Filename
7019360
Link To Document