Title :
Novel signal representation of descriptors dependency in information retrieval based on wavelet spectral analysis
Author :
El Hassani, Ibtissam ; Masrour, Tawfik
Author_Institution :
Doctoral Studies Center, Moulay Ismail Univ., Meknes, Morocco
Abstract :
The representation of texts in the current methods of information extraction, and TextMining in general, does not always reflect the dependencies between descriptors. In the vector representation, for example, descriptors related are often considered to be either totally independent or totally similar. This type of approach can be considered as a coarse resolution of the document. We propose in this paper a new method of information retrieval based on signal representation and spectral processing at different levels of resolution of documents. It is a new way to exploit the power and properties of the multiresolution analysis of wavelet transform. To illustrate the interest that could present this approach, we have applied it to an Arabic corpus. Our approach in this context demonstrates an ability to achieve higher accuracy compared to the standard vector representation.
Keywords :
data mining; information retrieval; natural language processing; signal representation; text analysis; wavelet transforms; Arabic corpus; TextMining; information extraction; information retrieval; multiresolution analysis; signal representation; text representation; vector representation; wavelet spectral analysis; wavelet transform; Context; Information retrieval; Signal representation; Signal resolution; Transforms; Vectors; Wavelet analysis; Arabic TextMining; Information Retrieval; Signal Analysis; Wavelet Transform;
Conference_Titel :
Computer Systems and Applications (AICCSA), 2013 ACS International Conference on
Conference_Location :
Ifrane
DOI :
10.1109/AICCSA.2013.6616445