DocumentCode
3628495
Title
Building a search engine model with morphological normalization support
Author
Jure Mijic;Bojana Dalbelo Basic;Jan Snajder
Author_Institution
Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, 10000, Croatia
fYear
2008
Firstpage
619
Lastpage
624
Abstract
Searching a collection of documents can seem like an easy task, but manipulating textual data can be difficult because the data are mostly unstructured. We undertook the task of building an effective search engine for a collection of Croatian legislative documents. The developed search engine model supports multiple modules for information retrieval. To improve the effectiveness of the retrieval, we used a morphological normalization module that uses an inflectional lexicon automatically acquired from a document corpus. As we do not have a gold standard for our legislative document collection, we evaluated our search engine on three English test collections, explored the effects of stemming, and compared the results to the vector space model.
Keywords
"Search engines","Indexes","Databases","Information retrieval","Buildings","Law","Indexing"
Publisher
ieee
Conference_Titel
Information Technology Interfaces, 2008. ITI 2008. 30th International Conference on
ISSN
1330-1012
Print_ISBN
978-953-7138-12-7
Type
conf
DOI
10.1109/ITI.2008.4588481
Filename
4588481
Link To Document