Title :
Midas for government: Integration of government spending data on Hadoop
Author :
Sala, Antonio ; Lin, Calvin ; Ho, Howard
Author_Institution :
Univ. of Modena & Reggio Emilia, Modena, Italy
Abstract :
We describe our experience in developing a Hadoop based integration flow to collect and integrate publicly available government datasets related to government spending. The objective is to enable users, U.S. taxpayers in this case, to easily access the data their government discloses on the web in different websites. We also provide users with easy-to-use tools to query and explore this data to gather information from the integrated data that allows for evaluation of how tax money is spent.
Keywords :
Web sites; government data processing; Hadoop based integration flow; Midas; Web sites; World Wide Web; data query; government datasets; government spending data; tax money; Aggregates; Contracts; Data mining; Database languages; File systems; Information retrieval; Libraries; US Government; User interfaces; Web pages;
Conference_Titel :
Data Engineering Workshops (ICDEW), 2010 IEEE 26th International Conference on
Conference_Location :
Long Beach, CA
Print_ISBN :
978-1-4244-6522-4
Electronic_ISBN :
978-1-4244-6521-7
DOI :
10.1109/ICDEW.2010.5452758