Author_Institution :
Electron. & Inf. Eng. Dept., Changsha Normal Univ., Changsha, China
Abstract :
With the rapid development of the cloud computing, massive information is distributed in the structure "cloud storage", leading to the heterogeneous data sources, and the "lost phenomenon" may occur in the process of information retrieval. To solve this problem, this paper builds the heterogeneous data integration model in the cloud computing environment, which includes three layers and five function modules: data acquisition and analysis interface in the cloud, data organization task scheduling engine, and heterogeneous data integrated interface based on the ontology. Also, common technologies used in unified retrieval of heterogeneous data resources are presented in this paper, such as parallel retrieval technology, server cluster retrieval technology, webpage\´s deep mining technology and retrieval database sharing technology.
Keywords :
cloud computing; data acquisition; data analysis; data integration; information retrieval; search engines; storage management; Web page; cloud computing environment; cloud storage; data acquisition; data analysis; data organization task scheduling engine; heterogeneous data integrated interface; heterogeneous data sources; information retrieval; massive information; parallel retrieval technology; retrieval database sharing technology; search engine; server cluster retrieval technology; Cloud computing; Computational modeling; Data acquisition; Distributed databases; Information retrieval; Servers; Cloud Computing; Heterogeneous Data; Information Retrieval; Search Engine;