DocumentCode :
3101264
Title :
Mining protein data using parallel/distributed association rules
Author :
Bahamish, Hesyam Awadh Abdallah ; Salam, Rosalina Abdul ; Abdullah, Rosni ; Osman, Mohd Azarn ; Rashid, Nur´Aini Abdul
Author_Institution :
Sch. of Comput. Sci., Universiti Sains Malaysia, Penang, Malaysia
fYear :
2004
fDate :
19-23 April 2004
Firstpage :
461
Lastpage :
462
Abstract :
Data Mining is used to extract the hidden information from large amount databases. Parallel/distributed computing is used which achieves scalability and improves the performance of compute intensive algorithms. A parallel version of ITL-Mine algorithm is proposed and implemented on distributed memory (shared nothing) architecture. The parallel ITL-Mine algorithm achieved the reduction of the execution time because of the distribution of the data over the processors where each processor worked on its data and communicates with other processors to complete its work. The proposed ITL-mine algorithm can be used in other data mining task such as sequential patterns, max-patterns and frequent closed patterns, classification and clustering.
Keywords :
data mining; medical information systems; parallel architectures; proteins; very large databases; ITL-Mine algorithm; compute intensive algorithms; distributed memory architecture; hidden information extraction; large databases; parallel-distributed association rules; protein data mining; Association rules; Clustering algorithms; Computer architecture; Concurrent computing; Data mining; Databases; Distributed computing; Memory architecture; Proteins; Scalability;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Communication Technologies: From Theory to Applications, 2004. Proceedings. 2004 International Conference on
Print_ISBN :
0-7803-8482-2
Type :
conf
DOI :
10.1109/ICTTA.2004.1307831
Filename :
1307831
Link To Document :
بازگشت