Title : 
Multi-string search in BSP
         
        
            Author : 
Ferragina, Paolo ; Luccio, Fabrizio
         
        
            Author_Institution : 
Dipt. di Inf., Pisa Univ., Italy
         
        
        
        
        
        
            Abstract : 
We have studied the worst-case complexity of the multi-string search problem in the bulk synchronous parallel (BSP) model (Valiant 1990). For this purpose, we have devised a very simple way to distribute the blind trie data structure among the p processors so that the communication cost is balanced. In the light of the very efficient algorithms and data structures known for external memory and the ones designed for the BSP model in this paper, it becomes a very challenging task to investigate the multi-string search problem in the parallel disk model (Vitter and Shriver, 1994) which combines both I/O, computation and communication complexities. In this setting, it would be also interesting to study the dynamic version of the multi-string search problem in which the set of indexed texts can be changed by inserting or deleting individual texts (Ferragina and Grossi 1995). Another interesting direction of research consists of investigating the multi-string search problem on some variants of the BSP model that have been previously introduced to encourage the use of spatial locality. In our setting, pieces of strings have to be moved among the processors to perform the lexicographic comparisons and thus spatial locality can possibly help in taking advantage of the bandwidth of routers
         
        
            Keywords : 
computational complexity; multiprocessing systems; parallel algorithms; tree data structures; tree searching; BSP; I/O; bandwidth; blind trie data structure; bulk synchronous parallel model; communication complexities; communication cost; data structures; dynamic version; indexed texts; lexicographic comparisons; multi-string search; parallel disk model; routers; spatial locality; worst-case complexity; Algorithm design and analysis; Costs; Data processing; Data structures; Databases; Design methodology; Parallel processing; Phase change random access memory; Search problems; Software libraries;
         
        
        
        
            Conference_Titel : 
Compression and Complexity of Sequences 1997. Proceedings
         
        
            Conference_Location : 
Salerno
         
        
            Print_ISBN : 
0-8186-8132-2
         
        
        
            DOI : 
10.1109/SEQUEN.1997.666919