Title :
Active disks for large-scale data processing
Author :
Riedel, Erik ; Faloutsos, Christos ; Gibson, Garth A. ; Nagle, David
Author_Institution :
Hewlett-Packard Co., Palo Alto, CA, USA
fDate :
6/1/2001 12:00:00 AM
Abstract :
As processor performance increases and memory cost decreases, system intelligence continues to move away from the CPU and into peripherals. Storage system designers use this trend toward excess computing power to perform more complex processing and optimizations inside storage devices. To date, such optimizations take place at relatively low levels of the storage protocol. Trends in storage density, mechanics, and electronics eliminate the hardware bottleneck and put pressure on interconnects and hosts to move data more efficiently. We propose using an active disk storage device that combines on-drive processing and memory with software downloadability to allow disks to execute application-level functions directly at the device. Moving portions of an application´s processing to a storage device significantly reduces data traffic and leverages the parallelism already present in large systems, dramatically reducing the execution time for many basic data mining tasks
Keywords :
data mining; disc drives; disc storage; parallel memories; random-access storage; storage management; active disk storage device; application-level function execution; data mining; large-scale data processing; on-drive memory; on-drive processing; parallelism; software downloadability; Costs; Data mining; Data processing; Design optimization; Hardware; Intelligent systems; Large-scale systems; Parallel processing; Power system interconnection; Protocols;