DocumentCode
1478353
Title
A Hybrid Algorithm of Backward Hashing and Automaton Tracking for Virus Scanning
Author
Lin, Po-Ching ; Lin, Ying-Dar ; Lai, Yuan-Cheng
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. Chung Cheng Univ., Chiayi, Taiwan
Volume
60
Issue
4
fYear
2011
fDate
4/1/2011 12:00:00 AM
Firstpage
594
Lastpage
601
Abstract
Virus scanning involves computationally intensive string matching against a large number of signatures of different characteristics. Matching a variety of signatures challenges the selection of matching algorithms, as each approach has better performance than others for different signature characteristics. We propose a hybrid approach that partitions the signatures into long and short ones in the open-source ClamAV for virus scanning. An algorithm enhanced from the Wu-Manber algorithm, namely the Backward Hashing algorithm, is responsible for only long patterns to lengthen the average skip distance, while the Aho-Corasick algorithm scans for only short patterns to reduce the automaton sizes. The former utilizes the bad-block heuristic to exploit long shift distance and reduce the verification frequency, so it is much faster than the original WM implementation in ClamAV. The latter increases the AC performance by around 50 percent due to better cache locality. We also rank the factors to indicate their importance for the string matching performance.
Keywords
automata theory; computer viruses; cryptography; digital signatures; public domain software; string matching; Aho-Corasick algorithm; Wu-Manber algorithm; automata tracking; backward hashing algorithm; characteristic signature; open source ClamAV; string matching; virus scanning; String matching; automaton; filtering; virus scanning.;
fLanguage
English
Journal_Title
Computers, IEEE Transactions on
Publisher
ieee
ISSN
0018-9340
Type
jour
DOI
10.1109/TC.2010.95
Filename
5453354
Link To Document