DocumentCode
594629
Title
A semantics aware approach to automated reverse engineering unknown protocols
Author
Yipeng Wang ; Xiaochun Yun ; Shafiq, M. Zubair ; Liyan Wang ; Liu, Alex X. ; Zhibin Zhang ; Danfeng Yao ; Yongzheng Zhang ; Li Guo
Author_Institution
Inst. of Comput. Technol., Beijing, China
fYear
2012
fDate
Oct. 30 2012-Nov. 2 2012
Firstpage
1
Lastpage
10
Abstract
Extracting the protocol message format specifications of unknown applications from network traces is important for a variety of applications such as application protocol parsing, vulnerability discovery, and system integration. In this paper, we propose ProDecoder, a network trace based protocol message format inference system that exploits the semantics of protocol messages without the executable code of application protocols. ProDecoder is based on the key insight that the n-grams of protocol traces exhibit highly skewed frequency distribution that can be leveraged for accurate protocol message format inference. In ProDecoder, we first discover the latent relationship among n-grams by first grouping protocol messages with the same semantics and then inferring message formats by keyword based clustering and cluster sequence alignment. We implemented and evaluated ProDecoder to infer message format specifications of SMB (a binary protocol) and SMTP (a textual protocol). Our experimental results show that ProDecoder accurately parses and infers SMB protocol with 100% precision and recall. For SMTP, ProDecoder achieves approximately 95% precision and recall.
Keywords
formal specification; inference mechanisms; pattern clustering; protocols; reverse engineering; ProDecoder; SMB; SMTP; application protocol parsing; automated reverse engineering; binary protocol; cluster sequence alignment; keyword based clustering; network traces; protocol message format inference system; protocol message format specifications; semantics aware approach; system integration; textual protocol; vulnerability discovery; Electronic mail; Natural language processing; Postal services; Protocols; Reverse engineering; Semantics; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Network Protocols (ICNP), 2012 20th IEEE International Conference on
Conference_Location
Austin, TX
Print_ISBN
978-1-4673-2445-8
Electronic_ISBN
978-1-4673-2446-5
Type
conf
DOI
10.1109/ICNP.2012.6459963
Filename
6459963
Link To Document