Title :
An Efficient Algorithm of Chinese String Sort in User-Defined Sequence
Author :
Haijun Zhang ; Shumin Shi
Author_Institution :
Sch. of Comput. Sci. & Technol., Xinjiang Normal Univ., Urumqi, China
Abstract :
Existing sort algorithms are difficult to implement Chinese string sort in user-defined sequence. This paper proposes an efficient string sort method in user-defined character order. On the basis of the consecutive numbers which used to define the custom order of characters, the hash table structure is employed to convert each string into corresponding array of integers. By taking the maximum number of characters as the new radix, the Radix sort algorithm is used to implement fast sort of strings in user-defined order. Theory analysis and experiments show that the sort algorithm of this paper can easily achieve Chinese string sort in user-defined order in linear time and space complexity. This sort algorithm has a better time performance than that of Quick Sort algorithm, and it can effortlessly extend to string sort applications of other languages.
Keywords :
computational complexity; file organisation; natural language processing; sorting; Chinese string sort method; Radix sort algorithm; consecutive numbers; custom order; hash table structure; integer array; linear time complexity; space complexity; user-defined character order; user-defined sequence; Algorithm design and analysis; Arrays; Educational institutions; Presses; Sorting; Time complexity; Radix sort; hash table structure; string sort; user-defined order;
Conference_Titel :
Asian Language Processing (IALP), 2013 International Conference on
Conference_Location :
Urumqi
DOI :
10.1109/IALP.2013.33