The invention provides a backward word segmentation method and device based on Chinese retrieval and relates to the field of
processing of webpage character information in computer networks. According to the backward word segmentation method and device based on the Chinese retrieval, professional word banks are established in a
robot dictionary, the value of the MAX_Length is determined firstly according to the maximum lengths of proper nouns in the word banks, a backward matching
algorithm is formed through a backward maximum matching
algorithm, and in order to solve the problems of word segmentation
ambiguity and incomplete matching during backward matching, a maximum length matching
algorithm is improved. According to the backward word segmentation method and device based on the Chinese retrieval, word segmentation is carried out on a Chinese character string which is S=C1C2C3C4...Cn through the device which is composed of a
central processing unit, input-and-output equipment, a register, a mechanized dictionary, a window counter and a memorizer, accuracy segmentation of Chinese character strings can be achieved on the premise that the semantic of the Chinese character strings is not lost, a word segmentation result is quite accurate when a
sentence is quite long, and searching accuracy can be improved. The backward word segmentation method and device based on the Chinese retrieval can be applied to an automatic abstracting and sorting
system in the field of
information retrieval.