中文信息处置就是运用盘算机对汉语信息停止主动处置。在中文信息处置中,处理汉字输出是一项基本而又主要的任务。固然曾经有不依附键盘输出汉字的产物问世,但汉字键盘输出法照样最普及的措施,也是中文信息处置范畴中一个很主要的课题。在已有的键盘输出法中,有以字、词为单元输出的,也有后来以短语和句子为单元来输出的,然则这些输出法在智能处置方面都不太幻想。所以设计了汉字语法语义智能输出法,目标是经由过程应用汉语的词语搭配常识、语法和语义搭配常识来进步输出法的智能性。本文所做的研究就是设计汉字语法语义输出法所运用的这些词语搭配常识库、语法和语义搭配常识库。具体内容以下1、设计并完成两词语搭配常识库。静态设定远、近间隔搭配窗口,统计窗口内的候选搭配词语,然后依据改良的几个统计模子近间隔搭配强度、远间隔搭配强度、近间隔搭配团圆度、远间隔搭配团圆度,各个地位上的尖峰值停止候选搭配词的初步挑选,法语论文,最初依据一些语法语义常识停止进一步的挑选,得出的终究成果填写到本文的两词语搭配常识库中。2、设计并完成三词语搭配常识库。对两词语搭配库中的每个词语搭配对作为一个症结词语对其反复两词语搭配库统计的进程步调,得出的终究成果存入到本文的三词语搭配常识库中。3、设计并完成语法搭配常识库。短语和句子外部都有必定的语法构造关系,先树立一系列的语法搭配规矩模板,然后对国民日报语料库停止模板婚配,主动抽掏出一系列的详细语法搭配实例,存入语法搭配常识库中。4、设计并完成语义搭配常识库。词语搭配对之间也存在必定的语义关系。起首借助同义词词林对语义常识停止编码,界说语义搭配的编码情势,采取这类编码措施对两词语搭配常识库中的节点词语实例指派适合的义类,然后给搭配实例中的搭配词指派适合的义类,最初对一切的义类搭配停止归并和统计,获得终究的语义搭配常识库。 Abstract: Chinese information processing is the application of computer to stop the active disposal of Chinese information. In Chinese information processing, processing the output of Chinese characters is a basic and main task. Although there is no dependence on the keyboard output of Chinese characters come out, but the Chinese character keyboard output method is still the most popular way, but also a very important topic in the field of Chinese information processing. In the existing keyboard output method, there are words, words for the unit output, but also to the phrase and sentence as the unit to the output, but the output of these methods are not very fancy intelligent disposal. Therefore, the design of Chinese grammar semantic intelligent output method, the goal is to use the Chinese language through the process of collocation common sense, syntax and semantic collocation common sense to improve the output of intelligent. The research of this paper is the design of the Chinese grammar semantic output method used by the collocation common sense of these words, syntax and semantic collocation common sense library. The specific content of the following 1, design and complete the two words collocation common sense library. Static set far, nearly interval collocational window, in the statistics window candidate collocations and according to modified several statistical models closely spaced collocation strength, distant collocational strength nearly interval collocation reunion, long-distance collocation reunion degree, status of each spike value stop candidate collocations of initial selection, initially based on some knowledge of semantic grammar to stop further selection, that eventually results to fill the two collocation knowledge base in. 2, design and complete the three words collocation common sense library. Collocation two words in the collocation library for each word collocation as a key word to its repeated two words of the process of the process of statistical process, the results obtained after the end of the three words collocation common sense library. 3, design and complete the grammar collocation common sense library. Certain grammatical structure and the relationship between phrases and sentences outside, to establish a series of grammatical collocation rule template and to the national daily "corpus stop template matching, active pumping out a series of detailed grammatical collocation examples, deposited in the grammatical collocation knowledge base. 4, design and complete the semantic collocation knowledge base. There is also a certain semantic relationship between collocation and the word. First and foremost by the Tongyici Cilin "of semantic knowledge to stop encoding, definition of semantic collocation coding situation, take this kind of coding method for two collocation knowledge base node word instances assigned for the semantic class, then to match instance collocation assigned for the semantic class, initially to all the righteous collocation stop merging and statistics, obtained after all the semantic collocation knowledge base. 目录: |