한국어 제목 개체명 인식 및 사전 구축:도서, 영화, TV프로그램 [韩语论文]-外语论文网

A named entity recognition method is used to improve the performance of information retrieval systems, question answering systems, machine translation systems and so on. The targets of the named entity recognition are usually PLOs(persons, locations and organizations). They are usually proper nouns or unregistered words, and traditional named entity recognizers use these characteristics to find out named entity candidates. The titles of books, movies and TV programs have different characteristics than PLO entities. They are sometimes multiple phrases, one sentence, or special characters. This makes it difficult to find the boundary of the named entity candidates.
In this we propose a method to extract title named entities from news articles and automatically build a named entity dictionary for the titles. For the candidates identification, the word phrases enclosed with special symbols in a sentence are firstly extracted, and then verified by the SVM with using feature words and their distances. For the classification of the extracted title candidates, SVM is used with the mutual information of word contexts.
The experiment was done on 19K news articles with 90% for learning data and 10% for testing data. The evaluation was done with 200 sentences randomly selected from the testing data. The performance of title identification is 81.17% in F1-score and that of title classification is 92.92% in each module. And the performance of the integrated module is 81.09% in F1-score. The dictionary construction performance, which is measured by deleting the duplicate extracted titles, is 71.01% in F1-score.

，免费韩语论文，韩语论文

高职院校韩语系建设的几点思考	항공사의 지각된 서비스품질이 실용적	한국과 독일의 중등교육단계에서의 진로
깔뱅의 기도론 연구	형태 초점 접근법을 활용한 한국어 대조	韩国跆拳道运动的文化价值观探讨
韩国电影剧本中会话含义的略论探讨	도시지역 여성결혼이민자의 재사회화	모야모야 환아의 수술 후 자기효능감,
영어권 학습자를 위한 한국어 교재 구성	중국인 학습자를 위한 한국어 거절 화행	영어 문장구조에 대한 이해가 읽기와 듣
汉韩常用颜色词对比探讨	한·중 사동 표현의 대조 연구	TV 포맷의 새로운 유형화 : 이야기, 놀이