|
|
Wikipedia-based Named Entity Recognition System for Turkish
|
|
|
|
|
نویسنده
|
KÜÇÜK Doğan ,ARICI Nursal
|
منبع
|
journal of polytechnic - 2016 - دوره : 19 - شماره : 3 - صفحه:325 -332
|
چکیده
|
Named entity recognition is a problem in the research area of natural language processing and is usually defined as the automatic extraction of the names of people, locations, and organizations in natural language texts. in this study, a wikipedia-based named entity recognition system for turkish is introduced. it is well-known that resources like wikipedia, which are created by internet users, are considerably important for topics like named entity recognition. we have first automatically compiled a large list of person names from turkish wikipedia. then, we have developed a wikipedia-based named entity recognition system for turkish which utilizes this large list with other lists of person, location and organization named obtained from turkish wikipedia and a former rule-based named entity recognizer for turkish. we have evaluated our system on different types of datasets and obtained promising results. our system is a significant contribution to information extraction on turkish texts since there are limited number of related studies carried out so far.
|
کلیدواژه
|
Named entity recognition ,information extraction ,Turkish ,automatic text processing
|
آدرس
|
Gazi Üniversitesi, Teknoloji Fakültesi, Bilgisayar Mühendisliği Bölümü, Turkey, Gazi Üniversitesi, Teknoloji Fakültesi, Bilgisayar Mühendisliği Bölümü, Turkey
|
پست الکترونیکی
|
nursal@gazi.edu.tr
|
|
|
|
|
|
|
|
|
|
|
|
Authors
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|