|
|
News classification with human annotators: A case study
|
|
|
|
|
نویسنده
|
fuddoly a. ,jaafar j. ,zamin n.
|
منبع
|
jurnal teknologi - 2015 - دوره : 74 - شماره : 10 - صفحه:21 -28
|
چکیده
|
The need to classify textual documents has become an increasingly vibrant research field due to the development of online news. while most of the news in news websites are categorised manually,the task becomes more strenuous considering the tremendous surge of data updates every day. this paper addresses the question of how text classification algorithms can substitute the particular task over manual classification methods. a combined method using bracewell's algorithm and top-n method is demonstrated and tested using indonesian language corpus. the experiment also uses human evaluation as the benchmark. the result from the human evaluation is further investigated in order to understand how the annotators classify documents and the aspects that can be improved to enhance the method in the future. the results indicate that the method can outperform human annotators by 13% in terms of accuracy. © 2015 penerbit utm press. all rights reserved.
|
کلیدواژه
|
Bracewell Algorithm; Category classification; Human annotator; Indonesian news classification; Text classification; Topic identification
|
آدرس
|
department of computer and information sciences,universiti teknologi petronas, Malaysia, department of computer and information sciences,universiti teknologi petronas, Malaysia, department of computer and information sciences,universiti teknologi petronas, Malaysia
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Authors
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|