|
|
|
|
a method for the automatic extraction of keywords in legislative documents using statistical, semantic, and clustering relationships
|
|
|
|
|
|
|
|
نویسنده
|
naseri jaber ,hassanpour hamid ,ghanbari ali
|
|
منبع
|
international journal of nonlinear analysis and applications - 2021 - دوره : 12 - شماره : Special Is - صفحه:265 -278
|
|
چکیده
|
Using smart methods for the automatic generation of keywords in legislative documents has attracted the attention of many researchers over the past few decades. with the increasing development of legislative documents and the large volume of unstructured texts, the need for rapid access to these documents has become more significant. extracting the keywords in legislative documents will accelerate the legislative process and reduce costs. the present study attemptes to extract meaningful keywords from texts by using the thesaurus, which has a structured system to improve the classification of legislative documents. in this method, the semantic relationships in the thesaurus and document clustering were used and the statistical features of different words were calculated to extract keywords. after pre-processing the texts, first the keywords in the text are selected using statistical methods. then, the phrases derived from the keywords are extracted using semantic terms in the thesaurus. after that, a numerical weight is assigned to each word to determine the relative importance of the words and indicate the effect of the word in relation to the text and compared to other words. finally, the final keywords are selected using the relationships in the thesaurus and clustering methods. the results of testing various texts from the parliament of iran and the deputy for presidential laws indicate the high accuracy of the proposed method in meaningful keywords extraction.
|
|
کلیدواژه
|
text mining ,keyword extraction ,thesaurus ,semantic relationships ,clustering
|
|
آدرس
|
shahroud university of technology, faculty of computer engineering, iran, shahroud university of technology, faculty of computer engineering, iran, university of science and technology of mazandaran, iran
|
|
پست الکترونیکی
|
ali.ganbari289@gmail.com
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Authors
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|