Automatic Extraction of Synonymous Collocation Pairs from a Text Corpus

dc.contributor.authorKhairova, N. F.en
dc.contributor.authorPetrasova, S. V.en
dc.contributor.authorLewoniewski, Włodzimierzpl
dc.contributor.authorMamyrbayev, Orkenen
dc.contributor.authorMukhsina, Kuralayen
dc.date.accessioned2020-12-14T11:02:51Z
dc.date.available2020-12-14T11:02:51Z
dc.date.issued2018
dc.description.abstractAutomatic extraction of synonymous collocation pairs from text corpora is a challenging task of NLP. In order to search collocations of similar meaning in English texts, we use logical-algebraic equations. These equations combine grammatical and semantic characteristics of words of substantive, attributive and verbal collocations types. With Stanford POS tagger and Stanford Universal Dependencies parser, we identify the grammatical characteristics of words. We exploit WordNet synsets to pick synonymous words of collocations. The potential synonymous word combinations found are checked for compliance with grammatical and semantic characteristics of the proposed logical-linguistic equations. Our dataset includes more than half a million Wikipedia articles from a few portals. The experiment shows that the more frequent synonymous collocations occur in texts, the more related topics of the texts might be. The precision of synonymous collocations search in our experiment has achieved the results close to other studies like ours.en
dc.identifier.citationAutomatic Extraction of Synonymous Collocation Pairs from a Text Corpus / N. Khairova [et al.] // Proceedings of the 2018 Federated Conference on Computer Science and Information Systems September (FedCSIS 2018), September 9-12, 2018, Poznań, Poland. Vol.15: Annals of Computer Science and Information Systems / ed. M. Ganzha, L. Maciaszek, M. Paprzycki. – Warsaw : PTI, 2018. – P. 485-488.en
dc.identifier.doidoi.org/10.15439/2018F186
dc.identifier.urihttps://repository.kpi.kharkov.ua/handle/KhPI-Press/49822
dc.language.isoen
dc.publisherPolskie Towarzystwo Informatyczne, Polandpl
dc.titleAutomatic Extraction of Synonymous Collocation Pairs from a Text Corpusen
dc.typeThesisen

Файли

Контейнер файлів

Зараз показуємо 1 - 1 з 1
Ескіз
Назва:
Khairova_Automatic_extraction_2018.pdf
Розмір:
145.66 KB
Формат:
Adobe Portable Document Format
Опис:

Ліцензійна угода

Зараз показуємо 1 - 1 з 1
Ескіз недоступний
Назва:
license.txt
Розмір:
11.25 KB
Формат:
Item-specific license agreed upon to submission
Опис: