The Influence of Various Text Characteristics on the Readability and Content Informativeness
dc.contributor.author | Khairova, N. F. | en |
dc.contributor.author | Kolesnyk, Anastasiia | en |
dc.contributor.author | Mamyrbayev, Orken | en |
dc.contributor.author | Mukhsina, Kuralay | en |
dc.date.accessioned | 2020-12-14T13:03:54Z | |
dc.date.available | 2020-12-14T13:03:54Z | |
dc.date.issued | 2019 | |
dc.description.abstract | Currently, businesses increasingly use various external big data sources for extracting and integrating information into their own enterprise information systems to make correct economic decisions, to understand customer needs, and to predict risks. The necessary condition for obtaining useful knowledge from big data is analysing high-quality data and using quality textual data. In the study, we focus on the influence of readability and some particular features of the texts written for a global audience on the texts quality assessment. In order to estimate the influence of different linguistic and statistical factors on the text readability, we reviewed five different text corpora. Two of them contain texts from Wikipedia, the third one contains texts from Simple Wikipedia and two last corpora include scientific and educational texts. We show linguistic and statistical features of a text that have the greatest influence on the text quality for business corporations. Finally, we propose some directions on the way to automatic predicting the readability of texts in the Web. | en |
dc.identifier.citation | The Influence of Various Text Characteristics on the Readability and Content Informativeness [Electronic resource] / N. Khairova [et al.] // Proceedings of the 21st International Conference on Enterprise Information Systems (ICEIS 2019), May 3-5, 2019, Crete, Greece. Vol. 1 / ed. J. Filipe [et al.]. – Electron. text data. – Heraklion, 2019. – P. 462-469. – URL: https://www.scitepress.org/Papers/2019/77550/77550.pdf, free (accessed 14.12.2020). | en |
dc.identifier.doi | doi.org/10.5220/0007755004620469 | |
dc.identifier.orcid | https://orcid.org/0000-0002-9826-0286 | |
dc.identifier.orcid | https://orcid.org/0000-0001-5817-0844 | |
dc.identifier.orcid | https://orcid.org/0000-0001-8318-3794 | |
dc.identifier.orcid | https://orcid.org/0000-0002-8627-1949 | |
dc.identifier.uri | https://repository.kpi.kharkov.ua/handle/KhPI-Press/49832 | |
dc.language.iso | en | |
dc.subject | text quality | en |
dc.subject | readability indexes | en |
dc.subject | linguistic features | en |
dc.subject | statistical characteristics of a document | en |
dc.subject | simple Wikipedia | en |
dc.subject | Enterprise Information Systems | en |
dc.title | The Influence of Various Text Characteristics on the Readability and Content Informativeness | en |
dc.type | Thesis | en |
Файли
Контейнер файлів
1 - 1 з 1
- Назва:
- Khairova_The_influence_2019.pdf
- Розмір:
- 417.83 KB
- Формат:
- Adobe Portable Document Format
- Опис:
Ліцензійна угода
1 - 1 з 1
Ескіз недоступний
- Назва:
- license.txt
- Розмір:
- 11.25 KB
- Формат:
- Item-specific license agreed upon to submission
- Опис: