Construction and Analysis of Berber Text Corpus
dc.contributor.author | Zayd, Khayi | |
dc.contributor.author | Orobinska, Olena | |
dc.date.accessioned | 2024-02-14T20:12:11Z | |
dc.date.available | 2024-02-14T20:12:11Z | |
dc.date.issued | 2020 | |
dc.description.abstract | This work is devoted to constructing a tool to analyze the different aspects of Berber languages. It is based on grammatical parameters of these languages. The text collection containing more than 500 texts that cover long historic period was collected. The corpus is free available and it will useful for further investigations on Tamazigh language. It was transformed into xml-format standardization goal. The corpus counts more than 200 000 of words. Based on the linguistic rules and statistic methods, original user interface and software prototype were developed by combining the technologies of web design and object programming in Python. | |
dc.identifier.citation | Zayd K. Construction and Analysis of Berber Text Corpus [Electronic resource] / K. Zayd, O. Orobinska // Computational Linguistics and Intelligent Systems (COLINS 2020) : proc. of the 4th Intern. Conf., April 23-24, 2020. Vol. 2. – Electronic text data. – Lviv, 2020. – P. 230-231. – Access mode: https://colins.in.ua/wp-content/uploads/2020/06/preface_colins_volume2_2020_part6.pdf, free (date of the application 14.02.2024.). | |
dc.identifier.orcid | https://orcid.org/0000-0001-8396-4136 | |
dc.identifier.uri | https://repository.kpi.kharkov.ua/handle/KhPI-Press/74110 | |
dc.language.iso | en | |
dc.subject | Tamazight language | |
dc.subject | corpus linguistic | |
dc.subject | grammar rules | |
dc.subject | statistical methods | |
dc.subject | xml-structure | |
dc.subject | xml-format | |
dc.subject | Python | |
dc.subject | software | |
dc.title | Construction and Analysis of Berber Text Corpus | |
dc.type | Article |
Файли
Контейнер файлів
1 - 1 з 1
- Назва:
- Zayd_Construction_and_analysis_2020.pdf
- Розмір:
- 231.06 KB
- Формат:
- Adobe Portable Document Format
Ліцензійна угода
1 - 1 з 1
Ескіз недоступний
- Назва:
- license.txt
- Розмір:
- 11.25 KB
- Формат:
- Item-specific license agreed upon to submission
- Опис: