Construction and Analysis of Berber Text Corpus

dc.contributor.authorZayd, Khayi
dc.contributor.authorOrobinska, Olena
dc.date.accessioned2024-02-14T20:12:11Z
dc.date.available2024-02-14T20:12:11Z
dc.date.issued2020
dc.description.abstractThis work is devoted to constructing a tool to analyze the different aspects of Berber languages. It is based on grammatical parameters of these languages. The text collection containing more than 500 texts that cover long historic period was collected. The corpus is free available and it will useful for further investigations on Tamazigh language. It was transformed into xml-format standardization goal. The corpus counts more than 200 000 of words. Based on the linguistic rules and statistic methods, original user interface and software prototype were developed by combining the technologies of web design and object programming in Python.
dc.identifier.citationZayd K. Construction and Analysis of Berber Text Corpus [Electronic resource] / K. Zayd, O. Orobinska // Computational Linguistics and Intelligent Systems (COLINS 2020) : proc. of the 4th Intern. Conf., April 23-24, 2020. Vol. 2. – Electronic text data. – Lviv, 2020. – P. 230-231. – Access mode: https://colins.in.ua/wp-content/uploads/2020/06/preface_colins_volume2_2020_part6.pdf, free (date of the application 14.02.2024.).
dc.identifier.orcidhttps://orcid.org/0000-0001-8396-4136
dc.identifier.urihttps://repository.kpi.kharkov.ua/handle/KhPI-Press/74110
dc.language.isoen
dc.subjectTamazight language
dc.subjectcorpus linguistic
dc.subjectgrammar rules
dc.subjectstatistical methods
dc.subjectxml-structure
dc.subjectxml-format
dc.subjectPython
dc.subjectsoftware
dc.titleConstruction and Analysis of Berber Text Corpus
dc.typeArticle

Файли

Контейнер файлів
Зараз показуємо 1 - 1 з 1
Вантажиться...
Ескіз
Назва:
Zayd_Construction_and_analysis_2020.pdf
Розмір:
231.06 KB
Формат:
Adobe Portable Document Format
Ліцензійна угода
Зараз показуємо 1 - 1 з 1
Ескіз недоступний
Назва:
license.txt
Розмір:
11.25 KB
Формат:
Item-specific license agreed upon to submission
Опис: