Evaluating effectiveness of linguistic technologies of knowledge identification in text collections
Дата
2014
ORCID
DOI
item.page.thesis.degree.name
item.page.thesis.degree.level
item.page.thesis.degree.discipline
item.page.thesis.degree.department
item.page.thesis.degree.grantor
item.page.thesis.degree.advisor
item.page.thesis.degree.committeeMember
Назва журналу
Номер ISSN
Назва тому
Видавець
ITHEA, Poland
Анотація
The possibility of using integral coefficients of recall and precision to evaluate effectiveness of linguistic
technologies of knowledge identification in texts is analyzed in the paper. An approach is based on the method of test collections, which is used for experimental validation of received effectiveness coefficients, and
on methods of mathematical statistics. The problem of maximizing the reliability of sample results in their
propagation on the general population of the tested text collection is studied. The method for determining
the confidence interval for the attribute proportion, which is based on Wilson’s formula, and the method
for determining the required size of the relevant sample under specified relative error and confidence probability, are considered.
Опис
Ключові слова
recall, precision, relevance, confidence interval, sample size
Бібліографічний опис
Khairova N. Evaluating effectiveness of linguistic technologies of knowledge identification in text collections / N. Khairova, G. Shepelyov, S. Petrasova // Information science and computing : Intern. bk. ser. Bk. 29 : Transactions on Business and Engineering Intelligent Applications / ed.: G. Setlak, K. Markov. – Rzeszow : ITHEA, 2014. – P. 71-75.