Evaluating effectiveness of linguistic technologies of knowledge identification in text collections

Вантажиться...
Ескіз

Дата

2014

ORCID

DOI

Науковий ступінь

Рівень дисертації

Шифр та назва спеціальності

Рада захисту

Установа захисту

Науковий керівник

Члени комітету

Видавець

ITHEA, Poland

Анотація

The possibility of using integral coefficients of recall and precision to evaluate effectiveness of linguistic technologies of knowledge identification in texts is analyzed in the paper. An approach is based on the method of test collections, which is used for experimental validation of received effectiveness coefficients, and on methods of mathematical statistics. The problem of maximizing the reliability of sample results in their propagation on the general population of the tested text collection is studied. The method for determining the confidence interval for the attribute proportion, which is based on Wilson’s formula, and the method for determining the required size of the relevant sample under specified relative error and confidence probability, are considered.

Опис

Ключові слова

recall, precision, relevance, confidence interval, sample size

Бібліографічний опис

Khairova N. Evaluating effectiveness of linguistic technologies of knowledge identification in text collections / N. Khairova, G. Shepelyov, S. Petrasova // Information science and computing : Intern. bk. ser. Bk. 29 : Transactions on Business and Engineering Intelligent Applications / ed.: G. Setlak, K. Markov. – Rzeszow : ITHEA, 2014. – P. 71-75.