Evaluating effectiveness of linguistic technologies of knowledge identification in text collections

Ескіз

Дата

2014

ORCID

DOI

item.page.thesis.degree.name

item.page.thesis.degree.level

item.page.thesis.degree.discipline

item.page.thesis.degree.department

item.page.thesis.degree.grantor

item.page.thesis.degree.advisor

item.page.thesis.degree.committeeMember

Назва журналу

Номер ISSN

Назва тому

Видавець

ITHEA, Poland

Анотація

The possibility of using integral coefficients of recall and precision to evaluate effectiveness of linguistic technologies of knowledge identification in texts is analyzed in the paper. An approach is based on the method of test collections, which is used for experimental validation of received effectiveness coefficients, and on methods of mathematical statistics. The problem of maximizing the reliability of sample results in their propagation on the general population of the tested text collection is studied. The method for determining the confidence interval for the attribute proportion, which is based on Wilson’s formula, and the method for determining the required size of the relevant sample under specified relative error and confidence probability, are considered.

Опис

Ключові слова

recall, precision, relevance, confidence interval, sample size

Бібліографічний опис

Khairova N. Evaluating effectiveness of linguistic technologies of knowledge identification in text collections / N. Khairova, G. Shepelyov, S. Petrasova // Information science and computing : Intern. bk. ser. Bk. 29 : Transactions on Business and Engineering Intelligent Applications / ed.: G. Setlak, K. Markov. – Rzeszow : ITHEA, 2014. – P. 71-75.

item.page.endorsement

item.page.review

item.page.supplemented

item.page.referenced