SciELO - Scientific Electronic Library Online

vol.19 número3Identificación de documentos multilingües relacionados mediante algoritmos de clustering de hormigasAnálisis de rendimiento académico estudiantil usando data warehouse y redes neuronales índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Ingeniare. Revista chilena de ingeniería

versão On-line ISSN 0718-3305


JARA, José Luis; CHACON, Max  e  ZELAYA, Gonzalo. Empirical evaluation of three machine learning method for automatic classification of neoplastic diagnoses. Ingeniare. Rev. chil. ing. [online]. 2011, vol.19, n.3, pp. 359-368. ISSN 0718-3305.

Diagnoses are a valuable source of information for evaluating a health system. However, they are not used extensively by information systems because diagnoses are normally written in natural language. This work empirically evaluates three machine learning methods to automatically assign codes from the International Classification of Diseases (10th Revision) to 3,335 distinct diagnoses of neoplasms obtained from UMLS®. This evaluation is conducted on three different types of preprocessing. The results are encouraging: a well-known rule induction method and maximum entropy models achieve 90% accuracy in a balanced cross-validation experiment.

Palavras-chave : Clinical coding; controlled vocabulary; international classification of diseases; machine learning; natural language processing.

        · resumo em Espanhol     · texto em Inglês     · Inglês ( pdf )


Creative Commons License Todo o conteúdo deste periódico, exceto onde está identificado, está licenciado sob uma Licença Creative Commons