Use of Natural Language Processing for Precise Retrieval of Key Elements of Health IT Evaluation Studies
Having precise information about health IT evaluation studies is important for evidence-based decisions in medical informatics. In a former feasibility study, we used a faceted search based on ontological modeling of key elements of studies to retrieve precisely described health IT evaluation studies. However, extracting the key elements manually for the modeling of the ontology was time and resource-intensive. We now aimed at applying natural language processing to substitute manual data extraction by automatic data extraction. Four methods (Named Entity Recognition, Bag-of-Words, Term-Frequency-Inverse-Document-Frequency, and Latent Dirichlet Allocation Topic Modeling were applied to 24 health IT evaluation studies. We evaluated which of these methods was best suited for extracting key elements of each study. As gold standard, we used results from manual extraction. As a result, Named Entity Recognition is promising but needs to be adapted to the existing study context. After the adaption, key elements of studies could be collected in a more feasible, time- and resource-saving way.
DOI: 10.3233/SHTI200502
Projects: Management of health information systems
Publication type: Journal article
Journal: Studies in health technology and informatics
Human Diseases: No Human Disease specified
Citation: Studies in health technology and informatics 272:95–98
Date Published: 2020
Registered Mode: imported from a bibtex file
Views: 2387
Created: 10th Nov 2020 at 14:58
Last updated: 7th Dec 2021 at 17:58
This item has not yet been tagged.
None