ANNO: A General Annotation Tool for Bilingual Clinical Note Information Extraction
- Authors
- 이계화; Lee, Hyunsung; 박진혁; Kim, Yi-Jun; 이영호
- Issue Date
- Jan-2022
- Publisher
- 대한의료정보학회
- Keywords
- Medical Records; Data Mining; Information Storage and Retrieval; Personal Health Records; Information Storage and Retrieval
- Citation
- Healthcare Informatics Research, v.28, no.1, pp.89 - 94
- Journal Title
- Healthcare Informatics Research
- Volume
- 28
- Number
- 1
- Start Page
- 89
- End Page
- 94
- URI
- https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/83804
- DOI
- 10.4258/hir.2022.28.1.89
- ISSN
- 2093-3681
- Abstract
- Objectives: This study was conducted to develop a generalizable annotation tool for bilingual complex clinical text annotation,which led to the design and development of a clinical text annotation tool, ANNO. Methods: We designed ANNO toenable human annotators to support the annotation of information in clinical documents efficiently and accurately. First,annotations for different classes (word or phrase types) can be tagged according to the type of word using the dictionaryfunction. In addition, it is possible to evaluate and reconcile differences by comparing annotation results between humanannotators. Moreover, if the regular expression set for each class is updated during annotation, it is automatically reflectedin the new document. The regular expression set created by human annotators is designed such that a word tagged once isautomatically labeled in new documents. Results: Because ANNO is a Docker-based web application, users can use it freelywithout being subjected to dependency issues. Human annotators can share their annotation markups as regular expressionsets with a dictionary structure, and they can cross-check their annotated corpora with each other. The dictionary-basedregular expression sharing function, cross-check function for each annotator, and standardized input (Microsoft Excel) andoutput (extensible markup language [XML]) formats are the main features of ANNO. Conclusions: With the growing needfor massively annotated clinical data to support the development of machine learning models, we expect ANNO to be helpfulto many researchers.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - IT융합대학 > 컴퓨터공학과 > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.