Automatic Annotation of Narrative Radiology Reports

Krsnik, Ivan; Glavaš, Goran; Krsnik, Marina; Miletić, Damir; Štajduhar, Ivan

doi:10.3390/diagnostics10040196

prikaz prve stranice dokumenta Automatic Annotation of Narrative Radiology Reports

Preuzmi
PDF 535.2 KB

Znanstveni rad - Izvorni znanstveni rad

Automatic Annotation of Narrative Radiology Reports

Diagnostics, 10 (2020), 4; 196. https://doi.org/10.3390/diagnostics10040196

Krsnik, Ivan; Glavaš, Goran; Krsnik, Marina; Miletić, Damir; Štajduhar, Ivan

Citirajte ovaj rad

APA 6th Edition

Krsnik, I., Glavaš, G., Krsnik, M., Miletić, D. i Štajduhar, I. (2020). Automatic Annotation of Narrative Radiology Reports. Diagnostics, 10. (4). doi: 10.3390/diagnostics10040196

MLA 8th Edition

Krsnik, Ivan, et al. "Automatic Annotation of Narrative Radiology Reports." Diagnostics, vol. 10, br. 4, 2020. https://doi.org/10.3390/diagnostics10040196

Chicago 17th Edition

Krsnik, Ivan, Goran Glavaš, Marina Krsnik, Damir Miletić i Ivan Štajduhar. "Automatic Annotation of Narrative Radiology Reports." Diagnostics 10, br. 4 (2020). https://doi.org/10.3390/diagnostics10040196

Harvard

Krsnik, I., et al. (2020) 'Automatic Annotation of Narrative Radiology Reports', Diagnostics, 10(4). doi: 10.3390/diagnostics10040196

Vancouver

Krsnik I, Glavaš G, Krsnik M, Miletić D, Štajduhar I. Automatic Annotation of Narrative Radiology Reports. Diagnostics [Internet]. 01.04.2020. [pristupljeno 06.12.2024.];10(4). doi: 10.3390/diagnostics10040196

IEEE

I. Krsnik, G. Glavaš, M. Krsnik, D. Miletić i I. Štajduhar, "Automatic Annotation of Narrative Radiology Reports", Diagnostics, vol. 10, br. 4, Travanj 2020. [Online]. Dostupno na: https://urn.nsk.hr/urn:nbn:hr:184:982562. [Citirano: 06.12.2024.]

Za citiranje koristite ovu mrežnu adresu: https://urn.nsk.hr/urn:nbn:hr:184:982562

Prijavite se u repozitorij kako biste mogli spremiti objekt u svoju listu.

Podaci o radu

Naslov (engleski)	Automatic Annotation of Narrative Radiology Reports
Autor	Ivan Krsnik
Autor	Goran Glavaš
Autor	Marina Krsnik
Autor	Damir Miletić
Autor	Ivan Štajduhar
Autorova ustanova	Sveučilište u Rijeci Medicinski fakultet (Katedra za radiologiju)
Znanstveno / umjetničko područje, polje i grana	BIOMEDICINA I ZDRAVSTVO Kliničke medicinske znanosti Radiologija
Znanstveno / umjetničko područje, polje i grana	TEHNIČKE ZNANOSTI Računarstvo
Sažetak (engleski)	Narrative texts in electronic health records can be efficiently utilized for building decision support systems in the clinic, only if they are correctly interpreted automatically in accordance with a specified standard. This paper tackles the problem of developing an automated method of labeling free-form radiology reports, as a precursor for building query-capable report databases in hospitals. The analyzed dataset consists of 1295 radiology reports concerning the condition of a knee, retrospectively gathered at the Clinical Hospital Centre Rijeka, Croatia. Reports were manually labeled with one or more labels from a set of 10 most commonly occurring clinical conditions. After primary preprocessing of the texts, two sets of text classification methods were compared: (1) traditional classification models—Naive Bayes (NB), Logistic Regression (LR), Support Vector Machine (SVM), and Random Forests (RF)—coupled with Bag-of-Words (BoW) features (i.e., symbolic text representation) and (2) Convolutional Neural Network (CNN) coupled with dense word vectors (i.e., word embeddings as a semantic text representation) as input features. We resorted to nested 10-fold cross-validation to evaluate the performance of competing methods using accuracy, precision, recall, and F 1 score. The CNN with semantic word representations as input yielded the overall best performance, having a micro-averaged F 1 score of 86 . 7 % . The CNN classifier yielded particularly encouraging results for the most represented conditions: degenerative disease ( 95 . 9 % ), arthrosis ( 93 . 3 % ), and injury ( 89 . 2 % ). As a data-hungry deep learning model, the CNN, however, performed notably worse than the competing models on underrepresented classes with fewer training instances such as multicausal disease or metabolic disease. LR, RF, and SVM performed comparably well, with the obtained micro-averaged F 1 scores of 84 . 6 % , 82 . 2 % , and 82 . 1 % , respectively.
Ključne riječi (engleski)
Jezik	engleski
Vrsta publikacije	Znanstveni rad - Izvorni znanstveni rad
Status objave	Objavljen
Vrsta recenzije	Recenziran - međunarodna recenzija
Verzija publikacije	Objavljena verzija rada (izdavačev PDF)
Naslov časopisa	Diagnostics
Brojčani podaci	vol. 10, br. 4, 196
e-ISSN	2075-4418
DOI	https://doi.org/10.3390/diagnostics10040196
URN:NBN	urn:nbn:hr:184:982562
Datum objave publikacije	2020-04-01
URL dokumenta	https://www.mdpi.com/2075-4418/10/4/196
Vrsta resursa	Tekst
Prava pristupa	Otvoreni pristup
Uvjeti korištenja
Datum i vrijeme pohrane	2020-04-02 16:12:57