University of Sussex
Browse
paper6.pdf (469.33 kB)

Annotating a corpus of clinical text records for learning to recognize symptoms automatically

Download (469.33 kB)
chapter
posted on 2023-06-08, 00:08 authored by Rob Koeling, John Carroll, Rosemary Tate, Amanda Nicholson
We report on a research effort to create a corpus of clinical free text records enriched with annotation for symptoms of a particular disease (ovarian cancer). We describe the original data, the annotation procedure and the resulting corpus. The data (approximately 192K words) was annotated by three clinicians and a procedure was devised to resolve disagreements. We are using the corpus to investigate the amount of symptom-related information in clinical records that is not coded, and to develop techniques for recognizing these symptoms automatically in unseen text.

History

Publication status

  • Published

File Version

  • Published version

Publisher

Norwegian University of Science and Technology

Volume

744

Page range

43-50

Pages

82.0

Event name

Louhi 2011: The third international workshop on health documentation text mining and information analysis

Event location

Bled, Slovenia

Event type

conference

Event date

July 6, 2011

Book title

Proceedings of LOUHI 2011 Third International Workshop on Health Document Text Mining and Information Analysis

Place of publication

Trondheim, Norway

ISBN

1613-0073

Series

CEUR Workshop Proceedings

Department affiliated with

  • Primary Care and Public Health Publications

Notes

E-publication

Full text available

  • Yes

Peer reviewed?

  • Yes

Editors

Laura Slaughter, Øystein Nytrø, Hans Moen

Legacy Posted Date

2012-02-06

First Open Access (FOA) Date

2016-03-22

First Compliant Deposit (FCD) Date

2016-03-22

Usage metrics

    University of Sussex (Publications)

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC