Using Distributional Similarity to Organise BioMedical Terminology

Weeds, Julie, Dowdall, James, Schneider, Gerold, Keller, Bill and Weir, David (1980) Using Distributional Similarity to Organise BioMedical Terminology. In: Application-Driven Terminology Engineering. Unset.

Full text not available from this repository.

Abstract

We investigate an application of distributional similarity techniques to the problem of
structural organisation of biomedical terminology. Our application domain is the relatively
small GENIA corpus. Using terms that have been accurately marked-up by hand within the
corpus, we consider the problem of automatically determining semantic proximity. Termino-
logical units are de�ned for our purposes as normalised classes of individual terms. Syntactic
analysis of the corpus data is carried out using the Pro3Gres parser and provides the data
required to calculate distributional similarity using a variety of di�erent measures. Evaluation
is performed against a hand-crafted gold standard for this domain in the form of the GENIA
ontology. We show that distributional similarity can be used to predict semantic type with a
good degree of accuracy.

Item Type: Book Section
Schools and Departments: School of Engineering and Informatics > Informatics
Depositing User: Julie Weeds
Date Deposited: 06 Feb 2012 20:53
Last Modified: 13 Apr 2012 11:45
URI: http://sro.sussex.ac.uk/id/eprint/28565
📧 Request an update