Using automatically acquired predominant senses for word sense disambiguation

McCarthy, Diana; Koeling, Rob; Weeds, Julie; Carroll, John

File(s) not publicly available

Using automatically acquired predominant senses for word sense disambiguation

presentation

posted on 2023-06-07, 23:35 authored by Diana McCarthy, Rob Koeling, Julie WeedsJulie Weeds, John Carroll

In word sense disambiguation (WSD), the heuristic of choosing the most common sense is extremely powerful because the distribution of the senses of a word is often skewed. The first (or predominant) sense heuristic assumes the availability of handtagged data. Whilst there are hand-tagged corpora available for some languages, these are relatively small in size and many word forms either do not occur, or occur infrequently. In this paper we investigate the performance of an unsupervised first sense heuristic where predominant senses are acquired automatically from raw text. We evaluate on both the SENSEVAL-2 and SENSEVAL-3 English allwords data. For accurate WSD the first sense heuristic should be used only as a back-off, where the evidence from the context is not strong enough. In this paper however, we examine the performance of the automatically acquired first sense in isolation since it turned out that the first sense taken from SemCor outperformed many systems in SENSEVAL-2.

History

Publication status

Published

Publisher URL

http://acl.ldc.upenn.edu/w/w04/w04-0837.pdf

Pages

4.0

Presentation Type

paper

Event name

3rd International Workshop on the Evaluation of Systems for the Semantic Analysis of Text (SENSEVAL)

Event location

Barcelona, Spain

Event type

conference

Department affiliated with

Informatics Publications

Full text available

No

Peer reviewed?

Yes

Legacy Posted Date

2012-02-06

Usage metrics

Keywords

Uncategorised value

Licence

Copyright not evaluated

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

File(s) not publicly available

Using automatically acquired predominant senses for word sense disambiguation

History

Publication status

Publisher URL

Pages

Presentation Type

Event name

Event location

Event type

Department affiliated with

Full text available

Peer reviewed?

Legacy Posted Date

Usage metrics

Categories

Keywords

Licence

Exports