University of Sussex
Browse

File(s) not publicly available

Automatic extraction of subcategorization from corpora

presentation
posted on 2023-06-07, 21:39 authored by Ted Briscoe, John Carroll
We describe a novel technique and implemented system for constructing a subcategorization dictionary from textual corpora. Each dictionary entry encodes the relative frequency of occurrence of a comprehensive set of subcategorization classes for English. An initial experiment, on a sample of 14 verbs which exhibit multiple complementation patterns, demonstrates that the technique achieves accuracy comparable to previous approaches, which are all limited to a highly restricted set of subcategorization classes. We also demonstrate that a subcategorization dictionary built with the system improves the accuracy of a parser by an appreciable amount.

History

Publication status

  • Published

Page range

356-363

Presentation Type

  • paper

Event name

Proceedings of the 5th ACL Conference on Applied Natural Language Processing (ANLP'97) Washington DC.

Event location

Washington DC.

Event type

conference

Department affiliated with

  • Informatics Publications

Full text available

  • No

Peer reviewed?

  • Yes

Legacy Posted Date

2012-02-06

Usage metrics

    University of Sussex (Publications)

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC