Can subcategorisation probabilities help a statistical parser?

Carroll, John; Minnen, Guido; Briscoe, Ted

W98-1114.pdf (841.8 kB)

Can subcategorisation probabilities help a statistical parser?

conference contribution

posted on 2023-06-07, 07:08 authored by John Carroll, Guido Minnen, Ted Briscoe

Research into the automatic acquisition of lexical information from corpora is starting to produce large-scale computational lexicons containing data on the relative frequencies of subcategorisation alternatives for individual verbal predicates. However, the empirical question of whether this type of frequency information can in practice improve the accuracy of a statistical parser has not yet been answered. In this paper we describe an experiment with a wide-coverage statistical grammar and parser for English and subcategorisation frequencies acquired from ten million words of text which shows that this information can significantly improve parse accuracy.

History

Publication status

Published

File Version

Published version

Journal

Proceeding of the Sixth Workshop on Very Large Corpora

Publisher

Association for Computational Lingustics (ACL)

Publisher URL

https://www.aclweb.org/anthology/W98-1114/

Page range

118-126

Event name

6th Workshop on Very Large Corpora, Montreal, Canada, 1998

Event location

Montreal, Canada

Event type

conference

Event date

15th - 16th August 1998

Department affiliated with

Informatics Publications

Full text available

Yes

Peer reviewed?

Yes

Editors

Eugene Charniak

Legacy Posted Date

2020-06-01

First Open Access (FOA) Date

2023-05-04

First Compliant Deposit (FCD) Date

2020-06-01

Usage metrics

Keywords

cmp-lg cs.CL

Licence

CC BY-NC-SA 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Can subcategorisation probabilities help a statistical parser?

History

Publication status

File Version

Journal

Publisher

Publisher URL

Page range

Event name

Event location

Event type

Event date

Department affiliated with

Full text available

Peer reviewed?

Editors

Legacy Posted Date

First Open Access (FOA) Date

First Compliant Deposit (FCD) Date

Usage metrics

Categories

Keywords

Licence

Exports