W96-0209.pdf (870.51 kB)
Apportioning development effort in a probabilistic LR parsing system through evaluation
conference contribution
posted on 2023-06-10, 00:33 authored by John Carroll, Ted BriscoeWe describe an implemented system for robust domain-independent syntactic parsing of English, using a unification-based grammar of part-of speech and punctuation labels coupled with a probabilistic LR parser. We present evaluations of the system's performance along several different dimensions; these enable us to assess the contribution that each individual part is making to the success of the system as a whole, and thus prioritise the effort to be devoted to its further enhancement. Currently, the system is able to parse around 80% of sentences in a substantial corpus of general text containing a number of distinct genres. On a random sample of 250 such sentences the system has a mean crossing bracket rate of 0.71 and recall and precision of 83% and 84~0 respectively when evaluated against manually-disambiguated analyses.
History
Publication status
- Published
File Version
- Published version
Journal
Conference on Empirical Methods in Natural Language ProcessingPublisher
ACL AnthologyPublisher URL
Page range
92-100Event name
Conference on Empirical Methods in Natural Language ProcessingEvent location
Philadelphia, Pa. USAEvent type
conferenceEvent date
17-18 May 1996Department affiliated with
- Informatics Publications
Full text available
- Yes
Peer reviewed?
- Yes
Legacy Posted Date
2021-08-05First Open Access (FOA) Date
2021-08-05First Compliant Deposit (FCD) Date
2021-08-05Usage metrics
Categories
No categories selectedKeywords
Licence
Exports
RefWorks
BibTeX
Ref. manager
Endnote
DataCite
NLM
DC