University of Sussex
Browse

File(s) not publicly available

The BNC parsed with RASP4UIMA

presentation
posted on 2023-06-08, 11:22 authored by O Andersen, J Nioche, E Briscoe, John Carroll
We have integrated the RASP system with the UIMA framework (RASP4UIMA) and used this to parse the XML-encoded version of the British National Corpus (BNC). All original annotation is preserved, and parsing information, mainly in the form of grammatical relations, is added in an XML format. A few specific adaptations of the system to give better results with the BNC are discussed briefly. The RASP4UIMA system is publicly available and can be used to parse other corpora or document collections, and the final parsed version of the BNC will be deposited with the Oxford Text Archive

History

Publication status

  • Published

Page range

865-869

Presentation Type

  • paper

Event name

Proceedings of the Sixth Language Resources and Evaluation Conference (LREC)

Event location

Marrakech, Morocco.

Event type

conference

Department affiliated with

  • Informatics Publications

Full text available

  • No

Peer reviewed?

  • Yes

Legacy Posted Date

2012-04-30

Usage metrics

    University of Sussex (Publications)

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC