University of Sussex
Browse

File(s) not publicly available

English for the computer: SUSANNE corpus and analytic scheme

book
posted on 2023-06-07, 14:14 authored by Geoffrey Richard Sampson
Computer processing of natural language is a burgeoning field, but until now there has been no agreement on a standardized classification of the diverse structural elements that occur in real-life language material. This book attempts to define a "Linnaean taxonomy" for the English language: an annotation scheme, the SUSANNE scheme, which yields a labelled constituency structure for any string of English, comprehensively identifying all of its surface and logical structural properties. The structure is specified with sufficient rigour that analysts working independently must produce identical annotations for a given example. The scheme is based on large sample of real-life use of British and American written and spoken English. The book also describes the SUSANNE electronic corpus of English which is annotated in accordance with the scheme. It is freely available as a research resource to anyone working at a computer conected to Internet, and since 1992 has come into widespread use in academic and commerical research environments on four continents.

History

Publication status

  • Published

Publisher

Oxford University Press

Pages

512.0

Place of publication

USA

ISBN

0198240236

Department affiliated with

  • Informatics Publications

Full text available

  • No

Peer reviewed?

  • Yes

Legacy Posted Date

2008-02-29

Usage metrics

    University of Sussex (Publications)

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC