University of Sussex
Browse
coling2016.pdf (181.06 kB)

Using linguistic data for English and Spanish verb-noun combination identification

Download (181.06 kB)
conference contribution
posted on 2023-06-09, 03:29 authored by Uxoa Iñurrieta, Arantza Díaz de Ilarraza, Gorka Labaka, Kepa Sarasola, Itziar Aduriz, John Carroll
We present a linguistic analysis of a set of English and Spanish verb+noun combinations (VNCs), and a method to use this information to improve VNC identification. Firstly, a sample of frequent VNCs are analysed in-depth and tagged along lexico-semantic and morphosyntactic dimensions, obtaining satisfactory inter-annotator agreement scores. Then, a VNC identification experiment is undertaken, where the analysed linguistic data is combined with chunking information and syntactic dependencies. A comparison between the results of the experiment and the results obtained by a basic detection method shows that VNC identification can be greatly improved by using linguistic information, as a large number of additional occurrences are detected with high precision.

History

Publication status

  • Published

File Version

  • Published version

Journal

Proceedings of the 26th International Conference on Computational Linguistics: Technical Papers

Publisher

International Committee on Computational Linguistics (ICCL)

Page range

857-867

Pages

11.0

Event name

COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers

Event location

Osaka, Japan

Event type

conference

Book title

Proceedings of the 26th International Conference on Computational Linguistics (COLING)

ISBN

9784879747020

Department affiliated with

  • Informatics Publications

Research groups affiliated with

  • Data Science Research Group Publications

Full text available

  • Yes

Peer reviewed?

  • Yes

Legacy Posted Date

2016-10-11

First Open Access (FOA) Date

2016-10-11

First Compliant Deposit (FCD) Date

2016-10-11

Usage metrics

    University of Sussex (Publications)

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC