University of Sussex
Browse
2022.coling-1.361.pdf (786.97 kB)

Towards structure-aware paraphrase identification with phrase alignment using sentence encoders

Download (786.97 kB)
conference contribution
posted on 2023-06-10, 04:51 authored by Qiwei Peng, David WeirDavid Weir, Julie WeedsJulie Weeds
Previous works have demonstrated the effectiveness of utilising pre-trained sentence encoders based on their sentence representations for meaning comparison tasks. Though such representations are shown to capture hidden syntax structures, the direct similarity comparison between them exhibits weak sensitivity to word order and structural differences in given sentences. A single similarity score further makes the comparison process hard to interpret. Therefore, we here propose to combine sentence encoders with an alignment component by representing each sentence as a list of predicate-argument spans (where their span representations are derived from sentence encoders), and decomposing the sentence-level meaning comparison into the alignment between their spans for paraphrase identification tasks. Empirical results show that the alignment component brings in both improved performance and interpretability for various sentence encoders. After closer investigation, the proposed approach indicates increased sensitivity to structural difference and enhanced ability to distinguish non-paraphrases with high lexical overlap.

History

Publication status

  • Published

File Version

  • Published version

Journal

29th International Conference on Computational Linguistics

Publisher

International Committee on Computational Linguistics

Page range

4113-4123

Event name

29th International Conference on Computational Linguistics

Event location

Korea

Event type

conference

Event date

October 12-17, 2022

Series

COLING'2022

Department affiliated with

  • Informatics Publications

Full text available

  • Yes

Peer reviewed?

  • Yes

Legacy Posted Date

2022-09-27

First Open Access (FOA) Date

2022-10-19

First Compliant Deposit (FCD) Date

2022-09-27

Usage metrics

    University of Sussex (Publications)

    Categories

    No categories selected

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC