Testing large language models on compositionality and inference with phrase-level adjective-noun entailment

Bertolini, Lorenzo Scott; Weeds, Julie; Weir, David

2022.coling-1.359.pdf (3.06 MB)

Testing large language models on compositionality and inference with phrase-level adjective-noun entailment

conference contribution

posted on 2023-06-10, 04:54 authored by Lorenzo Scott Bertolini, Julie WeedsJulie Weeds, David WeirDavid Weir

Previous work has demonstrated that pre-trained large language models (LLM) acquire knowledge during pre-training which enables reasoning over relationships between words (e.g, hyponymy) and more complex inferences over larger units of meaning such as sentences. Here, we investigate whether lexical entailment (LE, i.e. hyponymy or the is a relation between words) can be generalised in a compositional manner. Accordingly, we introduce PLANE (Phrase-Level Adjective-Noun Entailment), a new benchmark to test models on fine-grained compositional entailment using adjective-noun phrases. Our experiments show that knowledge extracted via In–Context and transfer learning is not enough to solve PLANE. However, a LLM trained on PLANE can generalise well to out–of–distribution sets, since the required knowledge can be stored in the representations of subwords (SW) tokens.

History

Publication status

Published

File Version

Published version

Journal

Proceedings of the 29th International Conference on Computational Linguistics

Publisher

International Committee on Computational Linguistics

Publisher URL

https://aclanthology.org/2022.coling-1.359

Page range

4084-4100

Event name

The 29th International Conference on Computational Linguistics (COLING)

Event location

Gyeongju, Republic of Korea

Event type

conference

Event date

October 12-17, 2022

Place of publication

Gyeongju, Republic of Korea

Series

COLING'2022

Department affiliated with

Informatics Publications

Full text available

Yes

Peer reviewed?

Yes

Legacy Posted Date

2022-09-29

First Open Access (FOA) Date

2022-10-19

First Compliant Deposit (FCD) Date

2022-09-29

Usage metrics

Keywords

Uncategorised value

Licence

CC BY 4.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Testing large language models on compositionality and inference with phrase-level adjective-noun entailment

History

Publication status

File Version

Journal

Publisher

Publisher URL

Page range

Event name

Event location

Event type

Event date

Place of publication

Series

Department affiliated with

Full text available

Peer reviewed?

Legacy Posted Date

First Open Access (FOA) Date

First Compliant Deposit (FCD) Date

Usage metrics

Categories

Keywords

Licence

Exports