Probing neural language models for understanding of words of estimative probability

Damien Sileo; Marie-Francine Moens

doi:10.18653/v1/2023.starsem-1.41

Communication Dans Un Congrès Année : 2023

Probing neural language models for understanding of words of estimative probability

(1, 2, 3, 4, 5) , (6)

1
2
3
4
5
6

Damien Sileo

Fonction : Auteur
PersonId : 1247304
IdHAL : damien-sileo

Inria Lille - Nord Europe

Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189

Université de Lille

Centrale Lille

Machine Learning in Information Networks

Marie-Francine Moens

Fonction : Auteur

Catholic University of Leuven = Katholieke Universiteit Leuven

Résumé

Words of Estimative Probability (WEP) are phrases used to express the plausibility of a statement. Examples include terms like probably, maybe, likely, doubt, unlikely, and impossible. Surveys have shown that human evaluators tend to agree when assigning numerical probability levels to these WEPs. For instance, the term highly likely equates to a median probability of 0.90±0.08 according to a survey by Fagen-Ulmschneider (2015). In this study, our focus is to gauge the competency of neural language processing models in accurately capturing the consensual probability level associated with each WEP. Our first approach is utilizing the UNLI dataset (Chen et al., 2020), which links premises and hypotheses with their perceived joint probability p. From this, we craft prompts in the form: "[PREMISE]. [WEP], [HYPOTHESIS]." This allows us to evaluate whether language models can predict if the consensual probability level of a WEP aligns closely with p. In our second approach, we develop a dataset based on WEP-focused probabilistic reasoning to assess if language models can logically process WEP compositions. For example, given the prompt "[EVENTA] is likely. [EVENTB] is impossible.", a wellfunctioning language model should not conclude that [EVENTA&B] is likely. Through our study, we observe that both tasks present challenges to out-of-the-box English language models. However, we also demonstrate that fine-tuning these models can lead to significant and transferable improvements.

Domaines

Informatique et langage [cs.CL]

Fichier principal

2023.starsem-1.41.pdf (286.92 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Damien SILEO : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04290243

Soumis le : jeudi 16 novembre 2023-17:50:43

Dernière modification le : vendredi 31 mai 2024-18:32:03

Dates et versions

hal-04290243 , version 1 (16-11-2023)

Licence

Paternité

Identifiants

HAL Id : hal-04290243 , version 1
DOI : 10.18653/v1/2023.starsem-1.41

Citer

Damien Sileo, Marie-Francine Moens. Probing neural language models for understanding of words of estimative probability. Proceedings of the 12th Joint Conference on Lexical and Computational Semantics (*SEM 2023), Jul 2023, Toronto, France. pp.469-476, ⟨10.18653/v1/2023.starsem-1.41⟩. ⟨hal-04290243⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 CRISTAL-MAGNET UNIV-LILLE

18 Consultations

18 Téléchargements

Probing neural language models for understanding of words of estimative probability

Résumé

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager