Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

A White Box Analysis of ColBERT

Abstract : Transformer-based models are nowadays state-of-the-art in ad-hoc Information Retrieval, but their behavior is far from being understood. Recent work has claimed that BERT does not satisfy the classical IR axioms. However, we propose to dissect the matching process of ColBERT, through the analysis of term importance and exact/soft matching patterns. Even if the traditional axioms are not formally verified, our analysis reveals that ColBERT: (i) is able to capture a notion of term importance; (ii) relies on exact matches for important terms.
Complete list of metadata

https://hal.sorbonne-universite.fr/hal-03084279
Contributor : Benjamin Piwowarski <>
Submitted on : Monday, December 21, 2020 - 7:39:14 AM
Last modification on : Wednesday, December 23, 2020 - 3:37:47 AM

Links full text

Identifiers

  • HAL Id : hal-03084279, version 1
  • ARXIV : 2012.09650

Collections

Citation

Thibault Formal, Benjamin Piwowarski, Stéphane Clinchant. A White Box Analysis of ColBERT. 2020. ⟨hal-03084279⟩

Share

Metrics

Record views

18