On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions

Denys Pushkin; Raphaël Berthier; Emmanuel Abbe

Preprints, Working Papers, ... Year : 2024

On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions

(1) , (2, 3) , (4, 1)

1
2
3
4

Denys Pushkin

Function : Author
PersonId : 1394140

School of Computer and Communication Sciences - EPFL

Raphaël Berthier

Function : Author
PersonId : 1394141

Inria de Paris

Sorbonne Université

Emmanuel Abbe

Function : Author

Apple Inc

School of Computer and Communication Sciences - EPFL

Abstract

We investigate the out-of-domain generalization of random feature (RF) models and Transformers. We first prove that in the `generalization on the unseen (GOTU)' setting, where training data is fully seen in some part of the domain but testing is made on another part, and for RF models in the small feature regime, the convergence takes place to interpolators of minimal degree as in the Boolean case (Abbe et al., 2023). We then consider the sparse target regime and explain how this regime relates to the small feature regime, but with a different regularization term that can alter the picture in the non-Boolean case. We show two different outcomes for the sparse regime with q-ary data tokens: (1) if the data is embedded with roots of unities, then a min-degree interpolator is learned like in the Boolean case for RF models, (2) if the data is not embedded as such, e.g., simply as integers, then RF models and Transformers may not learn minimal degree interpolators. This shows that the Boolean setting and its roots of unities generalization are special cases where the minimal degree interpolator offers a rare characterization of how learning takes place. For more general integer and real-valued settings, a more nuanced picture remains to be fully characterized.

Domains

Machine Learning [stat.ML]

Fichier principal

ICML_2024_camera_ready-1.pdf (653.04 Ko)

Origin	Files produced by the author(s)

Raphaël Berthier : Connect in order to contact the contributor

https://hal.science/hal-04619375

Submitted on : Thursday, June 20, 2024-5:47:53 PM

Last modification on : Wednesday, October 16, 2024-2:42:09 PM

Dates and versions

hal-04619375 , version 1 (20-06-2024)

Identifiers

HAL Id : hal-04619375 , version 1

Cite

Denys Pushkin, Raphaël Berthier, Emmanuel Abbe. On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions. 2024. ⟨hal-04619375⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA INRIA2 INRIA-EPFL SORBONNE-UNIVERSITE

179 View

34 Download

On the Minimal Degree Bias in Generalization on the Unseen for non-Boolean Functions

Abstract

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Share