Probing transfer learning with a model of synthetic correlated datasets

Gerace, Federica and Saglietti, Luca and Sarao Mannelli, Stefano and Saxe, Andrew and Zdeborová, Lenka (2022) Probing transfer learning with a model of synthetic correlated datasets. Machine Learning: Science and Technology, 3 (1). 015030. ISSN 2632-2153

[thumbnail of Gerace_2022_Mach._Learn.__Sci._Technol._3_015030.pdf] Text
Gerace_2022_Mach._Learn.__Sci._Technol._3_015030.pdf - Published Version

Download (1MB)

Abstract

Transfer learning can significantly improve the sample efficiency of neural networks, by exploiting the relatedness between a data-scarce target task and a data-abundant source task. Despite years of successful applications, transfer learning practice often relies on ad-hoc solutions, while theoretical understanding of these procedures is still limited. In the present work, we re-think a solvable model of synthetic data as a framework for modeling correlation between data-sets. This setup allows for an analytic characterization of the generalization performance obtained when transferring the learned feature map from the source to the target task. Focusing on the problem of training two-layer networks in a binary classification setting, we show that our model can capture a range of salient features of transfer learning with real data. Moreover, by exploiting parametric control over the correlation between the two data-sets, we systematically investigate under which conditions the transfer of features is beneficial for generalization.

Item Type: Article
Subjects: Digital Academic Press > Multidisciplinary
Depositing User: Unnamed user with email support@digiacademicpress.org
Date Deposited: 11 Jul 2023 04:38
Last Modified: 24 Sep 2025 03:55
URI: http://core.ms4sub.com/id/eprint/1666

Actions (login required)

View Item
View Item