首页> 外军国防科技报告 >Revisiting Generalization for Deep Learning: PAC-Bayes, Flat Minima, and Generative Models

【2h】

Revisiting Generalization for Deep Learning: PAC-Bayes, Flat Minima, and Generative Models

机译：重新审视深度学习的泛化：paC-Bayes，Flat minima和Generative models

代理获取

代理获取并翻译 | 示例

页面导航

摘要
著录项
相关主题

摘要

In this work, we construct generalization bounds to understand existing learning algorithms and propose new ones. Generalization bounds relate empirical performance to future expected performance. The tightness of these bounds vary widely, and depends on the complexity of the learning task and the amount of data available, but also on how much information the bounds take into consideration. We are particularly concerned with data and algorithm- dependent bounds that are quantitatively nonvacuous. We begin with an analysis of stochastic gradient descent (SGD) in supervised learning. By formalizing the notion of flat minima using PAC-Bayes generalization bounds, we obtain nonvacuous generalization bounds for stochastic classifiers based on SGD solutions. Despite strong empirical performance in many settings, SGD rapidly overfits in others. By combining nonvacuous generalization bounds and structural risk minimization, we arrive at an algorithm that trades-off accuracy and generalization guarantees. We also study generalization in the context of unsupervised learning. We propose to use a two sample test statistic for training neural network generator models and bound the gap between the population and the empirical estimate of the statistic.

著录项

作者

展开▼
作者单位

展开▼
年(卷),期 2019(),
年度 2019
页码
总页数 146
原文格式 PDF
正文语种
中图分类
网站名称剑桥大学机构知识库
栏目名称所有文件
关键词
Deep learning; statistical learning theory; Generalization in neural networks; adversarial learning; PAC-Bayesian bounds; generative models; Thesis;
入库时间 2022-08-19 16:59:41

Revisiting Generalization for Deep Learning: PAC-Bayes, Flat Minima, and Generative Models

摘要

著录项

相关主题

期刊订阅