机器学习：4.2 欠拟合和过拟合 Underfitting &

机器学习：4.2 欠拟合和过拟合 Underfitting &

作者: Cache_wood | 来源:发表于2022-04-14 11:43 被阅读0次

@[toc]

Training error: model error on the training data
Generalization error: model error on new data

		training	error
		Low	High
generalization	Low	Good	Bug?
error	High	Overfitting	Underfitting

		data	complexity
		Low	High
model	Low	Normal	Underfitting
complexity	High	Overfitting	Normal

Model Complexity

The capacity of a set of function to fit data points
In ML, model complexity usually refers to:
- The number of learnable parameters
- The value range for those parameters
It’s hard to compare between different types of ML models
- E.g. trees vs neural network
More precisely measure of complexity: VC dimension
- VC dim for classification model:
  the maximum number of examples the model can shatter

Data Complexity

Multiple factors matters
- of examples
- of features in each example
- the separability of the classes
Again, hard to compare among very different data
- E.g a char vs a pixel
More precisely, Kolmogorov complexity
- A data is simple if it can be generated by a short program

Generalization error

Generalization error bound (an informal statement)

|error on unseen data - training error| $\leq \sqrt{\frac{D}{N}(\log(\frac{2N}{D})+1)}$
- D: VC-dim, M: number of training examples
Generalization error also depends on the training algorithm
- Adding regularization can penalize complex models
- Model trained with stochastic gradient methods generalizes better

Model Selection

Pick a model with a proper complexity for your data
- Minimize the generalization error
- Also consider business metrics
Pick up a model family, then select proper hyper-parameters
- Trees: #trees, maximal depths
- Neural networks: architecture, depth (#layers), width (#hidden units), regularizations

Summary

We care about generalization error
Model complexity: the ability to fit various functions
Data complexity: the richness of information
Model selection: match model and data complexities

相关文章

网友评论

本文标题：机器学习：4.2 欠拟合和过拟合 Underfitting &

本文链接：https://www.haomeiwen.com/subject/hieosrtx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

栏目导航

热点阅读

简友广场

散文

关于我们|服务条款|联系我们|机器学习：4.2 欠拟合和过拟合 Underfitting & |投稿指南|网站地图|RSS订阅|排版工具|手机版

提供经典美文摘抄,优美散文欣赏,现代诗歌精选,短篇小说,心情随笔,表白情书范文,故事会在线阅读欣赏

Copyright © 2014-2023 Haomeiwen.com All Rights Reserved. 好美文阅读网版权所有

备案信息：桂公网安备 45052102000051号 · 桂ICP备13007215号-3

本站所收录作品、热点评论等信息部分来源互联网，目的只是为了系统归纳学习和传递资讯

所有作品版权归原创作者所有，与本站立场无关，如不慎侵犯了你的权益，请联系我们告知，我们将做删除处理！