跳转至

yux-lab

02 过拟合和欠拟合

yux-lab

What's new?
AI
AI
- Deep Learning
  Deep Learning
  - Deep Learning from Scratch
    Deep Learning from Scratch
    
    Chapter 02 感知机
    Chapter 02 感知机
    
    第2章感知机
    
    Chapter 03 神经网络
    Chapter 03 神经网络
    
    第3章神经网络
    
    Chapter 04 神经网络的学习
    Chapter 04 神经网络的学习
    
    神经网络的学习
    
    Chapter 05 误差反向传播法
    Chapter 05 误差反向传播法
    
    误差反向传播法
  - Dive into Deep Learning
    Dive into Deep Learning
    
    Chap.03 线性神经网络
    Chap.03 线性神经网络
    
    3.1. 线性回归
    
    3.3. 线性回归的简洁实现
    
    3.4. softmax回归
  - Grokking Deep Learning
    Grokking Deep Learning
    
    Chapter 02 基本概念：机器如何学习
    Chapter 02 基本概念：机器如何学习
    
    基本概念：机器如何学习
    
    Chapter 03 神经网络预测导论：前向传播
    Chapter 03 神经网络预测导论：前向传播
    
    神经网络预测导论：前向传播
    
    Chapter 04 神经网络学习导论：梯度下降
    Chapter 04 神经网络学习导论：梯度下降
    
    4.1 预测、比较和学习
    
    Chapter 05 通用梯度下降：一次学习多个权重
    Chapter 05 通用梯度下降：一次学习多个权重
    
    Chapter 5 通用梯度下降：一次学习多个权重
  - D2l
    D2l
    
    08 线性回归 + 基础优化
    08 线性回归 + 基础优化
    
    01 线性回归
    
    02 基础优化算法
    
    03 线性回归的从零开始实现
    
    09 Softmax 回归
    09 Softmax 回归
    
    01 Softmax 回归
    
    02 损失函数
    
    10 多层感知机 + 代码实现
    10 多层感知机 + 代码实现
    
    01 感知机
    
    02 多层感知机
    
    11 模型选择 + 过拟合和欠拟合
    11 模型选择 + 过拟合和欠拟合
    
    01 模型选择
    
    02 过拟合和欠拟合 02 过拟合和欠拟合
    目录
    
    过拟合和欠拟合
    
    模型容量
    
    简单的模型（低容量）
    
    复杂的模型（高容量）
    
    模型容量的影响
    
    数据复杂度
    
    总结
    
    12 权重衰退
    12 权重衰退
    
    01 权重衰退
    
    19 卷积层
    19 卷积层
    
    01 从全连接到卷积
Computer Science
Computer Science
- Algorithm
  Algorithm
  - Index
  - Readme
  - Hello 算法
    Hello 算法
    
    第 02 章复杂度分析
    第 02 章复杂度分析
    
    2.1 算法效率评估
    
    第 03 章数据结构
    第 03 章数据结构
    
    数据结构
    
    第 04 章数组与链表
    第 04 章数组与链表
    
    4.1 数组
    
    4.2 链表
    
    第 05 章栈与队列
    第 05 章栈与队列
    
    5.1 栈
    
    5.2 队列
    
    5.3 双向队列
    
    第 06 章哈希表
    第 06 章哈希表
    
    6.1 哈希表
    
    6.2 哈希冲突
    
    第 07 章树
    第 07 章树
    
    7.1 二叉树
    
    7.2 二叉树遍历
    
    7.3 二叉树数组表示
    
    第 08 章堆
    第 08 章堆
    
    8.1 堆
    
    8.2 建堆操作
    
    第 09 章图
    第 09 章图
    
    9.1 图
    
    第 10 章搜索
    第 10 章搜索
    
    10.1 二分查找
    
    第 11 章排序
    第 11 章排序
    
    11.1 排序算法
    
    11.2 选择排序
    
    11.3 冒泡排序
- How to eat course
  How to eat course
  - Terms
- Operating System
  Operating System
  - 2024 南京大学《操作系统：设计与实现》
    2024 南京大学《操作系统：设计与实现》
    
    01 操作系统概述 (操作系统的历史、学习操作系统的方法)
    
    Index
  - CSAPP
    CSAPP
    
    第01章计算机系统漫游
    第01章计算机系统漫游
    
    第一部分程序结构和执行
    第一部分程序结构和执行
    
    第01章计算机系统漫游
    
    第02章信息的表示和处理
    
    第03章程序的机器级表示
    
    第04章处理器体系结构
  - 汇编语言（第4版）
    汇编语言（第4版）
    
    第01章基础知识
    
    第02章寄存器
    
    第03章寄存器（内存访问）
- Programming Language
  Programming Language
  - C Programming
    C Programming
    
    Chapter 02 Basic Features of C
    Chapter 02 Basic Features of C
    
    Basic Features of C
    
    Chapter 03 Formatted Input Output
    Chapter 03 Formatted Input Output
    
    Formatted Input Output
    
    Chapter 05 Selection Statements
    Chapter 05 Selection Statements
    
    Selection Statements
    
    Chapter 06 Loops
    Chapter 06 Loops
    
    Loops
    
    Chapter 08 Arrays
    Chapter 08 Arrays
    
    Arrays
    
    Chapter 09 Functions
    Chapter 09 Functions
    
    Function
    
    Chapter 11 Pointers
    Chapter 11 Pointers
    
    Pointers
    
    Chapter 12 Pointers and Arrays
    Chapter 12 Pointers and Arrays
    
    Pointers and Arrays
    
    Chapter 13 Strings
    Chapter 13 Strings
    
    Strings
    
    Chapter 14 The Preprocessor
    Chapter 14 The Preprocessor
    
    The Preprocessor
    
    Chapter 15 Writing Large Programs
    Chapter 15 Writing Large Programs
    
    Writing Large Programs
    
    Chapter 16 Structures, Unions, and Enumerations
    Chapter 16 Structures, Unions, and Enumerations
    
    Structures, Unions, and Enumerations
    
    Chapter 17 Advanced Uses of Pointers
    Chapter 17 Advanced Uses of Pointers
    
    第 17 章指针的高级应用
    
    Chapter 19 Program Design
    Chapter 19 Program Design
    
    第 19 章程序设计
    
    习题笔记
    习题笔记
    
    Index
    
    Ch03
  - Linux C编程一站式学习
    Linux C编程一站式学习
    
    Index
    
    I. C语言入门
    I. C语言入门
    
    01. 程序的基本概念
    
    02. 常量、变量和表达式
    
    10. gdb
    
    II. C语言本质
    II. C语言本质
    
    18. x86汇编程序基础
Math
Math
- Calculus
  Calculus
  - 换元积分法
  - ch09 Differential Equations
    ch09 Differential Equations
    
    09.1 Modeling with Differential Equations
    
    09.2 Direction Fields and Euler’s Method
    
    Images
    Images
    
    09.2
  - ch11 Infinite Sequences and Series
    ch11 Infinite Sequences and Series
    
    01 Sequences
    
    02 Series
    
    03 The Integral Test and Estimates of Sums
    
    04 The Comparison Tests
    
    05 Alternating Series
    
    06 Absolute Convergence and the Ratio and Root Tests
    
    07 Strategy for Testing Series
    
    08 Power Series
    
    09 Representations of Functions as Power Series
    
    10 Taylor and Maclaurin Series
  - ch12 Vectors and Geometry of Space
    ch12 Vectors and Geometry of Space
    
    12 .5 Equations of Lines and Vectors
    
    The Cross Product
    
    The Dot Product
    
    Three Dimensional Coordinate Systems
    
    Vectors
    
    平面及其方程
  - ch13 Vector Functions
    ch13 Vector Functions
    
    13.1 Vector Functions and Space Curves
    
    13.2 Derivatives and Integrals of Vector Functions
    
    13.3 Arc Length and Curvature
  - 数学分析
    数学分析
    
    华东
    华东
    
    Chap. 12
    Chap. 12
    
    01 级数的敛散性
- Linear Algebra
  Linear Algebra
  - 线性代数（第5版） (Gilbert Strang (吉尔伯特·斯特朗）)
    线性代数（第5版） (Gilbert Strang (吉尔伯特·斯特朗）)
    
    第01章向量引论
    
    第02章求解线性方程组
Tools for anything
Tools for anything
- What's new?
- Command Manuals
  Command Manuals
  - Conda Commands
  - Git Commands
  - Jetson nano Commands
  - LaTex
  - Linux Commands
  - Powershell Commands
  - Vim Commands
  - Windows Terminal Commands
  - The Missing Semester of Your CS Education
    The Missing Semester of Your CS Education
    
    Lecture 1 Course overview + the shell
    
    Source

02 过拟合和欠拟合

过拟合和欠拟合

模型容量

“模型容量”（Model Capacity）是指一个机器学习模型能够拟合不同复杂度函数的能力。模型容量通常与模型的复杂度有关，反映了模型能够表达的函数空间的大小。

简单的模型（低容量）

直线：如果用一条直线来拟合这些点，这就相当于一个非常简单的模型。它可以很好地表示线性关系，但如果数据不是严格的线性关系，这条直线就可能无法很好地描述数据的趋势。比如，当数据点呈现一个 U 形时，直线就无法很好地拟合。

复杂的模型（高容量）

曲线：现在，如果允许你用一条曲线来拟合这些点，比如一个二次方程 y=ax2+bx+c，这就相当于一个稍微复杂一点的模型。这条曲线可以更好地拟合非线性的关系，比如 U 形数据。如果你允许使用更高次的多项式，比如三次或四次多项式，那么模型就会变得更加复杂，可以拟合更复杂的关系。

模型容量的影响

能记住所有训练数据的模型不一定是好的，因为数据可能包含大量噪音，过于注重无关的细节反而泛化能力会变差
深度学习的核心：模型先足够大，在足够大的情况下用各种手段控制模型容量，使得最后泛化误差能往下降

数据复杂度

多个重要因素
- 样本个数（动物种类）
- 每个样本的元素个数（每个动物的个数）
- 时间、空间结构（视频）
- 多样性（类别分类）

总结

模型容量需要匹配数据复杂度，否则可能导致欠拟合和过拟合
统计机器学习提供数学工具来衡量模型复杂度
实际中一般靠观察训练误差和验证误差