当前位置：首页 > news >正文

机器学习 - 提高模型 (代码）

news 2026/2/10 19:28:06

如果模型出现了 underfitting 问题，就得提高模型了。

Model improvement technique	What does it do?
Add more layers	Each layer potentially increases the learning capabilities of the model with each layer being able to learn some kind of new pattern in the data, more layers is often referred to as making your neural network deeper.
Add more hidden units	More hidden units per layer means a potential increase in learning capabilities of the model, more hidden units is often referred to as making your neural network wider.
Fitting for longer (more epochs)	Your model might learn more if it had more opportunities to look at the data.
Changing the activation functions	Some data just can’t be fit with only straight lines, using non-linear activation functions can help with this.
Change the learning rate	Less model specific, but still related, the learning rate of the optimizer decides how much a model should change its parameter each step, too much and the model overcorrects, too little and it doesn’t learn enough.
Change the loss function	Less model specific but still important, different problems require different loss functions. For example, a binary cross entropy loss function won’t work with a multi-class classification problem.
Use transfer learning	Take a pretrained model from a problem domain similar to yours and adjust it to your own problem.

举个例子，代码如下：

class CircleModelV1(nn.Module):def __init__(self):super().__init__()self.layer_1 = nn.Linear(in_features = 2, out_features = 10)self.layer_2 = nn.Linear(in_features = 10, out_features = 10)self.layer_3 = nn.Linear(in_features = 10, out_features = 1)def forward(self, x):return self.layer_3(self.layer_2(self.layer_1(x)))model_1 = CircleModelV1().to("cpu")
print(model_1)loss_fn = nn.BCEWithLogitsLoss()
optimizer = torch.optim.SGD(model_1.parameters(), lr=0.1)torch.manual_seed(42)epochs = 1000X_train, y_train = X_train.to("cpu"), y_train.to("cpu")
X_test, y_test = X_test.to("cpu"), y_test.to("cpu")for epoch in range(epochs):### Training# 1. Forward pass y_logits = model_1(X_train).squeeze()y_pred = torch.round(torch.sigmoid(y_logits))  # logits -> probabilities -> prediction labels # 2. Calculate loss/accuracy loss = loss_fn(y_logits, y_train)acc = accuracy_fn(y_true = y_train, y_pred = y_pred)# 3. Optimizer zero grad optimizer.zero_grad()# 4. Loss backwards loss.backward()# 5. Optimizer step optimizer.step() ### Testing model_1.eval()with torch.inference_mode():# 1. Forward pass test_logits = model_1(X_test).squeeze()test_pred = torch.round(torch.sigmoid(test_logits))# 2. Calculate loss/accuracy test_loss = loss_fn(test_logits, y_test)test_acc = accuracy_fn(y_true = y_test, y_pred = test_pred)if epoch % 100 == 0:print(f"Epoch: {epoch} | Loss: {loss:.5f}, Accuracy: {acc:.2f}%")# 结果如下
CircleModelV1((layer_1): Linear(in_features=2, out_features=10, bias=True)(layer_2): Linear(in_features=10, out_features=10, bias=True)(layer_3): Linear(in_features=10, out_features=1, bias=True)
)
Epoch: 0 | Loss: 0.69528, Accuracy: 51.38%
Epoch: 100 | Loss: 0.69325, Accuracy: 47.88%
Epoch: 200 | Loss: 0.69309, Accuracy: 49.88%
Epoch: 300 | Loss: 0.69303, Accuracy: 50.50%
Epoch: 400 | Loss: 0.69300, Accuracy: 51.38%
Epoch: 500 | Loss: 0.69299, Accuracy: 51.12%
Epoch: 600 | Loss: 0.69298, Accuracy: 51.50%
Epoch: 700 | Loss: 0.69298, Accuracy: 51.38%
Epoch: 800 | Loss: 0.69298, Accuracy: 51.50%
Epoch: 900 | Loss: 0.69298, Accuracy: 51.38%

都看到这了，点个赞呗~

机器学习 - 提高模型 (代码）

如果模型出现了 underfitting 问题，就得提高模型了。 Model improvement techniqueWhat does it do?Add more layersEach layer potentially increases the learning capabilities of the model with each layer being able to learn some kind of new pattern in…...

编程日记 2024/3/30 13:33:50

数值代数及方程数值解：预备知识——二进制及浮点数

文章目录二进制IEEE浮点数本篇文章的前置知识：数学分析二进制命题：二进制转化为十进制二进制的数字表示为 ⋯ b 2 b 1 b 0 . b − 1 b − 2 ⋯ \cdots b_2b_1b_0.b_{-1}b_{-2}\cdots ⋯b2b1b0.b−1b−2⋯这等价于十进制下的 ⋯ b 2 2 …...

编程日记 2024/3/30 13:30:47

新数字时代的启示：揭开Web3的秘密之路

在当今数字时代，随着区块链技术的不断发展，Web3作为下一代互联网的概念正逐渐引起人们的关注和探索。本文将深入探讨新数字时代的启示，揭开Web3的神秘之路，并探讨其在未来的发展前景。 1. Web3的定义与特点 Web3是对互联网未来发…...

编程日记 2024/3/30 13:28:46

算法——动态规划：01背包

原始01背包见下面这篇文章：http://t.csdnimg.cn/a1kCL 01背包的变种：. - 力扣（LeetCode） 给你一个只包含正整数的非空数组 nums 。请你判断是否可以将这个数组分割成两个子集，使得两个子集的元素和相等。简化一…...

编程日记 2024/3/30 13:25:43

写作类AI推荐（二）

本章要介绍的写作AI如下： 火山写作主要功能： AI智能创作：告诉 AI 你想写什么，立即生成你理想中的文章AI智能改写：选中段落句子，可提升表达、修改语气、扩写、总结、缩写等文章内容优化：根据全文…...

编程日记 2024/3/30 13:24:41

分寝室（20分）（JAVA）

目录题目描述输入格式： 输出格式： 输入样例 1： 输出样例 1： 输入样例 2： 输出样例 2： 题解： 题目描述学校新建了宿舍楼，共有 n 间寝室。等待分配的学生中，有女…...

编程日记 2024/3/30 13:23:40

Spring 源码调试问题 ( List.of(“bin“, “build“, “out“)； )

Spring 源码调试问题文章目录 Spring 源码调试问题一、问题描述二、解决方案一、问题描述错误：springframework\buildSrc\src\main\java\org\springframework\build\CheckstyleConventions.java:68: 错误: 找不到符号 List<String> buildFolders List.of…...

编程日记 2024/3/30 13:21:38

Centos7安装RTL8111网卡驱动

方法一： // 安装pciutils # yum install -y pciutils // 查看pci设备信息 # lspci | grep -i Ethernet 09:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller (rev 03) // 上面看到是Re…...

编程日记 2024/3/30 13:20:37

吉时利KEITHLEY2460数字源表

181/2461/8938产品概述： Keithley 2460 高电流源表源测量单元 (SMU) 将先进的触摸、测试和发明技术带到您的指尖。Keithley 2460 将创新的图形用户界面 (GUI) 与电容式触摸屏技术相结合，使测试变得直观并最大限度地缩短学习曲线，从而帮助工程…...

编程日记 2024/3/30 13:19:36

数据库原理(含思维导图)

数据库原理笔记，html与md笔记已上传 1.绪论发展历程记住数据怎么保存，谁保存数据，共享性如何，独立性如何人工管理阶段数据不保存应用程序管理数据数据不共享数据不具有独立性文件系统阶段数据可以长期保存文件系统管…...

编程日记 2024/3/30 13:15:32

数据结构(六)——图

六、图 6.1 图的基本概念图的定义图：图G由顶点集V和边集E组成，记为G (V, E)，其中V(G)表示图G中顶点的有限非空集；E(G) 表示图G中顶点之间的关系（边）集合。若V {v1, v2, … , vn}，则用|V|…...

编程日记 2024/3/30 13:13:30

Android-AR眼镜屏幕显示

Android-AR眼镜前提：Android手持设备需要具备DP高清口 1、创建Presentation（双屏异显） public class MyPresentation extends Presentation {private PreviewSingleBinding binding;private ScanActivity activity;public MyPresentatio…...

编程日记 2024/3/30 13:11:28

蓝桥集训之货币系统

蓝桥集训之货币系统核心思想：背包 #include <iostream>#include <cstring>#include <algorithm>using namespace std;const int N 30,M 10010;typedef long long LL;LL f[M];int w[N];int n,m;int main(){cin>>n>>m;for(int i1;i&…...

编程日记 2024/3/30 13:10:27

基于微信小程序的校园服务平台设计与实现（程序+论文）

本文以校园服务平台为研究对象，首先分析了当前校园服务平台的研究现状，阐述了本系统设计的意义和背景，运用微信小程序开发工具和云开发技术，研究和设计了一个校园服务平台，以满足学生在校园生活中的多样化需求。通过引…...

编程日记 2024/3/30 13:08:25

QT+Opencv+yolov5实现监测

功能说明：使用QTOpencvyolov5实现监测仓库链接：https://gitee.com/wangyoujie11/qt_yolov5.git git本仓库到本地一、环境配置 1.opencv配置将OpenCV-MinGW-Build-OpenCV-4.5.2-x64文件夹放在自己的一个目录下，如我的路径： …...

编程日记 2024/3/30 13:07:24

【Python-Docx库】Word与Python的完美结合

【Python-Docx库】Word与Python的完美结合今天给大家分享Python处理Word的第三方库：Python-Docx。什么是Python-Docx？ Python-Docx是用于创建和更新Microsoft Word（.docx）文件的Python库。日常需要经常处理Word文档&#xf…...

编程日记 2024/3/30 13:06:23

吴恩达深度学习笔记：浅层神经网络(Shallow neural networks)3.6-3.8

目录第一门课：神经网络和深度学习 (Neural Networks and Deep Learning)第三周：浅层神经网络(Shallow neural networks)3.6 激活函数（Activation functions）3.7 为什么需要非线性激活函数？（why need a non…...

编程日记 2024/3/30 13:02:19

盘点最适合做剧场版的国漫，最后一部有望成为巅峰

最近《完美世界》动画官宣首部剧场版，主要讲述石昊和火灵儿的故事。这个消息一出，引发了很多漫迷的讨论，其实现在已经有好几部国漫做过剧场版，还有是观众一致希望未来会出剧场版的。那么究竟是哪些国漫呢，下面就一起来…...

编程日记 2024/3/30 13:00:16

Altium Designer许可需求分析

在电子设计的世界中，Altium Designer已成为设计师们的得力助手。然而，如何进行有效的许可需求分析，以确保软件的高效使用和企业的可持续发展？本文将带您了解如何进行Altium Designer的许可需求分析，让您在设计的道路上…...

编程日记 2024/3/30 12:58:13

[c++]类和对象常见题目详解

本专栏内容为：C学习专栏，分为初阶和进阶两部分。通过本专栏的深入学习，你可以了解并掌握C。 💓博主csdn个人主页：小小unicorn ⏩专栏分类：C 🚚代码仓库：小小unicorn的代码仓库&…...

编程日记 2024/3/30 12:51:07

Vim 调用外部命令学习笔记

Vim 外部命令集成完全指南文章目录 Vim 外部命令集成完全指南核心概念理解命令语法解析语法对比常用外部命令详解文本排序与去重文本筛选与搜索高级 grep 搜索技巧文本替换与编辑字符处理高级文本处理编程语言处理其他实用命令范围操作示例指定行范围处理复合命令示例实用技…...

编程新知 2025/11/16 8:24:16

如何在看板中体现优先级变化

在看板中有效体现优先级变化的关键措施包括：采用颜色或标签标识优先级、设置任务排序规则、使用独立的优先级列或泳道、结合自动化规则同步优先级变化、建立定期的优先级审查流程。其中，设置任务排序规则尤其重要，因为它让看板视觉上直观地体…...

编程新知 2026/1/23 12:42:28

iPhone密码忘记了办？iPhoneUnlocker，iPhone解锁工具Aiseesoft iPhone Unlocker 高级注册版分享

平时用 iPhone 的时候，难免会碰到解锁的麻烦事。比如密码忘了、人脸识别 / 指纹识别突然不灵，或者买了二手 iPhone 却被原来的 iCloud 账号锁住，这时候就需要靠谱的解锁工具来帮忙了。Aiseesoft iPhone Unlocker 就是专门解决这些问题的软件&…...

编程新知 2026/1/29 10:22:28

服务器硬防的应用场景都有哪些？

服务器硬防是指一种通过硬件设备层面的安全措施来防御服务器系统受到网络攻击的方式，避免服务器受到各种恶意攻击和网络威胁，那么，服务器硬防通常都会应用在哪些场景当中呢？ 硬防服务器中一般会配备入侵检测系统和预防系统&#x…...

编程新知 2025/11/9 19:17:07

【快手拥抱开源】通过快手团队开源的 KwaiCoder-AutoThink-preview 解锁大语言模型的潜力

引言： 在人工智能快速发展的浪潮中，快手Kwaipilot团队推出的 KwaiCoder-AutoThink-preview 具有里程碑意义——这是首个公开的AutoThink大语言模型（LLM）。该模型代表着该领域的重大突破，通过独特方式融合思考与非思考…...

编程新知 2026/2/6 19:29:20

Vue2 第一节_Vue2上手_插值表达式{{}}_访问数据和修改数据_Vue开发者工具

文章目录 1.Vue2上手-如何创建一个Vue实例,进行初始化渲染2. 插值表达式{{}}3. 访问数据和修改数据4. vue响应式5. Vue开发者工具--方便调试 1.Vue2上手-如何创建一个Vue实例,进行初始化渲染准备容器引包创建Vue实例 new Vue()指定配置项 ->渲染数据准备一个容器,例如: …...

编程新知 2026/2/7 10:59:19

【C++从零实现Json-Rpc框架】第六弹 —— 服务端模块划分

一、项目背景回顾前五弹完成了Json-Rpc协议解析、请求处理、客户端调用等基础模块搭建。本弹重点聚焦于服务端的模块划分与架构设计，提升代码结构的可维护性与扩展性。二、服务端模块设计目标高内聚低耦合：各模块职责清晰，便于独立开发…...

编程新知 2025/10/13 4:15:41

基于 TAPD 进行项目管理

起因自己写了个小工具，仓库用的Github。之前在用markdown进行需求管理，现在随着功能的增加，感觉有点难以管理了，所以用TAPD这个工具进行需求、Bug管理。操作流程注册 TAPD，需要提供一个企业名新建一个项目&#…...

编程新知 2026/1/24 14:15:44

PAN/FPN

import torch import torch.nn as nn import torch.nn.functional as F import mathclass LowResQueryHighResKVAttention(nn.Module):"""方案 1: 低分辨率特征 (Query) 查询高分辨率特征 (Key, Value).输出分辨率与低分辨率输入相同。"""def __…...

编程新知 2025/10/20 4:39:36

iview框架主题色的应用

1.下载 less要使用3.0.0以下的版本 npm install less2.7.3 npm install less-loader4.0.52./src/config/theme.js文件 module.exports {yellow: {theme-color: #FDCE04},blue: {theme-color: #547CE7} }在sass中使用theme配置的颜色主题，无需引入，直接可…...

编程新知 2026/1/31 9:29:45

相关文章：