当前位置：首页 > news >正文

【论文阅读】Self-Paced Boost Learning for Classification

news 2026/5/16 13:23:41

论文下载
bib:

@INPROCEEDINGS{PiLi2016SPBL,title		= {Self-Paced Boost Learning for Classification},author	= {Te Pi and Xi Li and Zhongfei Zhang and Deyu Meng and Fei Wu and Jun Xiao and Yueting Zhuang},booktitle	= {IJCAI},year		= {2016},pages     = {1932--1938}
}

GitHub

1. 摘要

Effectiveness and robustness are two essential aspects of supervised learning studies.

For effective learning, ensemble methods are developed to build a strong effective model from ensemble of weak models.

For robust learning, self-paced learning (SPL) is proposed to learn in a self-controlled pace from easy samples to complex ones.

Motivated by simultaneously enhancing the learning effectiveness and robustness, we propose a unified framework, Self-Paced Boost Learning (SPBL).

With an adaptive from-easy-to-hard pace in boosting process, SPBL asymptotically guides the model to focus more on the insufficiently learned samples with higher reliability.

Via a max-margin boosting optimization with self-paced sample selection, SPBL is capable of capturing the intrinsic inter-class discriminative patterns while ensuring the reliability of the samples involved in learning.

We formulate SPBL as a fully-corrective optimization for classification.

The experiments on several real-world datasets show the superiority of SPBL in terms of both effectiveness and robustness.

Note:

将Self-paced learning（自步学习，从容易到难的学习）和Boost（集成学习）融合在一起，同时保证有效性与鲁棒性。

2. 算法

问题：多分类问题
$\widetilde{y}(x) = \argmax_{r \in \{1, \dots, C\} }F_r(x; \Theta) \tag{1}$

${(x_i, y_i)\}_{i=1}^n$ 表示带标签的训练数据，其中又 $n$ 个带标签的样本。 $x_i \in \mathbb{R}^d$ 是第 $i$ 个样本的特征， $y_i \in \{1, \dots, C\}$ 表示第个样本的标签。
$F_r(\cdot):\mathbb{R}^d \rightarrow \mathbb{R}$ 表示将样本 $x$ 分类到类别 $r$ 的置信度得分。值得注意的是, 这里相当于将多分类问题转化为了 $C$ 个二分类问题，对应于OvA策略。优点是只用训练类别数目 $C$ 个分类器，缺点是，会出现类别不平衡的问题（A对应类别样本多）。
最后的多分类预测则是预测样本对应最大评分的类。在实际操作中，可以理解为softmax操作后对应最大概率的类（threshold）。

boost:
boost是一种集成学习中的一个方法，目的是集成多个弱学习器成为一个强学习器。
$F_r(x;W) = \sum_{j=1}^k w_{rj}h_j(x), r \in \{1, \dots, C\} \tag{2}$

$h_j (x) : \mathbb{R}^d \rightarrow \{0, 1\}$ ，表示一个弱二分类器， $w_{rj}$ 学习器对应权重，是一个学习参数。
$[w_1, \dots, w_C ] \in \mathbb{R}^{k \times C}$ with each $w_r = [w_{r1}, \dots, w_{r_k}]^{\mathsf{T}}$ .

general objective of SPBL:
$\min_{W, v}\sum^{n}_{i=1}v_i\sum^{C}_{r=1}L(\rho_{ir}) + \sum^{n}_{i=1}g(v_i;\lambda) + \upsilon R(W) s.t. \forall i,r, \rho_{i,r} = H_{i:}w_{y_i} - H_{i:}w_{r}; W \geq 0; v \in [0, 1]^n \tag{3}$ .

$\in \mathbb{R}^{n \times k}$ with each item $H_{ij} = h_j(x_i)$ .
$H_{i:}w_{y_i} = H_{i:} \times w_{y_i}, w_{y_i} = [w_{y_i1}, \dots, w_{y_ik}]^{\mathsf{T}}$ .

speciﬁc formulation:
$\min_{W, v}\sum_{i, r}v_i \ln(1+ \exp(-\rho_{ir})) + \sum^{n}_{i=1}g(v_i;\lambda) + \upsilon \|W\|_{2, 1}$
$\text{s.t.} \forall i,r, \rho_{i,r} = H_{i:}w_{y_i} - H_{i:}w_{r}; W \geq 0; v \in [0, 1]^n \tag{3}$

$\|W\|_{2, 1}\| = \sum_{j=1}^k \|W_{j:}\|_2$ ，鼓励矩阵行列都稀疏。
the logistic loss. 我的理解该损失就是简单的对差值求 $\exp$ 。区别在于现有的是二分类的概率，概率值是由 $\text{sigmod} = \frac{1}{1+ e^{-x}}$ 计算的，即 $\ln{(\text{sigmod})} = -\ln(1+ \exp(-x))$ 。

3. 总结

关于优化目标的求解，涉及到了对偶问题（dual problem），实在是懂不了了。

【论文阅读】Self-Paced Boost Learning for Classification

1. 摘要

2. 算法

3. 总结

相关文章：

【论文阅读】Self-Paced Boost Learning for Classification

通过CSIG—走进合合信息探讨生成式AI及文档图像处理的前景和价值

流程图拖拽视觉编程--概述

深度学习中的卷积神经网络

vue3的介绍和两种创建方式（cli和vite）

camunda工作流user task如何使用

三元运算符

Vue3 Element-plus el-menu无限级菜单组件封装

( “树” 之 BST) 669. 修剪二叉搜索树 ——【Leetcode每日一题】

【C语言】浅涉结构体（声明、定义、类型、定义及初始化、成员访问及传参）

设计模式-结构型模式之装饰模式

【Chatgpt4 教学】 NLP（自然语言处理）第九课朴素贝叶斯分类器的工作原理机器学习算法

基于html+css的图片展示17

Jupyter Notebook小知识

redis原理及进化之路

ai智能写作助手-ai自动写作软件

redis持久化

Vue项目基于driverjs实现新用户导航

自编码器简单介绍—使用PyTorch库实现一个简单的自编码器，并使用MNIST数据集进行训练和测试

redis单机最大并发量

MonitorControl：终极解决方案！让你的Mac外接显示器亮度调节变得如此简单

别再为OSGB数据导入SuperMap iDesktop发愁了！手把手教你搞定倾斜摄影配置文件生成与常见报错

从零构建装饰艺术视觉系统：Midjourney + Figma联动作业流，1小时产出完整海报/包装/UI组件库

基于MCP协议连接AI与Postal邮件服务器的自动化实践

Coolapk-UWP 深度解析：基于MVVM架构的Windows桌面酷安客户端开发实战指南

这3个降AI提示词千万别用！让你的知网AI率反涨10个点过不了AIGC检测

如何在5分钟内用Python获取同花顺问财金融数据？

文档下载革命：kill-doc浏览器脚本让你的学习资料一键保存

金蝶云星空日常使用功能

Kubernetes二进制文件管理工具：自动化安装与多版本切换实践