当前位置：首页 > news >正文

基于可解释性特征矩阵与稀疏采样全局特征组合的人体行为识别

news 2026/2/9 0:50:26

论文还未发表，不细说，欢迎讨论。

Title: A New Solution to Skeleton-Based Human Action Recognition via the combination usage of explainable feature extraction and sparse sampling global features.

Abstract: With the development of deep learning technology, the vision-based applications of human action recognition (HAR) have received great progress. Many methods followed the idea of data-driven and tried their best to include more and more motion features in consideration for higher accuracy purposes. However, the thought of “the more features adopted, the higher accuracy will be”will inevitably result in the ever-increasing requirement of computing power and decreasing efficiency. In this paper, in order to effectively recognize human actions with only a few of the most sensitive motion features, the explainable features, the combining usage of local and global features, and a multi-scale shallow network are proposed. First, the explainable features let a deep neural network be finetuned in the input stage, and an action represented by these features are easier to find priori theory of physics and kinematics for data augmentation purpose. Second, although criticism of the global features never stops, it is universally acknowledged that the context information included in the global feature is essential to HAR. The proposed SMHI—motion history image generated in a sparse sampling way, can not only reduce the time-cost, but also effectively reflect the motion tendency. It is suggested to be a useful complementary of local features. Third, full experiments were conducted to find out the best feature combination for HAR. The results have proved that feature selection is more important than computing all features. The proposed method is evaluated on three datasets. The experiment results proved the effectiveness and efficiency of our proposed method. Moreover, the only usage of human skeleton motion data provides privacy assurances to users.

现在大多数方法有两个问题：1. 将尽可能多的特征纳入到输入端，虽然可以增强准确率，但增加了计算负担，而且模型越来越臃肿；2. 全局特征一直处于被抛弃的境地，而其包含的上下文信息却有非常重要。针对这两点，我尝试用物理学和运动学中的先验知识提取人体行为动作特征，使其具备可解释性，然后对其优化和数据增强。并进一步找到其最有效的组合。同时，通过稀疏采样的方式构建MHI，即：只提取其运动趋势特征。使之作为local feature的有效补充。实验结果良好，特别是在效率方面有质的提升。本文的主要创新点在于跳出了主流“数据驱动”特征越多越好的传统思路，通过实验证明：特征选择远比计算所有特征更为重要。

基于可解释性特征矩阵与稀疏采样全局特征组合的人体行为识别

相关文章：

基于可解释性特征矩阵与稀疏采样全局特征组合的人体行为识别

OpenCV4（C++）—— 仿射变换、透射变换和极坐标变换

http.header.Set()与Add（）区别；

vue-7-vuex

SSO单点登录和OAuth2.0区别

【轻松玩转MacOS】基本操作篇

华为ICT——第三章图像处理基本任务

（C++）引用的用法总结

Charles：移动端抓包 / windows客户端 iOS手机 / 手机访问PC本地项目做调试

【AI】深度学习——人工智能、深度学习与神经网络

RK3288：BT656 RN6752调试

LLMs 蒸馏, 量化精度, 剪枝模型优化以用于部署 Model optimizations for deployment

Milvus踩坑笔记

什么是轴电流？轴电流对轴承有什么危害？

react create-react-app v5配置 px2rem （不暴露 eject方式）

.net中用标志位解决socket粘包问题

【Ubuntu】Systemctl 管理 MinIO 服务器的启动和停止

《golang设计模式》第二部分·结构型模式-07-代理模式（Proxy）

Jmeter常用线程组设置策略

【Spring】Spring MVC 程序开发

wordpress后台更新后前端没变化的解决方法

[特殊字符] 智能合约中的数据是如何在区块链中保持一致的？

Python｜GIF 解析与构建（5）：手搓截屏和帧率控制

测试微信模版消息推送

用docker来安装部署freeswitch记录

OPenCV CUDA模块图像处理-----对图像执行均值漂移滤波（Mean Shift Filtering）函数meanShiftFiltering()

LeetCode - 199. 二叉树的右视图

HarmonyOS运动开发：如何用mpchart绘制运动配速图表

在Mathematica中实现Newton-Raphson迭代的收敛时间算法（一般三次多项式）

JS手写代码篇----使用Promise封装AJAX请求