当前位置：首页 > news >正文

Factorization Machines(论文笔记)

news 2025/7/22 6:16:15

样例一：

一个简单的例子,train是一个字典，先将train进行“one-hot” coding，然后输入相关特征向量，可以预测相关性。

from pyfm import pylibfm
from sklearn.feature_extraction import DictVectorizer
import numpy as np
train = [{"user": "1", "item": "5", "age": 19},{"user": "2", "item": "43", "age": 33},{"user": "3", "item": "20", "age": 55},{"user": "4", "item": "10", "age": 20},
]
v = DictVectorizer()
X = v.fit_transform(train)
print(X.toarray())
y = np.repeat(1.0,X.shape[0])
#print(X.shape[0])
fm = pylibfm.FM()
fm.fit(X,y)
fm.predict(v.transform({"user": "1", "item": "10", "age": 40}))
输出：
[[19.  0.  0.  0.  1.  1.  0.  0.  0.][33.  0.  0.  1.  0.  0.  1.  0.  0.][55.  0.  1.  0.  0.  0.  0.  1.  0.][20.  1.  0.  0.  0.  0.  0.  0.  1.]]
4
Creating validation dataset of 0.01 of training for adaptive regularization
-- Epoch 1
Training log loss: 0.37518
array([0.9999684])

样例二：

是基于真实的电影评分数据来训练。数据集点击下载即可。

import numpy as np
from sklearn.feature_extraction import DictVectorizer
from pyfm import pylibfm# Read in data
def loadData(filename,path="ml-100k/"):data = []y = []users=set()items=set()with open(path+filename) as f:for line in f:(user,movieid,rating,ts)=line.split('\t')data.append({ "user_id": str(user), "movie_id": str(movieid)})y.append(float(rating))users.add(user)items.add(movieid)return (data, np.array(y), users, items)(train_data, y_train, train_users, train_items) = loadData("ua.base")
(test_data, y_test, test_users, test_items) = loadData("ua.test")
v = DictVectorizer()
X_train = v.fit_transform(train_data)
X_test = v.transform(test_data)# Build and train a Factorization Machine
fm = pylibfm.FM(num_factors=10, num_iter=100, verbose=True, task="regression", initial_learning_rate=0.001, learning_rate_schedule="optimal")fm.fit(X_train,y_train)# Evaluate
preds = fm.predict(X_test)
from sklearn.metrics import mean_squared_error
print("FM MSE: %.4f" % mean_squared_error(y_test,preds))
输出：
Creating validation dataset of 0.01 of training for adaptive regularization
-- Epoch 1
Training MSE: 0.59525
-- Epoch 2
Training MSE: 0.51804
-- Epoch 3
Training MSE: 0.49046
-- Epoch 4
Training MSE: 0.47458
-- Epoch 5
Training MSE: 0.46416
-- Epoch 6
Training MSE: 0.45662
-- Epoch 7
Training MSE: 0.45099
-- Epoch 8
Training MSE: 0.44639
-- Epoch 9
Training MSE: 0.44264
-- Epoch 10
Training MSE: 0.43949
-- Epoch 11
Training MSE: 0.43675
-- Epoch 12
Training MSE: 0.43430
-- Epoch 13
Training MSE: 0.43223
-- Epoch 14
Training MSE: 0.43020
-- Epoch 15
Training MSE: 0.42851
-- Epoch 16
Training MSE: 0.42691
-- Epoch 17
Training MSE: 0.42531
-- Epoch 18
Training MSE: 0.42389
-- Epoch 19
Training MSE: 0.42255
-- Epoch 20
Training MSE: 0.42128
-- Epoch 21
Training MSE: 0.42003
-- Epoch 22
Training MSE: 0.41873
-- Epoch 23
Training MSE: 0.41756
-- Epoch 24
Training MSE: 0.41634
-- Epoch 25
Training MSE: 0.41509
-- Epoch 26
Training MSE: 0.41391
-- Epoch 27
Training MSE: 0.41274
-- Epoch 28
Training MSE: 0.41149
-- Epoch 29
Training MSE: 0.41032
-- Epoch 30
Training MSE: 0.40891
-- Epoch 31
Training MSE: 0.40774
-- Epoch 32
Training MSE: 0.40635
-- Epoch 33
Training MSE: 0.40495
-- Epoch 34
Training MSE: 0.40354
-- Epoch 35
Training MSE: 0.40203
-- Epoch 36
Training MSE: 0.40047
-- Epoch 37
Training MSE: 0.39889
-- Epoch 38
Training MSE: 0.39728
-- Epoch 39
Training MSE: 0.39562
-- Epoch 40
Training MSE: 0.39387
-- Epoch 41
Training MSE: 0.39216
-- Epoch 42
Training MSE: 0.39030
-- Epoch 43
Training MSE: 0.38847
-- Epoch 44
Training MSE: 0.38655
-- Epoch 45
Training MSE: 0.38461
-- Epoch 46
Training MSE: 0.38269
-- Epoch 47
Training MSE: 0.38068
-- Epoch 48
Training MSE: 0.37864
-- Epoch 49
Training MSE: 0.37657
-- Epoch 50
Training MSE: 0.37459
-- Epoch 51
Training MSE: 0.37253
-- Epoch 52
Training MSE: 0.37045
-- Epoch 53
Training MSE: 0.36845
-- Epoch 54
Training MSE: 0.36647
-- Epoch 55
Training MSE: 0.36448
-- Epoch 56
Training MSE: 0.36254
-- Epoch 57
Training MSE: 0.36067
-- Epoch 58
Training MSE: 0.35874
-- Epoch 59
Training MSE: 0.35690
-- Epoch 60
Training MSE: 0.35511
-- Epoch 61
Training MSE: 0.35333
-- Epoch 62
Training MSE: 0.35155
-- Epoch 63
Training MSE: 0.34992
-- Epoch 64
Training MSE: 0.34829
-- Epoch 65
Training MSE: 0.34675
-- Epoch 66
Training MSE: 0.34538
-- Epoch 67
Training MSE: 0.34393
-- Epoch 68
Training MSE: 0.34258
-- Epoch 69
Training MSE: 0.34129
-- Epoch 70
Training MSE: 0.34006
-- Epoch 71
Training MSE: 0.33885
-- Epoch 72
Training MSE: 0.33773
-- Epoch 73
Training MSE: 0.33671
-- Epoch 74
Training MSE: 0.33564
-- Epoch 75
Training MSE: 0.33468
-- Epoch 76
Training MSE: 0.33375
-- Epoch 77
Training MSE: 0.33292
-- Epoch 78
Training MSE: 0.33211
-- Epoch 79
Training MSE: 0.33131
-- Epoch 80
Training MSE: 0.33065
-- Epoch 81
Training MSE: 0.33002
-- Epoch 82
Training MSE: 0.32930
-- Epoch 83
Training MSE: 0.32882
-- Epoch 84
Training MSE: 0.32813
-- Epoch 85
Training MSE: 0.32764
-- Epoch 86
Training MSE: 0.32722
-- Epoch 87
Training MSE: 0.32677
-- Epoch 88
Training MSE: 0.32635
-- Epoch 89
Training MSE: 0.32591
-- Epoch 90
Training MSE: 0.32550
-- Epoch 91
Training MSE: 0.32513
-- Epoch 92
Training MSE: 0.32481
-- Epoch 93
Training MSE: 0.32451
-- Epoch 94
Training MSE: 0.32421
-- Epoch 95
Training MSE: 0.32397
-- Epoch 96
Training MSE: 0.32363
-- Epoch 97
Training MSE: 0.32341
-- Epoch 98
Training MSE: 0.32319
-- Epoch 99
Training MSE: 0.32293
-- Epoch 100
Training MSE: 0.32268
FM MSE: 0.8873

样例三：是一个分类的样例

import numpy as np
from sklearn.feature_extraction import DictVectorizer
from sklearn.model_selection import train_test_split
from pyfm import pylibfmfrom sklearn.datasets import make_classificationX, y = make_classification(n_samples=1000,n_features=100, n_clusters_per_class=1)
data = [ {v: k for k, v in dict(zip(i, range(len(i)))).items()}  for i in X]X_train, X_test, y_train, y_test = train_test_split(data, y, test_size=0.1, random_state=42)v = DictVectorizer()
X_train = v.fit_transform(X_train)
X_test = v.transform(X_test)fm = pylibfm.FM(num_factors=50, num_iter=10, verbose=True, task="classification", initial_learning_rate=0.0001, learning_rate_schedule="optimal")fm.fit(X_train,y_train)from sklearn.metrics import log_loss
print("Validation log loss: %.4f" % log_loss(y_test,fm.predict(X_test)))
输出：
Creating validation dataset of 0.01 of training for adaptive regularization
-- Epoch 1
Training log loss: 2.12467
-- Epoch 2
Training log loss: 1.74185
-- Epoch 3
Training log loss: 1.42232
-- Epoch 4
Training log loss: 1.16085
-- Epoch 5
Training log loss: 0.94964
-- Epoch 6
Training log loss: 0.78052
-- Epoch 7
Training log loss: 0.64547
-- Epoch 8
Training log loss: 0.53758
-- Epoch 9
Training log loss: 0.45132
-- Epoch 10
Training log loss: 0.38187
Validation log loss: 1.3678

代码：pyFM/pyfm/pylibfm.py at master · coreylynch/pyFM (github.com)

Factorization Machines(论文笔记)

样例一： 一个简单的例子,train是一个字典，先将train进行“one-hot” coding，然后输入相关特征向量，可以预测相关性。 from pyfm import pylibfm from sklearn.feature_extraction import DictVectorizer import numpy as np tra…...

编程日记 2023/7/27 6:12:07

Qt开发(5)——使用QTimer定时触发槽函数

实现效果软件启动之后，开始计时，到达预定时间后，调用其他类的某个函数。类的分工 BaseType：软件初始化的调用类 FuncType: 功能函数所在类具体函数 // FuncType.h class FuncType: public QObject {Q_OBJECT public: publ…...

编程日记 2023/7/27 6:11:06

2023年JAVA最新面试题

2023年JAVA最新面试题 1 JavaWeb基础1.1 HashMap的底层实现原理？1.2 HashMap 和 HashTable的异同？1.5 Collection 和 Collections的区别？1.6 Collection接口的两种区别1.7 ArrayList、LinkedList、Vector者的异同？1.8 String、Str…...

编程日记 2023/7/27 6:10:04

（四）RabbitMQ高级特性（消费端限流、利用限流实现不公平分发、消息存活时间、优先级队列

Lison <dreamlison163.com>, v1.0.0, 2023.06.23 RabbitMQ高级特性（消费端限流、利用限流实现不公平分发、消息存活时间、优先级队列文章目录 RabbitMQ高级特性（消费端限流、利用限流实现不公平分发、消息存活时间、优先级队列消费端限流利用限流…...

编程日记 2023/7/27 6:09:03

Vue如何配置eslint

eslint官网: eslint.bootcss.com eslicate如何配置 1、选择新的配置： 2、选择三个必选项 3、再选择Css预处理器 4、之后选择处理器 5、选择是提交的时候就进行保存模式 6、放到独立的配置文件上去 7、最后一句是将自己的数据存为预设 8、配合console不要出现的规则…...

编程日记 2023/7/27 6:08:02

Elasticsearch查询文档

GET查询索引单个文档 GET /索引/_doc/ID GET /ffbf/_doc/123返回结果如下，查到了有数据"found" : true表示 {"_index" : "ffbf","_type" : "_doc","_id" : "123","_version" : 2...

编程日记 2023/7/27 6:07:01

面向对象编程：多态性的理论与实践

文章目录 1. 修饰词和访问权限2. 多态的概念3. 多态的使用现象4. 多态的问题与解决5. 多态的意义在面向对象编程中，多态是一个重要的概念，它允许不同的对象以不同的方式响应相同的消息。本文将深入探讨多态的概念及其应用，以及在Java中如何实…...

编程日记 2023/7/27 6:06:00

linux：filezilla root密码登陆

问题： 如题参考： 亚马逊服务器FileZilla登录失败解决办法_亚马逊云 ssh链接秘钥认证不了 ubuntu拒绝root用户ssh远程登录解决办法总结： vi /etc/ssh/sshd_config，修改配置： PermitRootLogin yes PasswordAuthenticat…...

编程日记 2023/7/27 6:04:59

在nginx上部署nuxt项目

先安装Node.js 我安的18.17.0。安装完成后，可以使用cmd，winr然cmd进入，测试是否安装成功。安装在哪个盘都可以测试。测试输入node -v 和 npm -v，（中间有空格）出现下图版本提示就是完成了NodeJS的安装…...

编程日记 2023/7/27 6:03:58

嵌入式linux通用spi驱动之spidev使用总结

Linux内核集成了spidev驱动，提供了SPI设备的用户空间API。支持用于半双工通信的read和write访问接口以及用于全双工通信和I/O配置的ioctl接口。使用时，只需将SPI从设备的compatible属性值添加到spidev区动的spidev dt ids[]数组中，即可将该SP…...

编程日记 2023/7/27 6:02:56

【Nodejs】Puppeteer\爬虫实践

puppeteer 文档:puppeteer.js中文文档|puppeteerjs中文网|puppeteer爬虫教程 Puppeteer本身依赖6.4以上的Node，但是为了异步超级好用的async/await，推荐使用7.6版本以上的Node。另外headless Chrome本身对服务器依赖的库的版本要求比较高，c…...

编程日记 2023/7/27 6:01:55

Windows Active Directory密码同步

大多数 IT 环境中，员工需要记住其默认 Windows Active Directory （AD） 帐户以外的帐户的单独凭据，最重要的是，每个密码还受不同的密码策略和到期日期的约束，为不同的帐户使用单独的密码会增加用户忘记密码和…...

编程日记 2023/7/27 6:00:54

安科瑞能源物联网以能源供应、能源管理、设备管理、能耗分析的能源流向为主线-安科瑞黄安南

摘要：随着科学技术的发展，我国的物联网技术有了很大进展。为了提升电力抄表服务的稳定性，保障电力抄表数据的可靠性，本文提出并实现了基于物联网的智能电力抄表服务平台，结合云计算、大数据等技术，提供电力…...

编程日记 2023/7/27 5:59:53

FPGA设计时序分析一、时序路径

目录一、前言二、时序路径 2.1 时序路径构成 2.2 时序路径分类 2.3 数据捕获 2.4 Fast corner/Slow corner 2.5 Vivado时序报告三、参考资料一、前言时序路径字面容易简单地理解为时钟路径，事实时钟存在的意义是为了数据的处理、传输，因此严…...

编程日记 2023/7/27 5:58:53

spring复习：（52）注解方式下，ConfigurationClassPostProcessor是怎么被添加到容器的？

进入AnnotationConfigApplicationContext的构造方法： 进入AnnotatedBeanDefinitionReader的构造方法： 进入this(registry, getOrCreateEnvironment(registry));代码如下： 进入AnnotationConfigUtils.registerAnnotationConfigProcessors方…...

编程日记 2023/7/27 5:57:52

全国大学生数据统计与分析竞赛2021年【本科组】-B题：用户消费行为价值分析

目录摘要 1 任务背景与重述 1.1 任务背景 1.2 任务重述 2 任务分析 3 数据假设 4 任务求解 4.1 任务一：数据预处理 4.1.1 数据清洗 4.1.2 数据集成 4.1.3 数据变换 4.2 任务二：对用户城市分布情况与分布情况可视化分析 4.2.1 城市分布情况可视化分析 4…...

编程日记 2023/7/27 5:56:51

力扣1667. 修复表中的名字

表： Users ------------------------- | Column Name | Type | ------------------------- | user_id | int | | name | varchar | ------------------------- 在 SQL 中，user_id 是该表的主键。该表包含用户的 ID 和名字。…...

编程日记 2023/7/27 5:55:49

【设计模式】详解观察者模式

文章目录 1、简介2、观察者模式简单实现抽象主题（Subject）具体主题（ConcreteSubject）抽象观察者（Observer）具体观察者（ConcrereObserver）测试： 观察者设计模式优缺点观察…...

编程日记 2023/7/27 5:54:48

用html+javascript打造公文一键排版系统8：附件及标题排版

最近工作有点忙，所以没能及时完善公文一键排版系统，现在只好熬夜更新一下。有时公文有包括附件，招照公文排版规范： 附件应当另面编排，并在版记之前，与公文正文一起装订。“附件”二字及附件顺序号用3号黑…...

编程日记 2023/7/27 5:53:47

微服务体系＜1＞

我们的微服务架构我们的微服务架构和单体架构的区别什么是微服务架构微服务就是吧我们传统的单体服务分成订单模块库存模块账户模块单体模块是本地调用从订单模块调用到库存模块再到账户模块这三个模块都是调用的同一个数据库这就是我们的单体架构微服务就是…...

编程日记 2023/7/27 5:52:46

conda相比python好处

Conda 作为 Python 的环境和包管理工具，相比原生 Python 生态（如 pip 虚拟环境）有许多独特优势，尤其在多项目管理、依赖处理和跨平台兼容性等方面表现更优。以下是 Conda 的核心好处： 一、一站式环境管理&#xff1a…...

编程新知 2025/6/15 5:36:36

Java 8 Stream API 入门到实践详解

一、告别 for 循环！ 传统痛点： Java 8 之前，集合操作离不开冗长的 for 循环和匿名类。例如，过滤列表中的偶数： List<Integer> list Arrays.asList(1, 2, 3, 4, 5); List<Integer> evens new ArrayList…...

编程新知 2025/7/7 9:09:17

前端倒计时误差!

提示：记录工作中遇到的需求及解决办法文章目录前言一、误差从何而来？二、五大解决方案1. 动态校准法（基础版）2. Web Worker 计时3. 服务器时间同步4. Performance API 高精度计时5. 页面可见性API优化三、生产环境最佳实践四、终极解决方案架构前言前几天听说公司某个项…...

编程新知 2025/7/13 22:34:29

华为云Flexus+DeepSeek征文｜DeepSeek-V3/R1 商用服务开通全流程与本地部署搭建

华为云FlexusDeepSeek征文｜DeepSeek-V3/R1 商用服务开通全流程与本地部署搭建前言如今大模型其性能出色，华为云 ModelArts Studio_MaaS大模型即服务平台华为云内置了大模型，能助力我们轻松驾驭 DeepSeek-V3/R1，本文中将分享如何…...

编程新知 2025/7/17 20:04:26

Git 3天2K星标：Datawhale 的 Happy-LLM 项目介绍（附教程）

引言在人工智能飞速发展的今天，大语言模型（Large Language Models, LLMs）已成为技术领域的焦点。从智能写作到代码生成，LLM 的应用场景不断扩展，深刻改变了我们的工作和生活方式。然而，理解这些模型的内部…...

编程新知 2025/7/21 14:47:16

WebRTC从入门到实践 - 零基础教程

WebRTC从入门到实践 - 零基础教程目录 WebRTC简介基础概念工作原理开发环境搭建基础实践三个实战案例常见问题解答 1. WebRTC简介 1.1 什么是WebRTC？ WebRTC（Web Real-Time Communication）是一个支持网页浏览器进行实时语音…...

编程新知 2025/7/13 10:03:09

LangFlow技术架构分析

🔧 LangFlow 的可视化技术栈前端节点编辑器底层框架：基于 （一个现代化的 React 节点绘图库） 功能： 拖拽式构建 LangGraph 状态机实时连线定义节点依赖关系可视化调试循环和分支逻辑与 LangGraph 的深…...

编程新知 2025/6/10 21:26:51

永磁同步电机无速度算法--基于卡尔曼滤波器的滑模观测器

一、原理介绍传统滑模观测器采用如下结构： 传统SMO中LPF会带来相位延迟和幅值衰减，并且需要额外的相位补偿。采用扩展卡尔曼滤波器代替常用低通滤波器(LPF)，可以去除高次谐波，并且不用相位补偿就可以获得一个误差较小的转子位…...

编程新知 2025/7/20 6:59:20

虚幻基础：角色旋转

能帮到你的话，就给个赞吧 😘 文章目录移动组件使用控制器所需旋转：组件使用控制器旋转将旋转朝向运动：组件使用移动方向旋转控制器旋转和移动旋转缺点移动旋转：必须移动才能旋转，不移动不旋转控制器…...

编程新知 2025/7/14 6:35:45

XXE漏洞知识

目录 1.XXE简介与危害 XML概念 XML与HTML的区别 1.pom.xml 主要作用 2.web.xml 3.mybatis 2.XXE概念与危害案例：文件读取（需要Apache >5.4版本） 案例：内网探测（鸡肋） 案例：执行命…...

编程新知 2025/7/20 6:20:42

相关文章：