当前位置：首页 > news >正文

李宏毅机器学习2022-HW8-Anomaly Detection

news 2026/5/13 2:59:50

文章目录

Task
Baseline
Report
- Question2
Code Link

Task

异常检测Anomaly Detection

在这里插入图片描述

将data经过Encoder，在经过Decoder，根据输入和输出的差距来判断异常图像。training data是100000张人脸照片，testing data有大约10000张跟training data相同分布的人脸照片(label 0)，还有10000张不同分布的照片(anomaly, label 1)，每张照片都是(64,64,3)，.npy file

以训练集的前三张照片为例，auto-encoder的输入和输出如下：

在这里插入图片描述

Baseline

Auto-encoder model一共有五种模型

fcn: fully-connected network
cnn: convolutional network
VAE
Resnet
Multi-encoder autoencoder
- encoder(fcn+fcn+fcn)+decoder(fcn)
- encoder(cnn+cnn+cnn)+decoder(cnn)
- encoder(fcn+fcn+conv)+decoder(fcn)

通过FCN+调节超参数的方式可以轻易的达到strong，Resnet也是但是Multi-encoder的方式表现并不好，也许是我处理方式有问题，具体代码可以参考GitHub中的文件

Report

Question2

Train a fully connected autoencoder and adjust at least two different element of the latent representation. Show your model architecture, plot out the original image, the reconstructed images for each adjustment and describe the differences.

import matplotlib.pyplot as plt
# sample = train_dataset[random.randint(0,100000)]
sample = train_dataset[0]
print("sample shape:{}".format(sample.size()))
sample = sample.reshape(1,3,64,64)model.eval()
with torch.no_grad():img = sample.cuda()# 只调整fcn中的latent representation的其中两维，其他模型都是正常输出if model_type in ['res']:output = model(img)output = decoder(output)print("res output shape:{}".format(output.size()))output = output[0] # 第一个重建图像，当然只有一个图像if model_type in ['fcn']:img = img.reshape(img.shape[0], -1)x = model.encoder(img)x[0][2] = x[0][2]*3output = model.decoder(x)print("fcn output shape:{}".format(output.size()))output = output.reshape(3,64,64)if model_type in ['vae']:output = model(img)print("vae output shape:{}".format(output.size()))output = output[0][0] # output[0]是重建后的图像，output[0][0]重建后的第一个图像if model_type in ['cnn']:output = model(img)[0]print("output shape:{}".format(output.size()))sample = sample.reshape(3,64,64)   # 创建画布
fig, axes = plt.subplots(nrows=1, ncols=2, figsize=(5, 5))# plt sample image
axes[0].imshow(transforms.ToPILImage()((sample+1)/2)) #imshow的输入(H,W,C)
axes[0].axis('off')
axes[0].annotate('sample input', xy=(0.5, -0.15), xycoords='axes fraction',ha='center', va='center')
# plt output image
axes[1].imshow(transforms.ToPILImage()((output+1)/2))
axes[1].axis('off')
axes[1].annotate('sample output', xy=(0.5, -0.15), xycoords='axes fraction',ha='center', va='center')plt.show()

在这里插入图片描述

Code Link

具体代码在Github

李宏毅机器学习2022-HW8-Anomaly Detection

文章目录

Task

Baseline

Report

Question2

Code Link

相关文章：

李宏毅机器学习2022-HW8-Anomaly Detection

用户体验分享 | YashanDB V23.2.3安装部署

【漏洞复现】泛微OA E-Office /E-mobile/App/init.php 任意文件上传漏洞

SpringCloudEureka实战：搭建EurekaServer

DataLight（V1.4.5）版本更新，新增 Ranger、Solr

深度解析：Python蓝桥杯青少组精英赛道与高端题型概览

如何使用SCCMSecrets识别SCCM策略中潜在的安全问题

Qt 信号重载问题--使用lambda表达式--解决方法

并行编程实战——TBB框架的应用之一Supra的基础

std::vector

Java Web 之 Cookie 详解

linux系统下让.py文件开机自启动

linux远程桌面：xrdp 安装失败

9.30Python基础-元组（补充）、字典、集合

桥接模式和NET模式的区别

Pigar：Python 项目的依赖管理利器

泰勒图 ——基于相关性与标准差的多模型评价指标可视化比较-XGBoost、sklearn

记录｜Modbus-TCP产品使用记录【摩通传动】

工业交换机的RMON

生态遥感数据下载分享

知识付费浪潮下的技术学习：是捷径，还是新的信息茧房？

ARM动态内存控制器与SDRAM地址映射技术详解

工程师实战：Windows 8工作站部署、驱动危机与专业工具兼容性全解析

图解人工智能（11）让人惊讶的AI

利用Google可编程搜索引擎API实现免费高效的Python搜索自动化

taotoken模型广场功能体验与主流模型选型建议

必看！移动岗亭厂家交货及时性测评，日硕科技排名第一！

正点原子 RK3562 Android14 集成 GStreamer 1.24.13（CLI + V4L2 插件）完整移植方案

终极指南：10分钟快速上手Ghidra逆向工程工具安装与配置

win10打印机不能共享报0x0000011b/0x00000709修复工具合集分享，亲测解决Windows打印机共享报错问题