当前位置：首页 > news >正文

Diffusers 使用 LoRA

news 2026/4/12 1:07:30

使用diffusers 加载 LoRA，实现文生图功能。摘自 diffusers文档。
模型可以根据名称去modelscope找对应资源下载。使用的时候需要替换成具体路径。虽然modelscope和diffusers都使用了模型id，但是并不能通用。
不同的LoRA对应了不同的“trigger” words，在prompt中加入这个“trigger” words才能生成正确的结果。
比如使用了toy-face的LoRA模型，那么就要在prompt加入“toy face”。

from diffusers import DiffusionPipeline
import torchpipe_id = "stabilityai/stable-diffusion-xl-base-1.0"
pipe = DiffusionPipeline.from_pretrained(pipe_id, torch_dtype=torch.float16).to("cuda")#模型下载
#from modelscope import snapshot_download
#model_dir = snapshot_download('SDXL-LoRA/CiroN2022-toy-face')
#加载lora
pipe.load_lora_weights("CiroN2022/toy-face", weight_name="toy_face_sdxl.safetensors", adapter_name="toy")
# 加入 “toy face”
prompt = "toy_face of a hacker with a hoodie"
lora_scale = 0.9
image = pipe(prompt, num_inference_steps=30, cross_attention_kwargs={"scale": lora_scale}, generator=torch.manual_seed(0)
).images[0]
image

也可以再加载另一个模型，pipe.set_adapters() 函数确定使用哪个模型。不同的模型根据adapter_name来区分。

pipe.load_lora_weights("nerijs/pixel-art-xl", weight_name="pixel-art-xl.safetensors", adapter_name="pixel")
pipe.set_adapters("pixel")prompt = "a hacker with a hoodie, pixel art"
image = pipe(prompt, num_inference_steps=30, cross_attention_kwargs={"scale": lora_scale}, generator=torch.manual_seed(0)
).images[0]
image

也可以同时使用，adapter_weights设置不同的权重，prompt也应该包括全部的“trigger” words。

pipe.set_adapters(["pixel", "toy"], adapter_weights=[0.5, 1.0])
prompt = "toy_face of a hacker with a hoodie, pixel art"
image = pipe(prompt, num_inference_steps=30, cross_attention_kwargs={"scale": 1.0}, generator=torch.manual_seed(0)
).images[0]
image

在这里插入图片描述

更准确的控制LoRA的影响强度。unet一般包括down、mid、up，不同的部分对图片的细节、纹理、风格等影响不同。

pipe.enable_lora()  # enable lora again, after we disabled it above
prompt = "toy_face of a hacker with a hoodie, pixel art"
adapter_weight_scales = { "unet": { "down": 1, "mid": 0, "up": 0} }
pipe.set_adapters("pixel", adapter_weight_scales)
image = pipe(prompt, num_inference_steps=30, generator=torch.manual_seed(0)).images[0]
image

从左到右分别对应{ “down”: 1, “mid”: 0, “up”: 0}，{ “down”: 0, “mid”: 1, “up”: 0}，{ “down”: 0, “mid”: 0, “up”: 1}
在这里插入图片描述

更细粒度的控制。要根据具体的模型结构，不同的模型结构不同。

adapter_weight_scales_toy = 0.5
adapter_weight_scales_pixel = {"unet": {"down": 0.9,  # all transformers in the down-part will use scale 0.9# "mid"  # because, in this example, "mid" is not given, all transformers in the mid part will use the default scale 1.0"up": {"block_0": 0.6,  # all 3 transformers in the 0th block in the up-part will use scale 0.6"block_1": [0.4, 0.8, 1.0],  # the 3 transformers in the 1st block in the up-part will use scales 0.4, 0.8 and 1.0 respectively}}
}
pipe.set_adapters(["toy", "pixel"], [adapter_weight_scales_toy, adapter_weight_scales_pixel])
image = pipe(prompt, num_inference_steps=30, generator=torch.manual_seed(0)).images[0]
image

在这里插入图片描述

常用函数

#不使用
pipe.disable_lora()
#使用
pipe.enable_lora()
#正在使用的
pipe.get_active_adapters()
#列表
pipe.get_list_adapters()
#删除
pipe.delete_adapters("toy")

Diffusers 使用 LoRA

相关文章：

Diffusers 使用 LoRA

云安全博客阅读（二）

SpringCloud系列教程：微服务的未来（六）docker教程快速入门、常用命令

Vue 快速入门：开启前端新征程

UVM:uvm_component methods configure

LLM 训练中存储哪些矩阵：权重矩阵，梯度矩阵，优化器状态

大模型思维链推理的进展、前沿和未来分析

NLP 技术的突破与未来：从词嵌入到 Transformer

嵌入式中QT实现文本与线程控制方法

云备份项目--服务端编写

Node.js——fs（文件系统）模块

SAP BC 同服务器不同client之间的传输SCC1

CentOS: RPM安装、YUM安装、编译安装（详细解释+实例分析！！！）

linux音视频采集技术: v4l2

MySQL使用navicat新增触发器

voice agent实现方案调研

TCP通信原理学习

Three.js 基础概念：构建3D世界的核心要素

如何用代码提交spark任务并且获取任务权柄

关于Mac中的shell

数据分析三件套：Numpy、Pandas、Matplotlib

Profinet协议在工业自动化中的无线通信应用解析

LVM磁盘扩容实战：如何在已有逻辑卷上直接扩展存储空间

从停车场管理系统看STM32项目开发：如何规划你的第一个物联网硬件Demo？

一个简洁易用的 Delphi JSON 封装库，基于 System.JSON`单元封装，提供更直观的 API行

Visualized BGE批量推理实战：如何用Python代码将图片编码速度提升3倍

华为政务云时空信息平台PPT(37页)

Phi-4-mini-reasoning实战案例：从数学计算到商业分析，小白也能用的AI大脑

如何快速掌握PDF差异对比工具：diff-pdf终极指南

告别官方API：手把手教你从零封装YOLOv8-Pose的推理代码（附完整Python脚本）