当前位置：首页 > article >正文

在Ubuntu上用Llama Factory命令行微调Qwen2.5的简单过程

article 2026/5/12 2:44:30

半年多之前写过一个教程：在Windows上用Llama Factory微调Llama 3的基本操作_llama-factory windows-CSDN博客

如果用命令行做的话，前面的步骤可以参考上面这个博客。安装好环境后，用自我认知数据集微调Lora模块：data/identity.json，里面的格式也很好理解：

{

"instruction": "你是谁？",

"input": "",

"output": "您好，我是 { {name}}，一个由 { {author}} 发明的人工智能助手。我可以回答各种问题，提供实用的建议和帮助，帮助用户完成各种任务。"

},

可以直接用VS Code将上面的name和author替换，另存为一个文件，并且在data/dataset_info.json增加这个，类似于我这里（另存的文件名为identity_tpri.json）：

"identity_tpri": {

"file_name": "identity_tpri.json"

},

将文件examples/train_qlora/llama3_lora_sft_awq.yaml另存为一个文件并且重命名，然后配置对应一下已经下载下来的模型文件（顺便说一句，模型文件可以在：魔搭社区这里下载，应该速度都很快），我这里是这样修改的（标红的是更新的内容，除了微调数据集和模型位置以及Lora模块位置，需要注意的就是num_train_epochs，之前默认的值是3，经过测试以后太小了）：

### model

model_name_or_path: /home/quyu/Qwen2.5-7B-Instruct/

trust_remote_code: true

### method

stage: sft

do_train: true

finetuning_type: lora

lora_rank: 8

lora_target: all

### dataset

dataset: identity_tpri

template: qwen

cutoff_len: 2048

max_samples: 1000

overwrite_cache: true

preprocessing_num_workers: 16

### output

output_dir: saves/qwen-7b/lora/sft

logging_steps: 10

save_steps: 500

plot_loss: true

overwrite_output_dir: true

### train

per_device_train_batch_size: 1

gradient_accumulation_steps: 8

learning_rate: 1.0e-4

num_train_epochs: 20.0

lr_scheduler_type: cosine

warmup_ratio: 0.1

bf16: true

ddp_timeout: 180000000

### eval

# val_size: 0.1

# per_device_eval_batch_size: 1

# eval_strategy: steps

# eval_steps: 500

然后运行一下（重命名的文件是qwen_lora.yaml）：

llamafactory-cli train examples/train_qlora/qwen_lora.yaml

如果显存不够可能会报错（例如训练32B的时候），这个我在后一篇博客里再总结。如果显存够，那么可以直接得到微调后的lora模块，我这里用两个3090训练只需要一分多钟。我们将examples/inference/llama3_lora_sft.yaml复制以后重命名，并且将其内容改为：

model_name_or_path: /home/quyu/Qwen2.5-7B-Instruct

adapter_name_or_path: saves/qwen-7b/lora/sft

template: qwen

infer_backend: huggingface # choices: [huggingface, vllm]

trust_remote_code: true

然后运行（重命名的文件是qwen2_lora.yaml，看自己喜好重命名即可）：

llamafactory-cli chat examples/inference/qwen2_lora.yaml

然后再问大模型“你是谁？”，就可以看到修改之后的效果了。

在Ubuntu上用Llama Factory命令行微调Qwen2.5的简单过程

相关文章：

在Ubuntu上用Llama Factory命令行微调Qwen2.5的简单过程

go 循环处理无限极数据

Kafka 深入服务端 — 时间轮

一文掌握ADB的安装及使用

Linux系统下速通stm32的clion开发环境配置

Java 9模块开发：IntelliJ IDEA实战指南

OpenCSG月度更新2025.1

【算法与数据结构】动态规划

AWTK 骨骼动画控件发布

【llm对话系统】什么是 LLM？大语言模型新手入门指南

三角形的最大周长（LeetCode 976）

go到底是什么意思：对go的猜测或断言

学习数据结构（2）空间复杂度+顺序表

DeepSeek--通向通用人工智能的深度探索者

Unity游戏(Assault空对地打击)开发(1) 创建项目和选择插件

（三）Session和Cookie讲解

【信息系统项目管理师-选择真题】2011下半年综合知识答案和详解

1.Template Method 模式

【PyTorch】5.张量索引操作

力扣25.k个一组翻转链表

[EAI-023] FAST: Efficient Action Tokenization for Vision-Language-Action Models

2025年AI手机集中上市，三星Galaxy S25系列上市

八股文（一）

在虚拟机里运行frida-server以实现对虚拟机目标软件的监测和修改参数（一）（android Google Api 35高版本版）

FLTK - FLTK1.4.1 - demo - animgifimage-play

2024年除夕

如何实现滑动删除功能

golang通过AutoMigrate方法自动创建table详解

JAVA：利用 Content Negotiation 实现多样式响应格式的技术指南

Python 函数魔法书：基础、范例、避坑、测验与项目实战