wan-14b-laezel开源微调模块 - 免费定制生成拉埃泽尔角色视频内容

首页

Wan 14b Laezel

由 fofr 开发

专为Wan2.1 14B视频生成模型设计的LoRA微调模块，用于生成特定角色（拉埃泽尔）的视频内容

文本生成视频支持多种语言开源协议:Apache-2.0 #LoRA角色微调 #高动态视频生成 #多框架兼容

下载量 6

发布时间 : 3/12/2025

模型简介

基于Wan2.1 14B视频生成模型的LoRA微调模块，专注于生成拉埃泽尔角色的视频内容，支持文本转视频和图像转视频任务

模型特点

角色特定生成

专门针对拉埃泽尔角色进行优化，可生成高质量的角色视频

多框架兼容

兼容Diffusers和ComfyUI框架，使用灵活

云端优化

专为Replicate平台优化，提供高效的云端推理能力

模型能力

文本转视频

图像转视频

角色特定视频生成

LoRA微调应用

使用案例

角色视频创作

角色场景生成

生成拉埃泽尔角色在特定场景中的视频，如咖啡馆场景

可生成具有温暖氛围、背景虚化等效果的视频

游戏内容创作

游戏角色动画

为游戏角色生成动画片段

可快速生成角色动画原型

🚀 Wan 14B Laezel

本项目是用于视频生成的LoRA模型，基于Wan2.1 14b视频生成模型，可用于文本到视频、图像到视频的生成任务，能与diffusers或ComfyUI结合使用。

🚀 快速开始

关于此LoRA

这是一个适用于Wan2.1 14b视频生成模型的LoRA。它可以与diffusers或ComfyUI配合使用，并且可以加载到文本到视频和图像到视频的Wan2.1模型中。它是在Replicate上使用AI工具包进行训练的：https://replicate.com/ostris/wan-lora-trainer/train 。

触发词

你应该使用 LAEZEL 来触发视频生成。

使用此LoRA

Replicate有一系列针对速度和成本进行了优化的Wan2.1模型，它们也可以与此LoRA一起使用：

https://replicate.com/collections/wan-video
https://replicate.com/fofr/wan2.1-with-lora

使用Replicate的API运行此LoRA

import replicate

input = {
    "prompt": "LAEZEL",
    "lora_url": "https://huggingface.co/fofr/wan-14b-laezel/resolve/main/wan2.1-14b-laezel-lora.safetensors"
}

output = replicate.run(
    "fofr/wan2.1-with-lora:f83b84064136a38415a3aff66c326f94c66859b8ad7a2cb432e2822774f07b08",
    model="14b",
    input=input
)
for index, item in enumerate(output):
    with open(f"output_{index}.mp4", "wb") as file:
        file.write(item.read())

与Diffusers一起使用

pip install git+https://github.com/huggingface/diffusers.git

import torch
from diffusers.utils import export_to_video
from diffusers import AutoencoderKLWan, WanPipeline
from diffusers.schedulers.scheduling_unipc_multistep import UniPCMultistepScheduler

model_id = "Wan-AI/Wan2.1-T2V-14B-Diffusers"
vae = AutoencoderKLWan.from_pretrained(model_id, subfolder="vae", torch_dtype=torch.float32)
pipe = WanPipeline.from_pretrained(model_id, vae=vae, torch_dtype=torch.bfloat16)
flow_shift = 3.0  # 5.0 for 720P, 3.0 for 480P
pipe.scheduler = UniPCMultistepScheduler.from_config(pipe.scheduler.config, flow_shift=flow_shift)
pipe.to("cuda")

pipe.load_lora_weights("fofr/wan-14b-laezel")

pipe.enable_model_cpu_offload() #for low-vram environments

prompt = "LAEZEL"
negative_prompt = "Bright tones, overexposed, static, blurred details, subtitles, style, works, paintings, images, static, overall gray, worst quality, low quality, JPEG compression residue, ugly, incomplete, extra fingers, poorly drawn hands, poorly drawn faces, deformed, disfigured, misshapen limbs, fused fingers, still picture, messy background, three legs, many people in the background, walking backwards"

output = pipe(
    prompt=prompt,
    negative_prompt=negative_prompt,
    height=480,
    width=832,
    num_frames=81,
    guidance_scale=5.0,
).frames[0]
export_to_video(output, "output.mp4", fps=16)