开源reddy-v4模型 - 免费生成高质量女性形象图像

首页

Reddy V4

由 Unmapped2895 开发

基于FLUX.1-dev的标准PEFT LoRA模型，专注于生成高质量女性形象图像

图像生成开源协议:其他 #女性时尚写真 #高分辨率图像生成 #LoRA微调模型

下载量 59

发布时间 : 4/4/2025

模型简介

这是一个基于FLUX.1-dev的LoRA微调模型，专门用于生成各种场景下的高质量女性形象图像，支持文本生成图像和图像生成图像任务。

模型特点

高质量图像生成

能够生成细节丰富、风格多样的高质量女性形象图像

LoRA微调

采用低秩适应(LoRA)技术对基础模型进行高效微调

多场景支持

支持多种场景下的图像生成，包括瑜伽、赛博朋克、奇幻等不同风格

流匹配技术

采用流匹配(Flow Matching)技术进行训练，提升生成质量

模型能力

文本生成图像

图像生成图像

高质量人物形象生成

多风格图像生成

使用案例

创意设计

时尚摄影生成

生成高端时尚摄影风格的内衣模特图像

可生成具有专业摄影质感的时尚图像

角色设计

为游戏或影视作品生成各种风格的角色形象

可生成赛博朋克、奇幻等多种风格的角色形象

内容创作

社交媒体内容

为社交媒体生成吸引眼球的网红风格图像

可生成适合Instagram等平台的高质量内容

🚀 reddy-v4

reddy-v4 是一个基于 black-forest-labs/FLUX.1-dev 的标准 PEFT LoRA 模型，可用于文本到图像的生成任务。

🚀 快速开始

推理示例

import torch
from diffusers import DiffusionPipeline

model_id = 'black-forest-labs/FLUX.1-dev'
adapter_id = 'Unmapped2895/reddy-v4'
pipeline = DiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.bfloat16) # loading directly in bf16
pipeline.load_lora_weights(adapter_id)

prompt = "Realistic wide shot photo of woman posing in a luxurious satin lingerie set, featuring a plunging bra, delicate thong and a classic garter belt with black stockings. The satin lingerie shimmers softly in the light, and the cut emphasizes both sophistication and a hint of allure. The lingerie is detailed with fine lace edges, highlighting her alluring figure. She elegantly styled hair as if getting ready for a formal event. The photo has a cinematic quality with rays of light and dramatic play of shadow and light"


## Optional: quantise the model to save on vram.
## Note: The model was not quantised during training, so it is not necessary to quantise it during inference time.
#from optimum.quanto import quantize, freeze, qint8
#quantize(pipeline.transformer, weights=qint8)
#freeze(pipeline.transformer)
    
pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu') # the pipeline is already in its target precision level
model_output = pipeline(
    prompt=prompt,
    num_inference_steps=20,
    generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(42),
    width=832,
    height=1216,
    guidance_scale=3.5,
).images[0]

model_output.save("output.png", format="PNG")

✨ 主要特性

基于 black-forest-labs/FLUX.1-dev 基础模型构建，属于标准 PEFT LoRA 模型。
支持文本到图像、图像到图像等多种生成任务。
训练和推理过程中提供了详细的参数设置。

📦 安装指南

文档未提及具体安装步骤，暂不提供。

💻 使用示例

基础用法

import torch
from diffusers import DiffusionPipeline

model_id = 'black-forest-labs/FLUX.1-dev'
adapter_id = 'Unmapped2895/reddy-v4'
pipeline = DiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.bfloat16) # loading directly in bf16
pipeline.load_lora_weights(adapter_id)

prompt = "Realistic wide shot photo of woman posing in a luxurious satin lingerie set, featuring a plunging bra, delicate thong and a classic garter belt with black stockings. The satin lingerie shimmers softly in the light, and the cut emphasizes both sophistication and a hint of allure. The lingerie is detailed with fine lace edges, highlighting her alluring figure. She elegantly styled hair as if getting ready for a formal event. The photo has a cinematic quality with rays of light and dramatic play of shadow and light"

pipeline.to('cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu') # the pipeline is already in its target precision level
model_output = pipeline(
    prompt=prompt,
    num_inference_steps=20,
    generator=torch.Generator(device='cuda' if torch.cuda.is_available() else 'mps' if torch.backends.mps.is_available() else 'cpu').manual_seed(42),
    width=832,
    height=1216,
    guidance_scale=3.5,
).images[0]

model_output.save("output.png", format="PNG")

高级用法

文档未提及高级用法示例，暂不提供。

📚 详细文档

验证设置

CFG：3.5
CFG Rescale：0.0
步数：20
采样器：FlowMatchEulerDiscreteScheduler
随机种子：42
分辨率：832x1216
跳过层引导：无

注意：验证设置不一定与训练设置相同。

训练设置

属性	详情
训练轮数	10
训练步数	2000
学习率	0.0001
学习率调度	常量
热身步数	500
最大梯度值	2.0
有效批量大小	1
微批量大小	1
梯度累积步数	1
GPU 数量	1
梯度检查点	启用
预测类型	flow - matching (额外参数=['shift=3', 'flux_guidance_mode=constant', 'flux_guidance_value=1.0', 'flow_matching_loss=compatible', 'flux_lora_target=all'])
优化器	adamw_bf16
可训练参数精度	Pure BF16
基础模型精度	`no_change`
字幕丢弃概率	10.0%
LoRA 秩	16
LoRA Alpha	无
LoRA 丢弃率	0.1
LoRA 初始化风格	默认