fashion-object-detection开源时尚目标检测模型

首页

Fashion Object Detection

由 yainage90 开发

基于microsoft/conditional-detr-resnet-50微调的时尚目标检测模型，专门用于检测时尚物品

目标检测

Transformers

英语开源协议:MIT #时尚单品检测 #高精度mAP #电商视觉搜索

下载量 1,710

发布时间 : 8/24/2024

模型简介

该模型是针对时尚领域优化的目标检测模型，能够准确识别图像中的各类时尚单品，如包袋、下装、连衣裙等。

模型特点

时尚专用检测

专门针对时尚物品优化的检测模型，能准确识别7类时尚单品

高性能

在测试中达到0.7542 mAP，表现优异

多数据集训练

结合ModaNet和Fashionpedia两个时尚数据集进行训练

模型能力

时尚物品检测

图像分析

边界框定位

使用案例

电子商务

时尚商品自动标记

自动检测并标记电商平台上的时尚商品图片

提高商品上架效率，减少人工标注成本

视觉搜索

通过图片搜索相似时尚单品

提升用户体验和转化率

时尚分析

时尚趋势分析

分析社交媒体图片中的时尚单品流行度

为时尚品牌提供市场洞察

🚀 时尚目标检测模型

本项目基于transformers库，提供了一个用于时尚目标检测的微调模型。该模型以microsoft/conditional-detr-resnet-50为基础，可检测时尚图像中的各类物品，如包、上衣、裤子等，在时尚搜索等领域具有重要价值。

🚀 快速开始

本模型是microsoft/conditional-detr-resnet-50的微调版本。

你可以在这个GitHub仓库中找到模型的详细信息 -> fashion-visual-search

同时，你还能找到时尚图像特征提取器模型 -> yainage90/fashion-image-feature-extractor

本模型使用两个数据集组合进行训练：modanet 和 fashionpedia

模型可检测的标签有 ['包', '下装', '连衣裙', '帽子', '鞋子', '外套', '上衣']

在总共100个训练轮次中的第96轮，模型取得了最佳成绩，平均精度均值（mAP）为0.7542。因此，认为模型在性能上仍有一定的提升空间。

💻 使用示例

基础用法

from PIL import Image
import torch
from transformers import  AutoImageProcessor, AutoModelForObjectDetection

device = 'cpu'
if torch.cuda.is_available():
    device = torch.device('cuda')
elif torch.backends.mps.is_available():
    device = torch.device('mps')

ckpt = 'yainage90/fashion-object-detection'
image_processor = AutoImageProcessor.from_pretrained(ckpt)
model = AutoModelForObjectDetection.from_pretrained(ckpt).to(device)

image = Image.open('<path/to/image>').convert('RGB')

with torch.no_grad():
    inputs = image_processor(images=[image], return_tensors="pt")
    outputs = model(**inputs.to(device))
    target_sizes = torch.tensor([[image.size[1], image.size[0]]])
    results = image_processor.post_process_object_detection(outputs, threshold=0.4, target_sizes=target_sizes)[0]

    items = []
    for score, label, box in zip(results["scores"], results["labels"], results["boxes"]):
        score = score.item()
        label = label.item()
        box = [i.item() for i in box]
        print(f"{model.config.id2label[label]}: {round(score, 3)} at {box}")
        items.append((score, label, box))