detr-resnet-50-sku110k开源目标检测模型 - 免费部署助力商品货架检测

首页

Detr Resnet 50 Sku110k

由 isalia99 开发

该DETR模型在SKU110K目标检测数据集上进行了端到端训练，查询数设置为400，适用于商品货架检测等场景。

目标检测

Transformers

开源协议:Apache-2.0 #密集商品检测 #400查询数优化 #零售场景专用

下载量 4,066

发布时间 : 3/14/2024

模型简介

这是一个基于DETR架构的目标检测模型，使用ResNet-50作为骨干网络，在SKU110K数据集上进行了微调训练，专门用于零售场景中的商品检测。

模型特点

400查询数设计

相比原始DETR模型，该版本将查询数设置为400，可能更适合处理密集场景的目标检测任务。

SKU110K数据集优化

专门针对零售商品检测场景进行优化，在SKU110K数据集上表现良好。

两阶段训练策略

采用先微调解码器再微调整个网络的训练策略，可能有助于提高模型性能。

模型能力

商品目标检测

零售场景图像分析

密集物体识别

使用案例

零售行业

货架商品检测

自动识别和定位零售货架上的商品

在SKU110K验证集上达到58.9 mAP

库存管理

通过图像分析自动统计商品数量和位置

🚀 DETR（端到端目标检测）模型

本项目是一个基于ResNet - 50骨干网络的DETR（端到端目标检测）模型，在SKU110K数据集上进行训练，设置了400个查询数（num_queries）。该模型能够有效解决目标检测问题，为相关领域的应用提供了强大的支持。

🚀 快速开始

DETR（Detection Transformer）模型在SKU110K目标检测数据集（包含8000张标注图像）上进行了端到端的训练。与原始模型相比，主要区别在于设置了400个查询数（num_queries），并且在SKU110K数据集上进行了预训练。

模型使用方法

以下是使用该模型的示例代码：

from transformers import DetrImageProcessor, DetrForObjectDetection
import torch
from PIL import Image, ImageOps
import requests

url = "https://github.com/Isalia20/DETR-finetune/blob/main/IMG_3507.jpg?raw=true"
image = Image.open(requests.get(url, stream=True).raw)
image = ImageOps.exif_transpose(image)

# you can specify the revision tag if you don't want the timm dependency
processor = DetrImageProcessor.from_pretrained("facebook/detr-resnet-50", revision="no_timm")
model = DetrForObjectDetection.from_pretrained("isalia99/detr-resnet-50-sku110k")
model = model.eval()
inputs = processor(images=image, return_tensors="pt")
outputs = model(**inputs)

# convert outputs (bounding boxes and class logits) to COCO API
# let's only keep detections with score > 0.8
target_sizes = torch.tensor([image.size[::-1]])
results = processor.post_process_object_detection(outputs, target_sizes=target_sizes, threshold=0.8)[0]

for score, label, box in zip(results["scores"], results["labels"], results["boxes"]):
        box = [round(i, 2) for i in box.tolist()]
        print(
                f"Detected {model.config.id2label[label.item()]} with confidence "
                f"{round(score.item(), 3)} at location {box}"
        )

代码运行后，预期输出如下：

Detected LABEL_1 with confidence 0.983 at location [665.49, 480.05, 708.15, 650.11]
Detected LABEL_1 with confidence 0.938 at location [204.99, 1405.9, 239.9, 1546.5]
...
Detected LABEL_1 with confidence 0.998 at location [772.85, 169.49, 829.67, 372.18]
Detected LABEL_1 with confidence 0.999 at location [828.28, 1475.16, 874.37, 1593.43]

目前，特征提取器和模型均支持PyTorch。