detr-resnet-101-dc5-sku110k开源目标检测模型

首页

Detr Resnet 101 Dc5 Sku110k

由 isalia99 开发

这是一个基于DETR架构的目标检测模型，使用ResNet-101-DC5作为骨干网络，在SKU110K数据集上训练，查询数设置为400。

目标检测

Transformers

开源协议:Apache-2.0 #零售货架检测 #400查询数优化 #DETR架构

下载量 129

发布时间 : 3/18/2024

模型简介

该模型专门用于目标检测任务，特别适用于零售商品检测场景。

模型特点

400查询数设计

相比原始DETR模型，该模型将查询数设置为400，可能提高了检测密集小目标的能力

SKU110K数据集预训练

专门针对零售商品检测场景进行优化，在SKU110K数据集上进行了端到端训练

端到端训练

采用DETR的端到端训练方式，无需复杂的后处理流程

模型能力

目标检测

零售商品识别

密集小物体检测

使用案例

零售行业

货架商品检测

自动检测和识别零售货架上的商品

在SKU110K验证集上mAP达到59.8

库存管理

辅助零售商店进行自动化库存盘点

🚀 DETR（端到端目标检测）模型：基于ResNet - 101 - DC5骨干网络，在SKU110K数据集上训练，num_queries为400

本项目的DETR（Detection Transformer）模型在SKU110K目标检测数据集（包含8000张带注释图像）上进行了端到端训练。与原始模型的主要区别在于，本模型的num_queries设置为400，并且在SKU110K数据集上进行了预训练。

🚀 快速开始

模型使用方法

以下是使用该模型的示例代码：

from transformers import DetrImageProcessor, DetrForObjectDetection
import torch
from PIL import Image, ImageOps
import requests

url = "https://github.com/Isalia20/DETR-finetune/blob/main/IMG_3507.jpg?raw=true"
image = Image.open(requests.get(url, stream=True).raw)
image = ImageOps.exif_transpose(image)

# you can specify the revision tag if you don't want the timm dependency
processor = DetrImageProcessor.from_pretrained("facebook/detr-resnet-101-dc5")
model = DetrForObjectDetection.from_pretrained("isalia99/detr-resnet-101-dc5-sku110k")
model = model.eval()
inputs = processor(images=image, return_tensors="pt")
outputs = model(**inputs)

# convert outputs (bounding boxes and class logits) to COCO API
# let's only keep detections with score > 0.8
target_sizes = torch.tensor([image.size[::-1]])
results = processor.post_process_object_detection(outputs, target_sizes=target_sizes, threshold=0.8)[0]

for score, label, box in zip(results["scores"], results["labels"], results["boxes"]):
        box = [round(i, 2) for i in box.tolist()]
        print(
                f"Detected {model.config.id2label[label.item()]} with confidence "
                f"{round(score.item(), 3)} at location {box}"
        )

运行上述代码，输出示例如下：

Detected LABEL_1 with confidence 0.983 at location [665.49, 480.05, 708.15, 650.11]
Detected LABEL_1 with confidence 0.938 at location [204.99, 1405.9, 239.9, 1546.5]
...
Detected LABEL_1 with confidence 0.998 at location [772.85, 169.49, 829.67, 372.18]
Detected LABEL_1 with confidence 0.999 at location [828.28, 1475.16, 874.37, 1593.43]

目前，特征提取器和模型均支持PyTorch。