nsfw-image-detection-384开源模型 - 轻量级高准确率图像检测免费可用

首页

Nsfw Image Detection 384

由 Marqo 开发

轻量级NSFW图像检测模型，准确率98.56%，体积仅为同类模型的1/18-1/20

图像分类开源协议:Apache-2.0 #轻量级NSFW检测 #高准确率图像分类 #多样化内容识别

下载量 158.92k

发布时间 : 11/20/2024

模型简介

专为识别不适合工作场所(NSFW)图像设计的分类模型，支持多样化内容类型检测

模型特点

轻量高效

模型体积仅为其他开源NSFW检测器的1/18-1/20

高准确率

在22万张图片的数据集上达到98.56%的准确率

多样化内容支持

可检测真实照片、绘画、AI生成图像等多种内容类型

可调阈值

支持通过调整概率阈值平衡精确率与召回率

模型能力

NSFW图像检测

图像内容分类

实时图像过滤

使用案例

内容审核

社交媒体内容过滤

自动识别并过滤用户上传的不当内容

减少人工审核工作量

企业网络过滤

阻止工作场所访问不适当图像

维护职场环境安全

安全搜索

搜索引擎安全过滤

在图像搜索结果中过滤NSFW内容

提供更安全的搜索体验

🚀 nsfw-image-detection-384

nsfw-image-detection-384 是一个轻量级图像分类模型，旨在识别不适宜内容（NSFW）图像。该模型比其他开源模型小约 18 - 20 倍，并且在我们的数据集上达到了 98.56% 的卓越准确率。

🚀 快速开始

安装依赖

pip install timm

代码示例

from urllib.request import urlopen
from PIL import Image
import timm
import torch

img = Image.open(urlopen(
    'https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/beignets-task-guide.png'
))

model = timm.create_model("hf_hub:Marqo/nsfw-image-detection-384", pretrained=True)
model = model.eval()

data_config = timm.data.resolve_model_data_config(model)
transforms = timm.data.create_transform(**data_config, is_training=False)

with torch.no_grad():
    output = model(transforms(img).unsqueeze(0)).softmax(dim=-1).cpu()

class_names = model.pretrained_cfg["label_names"]
print("Probabilities:", output[0])
print("Class:", class_names[output[0].argmax()])

✨ 主要特性

轻量级：模型大小约为其他开源模型的 1/18 - 1/20。
高准确率：在特定数据集上达到 98.56% 的准确率。
灵活输入：使用 384x384 像素图像作为输入，以 16x16 像素块进行处理。

📦 安装指南

使用 timm 库进行安装：

pip install timm

💻 使用示例

基础用法

from urllib.request import urlopen
from PIL import Image
import timm
import torch

img = Image.open(urlopen(
    'https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/beignets-task-guide.png'
))

model = timm.create_model("hf_hub:Marqo/nsfw-image-detection-384", pretrained=True)
model = model.eval()

data_config = timm.data.resolve_model_data_config(model)
transforms = timm.data.create_transform(**data_config, is_training=False)

with torch.no_grad():
    output = model(transforms(img).unsqueeze(0)).softmax(dim=-1).cpu()

class_names = model.pretrained_cfg["label_names"]
print("Probabilities:", output[0])
print("Class:", class_names[output[0].argmax()])

📚 详细文档

评估

此模型在我们的数据集上优于现有的 NSFW 检测器，以下是与 AdamCodd/vit-base-nsfw-detector 和 Falconsai/nsfw_image_detection 的对比评估：与其他模型的评估对比

阈值调整与精确率 - 召回率

调整 NSFW 概率的阈值可以在精确率、召回率和准确率之间进行权衡，这在不同应用场景中可能很有用。阈值评估精确率和召回率曲线

训练详情

该模型是基于 timm/vit_tiny_patch16_384.augreg_in21k_ft_in1k 模型进行微调的。

训练参数

batch_size: 256
color_jitter: 0.2
color_jitter_prob: 0.05
cutmix: 0.1
drop: 0.1
drop_path: 0.05
epoch_repeats: 0.0
epochs: 20
gaussian_blur_prob: 0.005
hflip: 0.5
lr: 5.0e-05
mixup: 0.1
mixup_mode: batch
mixup_prob: 1.0
mixup_switch_prob: 0.5
momentum: 0.9
num_classes: 2
opt: adamw
remode: pixel
reprob: 0.5
sched: cosine
smoothing: 0.1
warmup_epochs: 2
warmup_lr: 1.0e-05
warmup_prefix: false

数据集

该模型在包含 220,000 张图像的专有数据集上进行训练。训练集包括 100,000 个 NSFW 示例和 100,000 个适宜内容（SFW）示例，测试集包含 10,000 个 NSFW 示例和 10,000 个 SFW 示例。数据集内容丰富多样，包括真实照片、绘画、规则 34 素材、表情包和 AI 生成图像等。

注意事项

⚠️ 重要提示

与所有模型一样，此模型可能会出现错误。NSFW 内容具有主观性和上下文相关性，该模型旨在帮助识别此类内容，请自行承担使用风险。

💡 使用建议

NSFW 的定义可能有所不同，有时取决于上下文。我们的数据集包含了具有挑战性的示例，但该定义可能并非 100% 适用于每个用例。因此，建议进行实验并尝试不同的阈值，以确定该模型是否适合您的需求。

📄 许可证

本模型使用 Apache 2.0 许可证。

📚 引用

@article{dosovitskiy2020vit,
  title={An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale},
  author={Dosovitskiy, Alexey and Beyer, Lucas and Kolesnikov, Alexander and Weissenborn, Dirk and Zhai, Xiaohua and Unterthiner, Thomas and  Dehghani, Mostafa and Minderer, Matthias and Heigold, Georg and Gelly, Sylvain and Uszkoreit, Jakob and Houlsby, Neil},
  journal={ICLR},
  year={2021}
}

@misc{rw2019timm,
  author = {Ross Wightman},
  title = {PyTorch Image Models},
  year = {2019},
  publisher = {GitHub},
  journal = {GitHub repository},
  doi = {10.5281/zenodo.4414861},
  howpublished = {\url{https://github.com/huggingface/pytorch-image-models}}
}