coin-clip-vit-base-patch32开源硬币图像检索模型

首页

Coin Clip Vit Base Patch32

由 breezedeus 开发

基于CLIP微调的硬币图像检索模型，增强对硬币图像的特征提取能力

图像生成文本

Transformers

开源协议:Apache-2.0 #硬币图像检索 #多模态特征提取 #钱币学专用

下载量 886

发布时间 : 11/26/2023

模型简介

Coin-CLIP是基于OpenAI CLIP模型微调的专用模型，专注于硬币图像的检索和识别，通过对比学习技术在大量硬币图像数据集上训练，显著提升了硬币图像搜索的准确性。

模型特点

硬币图像专用检索

针对硬币图像优化的特征提取能力，显著提升检索精度

多模态学习能力

继承CLIP的多模态学习能力，支持图像与文本的关联

大规模训练数据

在超过340,000张硬币图像数据集上微调

模型能力

硬币图像特征提取

基于图像的硬币搜索

硬币识别

使用案例

钱币学

硬币图像数据库检索

在大型硬币图像库中快速找到相似硬币

相比通用CLIP模型，检索准确率显著提升

硬币鉴定辅助

帮助鉴定稀有或历史硬币

🚀 Coin-CLIP 🪙 : 借助CLIP提升硬币图像检索能力

Coin-CLIP是一款专门用于硬币图像检索的模型。它基于OpenAI的CLIP模型，在大量硬币图像数据上微调，能更准确地提取硬币图像特征，实现高效的以图搜图。

✨ 主要特性

最先进的硬币图像检索技术；
增强了硬币图像的特征提取能力；
无缝集成CLIP的多模态学习能力。

📦 安装指南

安装Python库

pip install coin_clip

💻 使用示例

基础用法

from PIL import Image
import requests

import torch.nn.functional as F
from transformers import CLIPProcessor, CLIPModel

model = CLIPModel.from_pretrained("breezedeus/coin-clip-vit-base-patch32")
processor = CLIPProcessor.from_pretrained("breezedeus/coin-clip-vit-base-patch32")

image_fp = "path/to/coin_image.jpg"
image = Image.open(image_fp).convert("RGB")

inputs = processor(images=image, return_tensors="pt")
img_features = model.get_image_features(**inputs)
img_features = F.normalize(img_features, dim=1)

高级用法

from coin_clip import CoinClip

# Automatically download the model from Huggingface
model = CoinClip(model_name='breezedeus/coin-clip-vit-base-patch32')
images = ['examples/10_back.jpg', 'examples/16_back.jpg']
img_feats, success_ids = model.get_image_features(images)
print(img_feats.shape)  # --> (2, 512)