all-MiniLM-L2-v2开源模型 - 推理提速近2倍，CPU和GPU上精准度高

首页

All MiniLM L2 V2

由 tabularisai 开发

该模型是从all-MiniLM-L12-v2蒸馏而来，推理速度提升近2倍，同时在CPU和GPU上保持较高的准确度。

文本嵌入

Safetensors

支持多种语言开源协议:Apache-2.0 #高速文本嵌入 #检索增强生成 #轻量级模型

下载量 5,063

发布时间 : 5/5/2025

模型简介

一个高效的文本嵌入模型，适用于句子相似度计算和检索增强生成等任务。

模型特点

高速推理

相比all-MiniLM-L6-v2模型，推理速度提升近2倍

高准确度

在保持高速推理的同时，准确度接近原模型

轻量级

模型体积小，适合资源受限的环境

模型能力

文本嵌入

句子相似度计算

语义检索

使用案例

信息检索

检索增强生成(RAG)

作为检索器用于RAG流程，快速找到相关文档

提高检索速度和系统响应时间

语义分析

句子相似度计算

计算两个句子之间的语义相似度

可用于问答系统、重复检测等场景

🚀 最快的文本嵌入模型：tabularisai/all-MiniLM-L2-v2

本模型是从 sentence-transformers/all-MiniLM-L12-v2 蒸馏而来，与最小的 all-MiniLM-L6-v2 模型相比，推理速度几乎快了 2 倍，同时在 CPU 和 GPU 上都能保持较高的准确性。

🚀 快速开始

本模型可用于文本嵌入和检索增强生成（RAG）等任务，下面将详细介绍其使用方法。

💻 使用示例

基础用法

检索增强生成（RAG）示例

可将此模型用作 RAG 管道中的检索器：

from sentence_transformers import SentenceTransformer, util
import faiss
import numpy as np

# Load embedding model
model = SentenceTransformer("tabularisai/all-MiniLM-L2-v2")

# Your 5 simple documents
documents = [
    "Renewable energy comes from natural sources.",
    "Solar panels convert sunlight into electricity.",
    "Wind turbines harness wind power.",
    "Fossil fuels are non-renewable sources of energy.",
    "Hydropower uses water to generate electricity."
]

# Embed documents
doc_embeddings = model.encode(documents, convert_to_numpy=True)

# Create FAISS index
dim = doc_embeddings.shape[1]
index = faiss.IndexFlatL2(dim)
index.add(doc_embeddings)

# Query
query = "What are the benefits of renewable energy?"
query_embedding = model.encode([query], convert_to_numpy=True)

# Search top 3 similar docs
D, I = index.search(query_embedding, k=3)

# Print results
print("Query:", query)
print("\nTop 3 similar documents:")
for rank, idx in enumerate(I[0]):
    print(f"{rank+1}. {documents[idx]} (score: {D[0][rank]:.4f})")

高级用法

句子嵌入示例

首先安装库：

pip install -U sentence-transformers

然后加载模型并对句子进行编码：

from sentence_transformers import SentenceTransformer

model = SentenceTransformer("tabularisai/all-MiniLM-L2-v2")

sentences = [
    "The weather is lovely today.",
    "It's so sunny outside!",
    "He drove to the stadium.",
]

embeddings = model.encode(sentences)
print(embeddings.shape)  # [3, 384]

similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)  # [3, 3]