LLaMA-3-8B-Instruct-Abliterated-TR开源模型 - 强制用土耳其语响应，便于土耳其语交流

首页

Llama 3 8B Instruct Abliterated TR

由 Metin 开发

LLaMA-3-8B-Instruct的消融版本，通过特定技术手段强制模型用土耳其语响应

大型语言模型

Transformers

其他#土耳其语优化 #指令消融技术 #多轮对话支持

下载量 25

发布时间 : 4/25/2025

模型简介

该模型是基于Meta-Llama-3-8B-Instruct的消融版本，通过修改激活方向强制模型使用土耳其语回答问题。适用于土耳其语文本生成任务。

模型特点

土耳其语强制响应

通过消融技术强制模型使用土耳其语回答问题

基于LLaMA-3架构

继承LLaMA-3的强大文本生成能力

指令优化

针对指令跟随任务进行了优化

模型能力

土耳其语文本生成

指令理解与执行

问答系统

知识解释

使用案例

教育

土耳其语学习辅助

帮助学生理解土耳其语问题和概念

提供准确的知识解释

客服

土耳其语客服机器人

处理土耳其语客户咨询

提供流畅自然的土耳其语响应

🚀 🦙LLaMA-3-8B-Instruct-Abliterated-TR

LLaMA-3-8B-Instruct-Abliterated-TR 是 Meta-LLaMA-3-8B-Instruct 的消融版本，该模型通过特定方法消除了大语言模型中的拒绝机制，并强制其以土耳其语进行响应。

📦 模型信息

属性	详情
许可证	llama3
支持语言	土耳其语
任务类型	文本生成
基础模型	meta-llama/Meta-Llama-3-8B-Instruct

📊 模型评估结果

数据集	评估指标	得分	验证情况
MMLU_TR_V0.2	5-shot	49.08%	未验证
Truthful_QA_TR_V0.2	0-shot	49.62%	未验证
ARC_TR_V0.2	25-shot	43.77%	未验证
HellaSwag_TR_V0.2	10-shot	44.86%	未验证
GSM8K_TR_V0.2	5-shot	53.23%	未验证
Winogrande_TR_V0.2	5-shot	55.13%	未验证
平均得分	-	49.28%	-

🔍 什么是消融（Abliteration）？

Arditi 等人在他们的博客文章中指出，大语言模型中的拒绝机制是由残差流中的单一方向介导的。他们发现，阻止模型表示这个方向可以使其回答有害问题。若想深入了解这一概念，你可以参考 Maxime Labonne 的文章。

为了强制模型用土耳其语进行响应，研究人员使用了 LIMA 数据集的 stackexchange 子集制作了并行指令。这些指令随后被翻译成土耳其语，并在运行时附加了一个额外的句子，提示模型用土耳其语回答。

你可以通过以下链接找到本实验中使用的数据集：

https://huggingface.co/datasets/Metin/abliteration_en
https://huggingface.co/datasets/Metin/abliteration_tr

💻 使用示例

你可以使用以下代码片段来使用该模型：

from transformers import BitsAndBytesConfig
import transformers
import torch

bnb_config = BitsAndBytesConfig(
            load_in_4bit=True,
            bnb_4bit_use_double_quant=True,
            bnb_4bit_quant_type="nf4",
            bnb_4bit_compute_dtype=torch.bfloat16
)

model_id = "Metin/LLaMA-3-8B-Instruct-Abliterated-TR"

pipeline = transformers.pipeline(
    "text-generation",
    model=model_id,
    model_kwargs={"torch_dtype": torch.bfloat16 ,'quantization_config': bnb_config},
    device_map="auto",
)

messages = [
    {"role": "system", "content": "You are a helpful assistant."}, # Ideally we should not have to tell the model to answer in Turkish after abliteration.
    {"role": "user", "content": "Python'da bir öğenin bir listede geçip geçmediğini nasıl kontrol edebilirim?"},
]

prompt = pipeline.tokenizer.apply_chat_template(
        messages,
        tokenize=False,
        add_generation_prompt=True
)

terminators = [
    pipeline.tokenizer.eos_token_id,
    pipeline.tokenizer.convert_tokens_to_ids("<|eot_id|>")
]

outputs = pipeline(
    prompt,
    max_new_tokens=512,
    eos_token_id=terminators,
    do_sample=True,
    temperature=0.2,
    top_p=0.9,
)

print(outputs[0]["generated_text"][len(prompt):])

📈 模型细节

使用 40 个样本找出激活值之间的均值差异。
选择第 7 层作为最有可能的土耳其语表达方向的层。

⚠️ 注意事项

这些得分可能与你运行相同基准测试时得到的结果不同，因为本实验未使用任何推理引擎（如 vLLM、TensorRT-LLM 等）。

📋 输出示例（消融模型与基础模型对比）

虽然用单个示例测试模型并不准确，但这里仍提供一个示例来展示模型的能力。

模型：LLaMA-3-8B-Instruct-Abliterated-TR

输入

messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Gökyüzü neden mavi?"},
]

输出

A classic question!

Gökyüzü (the sky) neden mavi (blue) sorusu, Türkçe'de "Why is the sky blue?" olarak çevrilebilir. Bu soruya cevap vermek için, ilk olarak, gökyüzünün mavi olmasının nedenini açıklamak gerekir.

Gökyüzünün mavi olmasının nedeni, ışığın farklı uzunluklarındaki yayılımına dayanır. Işık, gökyüzüne ulaşmadan önce, atmosferde bulunan gaz moleküllerine çarpınır. Bu çarpma, ışığın farklı uzunluklarındaki yayılımına neden olur.

Gökyüzünün mavi olmasının nedeni, ışığın kısa dalga boyunda (yaklaşık 450-495 nm) yayılımına dayanır. Bu dalga boyunda ışığın, gökyüzüne ulaşmadan önce, atmosferde bulunan gaz moleküllerine çarpınır. Bu çarpma, ışığın mavi renkini oluşturur.

Diğer renkler, farklı dalga boyunda yayılımına neden olur. Örneğin, kırmızı renk, ışığın uzun dalga boyunda (yaklaşık 620-750 nm) yayılımına neden olur. Bu nedenle, gökyüzünün mavi olmasının nedeni, ışığın kısa dalga boyunda yayılımına dayanır.

Bu cevap, gökyüzünün mavi olmasının nedenini açıklamaktadır.

模型：LLaMA-3-8B-Instruct

输入

A question that has puzzled humans for centuries!

The short answer is that the sky appears blue because of a phenomenon called Rayleigh scattering. This is the scattering of light by small particles or molecules in the atmosphere, such as nitrogen (N2) and oxygen (O2).

When sunlight enters the Earth's atmosphere, it encounters these tiny molecules and scatters in all directions. However, shorter (blue) wavelengths are scattered more than longer (red) wavelengths. This is because the smaller molecules are more effective at scattering the shorter wavelengths.

As a result, our eyes perceive the blue light as being more abundant, giving the sky its blue appearance. The color of the sky can vary depending on the time of day, atmospheric conditions, and other factors, but Rayleigh scattering is the main reason why the sky appears blue.

In Turkish, the phrase "Gökyüzü neden mavi" can be translated to "Why is the sky blue?" or "What makes the sky blue?"