llama3-8B-medical开源医疗问答模型 - 免费支持精准医疗问题解答

首页

Llama3 8B Medical

由 ruslanmv 开发

基于LLAMA-3-8B模型的医疗微调4比特量化版本，专为医疗问答设计

大型语言模型

Transformers

英语开源协议:Apache-2.0 #医疗问答微调 #4比特量化 #多语言医疗助手

下载量 132

发布时间 : 4/24/2024

模型简介

这是一个针对医疗领域优化的语言模型，能够提供专业的医疗问题解答和建议。模型基于LLAMA-3架构，经过4比特量化处理，降低了计算资源需求。

模型特点

医疗专注

专门针对医疗健康领域优化，能够提供专业的医疗咨询回答

4比特量化

采用4比特量化技术，显著降低模型运行所需的计算资源

多语言支持

在多语言任务中表现提升，特别优化了中文医疗问答能力

安全回答

当问题无意义或事实不连贯时，会说明原因而非给出错误答案

模型能力

医疗问题解答

症状分析

医疗建议生成

专业医学术语理解

使用案例

医疗咨询

症状分析

根据用户描述的症状，分析可能的疾病原因

能识别常见疾病的典型症状并提供初步判断

医疗建议

针对特定健康问题提供一般性建议

能给出合理的医疗保健建议，并提示咨询专业医生

健康教育

疾病知识普及

解释特定疾病的特征、原因和预防措施

能提供准确、易懂的医学知识

🚀 医疗-Llama3-8B-4bit：针对医疗问答微调的Llama3模型

本项目是基于LLAMA-3-8B的医疗微调版本，使用常见的开源数据集将其量化为4位。在多语言任务上表现出了显著的提升。采用了标准的比特量化技术进行微调后量化，降低了运行模型所需的计算时间复杂度和空间复杂度。整体架构完全基于LLAMA-3。

本仓库提供了强大的Llama3 8B模型的微调版本，专门设计用于以信息丰富的方式回答医疗问题。它利用了AI医疗聊天机器人数据集（ruslanmv/ai-medical-chatbot）中包含的丰富知识。

🚀 快速开始

✨ 主要特性

专注医疗领域：针对健康相关的询问进行了优化。
知识储备丰富：在全面的医疗聊天机器人数据集上进行训练。
文本生成能力：生成信息丰富且可能有帮助的回复。

📦 安装指南

该模型可通过Hugging Face Transformers库访问。使用pip进行安装：

pip install git+https://github.com/huggingface/accelerate.git
pip install git+https://github.com/huggingface/transformers.git
pip install  bitsandbytes

💻 使用示例

基础用法

以下是一个Python代码片段，展示了如何与llama3-8B-medical模型进行交互，并生成医疗问题的答案：

from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig
import torch

# Load tokenizer and model
model_id = "ruslanmv/llama3-8B-medical"

quantization_config = BitsAndBytesConfig(
    load_in_4bit=True,
    bnb_4bit_compute_dtype=torch.bfloat16
)

tokenizer = AutoTokenizer.from_pretrained(model_id)
device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")

model = AutoModelForCausalLM.from_pretrained(model_id, config=quantization_config)

def create_prompt(user_query):
  B_INST, E_INST = "<s>[INST]", "[/INST]"
  B_SYS, E_SYS = "<<SYS>>\n", "\n<</SYS>>\n\n"
  DEFAULT_SYSTEM_PROMPT = """\
  You are an AI Medical Chatbot Assistant, provide comprehensive and informative responses to your inquiries.
  If a question does not make any sense, or is not factually coherent, explain why instead of answering something not correct. If you don't know the answer to a question, please don't share false information."""
  SYSTEM_PROMPT = B_SYS + DEFAULT_SYSTEM_PROMPT + E_SYS
  instruction = f"User asks: {user_query}\n"
  prompt = B_INST + SYSTEM_PROMPT + instruction + E_INST
  return prompt.strip()

def generate_text(model, tokenizer, prompt,
                  max_length=200,
                  temperature=0.8,
                  num_return_sequences=1):
    prompt = create_prompt(user_query)
    # Tokenize the prompt
    input_ids = tokenizer.encode(prompt, return_tensors="pt").to(device)  # Move input_ids to the same device as the model
    # Generate text
    output = model.generate(
        input_ids=input_ids,
        max_length=max_length,
        temperature=temperature,
        num_return_sequences=num_return_sequences,
        pad_token_id=tokenizer.eos_token_id,  # Set pad token to end of sequence token
        do_sample=True
    )    
    # Decode the generated output
    generated_text = tokenizer.decode(output[0], skip_special_tokens=True)
  
   # Split the generated text based on the prompt and take the portion after it
    generated_text = generated_text.split(prompt)[-1].strip()

    return generated_text
# Example usage
# - Context: First describe your problem.
# - Question: Then make the question.
user_query = "I'm a 35-year-old male experiencing symptoms like fatigue, increased sensitivity to cold, and dry, itchy skin. Could these be indicative of hypothyroidism?"
generated_text = generate_text(model, tokenizer, user_query)    
print(generated_text)

回答类型如下：

Yes, it is possible. Hypothyroidism can present symptoms like increased sensitivity to cold, dry skin, and fatigue. These symptoms are characteristic of hypothyroidism. I recommend consulting with a healthcare provider. 2. Hypothyroidism can present symptoms like fever, increased sensitivity to cold, dry skin, and fatigue. These symptoms are characteristic of hypothyroidism.