Llama-3_3-Nemotron-Super-49B-v1-Q6_K-GGUF开源模型 - 免费部署助力文本生成任务

首页

Llama 3 3 Nemotron Super 49B V1 Q6 K GGUF

由 openfree 开发

该模型是基于NVIDIA的Llama-3_3-Nemotron-Super-49B-v1转换而来的GGUF格式版本，适用于文本生成任务。

大型语言模型英语开源协议:其他 #49B大参数 #GGUF高效推理 #多语言文本生成

下载量 2,495

发布时间 : 3/30/2025

模型简介

这是一个大型语言模型，主要用于文本生成任务。它基于NVIDIA的Llama-3_3-Nemotron-Super-49B-v1模型，通过llama.cpp转换为GGUF格式，便于在各种硬件上运行。

模型特点

GGUF格式

模型已转换为GGUF格式，便于在各种硬件上高效运行。

大参数规模

49B参数的规模使其具备强大的文本生成能力。

跨平台支持

通过llama.cpp支持在多种平台上运行，包括Mac和Linux。

模型能力

文本生成

语言理解

上下文推理

使用案例

内容创作

文章生成

生成各种主题的文章和内容

高质量、连贯的文本输出

对话系统

构建智能对话助手

自然流畅的对话体验

研究与开发

语言模型研究

用于自然语言处理相关研究

🚀 openfree/Llama-3_3-Nemotron-Super-49B-v1-Q6_K-GGUF

本模型是使用llama.cpp，通过ggml.ai的GGUF-my-repo空间，从nvidia/Llama-3_3-Nemotron-Super-49B-v1转换为GGUF格式的。有关该模型的更多详细信息，请参考原始模型卡片。

🚀 快速开始

与llama.cpp一起使用

通过brew安装llama.cpp（适用于Mac和Linux）

brew install llama.cpp

调用llama.cpp服务器或命令行界面（CLI）。

命令行界面（CLI）：

llama-cli --hf-repo openfree/Llama-3_3-Nemotron-Super-49B-v1-Q6_K-GGUF --hf-file llama-3_3-nemotron-super-49b-v1-q6_k.gguf -p "The meaning to life and the universe is"

服务器：

llama-server --hf-repo openfree/Llama-3_3-Nemotron-Super-49B-v1-Q6_K-GGUF --hf-file llama-3_3-nemotron-super-49b-v1-q6_k.gguf -c 2048

注意：您也可以直接通过Llama.cpp仓库中列出的使用步骤使用此检查点。

步骤1：从GitHub克隆llama.cpp。

git clone https://github.com/ggerganov/llama.cpp

步骤2：进入llama.cpp文件夹，并使用LLAMA_CURL=1标志以及其他特定于硬件的标志（例如，在Linux上使用Nvidia GPU时使用LLAMA_CUDA=1）进行编译。

cd llama.cpp && LLAMA_CURL=1 make

步骤3：通过主二进制文件运行推理。

./llama-cli --hf-repo openfree/Llama-3_3-Nemotron-Super-49B-v1-Q6_K-GGUF --hf-file llama-3_3-nemotron-super-49b-v1-q6_k.gguf -p "The meaning to life and the universe is"

或者

./llama-server --hf-repo openfree/Llama-3_3-Nemotron-Super-49B-v1-Q6_K-GGUF --hf-file llama-3_3-nemotron-super-49b-v1-q6_k.gguf -c 2048

📦 安装指南

通过brew安装llama.cpp

brew install llama.cpp

💻 使用示例

基础用法

命令行界面（CLI）

llama-cli --hf-repo openfree/Llama-3_3-Nemotron-Super-49B-v1-Q6_K-GGUF --hf-file llama-3_3-nemotron-super-49b-v1-q6_k.gguf -p "The meaning to life and the universe is"

服务器

llama-server --hf-repo openfree/Llama-3_3-Nemotron-Super-49B-v1-Q6_K-GGUF --hf-file llama-3_3-nemotron-super-49b-v1-q6_k.gguf -c 2048

高级用法

从GitHub克隆并编译llama.cpp后运行推理

git clone https://github.com/ggerganov/llama.cpp
cd llama.cpp && LLAMA_CURL=1 make
./llama-cli --hf-repo openfree/Llama-3_3-Nemotron-Super-49B-v1-Q6_K-GGUF --hf-file llama-3_3-nemotron-super-49b-v1-q6_k.gguf -p "The meaning to life and the universe is"

或者

git clone https://github.com/ggerganov/llama.cpp
cd llama.cpp && LLAMA_CURL=1 make
./llama-server --hf-repo openfree/Llama-3_3-Nemotron-Super-49B-v1-Q6_K-GGUF --hf-file llama-3_3-nemotron-super-49b-v1-q6_k.gguf -c 2048

📄 许可证

本项目使用的许可证为nvidia-open-model-license，您可以通过此链接查看详细的许可证信息。

📋 模型信息

属性	详情
基础模型	nvidia/Llama-3_3-Nemotron-Super-49B-v1
语言	en
库名称	transformers
许可证	other
许可证名称	nvidia-open-model-license
许可证链接	https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/
任务类型	文本生成
标签	nvidia, llama-3, pytorch, llama-cpp, gguf-my-repo