llama 系列

由 NVIDIA 提供 · 共 7 个变体

llama-nemotron-rerank-1b-v2是NVIDIA Build平台提供的AI模型，提供高质量AI推理能力。免费使用（有速率限制）。

模型	变体	定位	上下文	速度(t/s)	输入价格/M	输出价格/M	思考
llama-nemotron-rerank-1b-v2 — NVIDIA Build	标准	Downloadable	-	-	$0.0000	$0.0000	-
llama-nemotron-embed-1b-v2 — NVIDIA Build	标准	Downloadable	-	-	$0.0000	$0.0000	-
llama-guard-4-12b — NVIDIA Build	标准	Free Endpoint	-	-	$0.0000	$0.0000	-
Llama Guard 3 8B — 高级对话模型	标准	-	131.1K	-	$0.0200	$0.0600	-
Llama Guard 3 8B — 超高速推理模型	标准	Groq LPU 极速推理，Meta开源模型	8.2K	765~765	$0.2000	$0.2000	-
llama-nemotron-rerank-vl-1b-v2 — NVIDIA Build	Vision	Downloadable	-	-	$0.0000	$0.0000	-
llama-nemotron-embed-vl-1b-v2 — NVIDIA Build	Vision	Downloadable	-	-	$0.0000	$0.0000	-