模型列表
更新时间:2025-04-25
千帆AI应用开发者中心已上线,期待您的点击!
推荐模型
旗舰模型 | ERNIE-X1-Turbo-32K | ERNIE-4.5-Turbo-128K | ERNIE-4.5-Turbo-VL-32K | DeepSeek-R1 |
---|---|---|---|---|
使用场景 | 核心定位:深度思考模型,具备更强的理解、规划、反思、进化能力。 适用场景: 在中文知识问答、文学创作、文稿写作、日常对话、逻辑推理、复杂计算及工具调用等方面表现尤为出色。 |
核心定位:更好的满足多轮长历史对话处理、长文档理解问答任务。 适用场景: 1)复杂语义理解:支持中文知识问答、文学创作,尤其擅长文档理解(如DocVQA任务)。 2)数学推理:在中文数学问题(CMath基准)表现突出。 |
核心定位:多模态基础模型,支持文本、图像跨模态输入与生成。 适用场景:结合图文生成营销文案、视频脚本设计等。 |
核心定位:专业优化推理模型,聚焦数学与逻辑任务。 适用场景: 复杂数学问题:如高等数学题求解、科学计算模拟。 逻辑拆解与规划:业务流程自动化、学术研究中的假设验证。 STEM领域应用:物理建模、金融量化分析等需高精度推理的场景。 |
上下文长度 (Tokene数) |
32k | 128k | 32k | 64k |
最大输出长度 (Token数) |
16k | 12k | 12k | 8k 默认4k |
原生多模态
模型名称 | model 入参 | 上下文长度 (token) |
最大输入 | 最大输出 (token) |
默认流控 |
---|---|---|---|---|---|
ERNIE 4.5 Turbo VL | ernie-4.5-turbo-vl-32k | 32k | 27k token 270000字符 |
[2,12288] 默认 2k |
RPM = 1000 TPM = 200000 |
ERNIE 4.5 | ernie-4.5-8k-preview | 8k | 5k token 50000字符 |
[2,2048] 默认 2k |
RPM = 100 TPM = 100000 |
Llama-4-Maverick | llama-4-maverick-17b-128e-instruct | 128k | 131072字符 | [2,8192] 默认 4k |
RPM = 120 TPM = 150000 |
Llama-4-Scout | llama-4-scout-17b-16e-instruct | 128k | 131072字符 | [2,8192] 默认 4k |
RPM = 120 TPM = 150000 |
文本生成
ERNIE系列-旗舰模型
模型名称 | model 入参 | 上下文长度 (token) | 最大输入 | 最大输出 (token) | 默认流控 |
---|---|---|---|---|---|
ERNIE 4.5 Turbo | ernie-4.5-turbo-128k | 128k | 123k token 1230000字符 |
[2,12288] 默认 2k |
RPM = 5000 TPM = 400000 |
ERNIE 4.5 Turbo | ernie-4.5-turbo-32k | 32k | 27k token 270000字符 |
[2,12288] 默认 2k |
RPM = 5000 TPM = 400000 |
ERNIE 4.5 | ernie-4.5-8k-preview | 8k | 5k token 50000字符 |
[2,2048] 默认 2k |
RPM = 1000 TPM = 50000 |
ERNIE 4.0 | ernie-4.0-8k | 8k | 5k token 20000字符 |
[2,2048] 默认 2k |
RPM = 10000 TPM = 800000 |
ERNIE 4.0 | ernie-4.0-8k-0613 | 8k | 5k token 20000字符 |
[2,2048] 默认 2k |
RPM = 300 TPM = 300000 |
ERNIE 4.0 | ernie-4.0-8k-latest | 8k | 5k token 20000字符 |
[2,2048] 默认 2k |
RPM = 120 TPM = 120000 |
ERNIE 4.0 | ernie-4.0-8k-preview | 8k | 6800 token 20000字符 |
[2,2048] 默认 2k |
RPM = 300 TPM = 300000 |
ERNIE 4.0 Turbo | ernie-4.0-turbo-128k | 128k | 124k token 507904字符 |
[2,4096] 默认 4k |
RPM = 5000 TPM = 400000 |
ERNIE 4.0 Turbo | ernie-4.0-turbo-8k | 8k | 6k token 24000字符 |
[2,2048] 默认 2k |
RPM = 10000 TPM = 800000 |
ERNIE 4.0 Turbo | ernie-4.0-turbo-8k-0628 | 8k | 6k token 24000字符 |
[2,2048] 默认 2k |
RPM = 60 TPM = 60000 |
ERNIE 4.0 Turbo | ernie-4.0-turbo-8k-0927 | 8k | 6k token 24000字符 |
[2,2048] 默认 2k |
RPM = 60 TPM = 60000 |
ERNIE 4.0 Turbo | ernie-4.0-turbo-8k-latest | 8k | 8k token 24000字符 |
[2,2048] 默认 2k |
RPM = 60 TPM = 60000 |
ERNIE 4.0 Turbo | ernie-4.0-turbo-8k-preview | 8k | 6k token 24000字符 |
[2,2048] 默认 2k |
RPM = 60 TPM = 60000 |
ERNIE 3.5 | ernie-3.5-128k | 128k | 119k token 487424字符 |
[2,8192] 默认 4k |
RPM = 5000 TPM = 400000 |
ERNIE 3.5 | ernie-3.5-128k-preview | 128k | 124k token 507904字符 |
[2,4096] 默认 4k |
RPM = 60 TPM = 150000 |
ERNIE 3.5 | ernie-3.5-8k | 8k | 5k token 20000字符 |
[2,2048] 默认 2k |
RPM = 10000 TPM = 800000 |
ERNIE 3.5 | ernie-3.5-8k-0613 | 8k | 5k token 20000字符 |
[2,2048] 默认 2k |
RPM = 300 TPM = 300000 |
ERNIE 3.5 | ernie-3.5-8k-0701 | 8k | 5k token 20000字符 |
[2,2048] 默认 2k |
RPM = 120 TPM = 120000 |
ERNIE 3.5 | ernie-3.5-8k-preview | 8k | 6800 token 20000字符 |
[2,2048] 默认 2k |
RPM = 300 TPM = 300000 |
ERNIE系列-主力模型
模型名称 | model 入参 | 上下文长度 (token) | 最大输入 | 最大输出 (token) | 默认流控 |
---|---|---|---|---|---|
ERNIE Speed | ernie-speed-128k | 128k | 124k token 507904字符 |
[2,4096] 默认 4k |
RPM = 500 TPM = 200000 |
ERNIE Speed | ernie-speed-8k | 8k | 6k token 24000字符 |
[2,2048] 默认 1k |
RPM = 500 TPM = 200000 |
ERNIE Speed | ernie-speed-pro-128k | 128k | 124k token 507904字符 |
[2,4096] 默认 4k |
RPM = 10000 TPM = 800000 |
ERNIE Lite | ernie-lite-8k | 8k | 6k token 24000字符 |
[2,2048] 默认 1k |
RPM = 500 TPM = 200000 |
ERNIE Lite | ernie-lite-pro-128k | 128k | 124k token 507904字符 |
[2,4096] 默认 4k |
RPM = 10000 TPM = 800000 |
ERNIE系列-轻量模型
模型名称 | model 入参 | 上下文长度 (token) | 最大输入 | 最大输出 (token) | 默认流控 |
---|---|---|---|---|---|
ERNIE Tiny | ernie-tiny-8k | 8k | 6k token 24000字符 |
[2,2048] 默认 1k |
RPM = 10000 TPM = 800000 |
ERNIE系列-垂直场景模型
模型名称 | model 入参 | 上下文长度 (token) | 最大输入 | 最大输出 (token) | 默认流控 |
---|---|---|---|---|---|
ERNIE Character | ernie-character-8k | 8k | 7k token 24000字符 |
[2,2048] 默认 1k |
RPM = 60 TPM = 60000 |
ERNIE Character | ernie-character-8k-1010 | 8k | 6k token 24000字符 |
[2,2048] 默认 1k |
RPM = 60 TPM = 60000 |
ERNIE Character | ernie-character-fiction-8k | 8k | 8k token 32768字符 |
[2,2048] 默认 1k |
RPM = 300 TPM = 300000 |
ERNIE Character | ernie-character-fiction-8k-preview | 8k | 8k token 32768字符 |
[2,2048] 默认 1k |
RPM = 60 TPM = 6000 |
ERNIE Novel | ernie-novel-8k | 8k | 5k token 20000字符 |
[2,2048] 默认 2k |
RPM = 60 TPM = 60000 |
QianFan系列
模型名称 | model 入参 | 上下文长度 (token) | 最大输入 | 最大输出 (token) | 默认流控 |
---|---|---|---|---|---|
Qianfan-8B | qianfan-8b | 32k | 32k token 131072字符 |
[2,16384] 默认 4k |
RPM = 60 TPM = 60000 |
Qianfan-70B | qianfan-70b | 32k | 32k token 128000字符 |
[2,16384] 默认 4k |
RPM = 60 TPM = 60000 |
Qianfan Agent | qianfan-agent-lite-8k | 8k | 7k token 28000字符 |
[2,2048] 默认 1k |
RPM = 60 TPM = 60000 |
Qianfan Agent | qianfan-agent-speed-32k | 32k | 28k token 112000字符 |
[2.4096] | RPM = 5000 TPM = 400000 |
Qianfan Agent | qianfan-agent-speed-8k | 8k | 7k token 28000字符 |
[2,2048] 默认 1k |
RPM = 180 TPM = 180000 |
Qianfan BLOOMZ | qianfan-bloomz-7b-compressed | 2k | 4800字符 | [2,2048] 默认 1k |
RPM = 180 TPM = 180000 |
Qianfan Chinese Llama | qianfan-chinese-llama-2-13b | 2k | 4800字符 | [2,2048] 默认 1k |
RPM = 60 TPM = 60000 |
Qianfan Chinese Llama | qianfan-chinese-llama-2-70b | 2k | 124000字符 | [2,2048] 默认 1k |
RPM = 60 TPM = 60000 |
Qianfan Chinese Llama | qianfan-chinese-llama-2-7b | 2k | 4800字符 | [2,2048] 默认 1k |
RPM = 180 TPM = 180000 |
Qianfan Sug | qianfan-sug-8k | 8k | 6k token 24000字符 |
[2,2048] 默认 1k |
RPM = 500 TPM = 200000 |
DeepSeek系列
模型名称 | 版本 | model 入参 | 上下文长度 (token) | 最大输入 | 最大输出 (token) | 默认流控 |
---|---|---|---|---|---|---|
DeepSeek-Chat | DeepSeek-V3-250324 | deepseek-v3 | 64k | 64k token 104860字符 |
8k 默认4k |
RPM = 1500 TPM = 300000 |
DeepSeek-Chat | DeepSeek-V3-241226 | deepseek-v3-241226 | 64k | 64k token 104860字符 |
8k 默认4k |
RPM = 120 TPM = 10000 |
其他
模型名称 | 版本 | model 入参 | 上下文长度 (token) |
最大输入 | 最大输出 (token) |
默认流控 |
---|---|---|---|---|---|---|
GLM-4 | GLM-4-32B-0414 | glm-4-32b-0414 | 32k | 16k token 64000字符 |
[2,8192] 默认 4k |
RPM = 120 TPM = 60000 |
Llama-4-Maverick | Llama-4-Maverick-17B-128E-Instruct | llama-4-maverick-17b-128e-instruct | 128k | 131072字符 | [2,8192] 默认 4k |
RPM = 120 TPM = 150000 |
Llama-4-Scout | Llama-4-Scout-17B-16E-Instruct | llama-4-scout-17b-16e-instruct | 128k | 131072字符 | [2,8192] 默认 4k |
RPM = 120 TPM = 150000 |
Qwen2.5 | Qwen2.5-7B-Instruct | qwen2.5-7b-instruct | 32k | 24k token 64000字符 |
[2,8192] 默认4k |
RPM = 60 TPM = 60000 |
Gemma | Gemma-7B-It | gemma-7b-it | 8k | 11200字符 | 1k | RPM = 60 TPM = 60000 |
Llama 2 | Llama-2-13B-Chat | llama-2-13b-chat | 2k | 4800字符 | 1k | RPM = 180 TPM = 180000 |
Llama 2 | Llama-2-70B-Chat | llama-2-70b-chat | 2k | 4800字符 | 500 | RPM = 180 TPM = 180000 |
Llama 2 | Llama-2-7B-Chat | llama-2-7b-chat | 2k | 4800字符 | 1500 | RPM = 180 TPM = 180000 |
Meta-Llama-3 | Meta-Llama-3-70B | meta-llama-3-70b | 8k | 20000字符 | 500 | RPM = 120 TPM = 120000 |
Meta-Llama-3 | Meta-Llama-3-8B | meta-llama-3-8b | 8k | 20000字符 | 1500 | RPM = 60 TPM = 60000 |
Mixtral | Mixtral-8x7B-Instruct | mixtral-8x7b-instruct | 8k | 11200字符 | 500 | RPM = 60 TPM = 60000 |
AquilaChat | AquilaChat-7B | aquilachat-7b | 2k | 4800字符 | 1k | RPM = 180 TPM = 180000 |
BLOOMZ | BLOOMZ-7B | bloomz-7b | 2k | 4800字符 | 1500 | RPM = 180 TPM = 180000 |
ChatGLM2 | ChatGLM2-6B-32K | chatglm2-6b-32k | 32k | 32500字符 | 1k | RPM = 180 TPM = 180000 |
CodeLlama | CodeLlama-7B-Instruct | codellama-7b-instruct | 2k | 4800字符 | 1k | RPM = 60 TPM = 60000 |
SQLCoder | SQLCoder-7B | sqlcoder-7b | 2k | 4800字符 | 1k | RPM = 60 TPM = 60000 |
XuanYuan | XuanYuan-70B-Chat-4bit | xuanyuan-70b-chat-4bit | 8k | 11200字符 | 1k | RPM = 60 TPM = 60000 |
Yi | Yi-34B-Chat | yi-34b-chat | 2k | 4800字符 | 768 | RPM = 60 TPM = 60000 |
图像理解
模型名称 | 版本 | model 入参 |
上下文长度 (token) |
最大输入 | 最大输出 (token) |
默认流控 |
---|---|---|---|---|---|---|
ERNIE 4.5 | ERNIE-4.5-8K-Preview | ernie-4.5-8k-preview | 8k | 5k token 50000字符 |
2k 默认 2k |
RPM = 2200 TPM = 200000 |
Qianfan Llama VL | Qianfan-Llama-VL-8B | qianfan-llama-vl-8b | 32k | 32k token 320000字符 |
16k 默认 2k |
RPM = 120 TPM = 150000 |
Llama-4-Maverick | Llama-4-Maverick-17B-128E-Instruct | llama-4-maverick-17b-128e-instruct | 128k | 131072字符 | 2,8192] 默认 4k |
RPM = 120 TPM = 150000 |
Llama-4-Scout | Llama-4-Scout-17B-16E-Instruct | llama-4-scout-17b-16e-instruct | 128k | 131072字符 | 2,8192] 默认 4k |
RPM = 120 TPM = 150000 |
DeepSeek-VL2 | DeepSeek-VL2 | deepseek-vl2 | 4k | 12000字符 | 2k 默认2k |
RPM = 60 TPM = 60000 |
DeepSeek-VL2 | DeepSeek-VL2-Small | deepseek-vl2-small | 4k | 38400字符 | 2k 默认2k |
RPM = 60 TPM = 60000 |
Qwen2.5-VL | Qwen2.5-VL-32B-Instruct | qwen2.5-vl-32b-instruct | 32k | 64000字符 | 8k 默认2k |
RPM = 60 TPM = 60000 |
Qwen2.5-VL | Qwen2.5-VL-7B-Instruct | qwen2.5-vl-7b-instruct | 16k | 38400字符 | 4k 默认2k |
RPM = 60 TPM = 60000 |
InternVL2_5 | InternVL2_5-38B-MPO | internvl2.5-38b-mpo | 32k | 64000字符 | 4k 默认2k |
RPM = 60 TPM = 60000 |
Fuyu-8B | Fuyu-8B | fuyu-8b | 2k | 4800字符 | 768 | QPS = 1 |
深度思考
模型名称 | 版本 | model 入参 |
上下文长度 (token) |
最大输入 | 最大输出 (token) |
思维链长度 (token) |
默认流控 |
---|---|---|---|---|---|---|---|
ERNIE X1 Turbo | ERNIE-X1-Turbo-32K | ernie-x1-turbo-32k | 32k | 24k token 240000字符 |
[2,16384] 默认 2k |
16k | RPM = 900 TPM = 300000 |
ERNIE X1 | ERNIE-X1-32K | ernie-x1-32k | 32k | 24k token 240000字符 |
[2,16384] 默认 2k |
16k | RPM = 300 TPM = 100000 |
ERNIE X1 | ERNIE-X1-32K-Preview | ernie-x1-32k-preview | 32k | 24k token 240000字符 |
[2,16384] 默认 2k |
16k | RPM = 300 TPM = 100000 |
DeepSeek-Reasoner | DeepSeek-R1 | deepseek-r1 | 64k | 64k token 104860字符 |
8k 默认 4k |
32k | RPM = 1500 TPM = 300000 |
DeepSeek-R1-Distill | DeepSeek-R1-Distill-Qianfan-70B | deepseek-r1-distill-qianfan-70b | 32k | 16k token 64000字符 |
[2,8192] 默认 8k |
16k | RPM = 1000 TPM = 60000 |
DeepSeek-R1-Distill | DeepSeek-R1-Distill-Qianfan-8B | deepseek-r1-distill-qianfan-8b | 32k | 16k token 64000字符 |
[2,8192] 默认 8k |
16k | RPM = 1000 TPM = 60000 |
DeepSeek-R1-Distill | DeepSeek-R1-Distill-Qianfan-Llama-70B | deepseek-r1-distill-qianfan-llama-70b | 32k | 64000字符 | 8k 默认 4k |
32k | RPM = 1000 TPM = 10000 |
DeepSeek-R1-Distill | DeepSeek-R1-Distill-Qianfan-Llama-8B | deepseek-r1-distill-qianfan-llama-8b | 32k | 64000字符 | 8k 默认 4k |
32k | RPM = 1000 TPM = 10000 |
DeepSeek-R1-Distill | DeepSeek-R1-Distill-Llama-70B | deepseek-r1-distill-llama-70b | 32k | 64000字符 | 8k 默认 4k |
32k | RPM = 1000 TPM = 10000 |
DeepSeek-R1-Distill | DeepSeek-R1-Distill-Llama-8B | deepseek-r1-distill-llama-8b | 32k | 64000字符 | 8k 默认 4k |
32k | RPM = 1000 TPM = 10000 |
DeepSeek-R1-Distill | DeepSeek-R1-Distill-Qwen-32B | deepseek-r1-distill-qwen-32b | 32k | 64000字符 | 8k 默认 4k |
32k | RPM = 1000 TPM = 10000 |
DeepSeek-R1-Distill | DeepSeek-R1-Distill-Qwen-14B | deepseek-r1-distill-qwen-14b | 32k | 64000字符 | 8k 默认 4k |
32k | RPM = 1000 TPM = 10000 |
DeepSeek-R1-Distill | DeepSeek-R1-Distill-Qwen-7B | deepseek-r1-distill-qwen-7b | 32k | 64000字符 | 8k 默认 4k |
32k | RPM = 1000 TPM = 10000 |
DeepSeek-R1-Distill | DeepSeek-R1-Distill-Qwen-1.5B | deepseek-r1-distill-qwen-1.5b | 32k | 64000字符 | 8k 默认 4k |
32k | RPM = 1000 TPM = 10000 |
GLM-Z1-32B-0414 | GLM-Z1-32B-0414 | glm-z1-32b-0414 | 32k | 16k token 64000字符 |
[2,8192] 默认 4k |
16k | RPM=120 TPM=60000 |
GLM-Z1-Rumination-32B-0414 | GLM-Z1-Rumination-32B-0414 | glm-z1-rumination-32b-0414 | 128k | 64k token 256000字符 |
[2,8192] 默认 4k |
32k | RPM = 120 TPM = 150000 |
QWQ-32B | QWQ-32B | qwq-32b | 32k | 65536字符 | 8k 默认 4k |
32k | RPM = 120 TPM = 100000 |
图像生成
模型名称 | 版本 | model 入参 |
最大输入(字符) | 默认流控 |
---|---|---|---|---|
ERINE iRAG | ERNIE-iRAG-1.0 | irag-1.0 | 200字符 | 6RPM |
Stable-Diffusion-XL | stable-diffusion-xl-base-1.0 | -- | 1024字符 | 180RPM |
向量
模型名称 | 版本 | model 入参 |
最大输入文本数量 | 每个文本上下文长度 (token) |
默认流控 |
---|---|---|---|---|---|
Embedding-V1 | Embedding-V1 | embedding-v1 | 1 | 384 | RPM = 1800 TPM = 800000 |
tao-8k | tao-8k | tao-8k | 16 | 8192 | RPM = 1800 TPM = 800000 |
bge-large-zh | bge-large-zh | bge-large-zh | 16 | 512 | RPM = 1800 TPM = 800000 |
bge-large-en | bge-large-en | bge-large-en | 16 | 512 | RPM = 1800 TPM = 800000 |
重排序
模型名称 | 版本 | model 入参 |
最大输入 | 默认流控 |
---|---|---|---|---|
bce-reranker-base | bce-reranker-base | bce-reranker-base | query:400 tokens/1600字符 document:1K tokens/4K字符 |
RPM = 1800 TPM = 800000 |
价格
模型价格参考 价格 文档。