← Back to Models

Qwen/Qwen3-Next-80B-A3B-Thinking

API Usage Instructions

Model Information

Model ID: Qwen/Qwen3-Next-80B-A3B-Thinking

Provider: cloudrift

Description: Model served by provider: cloudrift

Context Length: 4,096 tokens

Max Output Length: 4,096 tokens

Created: 2025-09-17 06:37:25

cURL Example

curl -X POST "https://inferline.cloudrift.ai/api/completions" -H "Content-Type: application/json" -d '{ "model": "Qwen/Qwen3-Next-80B-A3B-Thinking", "prompt": "Hello, how are you?" }'

Request Parameters

Parameter Type Required Description
model string Yes The model ID to use for completion
prompt string Yes The text prompt to complete
max_tokens integer No Maximum number of tokens to generate (optional)
temperature float No Sampling temperature (optional, 0.0-2.0)
Note: This is a queued inference system. Requests may take some time to process depending on provider availability and queue status. Check queue statistics for current load.