Dashboard

Qwen: Qwen3 14B - NeuralHub | NeuralHub

Qwen: Qwen3 14B

Model Details

Company: qwen

Created: 12/11/2025

Description

Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, programming, and logical inference, and a "non-thinking" mode for general-purpose conversation. The model is fine-tuned for instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects. It natively handles 32K token contexts and can extend to 131K tokens using YaRN-based scaling.

Technical Specifications

Context Window

41k tokens

Max Output

41k tokens

Pricing (Input / Output)

$0.000049999999999999996 / $0.00022 per 1M

Architecture

qwen3

Modality

text->text

API Usage

Example API Call

curl -X POST https://api.neuralhub.xyz/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer NEURALHUB_API_KEY" \
-d '{
  "model": "qwen/qwen3-14b",
  "messages": [
    { "role": "system", "content": "You are a helpful assistant." },
    { "role": "user", "content": "" }
  ],
  "temperature": 0.7,
  "max_tokens": 500,
  "top_p": 0.9
}'

Response Format

The API returns an OpenAI-compatible response. Example:

{
  "id": "chatcmpl-<uuid>",
  "object": "chat.completion",
  "created": 1765590421,
  "model": "qwen/qwen3-14b",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "The answer to life, the universe, and everything is famously 42..."
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 26,
    "completion_tokens": 169,
    "total_tokens": 195
  }
}