Dashboard

Meta: Llama 4 Scout - NeuralHub | NeuralHub

Meta: Llama 4 Scout

Model Details

Company: meta-llama

Created: 12/11/2025

Description

Llama 4 Scout 17B Instruct (16E) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input (text and image) and multilingual output (text and code) across 12 supported languages. Designed for assistant-style interaction and visual reasoning, Scout uses 16 experts per forward pass and features a context length of 10 million tokens, with a training corpus of ~40 trillion tokens. Built for high efficiency and local or commercial deployment, Llama 4 Scout incorporates early fusion for seamless modality integration. It is instruction-tuned for use in multilingual chat, captioning, and image understanding tasks. Released under the Llama 4 Community License, it was last trained on data up to August 2024 and launched publicly on April 5, 2025.

Technical Specifications

Context Window

328k tokens

Max Output

16k tokens

Pricing (Input / Output)

$0.00008 / $0.0003 per 1M

Architecture

transformer

Modality

text+image->text

API Usage

Example API Call

curl -X POST https://api.neuralhub.xyz/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer NEURALHUB_API_KEY" \
-d '{
  "model": "meta-llama/llama-4-scout",
  "messages": [
    { "role": "system", "content": "You are a helpful assistant." },
    { "role": "user", "content": "" }
  ],
  "temperature": 0.7,
  "max_tokens": 500,
  "top_p": 0.9
}'

Response Format

The API returns an OpenAI-compatible response. Example:

{
  "id": "chatcmpl-<uuid>",
  "object": "chat.completion",
  "created": 1765590371,
  "model": "meta-llama/llama-4-scout",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "The answer to life, the universe, and everything is famously 42..."
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 26,
    "completion_tokens": 169,
    "total_tokens": 195
  }
}