Dashboard

OpenAI: GPT-4.1 Nano - NeuralHub | NeuralHub

OpenAI: GPT-4.1 Nano

Model Details

Company: openai

Created: 12/11/2025

Description

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding – even higher than GPT‑4o mini. It’s ideal for tasks like classification or autocompletion.

Technical Specifications

Context Window

1048k tokens

Max Output

33k tokens

Pricing (Input / Output)

$0.00009999999999999999 / $0.00039999999999999996 per 1M

Architecture

transformer

Modality

text+image->text

API Usage

Example API Call

curl -X POST https://api.neuralhub.xyz/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer NEURALHUB_API_KEY" \
-d '{
  "model": "openai/gpt-4.1-nano",
  "messages": [
    { "role": "system", "content": "You are a helpful assistant." },
    { "role": "user", "content": "" }
  ],
  "temperature": 0.7,
  "max_tokens": 500,
  "top_p": 0.9
}'

Response Format

The API returns an OpenAI-compatible response. Example:

{
  "id": "chatcmpl-<uuid>",
  "object": "chat.completion",
  "created": 1765590280,
  "model": "openai/gpt-4.1-nano",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "The answer to life, the universe, and everything is famously 42..."
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 26,
    "completion_tokens": 169,
    "total_tokens": 195
  }
}