System Documentation

Technical reference for the ZeroThink Sovereign Architecture. Integrated multi-provider routing, Oracle analysis, large code ingest, and the new cross-platform CLI stack.

Access Tiers

Free Tier

High-speed access to open-weights models. Ideal for coding and rapid inquiry.

  • Groq Cloud: GPT-OSS, Llama 3.3, Qwen 3
  • NVIDIA NIM: Qwen 3 Coder 480B, Gemma 4 31B, DeepSeek V3.2, Codestral, Devstral
  • CLI: Linked-account login or no-Google direct mode, folder-only or machine-wide access, SSH profiles, remote file workspace tools, Playwright MCP, named agents, autopilot runs, local plan board, verified self-update, and local or Vault-backed provider keys
  • ZeroThink Logic: Disabled
Sovereign Pro

Full access to the Pentagon Protocol and proprietary closed-source models.

  • ZeroThink Logic: Active (Reasoning Layer)
  • xAI: Grok 4, Grok 4 Fast, Grok Code Fast 1
  • OpenAI: GPT-5.4, GPT-5 Mini, GPT-5.1
  • Google: Gemini 2.5 Pro / Flash / Flash-Lite / Gemini 3 previews
  • Billing: Cancel any time. Monthly non-refundable. Yearly refund requests within 7 days via shaf@talktoai.org

The Model Matrix

ProviderModel NameSpecsAccessBest Use
GROQGPT-OSS 120B / CompoundCurrent fast cloud stackFreeGeneral reasoning, Oracle, broad strategic work
GROQDeepSeek R1 Distill 70B / Qwen 3 32B / Llama 3.3 70BOpen-weight fast laneFreeCoding, structured analysis, fast iterations
NVIDIAQwen 3 Coder 480B / Codestral 22B / Devstral 2Heavy coding + instructFreeLarge code work, technical drafting, deep analysis
NVIDIAGemma 4 31B / Gemma 3n E4B / DeepSeek V3.2 / Llama 3.3 70BCurrent NVIDIA frontier setFreeLong-form reasoning, multilingual work, coding and research
xAIGrok 4 / Grok Code Fast 1Current premium xAI lineProReasoning, coding, live premium use
GOOGLEGemini 2.5 Pro / Gemini 3 previewsMultimodal + long-contextProLarge documents, deeper analysis, mixed text/media work
OPENAIGPT-5.4 / GPT-5 Mini / GPT-5.1Current frontierProPremium reasoning and writing

Oracle Desk

Crypto + General

The Oracle route is tuned for both crypto scenarios and broader strategic calls. Modes include crypto, general, launch, and risk.

Default Engine

Oracle now defaults to Groq + GPT-OSS 120B for a strong speed / reasoning balance, while still letting you switch provider and model directly from the desk. Llama 3.3 70B, Compound, and the NVIDIA frontier set are there when you want a different read.

Support The Project

The Oracle page includes a Solana support card so people who use the desk heavily can donate directly to the ZeroThink build.

Direct Web Mode

If someone wants ZeroThink without Google login first, the lean browser route is exposed at /guest. That lane accepts local Groq or NVIDIA keys, keeps memory/history off, does not connect to the ZeroThink account/Vault, and stores the key only in the browser.

ZeroThink CLI

Cross-Platform + Guided

The ZeroThink CLI, aka Agent Zero, installs on Windows PowerShell, macOS, and Linux. It can either pair to the same ZeroThink account as the web app or run in a no-Google direct mode with a local Groq or NVIDIA key on the machine. It now includes a real first-run setup guide on plain zerothink, beginner help like zerothink tutorial, zerothink help quickstart, zerothink help sessions, zerothink help plan, zerothink help browser, zerothink help mcp, zerothink help remote, zerothink help agents, zerothink key status, zerothink status, zerothink runtime, zerothink models coding, in-terminal /guide, a local /plan board, local web search and page browse, a first-class Playwright MCP browser lane, smarter short-prefix slash handling like /he, slash-command autocomplete when the optional prompt-toolkit dependency is present, a real remote workspace layer for SSH, SCP, and remote file read/write/replace flows, plus an explicit autopilot layer with saved named workers like zero1, zero2, and zero3. In the interactive prompt, plain-English build/fix requests now route into the real tool stack so Agent Zero can actually create folders, write files, and carry out bounded coding tasks instead of only narrating commands, and you can switch between /steer auto, /steer plan, and /steer confirm if you want tasks staged before execution. Use /go to run a staged task and /stop to discard it or request stop for the latest saved agent. Slash commands like /help only work inside the interactive ZERO> prompt. The CLI now runs in clean operator mode by default, so the full ZeroThink reasoning trace stays in the web app and Studio instead of being dumped into terminal replies.

Large Code Friendly

The CLI supports direct file injection, workspace scans, file search, file writes and replacements, zip creation and extraction, remote SCP transfers, remote file inspection and edits on saved server profiles, repo-level guidance files like ZEROTHINK.md, CLAUDE.md, AGENTS.md, and .claude/CLAUDE.md, local web search, saved browser pages, Playwright MCP automation for dynamic pages, page attach flows, and the same packed file strategy used in the web app, so large source files and config dumps are easier to analyze without immediately blowing the window. The CLI reasoning layer now follows a stronger private orient → inspect → plan → act → verify → report workflow instead of staying in generic assistant mode.

Operator Modes

The public install and manual page is exposed at /cli. The CLI can run in a safer folder-only mode for one repo or switch to machine mode when you want shell and server operator access, and the guide now walks beginners through install, linked-account mode, no-Google direct mode, first prompt, help topics, model selection, autocomplete, local checkpoint/resume flows, local planning, and the limits of direct mode. It also has a built-in zerothink update command that verifies the published CLI hash before replacing the local script, zerothink status for a quick local readiness snapshot, a remote workspace lane via zerothink remote ..., a local MCP layer with a built-in Playwright browser profile via zerothink mcp add-playwright and zerothink playwright open https://example.com, plus self-driven agent workflows like zerothink autopilot "audit this repo", zerothink agent create zero1 "map the auth flow", zerothink agent log zero1, zerothink stop zero1, and in-prompt /stop. For coding, the current recommended defaults are openai/gpt-oss-120b on Groq and qwen/qwen3-coder-480b-a35b-instruct on NVIDIA, and the fastest way to inspect the curated shortlist is zerothink models coding. If you just type a normal task like make a new project folder for shafaet.com inside the interactive prompt, Agent Zero now routes that into execution mode automatically instead of staying in pure chat mode, unless you switch to a staging lane like /steer plan.

No-Google CLI Mode

If you skip Google login in the CLI, ZeroThink can still auto-detect GROQ_API_KEY or NVIDIA_API_KEY from your local shell, or you can store them locally with zerothink key set groq / zerothink key set nvidia. That lane does not import Vault keys, memory, saved history, linked-account session state, or Pro account state. Local CLI browser tools like zerothink search, zerothink browse, /search-web, /page, and local Playwright MCP browser commands still work because they are machine-local terminal helpers rather than website account features. For Groq and NVIDIA, the CLI can also fall back to that local direct-key lane automatically if the linked provider lane fails. If you ever want the exact live route instead of asking the model to guess, use zerothink runtime or /runtime. If you already have the CLI installed, the safe update command is simply zerothink update.

ZeroThink Logic (Pentagon Protocol)

Note: ZeroThink is not merely a prompt; it is a proprietary Reasoning Architecture. It forces the underlying model into a recursive dialectical state.

This architecture is the Intellectual Property of TalkToAi and is the core value proposition of the platform. Therefore, access to the Pentagon Protocol is reserved exclusively for the Sovereign Pro tier to maintain system integrity.

1. ALPHA (Logic)

Analyzes the pure logical constraints of the request.

2. BETA (Quantum)

Applies abstract association and probabilistic weighting.

3. DELTA (Ethics)

Checks alignment with the 11:11 Math of Goodness.

API Configuration

To activate the Scout Node (Internet Search) or the premium model engines, you must obtain API keys from the respective providers.

Groq Cloud (Recommended / Free)
Cost: Free Beta (High Limits)
Steps:
1. Go to console.groq.com
2. Sign up with Email/Google.
3. Click "Create API Key".
4. Paste into the ZeroThink Vault under "Groq API", or use zerothink key set groq for local direct mode in the CLI.
5. If you already export GROQ_API_KEY in your shell, the CLI can auto-detect it without Google login.
NVIDIA NIM (Free Credits)
Cost: 1,000 Free Credits on Signup
Steps:
1. Go to build.nvidia.com
2. Click "Login" -> Create Account.
3. Click "Get API Key" (Top Right).
4. Paste into the ZeroThink Vault under "NVIDIA NIM", or use zerothink key set nvidia for local direct mode in the CLI.
5. If you already export NVIDIA_API_KEY in your shell, the CLI can auto-detect it without Google login.
Google Gemini (Free Tier)
Cost: Free Tier Available / Pay-as-you-go
Steps:
1. Go to aistudio.google.com
2. Click "Create API Key".
3. Paste into the ZeroThink Vault under "Gemini API".
xAI (Grok)
Cost: Paid (Pre-load Credits)
Steps:
1. Go to console.x.ai
2. Add Billing Details.
3. Generate Key and paste into Vault under "xAI API".
OpenAI (GPT-5.4 / GPT-5 Mini / GPT-5.1)
Cost: Paid (Pay-as-you-go)
Steps:
1. Go to platform.openai.com
2. Create new Secret Key.
3. Paste into Vault under "OpenAI API".
Serper (Web Search)
Cost: 2,500 Free Queries
Steps:
1. Go to serper.dev
2. Sign up.
3. Copy API Key.
4. Paste into Vault under "Serper Web Key".
ZERO SYSTEM v35 // SOVEREIGN ARCHITECTURE
© 2026 Shaf Brady / TalkToAi