Local Execution Authority for Large Language Models

R1 Hardware Engine enforces hardware-aware limits before execution — preventing crashes, freezes, and unstable local runs.

Built for developers running LLMs locally who want certainty, not guesswork.

Install Free via Open VSX ⚡ Upgrade to Pro — $19 Source

Infrastructure Guarantees

Compatibility Matrix

Hardware Fit Preview

Preview UI. Numbers shown are illustrative. Actual authority decisions are computed live inside the extension using real hardware data.

24.5
TPS (Est.)
Status: Optimal Fit

Core Capabilities

Saved You From a Crash

Local LLM execution often fails after launch — when VRAM is exhausted or the operating system begins aggressive swapping.

R1 Hardware Engine prevents those failures by enforcing a safety boundary before execution begins.

  1. A local model configuration is requested.
  2. The authority engine predicts real memory pressure using live hardware data.
  3. If instability is likely, execution is denied to protect the system.

This prevents GPU resets, frozen desktops, and forced reboots caused by unsafe local runs.

What Pro Includes

Full Model Library

  • 22 models across 6 families (vs 6 free)
  • Llama 4, Qwen3, Mistral, Phi-4, Gemma 3
  • VRAM predictions for every model + quantization tier

Advanced Stabilization

  • 4-bit KV cache quantization
  • Aggressive context reduction strategies
  • CPU offload fallback for borderline configurations

Pro does not bypass hardware limits. If a configuration cannot be stabilized safely, it is still denied — even with Pro.

Free vs Pro

Free
Pro — $19
Model families
1 (DeepSeek R1)
6 families
Total models
6
28 (6 free + 22 pro)
Llama 4, Qwen3, Mistral, Phi-4, Gemma 3
VRAM predictions + quantization tiers
DeepSeek only
All 28 models
4-bit KV cache quantization
CPU offload fallback
Context reduction strategies
MCP server integration
Advisory
Enforced
License type
Free forever
One-time, lifetime

One-time $19. No subscription. Activates permanently on your machine.

⚡ Get Pro — $19

In Action

R1 Hardware Dashboard
── Hardware Scan ──────────────────
GPU     RTX 4080 (16 GB VRAM)
RAM     64 GB system
Mode    FREE
── Authority Status ───────────────
✓ Engine active — monitoring
⚡ Upgrade to Pro for full matrix

Live Hardware Dashboard (Free Mode)

R1: Validate & Run
Model   DeepSeek-R1 7B (Q4)
VRAM    4.1 GB / 16 GB
Context 8,192 tokens
✓ ALLOW — Safe to execute
Predicted 28.3 TPS · 12.1% VRAM pressure

Safe Execution — ALLOW

R1: Validate & Run
Model   DeepSeek-R1 70B (Q4)
VRAM    38.2 GB required
Available 16 GB
✗ DENY — Insufficient VRAM
Would cause GPU OOM + system freeze

Crash Prevention — DENY

R1: Validate & Run — PRO
Model   Qwen3 32B (Q8)
Requested 32K context
↓ DOWNGRADED to Q4 + 8K ctx
⚡ ALLOW w/ Stabilization
KV cache quantized · CPU offload active

Pro: Stabilization — DOWNGRADED + Recovered

Trust & Transparency

Model Library

Free — DeepSeek R1 Family

  • DeepSeek R1 1.5B, 7B, 14B, 32B
  • DeepSeek R1 70B, 671B MoE

Pro — Full Library (22 Models)

  • Meta Llama 4 — Scout 17B, Maverick 17B MoE, 3.3 70B, 3.2 3B
  • Alibaba Qwen3 — 7B, 14B, 32B, 72B, 235B MoE
  • Mistral AI — 7B, Small 24B
  • Microsoft Phi-4 — 14B, Mini 3.8B
  • Google Gemma 3 — 4B, 12B, 27B

One-time payment. Lifetime license. Activate in VS Code with a single key.

⚡ Get Pro — $19

How to Activate Pro

  1. 01.Purchase above — you'll receive a license key by email immediately.
  2. 02.In VS Code, press Ctrl+Shift+P and run R1 Hardware: 💎 Activate License Key.
  3. 03.Paste your key — Pro unlocks permanently on that machine.