NVIDIA and Microsoft Launch RTX Spark: Personal AI Computers That Run 120B+ LLMs On-Device

NVIDIA and Microsoft announced RTX Spark, a new platform for personal AI computers that can run large language models with over 120 billion parameters entirely on-device, ushering in what Jensen Huang called 'the personal AI computer era'.

What Is RTX Spark?

NVIDIA and Microsoft jointly announced RTX Spark on June 18, 2026 — a new hardware and software platform designed to bring enterprise-grade AI capabilities to personal computers. At the heart of RTX Spark is a dedicated AI accelerator built on NVIDIA's next-generation architecture, capable of running large language models with over 120 billion parameters entirely on-device without cloud connectivity. The platform includes a specialized GPU with 96 GB of high-bandwidth memory, a neural processing unit for always-on AI tasks, and tight integration with Microsoft Windows through a new secure runtime called OpenShell.

Jensen Huang, NVIDIA's CEO, declared: "The personal AI computer era has arrived. Just as the GPU transformed graphics from a niche application into a universal computing paradigm, RTX Spark transforms AI from a cloud service into a personal capability."

How It Works

RTX Spark combines three innovations. First, the hardware: a PCIe card (or integrated laptop module) featuring NVIDIA's new "Spark Core" — a chip designed specifically for transformer model inference, capable of processing 1,000+ tokens per second for models in the 70-120 billion parameter range. Second, the software: OpenShell, a Microsoft-created secure runtime that runs inside Windows, allowing AI agents to operate persistently with hardware-level memory isolation. Third, the model stack: RTX Spark ships pre-configured to run Llama 4, Microsoft Phi-4, and Mistral Large 2 models with full precision, while also supporting quantized versions of larger models like Llama 405B at reduced context lengths.

Feature	Specification
Model size support	Up to 120B parameters (native), 405B (quantized)
Inference speed	1,000+ tokens/sec (70-120B models)
Memory	96 GB HBM4e
Context window	Up to 1M tokens (using quantization)
Security runtime	OpenShell (hardware-isolated)
Power consumption	150-300W (varies by workload)

India Angle

RTX Spark has significant implications for India's technology ecosystem. India has over 300 million PC users and a rapidly growing market for AI-powered tools in education, healthcare, and business. On-device AI eliminates the need for expensive cloud API calls — a critical factor in a price-sensitive market where dollar-denominated API costs can be prohibitive. Indian software developers and AI startups can now build and test models locally without committing to cloud compute budgets. Educational institutions can deploy AI-powered tutoring systems that work offline, addressing India's connectivity challenges in smaller towns and rural areas. Microsoft has indicated that RTX Spark will be available in India through its OEM partners, with Dell, HP, and Lenovo expected to ship RTX Spark laptops by Q3 2026.

Industry Impact

The RTX Spark announcement sent ripples through the AI industry. Cloud AI providers — including OpenAI, Anthropic, and Google — face a future where an increasing share of AI inference moves from centralized data centers to edge devices. This reduces their addressable market for API-based inference while increasing demand for model optimization and distillation services. For enterprise customers, on-device AI eliminates data privacy concerns associated with sending sensitive documents to cloud APIs. For consumers, it means AI assistants that work offline with no latency and no usage fees. The India price point is expected to be approximately ₹85,000-1,50,000 for RTX Spark-equipped laptops, positioning them as premium but not inaccessible devices.

Sources

• NVIDIA Newsroom: NVIDIA and Microsoft launch RTX Spark
• RiskInfo.ai: AI insights — key global developments June 2026
• DevFlokers: Daily AI tech updates June 2026
• Kersai Research: June 2026 AI news roundup

Internal Links

• Meta partners with Reliance for AI data center in Jamnagar
• Anthropic's Claude Opus 4.8 and Mythos models
• Open-source AI momentum with Meituan's LongCat-Next

NVIDIA and Microsoft Launch RTX Spark: Personal AI Computers That Run 120B+ Parameter Models On-Device