AI chipmaker Groq has secured a new $650 million funding round as the company accelerates its transition toward AI inference cloud services following its high-profile technology licensing agreement with Nvidia and the departure of several senior executives. The latest investment marks the next phase of Groq’s evolution as it focuses on scaling its global AI infrastructure footprint and expanding its neocloud business.
The funding round was led by investment firms Disruptive and Infinitum. Disruptive founder Alex Davis, who also serves as Groq’s chairman, participated alongside the Fort Lauderdale-based hedge fund. The company did not disclose its updated valuation. Groq was previously valued at $6.9 billion following a $750 million funding round completed in September.
Groq Rebuilds Following Nvidia Talent and IP Deal
The new financing comes roughly six months after Nvidia entered a non-exclusive licensing agreement for Groq’s technology and recruited several of the company’s top executives, including founder and CEO Jonathan Ross and president Sunny Madra. Despite widespread speculation at the time, the transaction was not a full acquisition, allowing Groq to continue operating as an independent company. Ross is widely recognized in the AI hardware industry for helping develop Google’s Tensor Processing Unit (TPU) architecture before co-founding Groq alongside fellow Google engineer Doug Wightman. Following the Nvidia agreement, Wightman remained with the company and assumed leadership responsibilities as Groq repositioned its business strategy.
The licensing agreement also gave Nvidia access to Groq’s Language Processing Unit (LPU) technology, which had been specifically designed for AI inference workloads. Nvidia later unveiled the Nvidia Groq 3 LPX inference hardware system during its GTC conference, further integrating Groq’s technology into its expanding AI infrastructure portfolio.
Pivot From AI Chips to AI Inference Infrastructure
Rather than competing directly in the increasingly crowded AI chip market, Groq is now focusing on AI inference services, one of the fastest-growing segments of the AI infrastructure ecosystem. Inference refers to the process of running trained AI models and generating responses to user queries, a market many industry observers believe will ultimately exceed AI training workloads in scale.
The company has repositioned itself around its neocloud platform, offering developers and enterprises access to high-performance AI inference infrastructure powered by Groq technology. This strategic shift follows growing demand for inference capacity as organizations deploy AI applications into production environments.
Global AI Infrastructure Footprint Continues to Expand
Groq said its neocloud business has expanded rapidly and now operates across 13 data centers spanning North America, Europe, the Middle East, and the Asia-Pacific region. The company claims its infrastructure currently serves more than five million developers and thousands of AI-focused organizations worldwide. According to the company, its platform processes trillions of AI tokens every week, reflecting growing demand for inference infrastructure as enterprises increase adoption of generative AI, agentic AI, and large language model applications.
The latest funding will support further capacity expansion, deployment of new inference technologies, and continued growth of GroqCloud’s global infrastructure footprint. Groq is also expected to strengthen its position in the rapidly evolving AI inference market, where cloud providers and infrastructure companies are racing to support production-scale AI workloads.
AI Infrastructure Competition Intensifies
Groq’s latest raise highlights a broader trend across the AI industry. While much of the attention over the past two years has focused on training large AI models, infrastructure providers are increasingly shifting toward inference, where long-term demand is expected to be significantly larger. The company’s ability to attract fresh capital following the Nvidia licensing transaction suggests investors remain confident in the long-term growth potential of AI inference infrastructure. As hyperscalers, cloud providers, and enterprises continue scaling AI deployments, demand for specialized inference platforms is expected to rise sharply over the coming years.
