d-Matrix
  • Technology
  • Product
  • Ecosystem
  • Blog
  • About
  • Careers

d-Matrix Blog

Featured

Scaling AI the Right Way: Introducing Our Rack-Level Inference Solution  

Scaling AI the Right Way: Introducing Our Rack-Level Inference Solution  

October 14, 2025
What is AI Inference and why it matters in the age of Generative AI

What is AI Inference and why it matters in the age of Generative AI

June 4, 2025
The Complete Recipe to Unlock AI Reasoning at Enterprise Scale

The Complete Recipe to Unlock AI Reasoning at Enterprise Scale

February 13, 2025
Building apps on the phone: how heterogeneous pipelines enable speech-to-code experiences

Building apps on the phone: how heterogeneous pipelines enable speech-to-code experiences

Modern AI has finally enabled us to build advanced, seamless applications in final frontier of human interaction: phone calls. Multimodal agentic AI applications have finally turned voice-based experiences from pulling… Read More
May 28, 2026
Where heterogeneous pipelines power an agentic coding future

Where heterogeneous pipelines power an agentic coding future

How speculative decoding supercharged AI inference in disaggregated pipelines

How speculative decoding supercharged AI inference in disaggregated pipelines

d-Matrix Boosts Rack-scale AI Capabilities With  Acquisition of GigaIO Data Center Business

d-Matrix Boosts Rack-scale AI Capabilities With Acquisition of GigaIO Data Center Business

What does success mean for agentic networks?

What does success mean for agentic networks?

Going Vertical: Why we created a 3D DRAM solution to advance low latency AI inference

Going Vertical: Why we created a 3D DRAM solution to advance low latency AI inference

d-Matrix and Gimlet Labs to Deliver 10x Speed Ups, Massive Power Efficiency for Frontier AI Workloads

d-Matrix and Gimlet Labs to Deliver 10x Speed Ups, Massive Power Efficiency for Frontier AI Workloads

View All Posts >

Trending

Blazing the Trail Toward More Scalable, Affordable AI with 3DIMC 

How to Bridge Speed and Scale: Redefining AI Inference with Ultra-Low Latency Batched Throughput

What is AI Inference and why it matters in the age of Generative AI

Transforming AI: d-Matrix’s Pivotal Moments in Pursuit of Gen AI Inference At Scale

Why Datacenters are struggling to keep up with Generative AI

Featured Video

The DeepSeek Moment

In this short talk, d-Matrix CTO Sudeep Bhoja discusses the release of the Deep Seek R1 model, highlighting its impact on inference compute. He discusses the evolution of reasoning models and the significance of inference time compute in enhancing model performance.

Learn more about d-Matrix

From the Media

d-Matrix Boosts Rack-scale AI Capabilities With Acquisition of GigaIO Data Center Business

d-Matrix and Gimlet Labs to Deliver 10x Speed Ups, Massive Power Efficiency for Frontier AI Workloads

Alchip logo

d-Matrix and Alchip Announce Collaboration on World’s First 3D DRAM Solution to Supercharge AI Inference

Andes logo

d-Matrix and Andes Team on World’s Highest Performing, Most Efficient Accelerator for AI Inference at Scale

Datacenter

d-Matrix Raises $275 Million to Power the Age of AI Inference

SquadRack d-Matrix

d-Matrix Announces SquadRack, Industry’s First Rack-Scale Solution Purpose-Built for AI Inference at Datacenter Scale

View all media articles > For all press inquiries, please email pr@d-matrix.ai>
Transforming AI from
unsustainable to attainable.
  • Technology
  • Product
  • Ecosystem
  • About
  • Careers
  • Blog
  • Newsletter
  • Media Kit
  • Contact
  • Privacy Policy
  • Terms of Use
© d-Matrix, Inc. 2026
X Twitter Logo Streamline Icon: https://streamlinehq.com