Inference - Search News

The $20 Billion Bet On Inference: What Every AI Infrastructure Team Needs To Get Right

Every ChatGPT query, every AI agent action, every generated video is based on inference. Training a model is a one-time ...

10don MSN

Microsoft announces powerful new chip for AI inference

Microsoft has announced the launch of its latest chip, the Maia 200, which the company describes as a silicon workhorse ...

20h

Cerebras Pockets $1 Billion To Challenge Nvidia In AI With ‘20X Faster’ Chip

Hello Nvidia. Here comes Cerebras, the new darling of AI compute, with a $10 billion OpenAI contract and a new $1 billion in ...

3don MSN

OpenAI ditches Nvidia for faster AI inference chips, threatening chipmaker's dominance

Nvidia remains dominant in chips for training large AI models, while inference has become a new front in the competition.

16d

AI inference startup Baseten hits $5B valuation in $300M round backed by Nvidia

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities ...

Positron AI Raises $230 Million Series B at Over $1 Billion Valuation to Scale Energy-Efficient AI Inference

Positron AI, the leader in energy-efficient AI inference hardware, today announced an oversubscribed $230 million Series B financing at a post-money valuation exceeding $1 billion.

The News International

OpenAI reportedly explores alternatives to Nvidia for AI inference chips

OpenAI is reportedly looking beyond Nvidia for artificial intelligence chips, signalling a potential shift in its hardware ...

14don MSN

Inference startup Inferact lands $150M to commercialize vLLM

The seed round values the newly formed startup at $800 million.

The Chosun Ilbo on MSN

OpenAI seeks inference chips beyond Nvidia's GPUs

Reuters reported on the 2nd (local time) that OpenAI has been dissatisfied with certain performance aspects of Nvidia’s ...

Yahoo Finance

AI Inference Company Evaluation Report 2025 | NVIDIA, AMD, and Intel Compete for Dominance with Diverse Hardware and Strategic Partnerships

Dublin, Aug. 05, 2025 (GLOBE NEWSWIRE) -- The "AI inference - Company Evaluation Report, 2025" report has been added to ResearchAndMarkets.com's offering. The AI Inference Market Companies Quadrant is ...

SDxCentral

SoftBank unveils 'Infrinia’ cloud OS to power AI inference-as-a-service

SoftBank is positioning the internally developed Infrinia OS as a foundation for inference-as-a-service offerings. The Japanese giant suggests the stack will allow users to deploy services by ...

Daily Energy Insider

Electric Power Research Institute to partner with NVIDIA, other organizations to study data centers

The Electric Power Research Institute (EPRI) is collaborating with Prologis, NVIDIA, and InfraPartners to study data centers designed for distributed inference. Distributed inference is a form of real ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results