Nvidia and AWS have expanded their partnership to make AI cheaper to run at production scale, detailed by Nvidia on June 24.
As AI developers and cloud providers have launched server chips to lessen their dependence on Nvidia’s, some analysts and executives at these firms expected the chips to eat into Nvidia’s market share ...
Demand for AI inference compute workloads is increasing rapidly, and Nvidia is dominating the market despite competition from AMD and Broadcom.
Nvidia Corp. today stoked the fires of the emerging artificial intelligence factory trend with the announcement of Dynamo 1.0, an open-source platform the company is positioning as an essential ...
Nvidia is reportedly developing a specialized processor aimed at accelerating AI inference, a move that could reshape how companies like OpenAI deploy their models. The push comes as Nvidia has also ...
The company says its new architecture marks a shift from training-focused infrastructure to systems optimized for continuous, low-latency enterprise AI workloads. 2026 is predicted to be the year that ...
Nvidia has just unveiled its new Groq 3 LPX inference accelerator for AI. The product incorporates Groq's high memory bandwidth and Nvidia's processing power. The news comes less than three months ...
HPE announced it has outfitted its HPE ProLiant Compute DL394 Gen12 server with the Vera CPU—which is designed for agentic ...
Last October, Nvidia’s CEO Jensen Huang admitted that its market share in China had fallen to effectively zero. However, it seems Nvidia is not giving up on the Chinese market, as it is now reportedly ...
Ahead of Nvidia Corp.’s GTC 2026 this week, we reiterate our thesis that the center of gravity in artificial intelligence is shifting from “How fast can you train?” to “How well can you serve?” ...