Matlab Fuzzy Inference System

Microsoft launches its second generation AI inference chip, Maia 200

Calling it the highest performance chip of any custom cloud accelerator, the company says Maia is optimized for AI inference on multiple models. Signaling that the future of AI may not just be how ...

SDxCentral

AI inference crisis: Google engineers on why network latency and memory trump compute

Google researchers have warned that large language model (LLM) inference is hitting a wall amid fundamental problems with memory and networking problems, not compute. In a paper authored by ...

SiliconANGLE

Retail 2026: When AI becomes the operating system

As the National Retail Federation’s annual conference takes over New York this week, the conversations we’re having across SiliconANGLE, theCUBE, theCUBE Research and NYSE Wired all point to the same ...

TechCrunch

Caterpillar taps Nvidia to bring AI to its construction equipment

Caterpillar is diving deeper into incorporating AI and automation into its fleet of construction machinery through a partnership with semiconductor giant Nvidia. The construction equipment giant is ...

SiliconANGLE

Lenovo launches new ThinkSystem servers dedicated to AI inference

Lenovo Group Ltd. is pushing to become the workhorse of the artificial intelligence industry after unveiling a slate of new, enterprise-grade server systems specifically for AI inference workloads.

VentureBeat

Nvidia just admitted the general-purpose GPU era is ending

Nvidia’s $20 billion strategic licensing deal with Groq represents one of the first clear moves in a four-front fight over the future AI stack. 2026 is when that fight becomes obvious to enterprise ...

InfoQ

NVIDIA Dynamo Addresses Multi-Node LLM Inference Challenges

Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. Cory Benfield discusses the evolution of ...

Network World

What is edge AI? When the cloud isn’t close enough

Edge AI is a form of artificial intelligence that in part runs on local hardware rather than in a central data center or on cloud servers. It’s part of the broader paradigm of edge computing, in which ...

MIT Technology Review

Realizing value with AI inference at scale and in production

As organizations enter the next phase of AI maturity, IT leaders must step up to help turn promising pilots into scalable, trusted systems. In partnership withHPE Training an AI model to predict ...

CNX Software

Firefly EC-AGXOrin – Jetson AGX Orin 64GB AI inference system supports up to 8 GMSL2 cameras

Firefly EC-AGXOrin is an NVIDIA Jetson AGX Orin 64GB-powered AI inference system, similar to the AAEON BOXER-8645AI and Vecow RAC-1000 rugged Edge AI systems, and designed for edge AI applications ...

SDxCentral

Nvidia rack-scale Blackwell systems lead new AI inference benchmark

Nvidia’s rack-scale Blackwell systems topped a new benchmark of AI inference performance, with the tech giant's networking technologies helping to play a key role in the results. The InferenceMAX v1 ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results