Up to
6.2x higher
real-time NLP inference performance (BERT) on 4th Gen Intel Xeon Platinum 8480+ with Intel Xeon AMX BF16 vs. prior generation with FP32³
Up to
10x
higher
PyTorch performance for both real-time inference and training workloads with built-in Intel AMX BF16 vs. prior generation with FP32³
70%
of data center AI inferencing runs on Intel Xeon processors²
Optimize AI infrastructure and implement solutions that unlock greater business value for system integrators faster, more simply, and with lower cost and risk.
Many AI initiatives fail to deliver value before the clock runs out. Streamline time to solution and optimize your customer's infrastructure for every phase of AI solution development. Deliver tangible results while helping system integrators avoid the unnecessary complexity, cost, and risk that come with specialized hardware or instances.
Scroll for More
Scroll for More
Intel has built ready-to-deploy systems and optimized developer tools that run out of the box on the most widely used AI inference server platform.
Habana® Gaudi®2 processor accelerates high- performance, high-efficiency deep learning training and inference and is particularly well suited for the scale and complexity of generative AI and large language models.
Intel® Data Center GPU Max Series is ideal for high performance computing (HPC) with AI. The GPU provides highly tuned, end-to-end AI and data pipelines. Enable the industry’s most flexible GPUs with the Intel® one API Toolkit, a core set of tools and libraries for developing high-performance, data-centric applications across diverse architectures.
Protect your AI initiative and comply with regulations with built-in security features, including confidential computing and Intel® Trusted Execution Technology (Intel® TXT). These hardware-enabled capabilities help protect sensitive data and models, comply with security and privacy regulations, and allow you to engage multiparty AI without exposing private data.
4th Gen Intel® Xeon® Scalable processors have more built-in accelerators than any other CPU to support AI and other demanding workloads. These accelerators include:
Intel® Advanced Matrix Extensions (Intel® AMX) accelerates AI capabilities on 4th Gen Intel Xeon Scalable processors, speeding up training and inference without additional hardware. This accelerator is ideal for workloads such as natural language processing, recommendation systems, and image recognition.
Intel® Advanced Vector Extensions 512 (Intel® AVX-512) accelerates performance for compute-intensive workloads. Intel AVX-512 is the latest x86 vector instruction set with up to two fused multiply-add (FMA) units and other optimizations to boost performance for the most demanding computational tasks.
Intel® Software Guard Extensions (Intel® SGX), accelerates performance for compute-intensive workloads. Intel AVX-512 is the latest x86 vector instruction set with up to two fused multiply-add (FMA) units and other optimizations to boost performance for the most demanding computational tasks.
Intel® Trusted Domain Extensions (Intel® TDX), offering confidentiality at the virtual machine (VM) level. Intel TDX isolates the guest OS and all VM applications from the cloud host, hypervisor, and other VMs on the platform. Intel TDX is designed so confidential VMs are easier to deploy and manage at scale than application enclaves.