Habana Labs Announces Gaudi AI Training Processor

Habana Labs, Ltd. (www.habana.ai), a leading developer of AI processors, today announced the Habana Gaudi™ AI Training Processor. Training systems based on Gaudi processors will deliver an increase in throughput of up to four times over systems built with equivalent number GPUs.

Gaudi’s innovative architecture enables near-linear scaling of training systems performance, as high throughput is maintained even at smaller batch sizes, thus allowing performance scaling of Gaudi-based systems from a single-device to large systems built with hundreds of Gaudi processors.

In addition to record-breaking performance, Gaudi brings another industry first to AI training: on-chip integration of RDMA over Converged Ethernet (RoCE v2) functionality within the AI processor, to enable the scaling of AI systems to any size, using standard Ethernet. With Gaudi, Habana Labs’ customers can now utilize standard Ethernet switching for both scaling-up and scaling-out AI training systems. Ethernet switches are multi-sourced, offering virtually unlimited scalability in speeds and port-count, and are already used in datacenters to scale compute and storage systems. In contrast to Habana’s standards-based approach, GPU-based systems rely on proprietary system interfaces, that inherently limit scalability and choice for system designers.

“With its new products, Habana has quickly extended from inference into training, covering the full range of neural-network functions,” commented Linley Gwennap, principal analyst of The Linley Group. “Gaudi offers strong performance and industry-leading power efficiency among AI training accelerators. As the first AI processor to integrate 100G Ethernet links with RoCE support, it enables large clusters of accelerators built using industry-standard components.”

Eitan Medina, Chief Business Officer of Habana Labs. Photo credit – Habana Labs

The Gaudi processor includes 32GB of HBM-2 memory and is currently offered in two forms:

HL-200 – a PCIe card supporting eight ports of 100Gb Ethernet;
HL-205 – a mezzanine card compliant with the OCP-OAM specification, supporting 10 ports of 100Gb Ethernet or 20 ports of 50Gb Ethernet.

Habana is also introducing an 8-Gaudi system called HLS-1, which includes eight HL-205 Mezzanine cards, with PCIe connectors for external Host connectivity and 24 100Gbps Ethernet ports for connecting to off-the-shelf Ethernet switches, thus allowing scaling-up in a standard 19’’ rack by populating multiple HLS-1 systems.

Gaudi is the second purpose-built AI processor to be launched by Habana Labs in the past year, following the Habana Goya™ AI Inference Processor. Goya has been shipping since Q4, 2018, and has demonstrated industry-leading inference performance, with the industry’s highest throughput, highest power efficiency (images-per-second per Watt), and real-time latency.

“Training AI models require exponentially higher compute every year, so it’s essential to address the urgent needs of the datacenter and cloud for radically improved productivity and scalability. With Gaudi’s innovative architecture, Habana delivers the industry’s highest performance while integrating standards-based Ethernet connectivity that enables unlimited scale,” said David Dahan, CEO and Co-founder of Habana Labs. “Gaudi will disrupt the status quo of the AI Training processor landscape.”

“Facebook is seeking to provide open platforms for innovation around which our industry can converge,” said Vijay Rao, Director of Technology, Strategy at Facebook. “We are pleased that the Habana Goya AI inference processor has implemented and open-sourced the backend for the Glow machine learning compiler and that the Habana Gaudi AI training processor is supporting the OCP Accelerator Module (OAM) specification.”

The Gaudi Processor is fully programmable and customizable, incorporating a second- generation Tensor Processing Core (TPC™) cluster, along with development tools, libraries, and a compiler, that collectively deliver a comprehensive and flexible solution. Habana Labs’ SynapseAI™ software stack consists of a rich kernel library and open toolchain for customers to add proprietary kernels.

Habana will be sampling the Gaudi to select customers in the second half of 2019. For more information on Gaudi AI Training and Goya AI inference processors, please visit www.habana.ai

Lihi

Next TowerJazz Expands its Leading-Edge High Voltage Low RDSON Power Platform »

Previous « Formula E: Audi in the thick of the title race

Published by

Lihi

7 years ago

DigiKey Collaborates with STMicroelectronics and Ultra Librarian to Deliver Enhanced eDesignSuite Integration

Free, browser-based design environment streamlines engineering workflows from simulation to BOM export and purchasing THIEF…

14 hours ago

LATEST NEWS

Flying electric vessel maker Candela raises €30M amid soaring oil prices

Candela’s largest funding round to date, with the World Bank’s IFC arm joining existing investors,…

2 days ago

LATEST NEWS

IBM Fellow and Quantum Pioneer Charles H. Bennett Receives A.M. Turing Award, Computing’s Highest Honor

• Charles H. Bennett helped pioneer the foundations of quantum information science alongside co-laureate Gilles…

5 days ago

LATEST NEWS

One Million Industrial Vehicles Charged

Delta Reaches New Milestone in Electrified Intralogistics Infrastructure Delta, a global leader in power and…

5 days ago

LATEST NEWS

Quantum Machines Launches Open Acceleration Stack Alongside NVIDIA, AMD and Riverlane to Deliver Next Level of Hybridization

First-of-its-kind framework enables seamless integration of quantum computers with advanced accelerators to support AI-native and…

5 days ago

LATEST NEWS

German Defence Force Expands Use of Driving Robots for Driverless Testing

The Bundeswehr acquires a fourth Automated Driverless Testing Solution from AB Dynamics The solution removes…

7 days ago

Habana Labs Announces Gaudi AI Training Processor

Related Post

Recent Posts

DigiKey Collaborates with STMicroelectronics and Ultra Librarian to Deliver Enhanced eDesignSuite Integration

Flying electric vessel maker Candela raises €30M amid soaring oil prices

IBM Fellow and Quantum Pioneer Charles H. Bennett Receives A.M. Turing Award, Computing’s Highest Honor

One Million Industrial Vehicles Charged

Quantum Machines Launches Open Acceleration Stack Alongside NVIDIA, AMD and Riverlane to Deliver Next Level of Hybridization

German Defence Force Expands Use of Driving Robots for Driverless Testing