Arista floats its answer to the strain AI puts on networks

11-10-2022

Arista floats its answer to the strain AI puts on networks

AI Spine relies on Arista’s switches with deep packet buffers and its networking software to control heavy AI traffic, reduce latency, and eliminate packet loss.

By Michael CooneySenior Editor, Network World | JAN 5, 2023 6:54 AM PST

Wired brain illustration - next step to artificial intelligence

If networks are to deliver the full power of AI they will need a combination of high-performance connectivity and no packet lossThe concern is that today’s traditional network interconnects cannot provide the required scale and bandwidth to keep up with AI requests, said Martin Hull, vice president of Cloud Titans and Platform Product Management with Arista Networks. Historically, the only option to connect processor cores and memory have been proprietary interconnects such as InfiniBand, PCI Express and other protocols that connect compute clusters with offloads but for the most part that won’t work with AI and its workload req uirements.

Arista AI Spine

To address these concerns, Arista is developing a technology it calls AI Spine, which calls for data-center switches with deep packet buffers and networking software that provides real-time monitoring to hep manage the buffers and efficiently control traffic.

[ Get daily insights by signing up for Network World newsletters. ]

“What we are starting to see is a wave of applications based on AI, natural language, machine learning, that involve a huge ingestion of data distributed across hundreds or thousands of processors—CPUs, GPUs—all taking on that compute task, slicing it up into pieces, each processing their piece of it, and sending it back again,” Hull said.“And if your network is guilty of dropping traffic, that means that the start of the AI workload is delayed because you've got to retransmit it. And if during the processing of those AI workloads, traffic is going backwards and forwards again, that slows down the AI jobs, and they may actually fail.”

AI Spine architecture

Arista’s AI Spine is based on its 7800R3 Series data-center switches, which, at the high end support 460Tbps of switching capacity and hundreds of 40Gbps, 50Gbps, 100Gbps, or 400Gbps interfaces along with 384GB of deep buffering. “Deep buffers are the key to keeping the traffic moving and not dropping anything,” Hull said. “Some worry about latency with large buffers, but our analytics don’t show that happening here.”

[ ‘IT has a new ‘It Crowd’: Join the CIO Tech Talk Community ]

AI Spine systems would be controlled by Arista’s core networking software, the Extensible Operating System (EOS), which enables high-bandwidth, lossless, low-latency, Ethernet-based networks that can interconnect thousands of GPUs at speeds of 100Gbps, 400Gbps, and 800Gbps along with buffer-allocation schemes, according to a white paper on AI Spine.To help support that, the switches and EOS package creates a fabric that breaks apart packets and reformats them into uniform-sized cells, “spraying” them evenly across the fabric, according to Arista. The aim is to ensure equal access to all available paths within the fabric and zero packet loss.“A cell-based fabric is not concerned with the front-panel connection speeds, making mixing and matching 100G, 200G, and 400G of little concern,” Arista wrote. “Moreover, the cell fabric makes it immune to the 'flow collision' problems of an Ethernet fabric. A distributed scheduling mechanism is used within the switch to ensure fairness for traffic flows contending for access to a congested output port.”Because each flow uses any available path to reach its destination, the fabric is well suited to handling the “elephant flow” of heavy traffic common to AI/ML applications, and as a result, “there are no internal hot spots in the network,” Arista wrote.

AI Spine models

To explain how AI Spine would work, Arista’s white paper provides two examples.In the first, a dedicated leaf-and-spine design with Arista 7800s tied to perhaps hundreds of server racks, EOS’s intelligent load-balancing capabilities would control the traffic among servers to avoid collisions.QoS classification, Explicit Congestion Notification (ECN), and Priority Flow Control (PFC) thresholds are configured on all the switches to avoid packet drops. Arista EOS’s Latency Analyzer (LANZ) determines the appropriate thresholds to avoid packet drops while keeping the throughput high and allows the network to scale while keeping latency predictive and low. The second use case, which could scale to hundreds of endpoints, connects all the GPU modes directly into the 7800R3 switches within AI Spine. The result is a fabric providing a single hop between all endpoints, driving down latency, and that enables a single, large, lossless network requiring no configuration or tuning, Arista wrote.

Challenges of networking AI

The need for the AI Spine architecture was primarily driven by technologies and applications such as server virtualization, application containerization, multi-cloud computing, Web 2.0, big data, and HPC. “To optimize and increase the performance of these new technologies, a distributed scale-out, deep-buffered IP fabric has been proven to provide consistent performance that scales to support extreme ‘East-West’ traffic patterns,” Arista wrote.While it may be early for most enterprises to worry about handling large-scale AI cluster workloads, some larger environments as well as hyperscaler, financial, virtual reality, gaming, and automotive development networks are already gearing up for the traffic disruption they might cause on traditional networks.As AI workloads grow they put increasing pressure on the network for scale and bandwidth, but also for the right storage and buffer depth, having predictable latency, and handling both small packets an elephant flows, Jayshree Ullal, CEO of Arista recently told a Goldman Sachs technology gathering. “This requires a tremendous amount of engineering to make traditional Ethernet run as a back-end network to support this technology for the future and the growing use of 400G is going to add additional fuel to this development,” Ullal said.

Keep in touch

FOR EVEN MORE FACTS!

Latest Post

10-02-2026

Top 10 shipping companies in the world as of 2025

10-02-2026

Prime Flight acquires GAT to expand ground handling services By Rebecca Jeffrey | 6 February 2026

10-02-2026

CMA CGM to split West Africa Express into three loops with new WAX1 and WAX2 from late March

10-02-2026

Royal Jordanian Cargo strengthens U.S. presence through ECS Group partnership

30-01-2023

How to rank the best freight forwarding services ?

03-01-2023

Machine learning in the digital age

18-10-2022

Freight News

16-10-2022

Tesla delivery time is longer on some China models after discounts

11-10-2022

Amazon UK warehouse shake-up places 1,200 jobs at risk

11-10-2022

Best Business Schools In The World For 2023

11-10-2022

Pioneer logistics Network Sponsoring Summer Camp in Jordan

11-10-2022

Arista floats its answer to the strain AI puts on networks

Arista floats its answer to the strain AI puts on networks

Arista floats its answer to the strain AI puts on networks

Arista floats its answer to the strain AI puts on networks

AI Spine relies on Arista’s switches with deep packet buffers and its networking software to control heavy AI traffic, reduce latency, and eliminate packet loss.

Arista AI Spine

AI Spine architecture

AI Spine models

Challenges of networking AI

CATEGORIES

Keep in touch

Latest Post

10-02-2026

10-02-2026

10-02-2026

10-02-2026

30-01-2023

03-01-2023

18-10-2022

16-10-2022

11-10-2022

11-10-2022

11-10-2022

11-10-2022

Quick Links

Explore More

Contacts

Keep in touch

Arista floats its answer to the strain AI puts on networks

Arista floats its answer to the strain AI puts on networks

Arista floats its answer to the strain AI puts on networks

AI Spine relies on Arista’s switches with deep packet buffers and its networking software to control heavy AI traffic, reduce latency, and eliminate packet loss.

Arista AI Spine

AI Spine architecture

AI Spine models

Challenges of networking AI

CATEGORIES

Keep in touch

Latest Post

10-02-2026

10-02-2026

10-02-2026

10-02-2026

30-01-2023

03-01-2023

18-10-2022

16-10-2022

11-10-2022

11-10-2022

11-10-2022

11-10-2022

Quick Links

Explore More

Contacts

Keep in touch

Sign In