IT teams that need fast, predictable cluster networking with clear control over growth use Nvidia InfiniBand Switches to keep AI, HPC and shared research environments responsive. The range covers managed and externally managed options, so teams can choose whether switch control sits locally or in central fabric tools as the estate scales.

In data centres and shared compute platforms, these switches support low-latency, high-bandwidth traffic between nodes, helping reduce bottlenecks, keep jobs moving and standardise operations across storage, leaf and spine layers without adding unnecessary management overhead.

Nvidia InfiniBand Switches Quick Specs & Key Features

  • Low-latency HDR fabric: QM8700 and QM8790 provide 40 x 200Gb/s InfiniBand ports for non-blocking cluster tiers, giving AI, HPC and render workloads fast node-to-node exchange so jobs complete sooner and the fabric is easier to own.
  • Centrally managed operation: QM8790 and QM9790 hand switch control to external fabric tools, letting operations teams standardise provisioning, telemetry and change control across larger estates so management overhead stays lower.
  • High-density NDR bandwidth: QM9700 and QM9790 deliver 64 x 400Gb/s ports in 1U, increasing radix and reducing hop count for dense scale-out clusters so latency falls and growth is simpler.
Read more
  • Compact 1U footprint: The switch platforms fit 40-port HDR or 64-port NDR fabric roles into one rack unit, which helps data centres and shared labs add cluster capacity without expanding space requirements.
  • Leaf and spine flexibility: The range supports HDR leaf, storage and smaller spine roles, or NDR leaf and compact spine roles, allowing buyers to match fabric design to workload scale without overbuilding the interconnect.
  • Quantum-class model range: The QM8700, QM8790, QM9700 and QM9790 cover managed and externally managed HDR and NDR options, giving IT teams a consistent InfiniBand platform that fits different governance models and future cluster growth.
Steel City Consulting logo

Comparing multiple platforms? Our experts are available to help.

No commitment needed, no hard sells. Just straightforward technical guidance tailored to your infrastructure.

Find your ideal Nvidia InfiniBand Switches

Full technical specifications are available on each product page.

Model Popularity Deployment Target Organisation Size Primary Use Case Access Ports Port Configuration Maximum Port Speed PoE Budget (W) Maximum Switching Capacity (Gbps) Stackable Operating System / Management Platform
Mellanox QM9700 Quantum-2 40-Port NDR 400Gb/s InfiniBand Switch Mellanox QM9700 Quantum-2 40-Port NDR 400Gb/s InfiniBand Switch Data Centre Service Provider Spine 64 64×QSFP56 200G 0, None 51200 Supported MLNX-OS View
Mellanox QM9790 Quantum-2 40-Port NDR 400Gb/s InfiniBand Switch Mellanox QM9790 Quantum-2 40-Port NDR 400Gb/s InfiniBand Switch Data Centre Service Provider Spine 64 64×QSFP56 200G 0, None 51200 Supported MLNX-OS View
Mellanox QM8700 Quantum 40-Port HDR 200Gb/s InfiniBand Switch Mellanox QM8700 Quantum 40-Port HDR 200Gb/s InfiniBand Switch Data Centre Service Provider Spine 40 40×QSFP56 200G 0, None 16000 Supported MLNX-OS View
Mellanox QM8790 Quantum 40-Port HDR 200Gb/s InfiniBand Switch Mellanox QM8790 Quantum 40-Port HDR 200Gb/s InfiniBand Switch Data Centre Service Provider Spine 40 40×QSFP56 200G 0, None 16000 Supported MLNX-OS View

Nvidia InfiniBand Switches Deployment Scenarios and Industries

Data Centres

Data centre teams need low-latency InfiniBand fabrics for AI, HPC and storage clusters, but not every estate is ready for the cost and scale of NDR. This category supports compact HDR deployments for fast cluster performance, or denser NDR fabrics where radix, hop count and cable reduction matter.

Media & Entertainment

Studio and service-provider teams need predictable node-to-node traffic for render, simulation and generative-media jobs. These switches help keep shared farms responsive, with HDR options for busy clusters and NDR options where larger scene data and faster interconnects need more headroom.

Healthcare

Research, genomics and imaging teams need tightly coupled cluster performance for parallel analysis, often within tighter power and budget limits. This category provides low-latency fabrics that support controlled HDR deployments, or centralised NDR fabrics for larger research estates with stronger governance needs.

Finance

Quant, risk and AI modelling teams need deterministic interconnect behaviour so distributed runs finish without network delays becoming a constraint. InfiniBand switches in this category support fast cluster fabrics with either local management or central control, depending on how tightly the environment is governed.

Software Development

AI and HPC development teams need realistic cluster networking for training, benchmarking and distributed code validation before workloads move to production. These switches give shared labs and platform teams the fabric performance to test at scale, with management options that fit either smaller labs or centrally run estates.

Nvidia InfiniBand Switches Management and Licensing Options

Nvidia AI Platforms & Software

Nvidia AI Enterprise, NIM, and AI Workbench give IT and data science teams validated, enterprise-ready platforms to deploy, serve, and manage AI workloads at scale. We help organisations implement and support these software environments so AI infrastructure remains performant, current, and straightforward to operate.

Explore Nvidia AI Software

Nvidia AI Infrastructure Services

We provide end-to-end services to design, deploy, and optimise GPU-accelerated AI environments. From initial architecture through to integration and scaling, we help infrastructure teams build Nvidia environments that are aligned to workload demands and positioned to grow alongside the business.

Discuss Infrastructure Services

Nvidia Support & Lifecycle Management

As an authorised Nvidia partner, we help organisations maximise uptime and performance across their Nvidia estate. From technical support and software updates through to lifecycle planning, we help IT teams keep AI infrastructure reliable, current, and aligned to long-term operational requirements.

Contact Our NVIDIA Specialists

We help organisations get more from their Nvidia investments — from initial architecture through to ongoing optimisation and support. Contact our Nvidia specialists for guidance today.

Shop All Nvidia InfiniBand Switches Models

Browse our full range below, or contact our team for tailored configuration advice.

Designing & Supporting Nvidia InfiniBand Switches Solutions

Backed by decades of expertise in the IT sector, our specialists support every stage of your deployment — from initial selection through to long-term lifecycle management.

  • AI Fabric & Workload Assessment: We assess AI workloads, GPU communication patterns, east-west traffic, storage traffic, and future scalability requirements before recommending a fabric approach. This helps align NVIDIA InfiniBand or high-performance Ethernet designs to the way your cluster will actually operate, rather than treating networking as a generic switch purchase.
  • NVIDIA Platform & Topology Selection: We help you evaluate the right NVIDIA networking platforms, port speeds, rack layout, and topology design for your environment. The aim is to avoid under-specced fabrics that restrict GPU performance, as well as oversized switching designs that add unnecessary cost and complexity.
  • Lead Time, Scalability & Cost Planning: Larger high-performance switching platforms can present availability, lead time, or budget challenges. Where appropriate, we help compare equivalent multi-switch architectures that can deliver similar scalability and performance while improving deployment flexibility and keeping infrastructure costs under control.
Read more
  • Deployment, Integration & Fabric Configuration: Our engineers support switch configuration, fabric setup, interoperability checks, and integration across compute, storage, and GPU environments. Whether you need full deployment support or validation around an internal build, we help reduce complexity and improve long-term operational stability.
  • Performance Optimisation & Lifecycle Support: As GPU clusters scale, congestion, traffic flow, firmware, redundancy, and expansion planning all affect performance. We help optimise NVIDIA networking environments for AI training, inference, and data-intensive workloads, while supporting future growth as operational requirements evolve.

Nvidia InfiniBand Switches FAQ

Why choose Nvidia InfiniBand Switches instead of standard Ethernet for AI and HPC clusters?

Standard Ethernet works well for general business traffic, but AI and HPC workloads create far heavier communication between systems during training, simulation and parallel processing. As clusters grow, latency and congestion can start affecting overall workload efficiency.

Nvidia InfiniBand Switches are designed for high-performance cluster fabrics, helping maintain predictable throughput across AI training environments, HPC platforms and high-performance storage deployments where the network directly affects compute performance.

When are HDR Nvidia InfiniBand Switches a better fit than moving straight to NDR?

Not every clustered environment needs 400Gb/s fabric capacity immediately. Departmental AI clusters, research environments and mid-sized HPC deployments often still need low-latency performance without the scale and infrastructure overhead associated with NDR.

The QM8700 and QM8790 provide 40 HDR 200Gb/s ports in a compact 1U design, making them well suited to clustered workloads that need high-performance InfiniBand without moving into larger NDR fabrics.

Why would an organisation choose the QM8790 instead of the QM8700?

Internally managed switches can work well in smaller or isolated environments. The challenge appears when multiple AI, HPC or research clusters need consistent operational control across a wider infrastructure estate.

The QM8790 is externally managed for use with centralised fabric management platforms, while the QM8700 includes onboard management. Both provide the same 40 HDR 200Gb/s connectivity, so the decision is mainly operational rather than performance-based.

What makes the QM9700 a better fit than HDR models for larger AI and HPC fabrics?

HDR fabrics remain effective while cluster sizes and traffic demands stay moderate. As environments scale, larger datasets and distributed workloads increase the need for higher bandwidth and flatter fabric designs.

The QM9700 provides 64 NDR 400Gb/s ports in 1U, helping support large AI training clusters, simulation platforms and research computing environments with fewer switching layers and lower infrastructure complexity.

Why choose the QM9790 instead of the QM9700?

Onboard switch management can suit smaller deployments, but larger clustered environments often need centralised operational control across the wider fabric.

The QM9790 delivers the same Quantum-2 NDR switching class as the QM9700, but is designed for externally managed environments using Nvidia Unified Fabric Manager tools for provisioning, monitoring and fabric maintenance across shared AI and HPC estates.

Are NDR Nvidia InfiniBand Switches only relevant for the largest AI environments?

Smaller AI and HPC environments may continue operating effectively on HDR infrastructure while cluster traffic remains manageable. The move to NDR becomes more relevant as workloads, datasets and inter-node communication demands increase.

The QM9700 and QM9790 provide 64 NDR 400Gb/s ports in 1U, supporting large AI training environments, simulation platforms and research computing estates where consistent node-to-node performance matters at scale.

Need a different solution?

If these options aren’t the right fit for your environment, we provide a wide portfolio of product series and solutions that may better suit your infrastructure. Explore below, or speak to our team and we’ll help you find the right match.

Ready to discuss your requirements?

Whether you know exactly what you need or you’re still evaluating options, our team is available for a no-obligation conversation.

A group discussing IT solutions