Servers in stock
 Checking availability...
50% off 1st month on Instant Servers - code 50OFF Sales: +1‑917‑284‑6090
Build your server

GPU dedicated servers

AI-ready infrastructure with NVIDIA power.
Run TensorFlow, train LLMs, mine crypto, and virtualize workloads with ease.

Custom configurations

Choose any server and any GPU. Your server is built with HP enterprise parts, tested for full compatibility with your GPU of choice.

Consistent results

Your GPU dedicated server operates in a temperature and humidity controlled environment, delivering consistent results and maximum efficiency.

Fast network

Deploy your GPU dedicated server on a custom build, global network, designed for low latency.p>

Support

Help is just a click or call away. Get access to instant support from helpful humans, available around the clock.

GPU Dedicated Servers with instant deployment and low prices

Low-cost GPU dedicated servers

GPU bare metal cloud servers, starting at $59/mo.

Get instant access to your bare-metal server

Deploy your server instantly, in a global network backed by a 99.9% uptime SLA.

24/7 Support

A team of GPU experts is available around the clock, via phone or live chat.

See Configurations

Hourly price per A100 GPU

chart-GPU-costs-comparison

*Pricing based on a single A100 GPU with 40GB VRAM.

up to 4 matching gpu's per server

NVIDIA A100 CHIP
nvidia A40 / A100

NVIDIA's Ampere Architecture, Volta's successor, is the fundamental solution for AI acceleration. The NVIDIA A40 brings to the table powerful new features for virtual reality projects, ray-traced rendering, and more, with second-generation RT cores that perform 2X better, delivering more throughput than ever before. With 336 Tensor cores, AI applications will enjoy 5X more training capabilities, and the GDDR6 memory supports massive workloads, adequate for game developers, data scientists, graphic designers, and more. A groundbreaking leap for artificial intelligence, the NVIDIA A100 Tensor Core GPU provides unmatched acceleration at all scales, using NVIDIA's Multi-Instance GPU (MIG) technology. Third generation Tensor cores deliver 20X greater performance, and MIG enables multiple networks to run simultaneously on a single A100 GPU, maximizing computing power.

NVIDIA A100 Specifications

  • 48 GB GDDR6 with ECC
  • 10752 CUDA Cores
  • 336 Tensor Cores
  • 696 GB/s Max Bandwidth
  • NVIDIA GPU Boost

NVIDIA A40 Specifications

  • 40 GB GDDR6
  • 6912 CUDA Cores
  • 432 Tensor Cores
  • 1555 GB/s Max Bandwidth
  • NVIDIA GPU Boost
Order configuration
NVIDIA H100
nvidia H100

The NVIDIA H100 Tensor Core GPU represents the pinnacle of AI and high-performance computing acceleration. Built on the Hopper architecture, the H100 delivers unprecedented performance for large-scale AI training and inference workloads. With fourth-generation Tensor Cores and the revolutionary Transformer Engine, the H100 accelerates AI models up to 9x faster than previous generations. Advanced features like Multi-Instance GPU (MIG) technology enable optimal resource utilization by partitioning a single GPU into multiple instances, while NVLink and PCIe Gen5 connectivity ensure maximum data throughput for the most demanding enterprise applications.

NVIDIA H100 Specifications

  • 80 GB HBM3 with ECC
  • 8448 CUDA Cores
  • 528 Tensor Cores (4th Gen)
  • 3 TB/s Memory Bandwidth
  • Transformer Engine
  • NVIDIA GPU Boost
Order configuration
NVIDIA RTX 6000 PRO
nvidia rtx 6000 pro

The NVIDIA RTX 6000 Pro delivers professional-grade performance engineered for the most demanding 3D workflows, AI development, and visual computing applications. This enterprise-class workstation GPU combines 48GB of ultra-fast GDDR6 memory with ECC error correction, advanced RT Cores for real-time ray tracing, and powerful Tensor Cores for AI-accelerated computing. Purpose-built for mission-critical workloads, the RTX 6000 Pro excels in media production, engineering simulation, scientific visualization, and enterprise AI research, delivering the reliability and cutting-edge performance that professionals demand for large-scale projects and production environments.

NVIDIA RTX 6000 Pro Specifications

  • 96 GB GDDR6 with ECC
  • 24 064 CUDA Cores
  • 568 Tensor Cores (4th Gen)
  • 142 RT Cores (3rd Gen)
  • 1 792 GB/s Memory Bandwidth
  • NVIDIA GPU Boost
Order configuration
NVIDIA L4 L40S
nvidia L4 / L40S

The NVIDIA L4 and L40S data center GPUs deliver exceptional performance for AI inference, video processing, and graphics workloads in enterprise environments. Built for efficient deployment at scale, the L4 provides outstanding AI inference performance with low power consumption, making it ideal for edge deployments and video streaming applications. The more powerful L40S offers enhanced capabilities for AI training, visual computing, and virtual workstations, featuring advanced Tensor Cores and RT Cores. Both GPUs support Multi-Instance GPU (MIG) technology for optimal resource allocation, making them perfect for cloud service providers, telecommunications, and enterprises running mixed workloads requiring both AI inference and graphics acceleration.

NVIDIA L4 Specifications

  • 24 GB GDDR6 with ECC
  • 7424 CUDA Cores
  • 232 Tensor Cores (4th Gen)
  • 58 RT Cores (3rd Gen)
  • 300 GB/s Memory Bandwidth

NVIDIA L40S Specifications

  • 48 GB GDDR6 with ECC
  • 18176 CUDA Cores
  • 568 Tensor Cores (4th Gen)
  • 142 RT Cores (3rd Gen)
  • 864 GB/s Memory Bandwidth
Order configuration
NVIDIA GeForce RTX 5090
NVIDIA GeForce RTX 5080 / RTX 5090

Experience unparalleled performance with NVIDIA's newest generation GPUs, delivering breakthrough advances in ray tracing, AI-accelerated rendering, and computational power. Engineered on advanced architecture, these cards combine superior energy efficiency with exceptional graphics capabilities for demanding workflows.

GeForce RTX 5080 Specifications

  • 16 GB GDDR6X
  • 10 752 CUDA Cores
  • Ultra-fast memory bandwidth

GeForce RTX 5090 Specifications

  • 24 GB GDDR6X
  • 21 760 CUDA Cores
  • Blazing-fast memory bandwidth
Order configuration
NVIDIA RTX 4090D
nvidia rtx 4090D

Experience the ultimate in gaming and creative performance with the NVIDIA GeForce RTX 4090D. Driven by the revolutionary Ada Lovelace architecture, this GPU combines incredible power and efficiency to deliver ultra-realistic graphics and truly immersive experiences. Featuring advanced ray tracing and AI-enhanced capabilities, the RTX 4090D ensures cinematic-quality rendering in real-time, ultra-smooth frame rates, and top-tier performance for even the most demanding games and applications.

RTX 4090D Specifications

  • 24 GB GDDR6X
  • 14 592 CUDA Cores
  • 1 008 GB/s Max Bandwidth

Compatible with Linux, CUDA/OpenCL, DirectX, Windows.

Order configuration
NVIDIA QUADRO RTX A6000
nvidia quadro rtx a4000 / a5000 / a6000

Nvidia's new generation of Ampere-based GPUs significantly outperforms the Turing Quadro RTX series. The second generation RT Cores accelerate the processing time of workloads such as photorealistic rendering, 3D design and ray-tracing for greater visual accuracy. The RTX A series also brings AI to graphics, with features such as DLSS, AI denoising, and enhanced editing for certain applications.

QUADRO RTX A4000 Specifications

  • 16 GB GDDR6
  • 6144 CUDA Cores
  • 448 GB/s Max Bandwidth

QUADRO RTX A5000 Specifications

  • 24 GB GDDR6
  • 8192 CUDA Cores
  • 768 GB/s Max Bandwidth

QUADRO RTX A6000 Specifications

  • 48GB GDDR6X
  • 10752 CUDA Cores
  • 768 GB/s Max Bandwidth
Order configuration
NVIDIA RTX 3090
nvidia rtx 3070 / 3080 / 3090

NVIDIA’s GeForce RTX 30 (2nd generation RTX) series GPU’s running on Ampere architecture, introduce many innovations, from faster Ray Tracing and Tensor Cores to state-of-the-art multiprocessor streaming. The WOW factor comes from the groundbreaking thermal design that’s 2X more efficient and the ultra-fast GDDR6X memory, that delivers outstanding performance, making it ideal for AI projects with large datasets, gaming, or visualization.

RTX 3070 Specifications

  • 8 GB GDDR6
  • 5888 CUDA Cores
  • 512 GB/s Max Bandwidth
  • NVIDIA GPU Boost

RTX 3080 Specifications

  • 10 GB GDDR6X
  • 8704 CUDA Cores
  • 760 GB/s Max Bandwidth
  • NVIDIA GPU Boost

RTX 3090 Specifications

  • 24GB GDDR6X
  • 10496 CUDA Cores
  • 936 GB/s Max Bandwidth
  • NVIDIA GPU Boost

Compatible with Linux, CUDA/OpenCL, KVM, Windows.

Order configuration
NVIDIA QUADRO RTX 8000
nvidia quadro 5000 / 6000 / 8000

The famous Turing™ chip architecture is reshaping the work of countless visual creators and designers. NVIDIA’s Quadro RTX series provides new AI-based performance, accelerated ray tracing, advanced shading, all of which allow artists to improve their rendering capabilities. Featuring 4608 CUDA® cores and 24 GB GDDR6 memory, this series supports elaborate visual designs, 8K video content, massive architectural datasets, and many more.

QUADRO RTX 5000 Specifications

  • 16 GB GDDR6
  • 3072 CUDA Cores
  • 448 GB/s Max Bandwidth
  • NVIDIA GPU Boost

QUADRO RTX 6000 Specifications

  • 24 GB GDDR6
  • 4608 CUDA Cores
  • 672 GB/s Max Bandwidth
  • NVIDIA GPU Boost

QUADRO RTX 8000 Specifications

  • 48 GB GDDR6
  • 4608 CUDA Cores
  • 672 GB/s Max Bandwidth
  • NVIDIA GPU Boost

Compatible with Linux, CUDA/OpenCL, KVM, Windows.

Order configuration
NVIDIA QUADRO RTX 4000
quadro rtx 4000

NVIDIA’S QUADRO RTX 4000 gives you access to outstanding performance and powerful features, all from a single PCI-e slot. The Turing™ chip architecture, combined with modern display characteristics and state-of-the-art technologies, delivers photorealistic single ray-traced rendering in no time. RT cores allow for this kind of advance in rendering, while the Tensor cores provide the ideal support for any deep learning applications. With this cost-effective solution, you can now create authentic VR experiences and enjoy faster performance when it comes to your AI projects.

QUADRO RTX 4000 Specifications

  • 8 GB GDDR6
  • 2304 CUDA Cores
  • 416 GB/s Max Bandwidth
  • NVIDIA GPU Boost

Compatible with Linux, CUDA/OpenCL, KVM, Windows.

Order configuration

3D Rendering

Faster 3D graphics processing, allow you to increase productivity and revenue.

Compute

Run your CUDA and OpenCL applications at optimal performance by using the computing power of the GTX 1080.

Mining

Use the 2560 cores to mine your favorite cryptocurrency.

NVIDIA TESLA T4
nvidia tesla t4

Choose NVIDIA’s Tesla T4 for multi-precision computing power, through its Tensor Core technology. The T4 is up to 40 times faster than a conventional CPU and up to 3.5 times faster than its Pascal predecessor. Transcode up to 38 full HD video streams simultaneously, when you pair a Tesla T4 with one of our HPE server configurations. *Results may vary, based on server configuration.

Specifications

  • TURO TU104
  • 320 TURING TENSOR CORES
  • 2560 CUDA CORES
  • 16 GB GDDR6
  • 8.1 TFLOPS SINGLE PRECISION
  • 65 FP16 TFLOPS
  • 130 INT8 TOPS
  • 260 INT4 TOPS
  • 320 GB/s Max Bandwidth

Compatible: VMWare ESXi, Citrix Xenserver, KVM, Linux, Windows.

Order configuration
CORAL

The Coral USB Accelerator

With the new Coral USB Accelerator, you can add Edge TPU to any Linux-based system. A low cost for power that can provide high-performance ML.

Specifications

  • ARM 32 Bit Cortex 32 MHz
  • Edge TPU ASIC (for Lite TensorFlow models)
  • USB 3.1 5Gb/s transfer speed

Compatible with Linux machines, Debian 6.0 or higher, or any derivative (such as Ubuntu 10.0+), but also with Raspberry Pi (213 Mode B/B+).

NVIDIA GeForce RTX 2080 Ti
NVIDIA GeForce RTX 2080 / RTX 2080 Ti

Get up to six times the performance of the Pascal chip predecessor, with the RTX 2080, powered by NVIDIA’s new Turing chip architecture.

RTX 2080 Specifications

  • 8 GB GDDR6
  • 2944 CUDA Cores
  • 448 GB/s Max Bandwidth
  • NVIDIA GPU Boost 4.0

RTX 2080 TI Specifications

  • 11 GB GDDR6
  • 2944 CUDA Cores
  • 616 GB/s Max Bandwidth
  • NVIDIA GPU Boost 4.0

Compatible with Linux, CUDA/OpenCL, KVM.

Order configuration
NVIDIA GeForce GTX 1080
NVIDIA GeForce GTX 1080 / 1070 TI

Get excellent performance in graphics rendering, computing or mining, with the Pascal architecture based GPU from NVIDIA, the GeForce GTX 1070/1080.

Specifications

  • 8 GB DDR5
  • 2560 CUDA Cores
  • 320 GB/s Max Bandwidth
  • NVIDIA GPU Boost 3.0

Compatible with Linux, CUDA/OpenCL, KVM.

Order configuration
NVIDIA TESLA P4
NVIDIA TESLA P4 / P40 / P100

An optimal chip for machine learning and video transcoding, can be found in the NVIDIA Tesla P4 and P100 GPU’s. NVIDIA’s Pascal chip architecture has been proven to be faster and more power efficient than its Maxwell predecessor. Transcode up to 20 simultaneous video streams with a single Tesla P4 paired with our HPE BL460c blade server. * A more powerful version of the Tesla P4 is the Tesla P40, with more than twice the processing power of the Tesla P4. The Tesla P100 GPU, is most suitable for deep learning and remote graphics. With 18.7 TeraFLOPS of inference performance, a single Tesla P100 can replace over 25 CPU servers. *Results may vary, based on server configuration and video resolution of each stream.

Specifications

  • Pascal GP100 or GP104 chip
  • Up to 3584 CUDA cores
  • Up to 16 GB CoWoS
  • Enterprise grade hardware

Compatible: VMWare ESXi, Citrix Xenserver, KVM, Linux, Windows.

Order configuration
NVIDIA TITAN V
NVIDIA TITAN V

Get your deep learning results up to 1.5x faster, when compared to the P100 GPU board. Process up to 110 TeraFLOPS of inference performance with the Titan V GPU. Use the Titan V to predict the weather or to discover new energy sources. It’s the optimal GPU choice for precise, fast results. A single Titan V GPU server can replace up to 30 single CPU servers. Results may vary based on server configuration.

Specifications

  • NVIDIA Volta Chip
  • 5120 CUDA cores
  • 640 Tensor Cores
  • 12 GB CoWoS Stacked HBM2
  • 653Gbps max bandwidth

Compatible: VMWare ESXi, Citrix Xenserver, KVM, Linux, Windows.

Order configuration

Why Server Room?

Your resource intensive applications require enterprise grade hardware, stress tested for constant high loads. Your GPU dedicated server configuration runs through a series of tests to ensure full hardware compatibility and integration with your GPU of choice. Your services are backed by a 99.9% uptime SLA and are supported by a team of experts, available around the clock.