Register

Why choose Exoscale GPU Cloud Computing

Instant Access to NVIDIA GPU Power

Instant Access to NVIDIA GPU Power

Deploy powerful NVIDIA GPUs within minutes. Start training AI models, rendering 3D projects, or running simulations — no setup complexity or waiting times.

Performance Optimized for AI, ML, and Rendering

Performance Optimized for AI, ML, and Rendering

Choose the GPU instance that fits your workload. From deep learning and data analytics to 3D rendering and video processing, Exoscale offers high-performance GPUs optimized for speed and reliability.

Effortless Scalability for Dynamic Workloads

Effortless Scalability for Dynamic Workloads

Scale GPU resources instantly as your project grows. Add or remove GPU instances on demand to handle everything from AI inference peaks to large-scale rendering tasks.

Transparent, Pay-as-You-Go Pricing

Predictable and transparent costs

Stay in full control of your cloud costs. Exoscale GPUs are billed per second with no hidden fees or data transfer surprises — perfect for predictable, cost-efficient scaling. Calculate your price

European Data Sovereignty & Security

European Data Sovereignty & Security

Run your GPU workloads on a trusted European Cloud Provider. With Exoscale, all data stays within EU borders, fully GDPR-compliant and protected by strict security standards.

Powerful NVIDIA GPU Cloud Computing for demanding workloads

Running compute-intensive tasks requires the right GPU infrastructure. Exoscale Cloud GPUs make deploying and managing instances simple, efficient, and secure. Whether you need a single server for AI training or a scalable cloud setup for complex simulations, you stay in control with fast provisioning, reliable performance, and privacy-focused European data centers. Exoscale’s GPU instances are ready to integrate with the tools you already use, supporting popular frameworks like TensorFlow and PyTorch right out of the box. Optimize costs and performance with our per-second billing model—pay only for the computational power you need, exactly when you need it.

Use your favorite AI and Machine Learning framework on Exoscale's on-demand GPU instances

Find the Right NVIDIA GPU for Your Workload

Every project has unique demands. That’s why we offer a range of NVIDIA GPU instances, from models optimized for AI and machine learning to powerful servers built for high-performance computing (HPC) and complex visual rendering.


Find the perfect balance of performance and price for your application, and count on our reliable infrastructure to drive your work forward with confidence. Explore our GPU cloud server and find your perfect match.

GPU A30 on Exoscale

GPU A30

Powered by NVIDIA A30. Perfect for AI inference, high-performance computing (HPC), and data-analytics workloads.

Discover
Tesla V100 on Exoscale

GPU2

Based on NVIDIA Tesla V100. Ideal for deep learning, neural-network training, and advanced AI workloads.

Discover
A40 on Exoscale

GPU3

Powered by NVIDIA A40 the all-rounder for AR/VR, complex simulations, rendering, AI, and more.

Discover
GPU 3080ti on Exoscale

GPU 3080ti

NVIDIA GeForce RTX 3080 Ti is excellent for deep-learning model training, image processing, NLP, and more. 100 % liquid-cooled with heat-reuse technology.

Discover
GPU A5000 on Exoscale

GPU A5000

Entry-level all-rounder leveraging NVIDIA RTX A5000. Fully liquid-cooled with heat-reuse for sustainable accelerated computing. Great for AR/VR, simulations, rendering, and AI.

Discover
NVIDIA RTX Pro 6000 on Exoscale

GPU RTX 6000 - coming soon

NVIDIA RTX Pro 6000, ultimate power for AI and graphics. Delivering cutting-edge rendering, massive memory, and breakthrough performance.

Discover
GPU B300 on Exoscale

GPU B300 - coming soon

NVIDIA HGX B300, next-generation performance for AI and HPC. Expect FP4 precision, massive GPU memory, and unmatched throughput.

Contact Us

Combine GPU Cloud Computing with other Exoscale products

Extend your infrastructure with services that work seamlessly with your GPU servers in Europe. From managed Kubernetes (SKS) to object storage (SOS) and compute instances, everything integrates with minimal configuration effort.

Simple Object Storage

Simple Object Storage

Use a highly scalable and S3-compatible object storage solution for unstructured data. Ideal for storing backups, logs, static assets, or media, fully integrated with Exoscale regions and access-controlled via API.

Discover
Cloud Block Storage

Cloud Block Storage

Attach flexible, high-performance volumes to your VM instances for persistent data, fast I/O, and scalable capacity. Ideal for databases, log storage, and container environments.

Discover
Hosted & Managed Kubernetes

Scalable Kubernetes Service

Deploy containerized applications on a production-ready hosted and managed Kubernetes cluster in under two minutes. Use SKS as the orchestration layer for your application pods, using GPU virtual machine instances as underlying nodes.

Discover

Key Features of Exoscale GPU Server in Europe

Find the Right GPU for Your Cloud Computing

GPU A30 GPU2 GPU3 GPU A5000 GPU 3080ti RTX Pro 6000
Graphic Card A30 Tesla V100 A40 A5000 3080ti RTX Pro 6000
Cuda Cores per card 10752 5120 10752 8192 10240 24064
Tensor Cores per card 336 640 336 256 320 752
RT Cores per card 0 0 84 84 80 188
GPU memory per card 24 GB 16 GB 48 GB 24 GB 12 GB 96 GB
GPU cards 1-4 1-4 1-8 1-4 1-8 1-8
CPU cores 12-48 12-48 12-96 12-48 12-48 36-288
RAM per instance 56-225 56-225 56-448 56-224 56-224 120-960
SSD local storage max. 1.6 TB (SATA) max. 1.6 TB (SATA) max. 1.6 TB (NVMe) max. 1.6 TB (SATA) max. 1.6 TB (NVMe) max. 10 TiB (NVMe)
Zones CH-GVA-2 AT-VIE-1 DE-FRA-1 AT-VIE-2 AT-VIE-2 DE-FRA-1,
HR-ZAG-1,
CH-DK-2
Usable for AI Inference, Data Analytics,
HPC, Simulations
AI Training, Deep Learning,
Scientific Research
AI Inference, Rendering,
Simulation, Visualization
AI Inference, 3D Rendering,
Visualization, Creative Workloads
AI Inference, VDI, HPC,
Simulations, Data Analytics
AI Inference, Simulations,Visualization
Advanced Rendering, Digital Twins
Learn more A30 GPU2 GPU3 A5000 3080ti RTX 6000

Why choose a GPU Server on Exoscale

shielded compute instance
  • Shared or dedicated Hypervisors
  • Large high-performance SSD/NVMe Storage
  • No resource sharing
network bandwidth picto
  • Cutting-edge NVIDIA GPU technology
  • Latest NVIDIA graphics cards
  • Direct GPU pass-through access
cloud orchestration picto
  • Complete cloud platform integration
  • Full Terraform automation support
  • Comprehensive API management support

Level Up with NGC on Exoscale GPUs

Combine the simplicity and scalability of the Exoscale Cloud with the power of NVIDIA GPUs. With a Docker-based template or SKS, you can access the full potential of the NVIDIA GPU Cloud (NGC) and significantly reduce time to solution.


NVIDIA GPU Cloud (NGC) provides a selected set of GPU-optimized software for artificial intelligence applications, visualizations, and HPC. The NGC Catalog includes containers, pre-trained models, Helm charts for Kubernetes deployments, and specific AI toolkits with SDKs.


NGC catalog works with both Exoscale Compute Instances and Exoscale Scalable Kubernetes Service. The image template for Kubernetes worker nodes embeds the optimized version of the container runtime for GPU cards, making it quick work to start any application from the catalog without fiddling with drivers and CUDA versions.

Learn more

GPU Cloud Computing trusted by teams across Europe

Reliable GPU servers are essential for demanding workloads. Exoscale enables developers, research teams, and companies across Europe to run complex applications securely and efficiently. Power up your projects by seamlessly connecting your GPU instances with our full ecosystem of services, including our S3-compatible Object Storage for your datasets and our fully managed databases.

Contact us
Choose from a Wide Selection of Officially Supported Templates
Multiple OS images support picto

Explore more Exoscale services

Extend your GPU Cloud Computing with complementary offerings that help you achieve higher availability, better performance, and expert support for any workload. Exoscale has the right service to support your project’s growth.

GPU A30 on Exoscale

Database as a Service (DBaaS)

Simplify data management with managed databases, including PostgreSQL, MySQL, Kafka, OpenSearch, Valkey, and Grafana, designed to seamlessly integrate with your applications. Access their vector extensions like pgvector or vector search, at no additional cost for RAG.

Discover
Compute Instances

Compute Instances

Flexible virtual machines optimized for general-purpose, memory-intensive, CPU-bound applications or GPU workloads. Combine with your Kubernetes workloads to scale efficiently across all use cases.

Discover
Support Plans

Support Plans

Get the help you need to run your infrastructure with confidence through flexible support plans, designed to provide expert guidance, with guaranteed response times (SLA), ensuring our experts are there when you need them most.

Discover

Frequently asked questions about GPU Cloud Computing

What is a GPU server?

A GPU instance is a cloud or physical server equipped with one or more graphics processing units (GPUs). Unlike standard servers, it is optimized for parallel processing tasks such as machine learning, 3D rendering, and scientific computing.

Which workloads benefit most from GPU Cloud Computing on Exoscale?

Exoscale GPUs accelerate workloads like AI/ML training, deep-learning inference, 3D rendering, video processing, and scientific simulations. They deliver the parallel processing power needed for compute-intensive applications, all hosted in secure European data centers.

What kind of GPU instances does Exoscale offer?

Exoscale provides cloud servers with NVIDIA GPUs (A30, V100, A40, A5000, 3080ti, and soon RTX Pro 6000 and B300) for high-performance computing. You can choose the cloud server that fits your AI, research, or visual workload best.

How can I run Large Language Models (LLMs) on Exoscale GPU instances?

Exoscale GPU instances are ideal for deploying and fine-tuning Large Language Models such as Llama 3, GPT-OSS, Qwen3 or Mistral. You can choose from powerful NVIDIA GPUs like the A30 or RTX A5000 to handle demanding inference and training workloads. Our detailed step-by-step guide — Running LLMs on GPUs — shows how to prepare your environment, launch a GPU instance, and deploy open-source models efficiently within our sovereign European cloud.