Register

Key benefits of our Nvidia A40 GPU

speed up graphics and simulation workflows

Speed Up Graphics & Simulation Workflows

Improve performance for compute-heavy workloads, like complex 3D computer-aided design (CAD) or computer-aided engineering (CAE) and real-time rendering.

massive AI datasets processing

Process Massive Datasets for AI

Tackle large-scale data science projects, complex simulations, and AI model training with ease, thanks to the NVIDIA A40's 48 GB of ultra-fast GDDR6 memory per card.

Optimal Cost-to-Performance Ratio

Optimal Cost-to-Performance Ratio

Experience premium visual computing and AI acceleration at a highly competitive cost. Exoscale offers flexible GPU3 A40 Nvidia instance types, ensuring you only pay for the compute power you need.

GPU3 based on NVIDIA A40

Our GPU3 is based on NVIDIA A40 graphic cards known for powerful visual computing capabilities. The Ampere GPU architecture is opening a new era in performance and multi-workload capabilities. By combining the latest Ampere RT Cores, Tensor Cores and CUDA Cores with 48 GB of graphics memory, the NVIDIA A40 delivers a unique set for visual computing workloads and scalable GPU Cloud Computing scenarios.


This makes the A40 great for advanced visuals, speeding up demanding jobs like realistic rendering and creating high-quality content twice as fast. Plus, its improved Tensor Cores significantly boost AI and deep learning training.

Start your A40 GPU3 Instance

Stay on Top of the Latest Visual Developments

NVIDIA A40 is your starting point for many of the latest visual technical developments. Whether it is cave automatic virtual environments (CAVEs), broadcast-grade streaming, working with multiple video streams, or immersive AR and VR, you are well-prepared with GPU A40 instances. The A40’s capabilities align perfectly with the growing trends in immersive experiences and real-time content generation across Europe.

Rendering Made Easy

Make photorealistic rendering, architectural design evaluations or virtual prototyping faster than ever. The NVIDIA A40’s second-generation RT Cores accelerate ray-traced motion blur and complex scene rendering in significantly reduced timeframes. With a 2 times throughput increase over previous generations, GPU3 instances deliver a massive performance leap for all your rendering needs.

Accelerate simulation and engineering workflows

NVIDIA GPU A40 instances in Europe provide the memory and compute power needed for large-scale simulation, CAE, and engineering visualization—ideal for industries like automotive, aerospace, and manufacturing.

Why GPU A40 on Exoscale

dedicated compute instance
  • Shared or dedicated Hypervisors
  • Large SSD Storage
  • No resource sharing
gpu A40 picto
  • Cutting-edge GPU A40 technology
  • NVIDIA A40 cards
  • Direct GPU pass-through access
cloud orchestration picto
  • Complete cloud platform integration
  • Full Terraform automation support
  • Comprehensive API management support

Discover the best cost-to-performance ratio

Our GPU3, based on NVIDIA A40, comes at a fairly priced cost to performance ratio. We provide four distinct instance options, scaling from 1 to 8 NVIDIA A40 GPUs, each paired with high-performance NVMe SSD storage ranging from 100 GB up to 1.6 TB, depending on your chosen instance type. This flexibility ensures you can optimize your infrastructure for both budget and computational demands.

RAM CPU Cores GPU Cards Min Local Storage Max Local Storage Price / Hour ({{ currency | uppercase }})
Small 56 GB 12 Cores 1 GPU 100 GB 800 GB {{ prices.opencompute.gpu3.small[currency] | number:8 }}
Medium 120 GB 24 Cores 2 GPU 100 GB 1.2 TB {{ prices.opencompute.gpu3.medium[currency] | number:8 }}
Large 224 GB 48 Cores 4 GPU 100 GB 1.6 TB {{ prices.opencompute.gpu3.large[currency] | number:8 }}
Huge 448 GB 96 Cores 8 GPU 100 GB 1.6 TB {{ prices.opencompute.gpu3.huge[currency] | number:8 }}
  1. [1]

    Local Storage is not included in the displayed Instances price, and has a cost of {{ prices.opencompute.volume[currency] | number:8 }} {{ currency | uppercase }} / GiB hour.

  2. [2]

    Instance needs to be shutdown for hypervisor and platform updates as it cannot be live-migrated.

  3. [3]

    Available in the following Zones: DE-FRA-1

  4. [4]

    GPU3 Large and Huge Instances are only available on dedicated hypervisors.

  5. [5]

    Please note that GPU instances require account validation: access is provided with priority to established businesses, and is granted after a manual screening process.

Level Up with NGC on Exoscale NVIDIA A40 GPU Instances

Combine the simplicity and scalability of the Exoscale Cloud with the power of NVIDIA A40 GPUs. With our Docker based template you can access the full potential of the NVIDIA GPU Cloud (NGC) and significantly reduce time to solution.


NVIDIA GPU Cloud (NGC) provides a selected set of GPU-optimized software for artificial intelligence applications, visualizations and HPC. The NGC Catalog includes containers, pre-trained models, Helm charts for Kubernetes deployments and specific AI toolkits with SDKs.


NGC catalog works with both Exoscale Compute Instances and Exoscale Hosted & Managed Kubernetes. The image template for Kubernetes worker nodes embeds the optimized version of the container runtime for NVIDIA A40 GPUs—so you can launch workloads without worrying about drivers or CUDA compatibility.

Learn more

GPU A40 Features

Technical Specifications GPU3

Description Specifications
Graphic Card A40
Cuda Cores per card 10572
Tensor Cores per card 336
GPU memory per card 48 GB
GPU cards 1-8
CPU cores 12-96
RAM per instance 56-448
SSD local storage max. 1.6 TB (NVMe)
Zone DE-FRA-1
Works with Compute
SKS
Docker NVIDIA

Resources

Portal

Get started in our integrated environment with just a few clicks.
Choose from a Wide Selection of Officially Supported Templates
Multiple OS images support picto

Trusted across Europe: Power graphics & AI with NVIDIA A40

When running graphics-intensive, rendering, or AI inference workloads in the cloud, having a trusted partner makes all the difference. Our engineers have supported customers across Europe in deploying and scaling NVIDIA A40 GPU workloads with Exoscale.

Contact us

More GPU Instances

Discover more GPU Instances for Cloud Computing to power diverse compute, graphics, and AI tasks. Fully integrated with the Exoscale ecosystem.

GPU A30 on Exoscale

GPU A30

Powered by NVIDIA A30. Perfect for AI inference, high-performance computing (HPC), and data-analytics workloads.

Discover
Tesla V100 on Exoscale

GPU2

Based on NVIDIA Tesla V100. Ideal for deep learning, neural-network training, and advanced AI workloads.

Discover
GPU 3080ti on Exoscale

GPU 3080ti

NVIDIA GeForce RTX 3080 Ti is excellent for deep-learning model training, image processing, NLP, and more. 100 % liquid cooled with heat-reuse technology.

Discover
GPU A5000 on Exoscale

GPU A5000

Entry-level all-rounder leveraging NVIDIA RTX A5000. Fully liquid cooled with heat-reuse for sustainable accelerated computing. Great for AR/VR, simulations, rendering, and AI.

Discover
GPU RTX 6000 on Exoscale

GPU RTX 6000 - coming soon

NVIDIA RTX Pro 6000, ultimate power for AI and graphics. Delivering cutting-edge rendering, massive memory, and breakthrough performance.

Discover
GPU B300 on Exoscale

GPU B300 - coming soon

NVIDIA HGX B300, next-generation performance for AI and HPC. Expect FP4 precision, massive GPU memory, and unmatched throughput.

Contact Us

Frequently asked questions about NVIDIA A40

What workloads are best suited for the NVIDIA A40?

The NVIDIA A40 is ideal for visual computing workloads such as 3D rendering, AR/VR environments, engineering simulation, and high-end content creation. It combines Tensor, CUDA, and RT cores with 48 GB of memory to handle graphics-intensive and compute-heavy tasks efficiently.

How does the NVIDIA A40 compare to previous generations?

Compared to older GPUs, the NVIDIA A40 offers significantly improved performance—especially in ray tracing, deep learning inference, and simulation. It delivers up to 2-times the throughput for workloads such as ray-traced motion blur or large-scale data visualization.

Can I combine NVIDIA A40 GPU instances with other Exoscale services?

Yes. You can integrate NVIDIA A40 GPU instances with Exoscale services such as SKS (Managed Kubernetes), DBaaS, and Object Storage. Full Terraform and API support allow for seamless orchestration of your infrastructure.

How can I scale NVIDIA A40 GPU workloads on Exoscale?

You can easily add or remove A40 GPU instances to match your workload needs. While the GPU type of an existing instance cannot be changed directly, you can deploy additional instances or adjust resources using Terraform, API, or Managed Kubernetes (SKS).