Sharon AI Launches Cloud Media Tech with On-Demand GPU Service

NVIDIA L40 GPU

The NVIDIA L40 GPU, built on the NVIDIA Ada Lovelace architecture, delivers cutting-edge graphics, compute, and AI performance for modern data center workloads. With 48GB of GDDR6 memory, third-generation RT cores, and fourth-generation Tensor cores, it is optimized for virtual workstations, AI training, rendering, and complex visual computing tasks, making it ideal for enterprises requiring scalable performance across a wide range of applications.

Basic Product Information

Product Name

NVIDIA L40 GPU

Architecture

NVIDIA Ada Lovelace

Memory

48GB GDDR6 with ECC

Compute Power

Up to 362 TFLOPS (FP8 Tensor Core)

Release Year

2023

Use Cases

Virtual workstations, AI training, 3D rendering, data science, visual computing

Key Advantages

48GB GDDR6 Memory

Ideal for memory-intensive tasks like 3D modeling and large-scale simulation.

Third-Generation RT Cores

Enhanced ray-tracing performance for lifelike designs and real-time animations.

Fourth-Generation Tensor Cores

Faster AI training with optimized TF32 and support for structural sparsity.

Data Center Ready

Designed for 24×7 enterprise operations with power efficiency, secure boot, and NEBS Level 3 compliance.

vGPU Support

Distribute GPU resources efficiently with NVIDIA vGPU software for multiple users.

Specifications

Performance Specifications

CUDA Cores

18,176

RT Cores

142 (Third-Generation)

Tensor Cores

568 (Fourth-Generation)

RT Core Performance

209 TFLOPS

FP32 Performance

90.5 TFLOPS

Tensor Core Performance:

TF32

90.5 | 181 TFLOPS (with sparsity)

BFLOAT16

181.05 | 362.1 TFLOPS (with sparsity)

FP16

181.05 | 362.1 TFLOPS (with sparsity)

FP8

362 | 724 TFLOPS (with sparsity)

INT8

724 TOPS

INT4

1448 TOPS

Memory and Bandwidth

GPU Memory

48GB GDDR6 with ECC

Memory Bandwidth

864GB/s

Thermal and Power

Max Power Consumption

300W

Cooling

Passive cooling

Power Connector

16-pin

Board Specifications

Form Factor

Dual-slot (4.4” H x 10.5” L)

Interconnect Interface

PCIe Gen4 x16 (64GB/s bi-directional)

Display Outputs

4x DisplayPort 1.4a

NVENC/NVDEC

3x NVENC / 3x NVDEC (with AV1 encode & decode support)

Thermal Solution

Passive

vGPU Software Support

Yes (refer to Virtual GPU Licensing Guide for specific profiles)

Secure Boot with Root of Trust

Yes

NEBS Ready

Level 3

MIG Support

No

NVLink Support

No

Supported Technologies

Virtual GPU (vGPU)

Supported for multi-user environments

Secure Boot with Root of Trust

Yes

NEBS Ready

Level 3 compliant

NVLink and MIG Support

No

Server Compatibility

Available in a wide variety of NVIDIA-Certified Systems™ from leading OEM vendors, making it adaptable to a range of data center configurations.

Additional Features

01 Third-Generation RT Cores

Enhance ray-tracing and shading capabilities for rendering in design, architecture, and simulation workflows.

02 Fourth-Generation Tensor Cores

Provide optimized AI training with support for sparsity and TF32 format.

03 vGPU Support

Allocate GPU memory to multiple users or teams, enabling large workloads to be distributed efficiently across virtual workstations.

04 Data-Center Optimized

Engineered for 24/7 operations, with secure boot and NEBS Level 3 compliance for robust enterprise performance.

05 Virtual GPU (vGPU) Software Support

NVIDIA vPC/vApps, NVIDIA RTX Virtual Workstation, NVIDIA Virtual Compute Server

Want to learn more?

Let's call you back

By clicking the “submit” button, you agree to and accept our Terms & Conditions and Privacy Policy