Red Hat Certifies Linux for NVIDIA AI Boxes

Published wed 24 Oct 2018 // 12:20 UTC

Red Hat cozied up even further with NVIDIA yesterday, certifying its Enterprise Linux platform on the GPU vendor’s DGX-1 machine learning boxes.

The announcement makes it easier for enterprises to manage their machine learning training on their own premises, the Linux vendor said.

DEVCLASS AD

Under the deal, existing Red Hat Enterprise Linux subscriptions are eligible for use on DGX-1 systems. It also opens up certified applications developed for Red Hat’s Linux system to DGX-1 users. Red Hat is going beyond certification by optimizing its Linux for DGX-1 using tuned profiles for the NVIDIA platform. This draws on the tuned package that it released in Red Hat Enterprise Linux 6. The company has said in the past that tuned profiles can boost performance in the double-digit percent range.

Red Hat’s hope is that it will become the control layer of choice for companies training their machine learning models on NVIDIA’s workstation and server boxes. This hardware targets companies with enough AI training workload to crunch their models on their own premises rather than using third-party cloud resources to do it.

The relationship also extends to containers. Red Hat has brought Kubernetes capabilities to DGX-1 users in the form of the Red Hat OpenShift Container Platform, using the device plug-ins capability in Kubernetes to support NVIDIA GPUs.

DEVCLASS AD

The two companies will also collaborate on further open-source initiatives. NVIDIA offers its own container platform called NVIDIA GPU Cloud (NGC). It includes a catalogue of software containers optimized for deep learning workloads on NVIDIA hardware, covering frameworks including TensorFlow, PyTorch, MXNet and TensorRT. These containers include the NVIDIA Cuda toolkit and its deep learning libraries. They are now available on Red Hat OpenShift, the software company said.

The two companies will also continue to work together on heterogeneous memory management (HMM), a feature that lets devices access and mirror the content of a system’s memory into their own. This improves the performance of applications using GPUs, Red Hat said.

The two companies have long been close, working on a range of technologies ranging from video drivers to Kubernetes. They have been involved in the Kubernetes Resource Management Working Group for two years to help tackle performance-sensitive workloads using the container system.

containers ai-ml

Red Hat Certifies Linux for NVIDIA AI Boxes

Generic methods arrive in Golang, but they weren't the top dev demand

Top Microsoft execs fret about impact of AI on software engineering profession

GitHub Dependabot is a 'noise machine', and should be turned off, says Go library maintainer

From Agile to AI: Anniversary workshop says test-driven development ideal for AI coding

Godot maintainers struggle with 'draining and demoralizing' AI slop submissions

React survey shows TanStack gains, doubts over server components

GitHub previews Agentic Workflows as part of continuous AI concept

Anthropic updates to hide Claude’s AI actions, devs hate it

Microsoft's sudden deprecation of Polyglot Notebooks leaves users fuming

Microsoft delivers first preview of .NET 11 and C# 15

JavaScript survey reveals gripes against date handling, Webpack and Next.js - and that "TypeScript has won"

Heroku future in doubt as Salesforce freezes features to focus on AI

OpenAI Codex app looks beyond the IDE, devs ask why Mac-only?

Apple embraces agentic AI development with Xcode 26.3

Adobe backtracks, reanimates Animate following user backlash

Anthropic research: skilled devs make better use of AI, but using AI is bad for learning skills

Kubernetes leadership warns of Ingress NGINX risks, but has also hastened its deprecation

TypeScript inventor Anders Hejlsberg: AI is 'a big regurgitator of stuff someone has done'

WebAssembly gaining adoption "behind the scenes" as technology advances

Microsoft previews command-line tool created because calling modern Windows APIs is too difficult

VS Code tasks config file abused to run malicious code

LLVM project adopts "human in the loop" policy following AI-driven nuisance contributions