Nvidia’s new CUDA 4.0 release makes parallel programming easier, reports the graphics giant. And the new 4.0 version of the toolkit adds support for Mac OS X. The Unified Virtual Addressing, GPU-to-GPU Communication, and Enhanced C++ Template Libraries enable more developers to take advantage of GPU computing.
A quick summary of the latest key elements of the CUDA 4.0 include:
- Nvidia GPUDirect 2.0 Technology — offers support for peer-to-peer communication between GPUs within a single server or workstation. This enables easier and faster multi-GPU programming and application performance
- Unified Virtual Addressing (UVA) — provides a single merged-memory address space for the main system memory and the GPU memories, enabling quicker and easier parallel programming
- Thrust C++ Template Performance Primitives Libraries — provides a collection of powerful open source C++ parallel algorithms and data structures that ease programming for C++ developers. With Thrust, routines such as parallel sorting are 5x to 100x faster
The new Mac OS X support includes new features in CUDA-GDB. A release candidate for CUDA Toolkit 4.0 will be available free of charge beginning March 4, 2011. You can enroll in the CUDA developer program here: www.nvidia.com/paralleldeveloper
To learn more about CUDA go here.