NVIDIA CUDA 1.0 released
Monday 25th June 2007, 09:09:00 PM, written by Tim
NVIDIA has released version 1.0 of its CUDA programming framework, with
a large number of new features including asynchronous kernel calls and 64-bit
Linux support. Like 0.8, there seems to be support for Red Hat Enterprise Linux, OpenSUSE and SUSE Linux Enterprise Desktop, as well as 32-bit Windows XP.
If you look at the release notes, you'll see a number of interesting things as well as the new features. First, there's all the new device support (all G80s as well as G84--no G86, though). There's a brief mention of PTX, which is the intermediate ISA generated by the CUDA compiler. There are also numerous improvements in the FFT and BLAS libraries, plus more performance and stability improvements.
Asynchronous kernels have new calling conventions, so you should definitely check out the programming guide to see how to use that. Also note that there are now two different versions of CUDA-capable chips: G80 is v1.0, G84 is v1.1. At the moment, the only feature that seems to separate the two are atomic functions, which are only available in 1.1. However, this does mean that you could write a global mutex for your G84 (there is atomic compare-and-swap)...
Look for more CUDA coverage soon; we'll be exploring version 1.0 as well!
If you look at the release notes, you'll see a number of interesting things as well as the new features. First, there's all the new device support (all G80s as well as G84--no G86, though). There's a brief mention of PTX, which is the intermediate ISA generated by the CUDA compiler. There are also numerous improvements in the FFT and BLAS libraries, plus more performance and stability improvements.
Asynchronous kernels have new calling conventions, so you should definitely check out the programming guide to see how to use that. Also note that there are now two different versions of CUDA-capable chips: G80 is v1.0, G84 is v1.1. At the moment, the only feature that seems to separate the two are atomic functions, which are only available in 1.1. However, this does mean that you could write a global mutex for your G84 (there is atomic compare-and-swap)...
Look for more CUDA coverage soon; we'll be exploring version 1.0 as well!
Tagging
nvidia ± cuda, gpgpu
Related nvidia News
CUDA 4.0 and Parallel Nsight 2.0 released
NVIDIA Fermi GPU and Architecture Analysis
NVIDIA's Parallel Nsight finally released
NVIDIA GeForce GTX 460 - GF104 breaks cover
PhysX87, ancient tragedy in 5 acts by RWT
So long, Chris, and thanks for all the fish
NVIDIA GF100 graphics architecture details
NVIDIA Fermi: new GPU architecture, starting with GF100
NVIDIA release OpenCL GPU drivers for Linux and Windows
NVIDIA GeForce GTX 275 at $250 to fight HD 4890
NVIDIA Fermi GPU and Architecture Analysis
NVIDIA's Parallel Nsight finally released
NVIDIA GeForce GTX 460 - GF104 breaks cover
PhysX87, ancient tragedy in 5 acts by RWT
So long, Chris, and thanks for all the fish
NVIDIA GF100 graphics architecture details
NVIDIA Fermi: new GPU architecture, starting with GF100
NVIDIA release OpenCL GPU drivers for Linux and Windows
NVIDIA GeForce GTX 275 at $250 to fight HD 4890