CUDA C/C++
Dec 05, 2022
Upcoming Workshop: Fundamentals of Accelerated Computing with CUDA C/C++
Learn the fundamental tools and techniques for accelerating C/C++ applications to run on massively parallel GPUs with CUDA in this instructor-led workshop.
1 MIN READ
Oct 05, 2022
Upcoming Workshop: Fundamentals of Accelerated Computing with CUDA C/C++
Learn tools and techniques for accelerating C/C++ applications to run on massively parallel GPUs with CUDA.
1 MIN READ
Jan 17, 2022
CUDA 11.6 Toolkit New Release Revealed
NVIDIA announces the newest release of the CUDA development environment, CUDA 11.6. This release is focused on enhancing the programming model and performance...
3 MIN READ
Oct 25, 2021
Reducing Application Build Times Using CUDA C++ Compilation Aids
The CUDA 11.5 C++ compiler addresses a growing customer request. Specifically, how to reduce CUDA application build times. Along with eliminating unused...
13 MIN READ
Oct 25, 2021
Revealing New Features in the CUDA 11.5 Toolkit
NVIDIA announces the newest release of the CUDA development environment, CUDA 11.5. CUDA 11.5 is focused on enhancing the programming model and performance of...
11 MIN READ
Apr 15, 2021
Programming Efficiently with the NVIDIA CUDA 11.3 Compiler Toolchain
The CUDA 11.3 release of the CUDA C++ compiler toolchain incorporates new features aimed at improving developer productivity and code performance. NVIDIA is...
15 MIN READ
Jul 22, 2015
Using GPUs to Accelerate Epidemic Forecasting
Originally trained as a veterinary surgeon, Chris Jewell, a Senior Lecturer in Epidemiology at Lancaster Medical School in the UK became interested in epidemics...
12 MIN READ
Jul 20, 2015
GPU-Accelerated Cosmological Analysis on the Titan Supercomputer
Ever looked up in the sky and wondered where it all came from? Cosmologists are in the same boat, trying to understand how the Universe arrived at the structure...
9 MIN READ
Jun 29, 2015
GPU Pro Tip: Fast Great-Circle Distance Calculation in CUDA C++
This post demonstrates the practical utility of CUDA’s sinpi() and cospi() functions in the context of distance calculations on earth. With the advent of...
3 MIN READ
Jun 10, 2015
GPU Pro Tip: Lerp Faster in C++
Linear interpolation is a simple and fundamental numerical calculation prevalent in many fields. It's so common in computer graphics that programmers often use...
2 MIN READ
Feb 12, 2015
Accelerating Bioinformatics with NVBIO
NVBIO is an open-source C++ template library of high performance parallel algorithms and containers designed by NVIDIA to accelerate sequence analysis and...
11 MIN READ
Feb 11, 2015
GPU Pro Tip: Fast Dynamic Indexing of Private Arrays in CUDA
Sometimes you need to use small per-thread arrays in your GPU kernels. The performance of accessing elements in these arrays can vary depending on a number of...
12 MIN READ
Oct 01, 2014
CUDA Pro Tip: Optimized Filtering with Warp-Aggregated Atomics
Note: This post has been updated (November 2017) for CUDA 9 and the latest GPUs. The NVCC compiler now performs warp aggregation for atomics automatically in...
14 MIN READ
Sep 24, 2014
CUDA Pro Tip: Use cuFFT Callbacks for Custom Data Processing
Digital signal processing (DSP) applications commonly transform input data before performing an FFT, or transform output data afterwards. For example, if the...
10 MIN READ
Jun 12, 2014
A CUDA Dynamic Parallelism Case Study: PANDA
This post concludes an introductory series on CUDA dynamic parallelism. In this post, I finish the series with a case study on an online track reconstruction...
11 MIN READ
May 20, 2014
CUDA Dynamic Parallelism API and Principles
This post is the second in a series on CUDA Dynamic Parallelism. In my first post, I introduced Dynamic Parallelism by using it to compute images of the...
13 MIN READ