C++

Mar 06, 2024

How to Accelerate Quantitative Finance with ISO C++ Standard Parallelism

Quantitative finance libraries are software packages that consist of mathematical, statistical, and, more recently, machine learning models designed for use in...

10 MIN READ

Aug 08, 2023

Accelerate 3D Workflows with Modular, OpenUSD-Powered Omniverse Release

The latest release of NVIDIA Omniverse delivers an exciting collection of new features based on Omniverse Kit 105, making it easier than ever for developers to...

7 MIN READ

Apr 20, 2023

Debugging a Mixed Python and C Language Stack

Debugging is difficult. Debugging across multiple languages is especially challenging, and debugging across devices often requires a team with varying skill...

18 MIN READ

Mar 06, 2023

Maximizing Performance with Massively Parallel Hash Maps on GPUs

Decades of computer science history have been devoted to devising solutions for efficient storage and retrieval of information. Hash maps (or hash tables) are a...

19 MIN READ

Illustration of different AI workflows in enterprise settings such as an airports.

Mar 02, 2023

New Course: Scaling GPU-Accelerated Applications with the C++ Standard Library

Learn how to write scalable GPU-accelerated hybrid applications using C++ standard language features alongside MPI in this interactive hands-on self-paced...

1 MIN READ

Jan 09, 2023

Rapidly Build AI-Streaming Apps with Python and C++

The computational needs for AI processing of sensor streams at the edge are increasingly demanding. Edge devices must keep up with high rates of incoming data...

5 MIN READ

Dec 19, 2022

New Course: GPU Acceleration with the C++ Standard Library

Learn how to write simple, portable, parallel-first GPU-accelerated applications using only C++ standard language features in this self-paced course from the...

1 MIN READ

Jul 26, 2022

Accelerating GPU Applications with NVIDIA Math Libraries

There are three main ways to accelerate GPU applications: compiler directives, programming languages, and preprogrammed libraries. Compiler directives such as...

12 MIN READ

May 31, 2022

Improve Guidance and Performance Visualization with the New Nsight Compute

NVIDIA Nsight Compute is an interactive kernel profiler for CUDA applications. It provides detailed performance metrics and API debugging through a user...

3 MIN READ

Feb 24, 2022

Speeding up Numerical Computing in C++ with a Python-like Syntax in NVIDIA MatX

Rob Smallshire once said, "You can write faster code in C++, but write code faster in Python." Since its release more than a decade ago, CUDA has given C and...

6 MIN READ

Nov 10, 2021

NVIDIA GTC: A Complete Overview of Nsight Developer Tools

The Nsight suite of Developer Tools provide insightful tracing, debugging, profiling, and other analyses to optimize your complex computational applications...

6 MIN READ

Aug 10, 2021

Announcing Nsight Deep Learning Designer 2021.1 - A Tool for Efficient Deep Learning Model Design and Development

Nsight Deep Learning Designer 2021.1 Today NVIDIA announced Nsight DL Designer - the first in-class integrated development environment to support efficient...

3 MIN READ

Mar 10, 2021

NVIDIA Tools Extension API: An Annotation Tool for Profiling Code in Python and C/C++

As PyData leverages much of the static language world for speed including CUDA, we need tools which not only profile and measure across languages but also...

9 MIN READ

Image depicting NVIDIA CEO Jen-Hsun Huang explaining the importance of the RAPIDS launch demo at GTC Europe 2018.

Dec 08, 2020

Fast, Flexible Allocation for NVIDIA CUDA with RAPIDS Memory Manager

When I joined the RAPIDS team in 2018, NVIDIA CUDA device memory allocation was a performance problem. RAPIDS cuDF allocates and deallocates memory at high...

24 MIN READ

PCAST helps to quickly isolate divergence between CPU and GPU results so you can isolate bugs or verify your results are OK even if they aren’t identical.

Nov 18, 2020

Detecting Divergence Using PCAST to Compare GPU to CPU Results

Parallel Compiler Assisted Software Testing (PCAST) is a feature available in the NVIDIA HPC Fortran, C++, and C compilers. PCAST has two use cases. The first...

14 MIN READ

Aug 04, 2020

Accelerating Standard C++ with GPUs Using stdpar

Historically, accelerating your C++ code with GPUs has not been possible in Standard C++ without using language extensions or additional libraries: CUDA C++...

19 MIN READ