NVIDIA Technical Blog | News and tutorials for developers, data scientists, and IT admins

Generative AI

Visual Language Models on NVIDIA Hardware with VILA
Generative AI

Turbocharging Meta Llama 3 Performance with NVIDIA TensorRT-LLM and NVIDIA Triton Inference Server
Cybersecurity

Announcing Confidential Computing General Access on NVIDIA H100 Tensor Core GPUs
Conversational AI

New Standard for Speech Recognition and Translation from the NVIDIA NeMo Canary Model
Generative AI

Turbocharge ASR Accuracy and Speed with NVIDIA NeMo Parakeet-TDT

Recent

See all

nearly 100 training labs from GTC available on demand

May 07, 2024

NVIDIA GTC Training Labs On Demand Available Now

Missed GTC or want to replay your favorite training labs? Find it on demand with the NVIDIA GTC Training Labs playlist.

1 MIN READ

May 07, 2024

Enhancing Anomaly Detection in Linux Audit Logs with AI

In cybersecurity, identifying threats swiftly and accurately is paramount to the success of the modern enterprise. Linux audit logs, which record system...

10 MIN READ

May 06, 2024

Intelligent Traffic Offload with Enhanced AI-powered 5G Security for Enterprises

Securing the private 5G and applications at the edge comes with many challenges. Sophisticated AI– and ML-based attack campaigns require security to respond...

6 MIN READ

May 06, 2024

Automating Smart Pick-and-Place with Intrinsic Flowstate and NVIDIA Isaac Manipulator

We are announcing our collaboration with Intrinsic.ai on learning foundation skill models for industrial robotics tasks. Many pick-and-place problems in...

4 MIN READ

Image of a gridded cube with purple and green dots.

May 03, 2024

Explainer: What Is a Vector Database?

A vector database is an organized collection of vector embeddings that can be created, read, updated, and deleted at any point in time. Vector embeddings...

1 MIN READ

Decorative image of VILA and Jetson Orin workflow.

May 03, 2024

Visual Language Intelligence and Edge AI 2.0

VILA is a family of high-performance vision language models developed by NVIDIA Research and MIT. The largest model comes with ~40B parameters and the smallest...

8 MIN READ

May 03, 2024

Visual Language Models on NVIDIA Hardware with VILA

Visual language models have evolved significantly recently. However, the existing technology typically only supports one single image. They cannot reason among...

11 MIN READ

May 01, 2024

Spotlight: Continental and SoftServe Deliver Generative AI-Powered Virtual Factory Solutions with OpenUSD

With automotive consumers increasingly seeking more seamless, connected driving experiences, the industry has increased its focus on connectivity, advanced...

5 MIN READ

Apr 30, 2024

Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks

This week’s model release features DBRX, a state-of-the-art large language model (LLM) developed by Databricks. With demonstrated strength in programming and...

3 MIN READ

3 sessions for data scientists to watch from NVIDIA GTC 2024

Apr 29, 2024

Top Data Science Sessions from NVIDIA GTC 2024 Now Available On Demand

At GTC 2024, experts from NVIDIA and our partners shared insights about GPU-accelerated tools, optimizations, and best practices for data scientists. From the...

2 MIN READ

Three reflective green spheres hovering above three white platforms on a neutral background.

Apr 29, 2024

GPU-Powered Windows 365 Cloud PCs with NVIDIA RTX Virtual Workstation for High-End Graphics Workloads

Professional workflows have become more complex with the increased demand for graphics-intensive scenarios. From regular office applications to demanding...

7 MIN READ

Apr 28, 2024

Turbocharging Meta Llama 3 Performance with NVIDIA TensorRT-LLM and NVIDIA Triton Inference Server

We're excited to announce support for the Meta Llama 3 family of models in NVIDIA TensorRT-LLM, accelerating and optimizing your LLM inference performance. You...

9 MIN READ

Generative AI

See all

May 03, 2024

Explainer: What Is a Vector Database?

A vector database is an organized collection of vector embeddings that can be created, read, updated, and deleted at any point in time. Vector embeddings...

1 MIN READ

May 03, 2024

Visual Language Intelligence and Edge AI 2.0

VILA is a family of high-performance vision language models developed by NVIDIA Research and MIT. The largest model comes with ~40B parameters and the smallest...

8 MIN READ

May 03, 2024

Visual Language Models on NVIDIA Hardware with VILA

Visual language models have evolved significantly recently. However, the existing technology typically only supports one single image. They cannot reason among...

11 MIN READ

May 01, 2024

Spotlight: Continental and SoftServe Deliver Generative AI-Powered Virtual Factory Solutions with OpenUSD

With automotive consumers increasingly seeking more seamless, connected driving experiences, the industry has increased its focus on connectivity, advanced...

5 MIN READ

Apr 30, 2024

Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks

This week’s model release features DBRX, a state-of-the-art large language model (LLM) developed by Databricks. With demonstrated strength in programming and...

3 MIN READ

Apr 28, 2024

Turbocharging Meta Llama 3 Performance with NVIDIA TensorRT-LLM and NVIDIA Triton Inference Server

We're excited to announce support for the Meta Llama 3 family of models in NVIDIA TensorRT-LLM, accelerating and optimizing your LLM inference performance. You...

9 MIN READ

Apr 26, 2024

New LLM: Snowflake Arctic Model for SQL and Code Generation

Large language models (LLMs) have revolutionized natural language processing (NLP) in recent years, enabling a wide range of applications such as text...

3 MIN READ

Apr 26, 2024

Enhance Text-to-Image Fine-Tuning with DRaFT+, Now Part of NVIDIA NeMo

Text-to-image diffusion models have been established as a powerful method for high-fidelity image generation based on given text. Nevertheless, diffusion models...

10 MIN READ

Apr 25, 2024

Announcing Confidential Computing General Access on NVIDIA H100 Tensor Core GPUs

NVIDIA launched the initial release of the Confidential Computing (CC) solution in private preview for early access in July 2023 through NVIDIA LaunchPad....

3 MIN READ

Decorative image of different workflows against a grey background.

Apr 23, 2024

Democratizing AI Workflows with Union.ai and NVIDIA DGX Cloud

GPUs were initially specialized for rendering 3D graphics in video games, primarily to accelerate linear algebra calculations. Today, GPUs have become one of...

7 MIN READ

Apr 23, 2024

Webinar: Enhance LLMs with RAG and Accelerate Enterprise AI with Pure Storage and NVIDIA

Join Pure Storage and NVIDIA on April 25 to discover the benefits of enhancing LLMs with RAG for enterprise-scale generative AI applications.

1 MIN READ

Apr 22, 2024

Advancing Cell Segmentation and Morphology Analysis with NVIDIA AI Foundation Model VISTA-2D

Genomics researchers use different sequencing techniques to better understand biological systems, including single-cell and spatial omics. Unlike single-cell,...

7 MIN READ

AI Foundation Models

See all

Apr 30, 2024

Leverage Mixture of Experts-Based DBRX for Superior LLM Performance on Diverse Tasks

This week’s model release features DBRX, a state-of-the-art large language model (LLM) developed by Databricks. With demonstrated strength in programming and...

3 MIN READ

Apr 26, 2024

New LLM: Snowflake Arctic Model for SQL and Code Generation

Large language models (LLMs) have revolutionized natural language processing (NLP) in recent years, enabling a wide range of applications such as text...

3 MIN READ

Apr 22, 2024

Mistral Large and Mixtral 8x22B LLMs Now Powered by NVIDIA NIM and NVIDIA API

This week’s model release features two new NVIDIA AI Foundation models, Mistral Large and Mixtral 8x22B, both developed by Mistral AI. These cutting-edge...

4 MIN READ

Mar 18, 2024

Scale AI-Enabled Robotics Development Workloads with NVIDIA OSMO

Autonomous machine development is an iterative process of data generation and gathering, model training, and deployment characterized by complex multi-stage,...

4 MIN READ

Mar 04, 2024

Solve Complex AI Tasks with Leaderboard-Topping Smaug 72B from NVIDIA AI Foundation Models

This week’s model release features the NVIDIA-optimized language model Smaug 72B, which you can experience directly from your browser. NVIDIA AI Foundation...

2 MIN READ

Feb 28, 2024

Unlock Your LLM Coding Potential with StarCoder2

Coding is essential in the digital age, but it can also be tedious and time-consuming. That's why many developers are looking for ways to automate and...

7 MIN READ

Feb 27, 2024

Unlock the Power of Small Language Model Phi-2 for Chat, Research, Coding, and More

This week’s model release features the NVIDIA-optimized language model Phi-2, which can be used for a wide range of natural language processing (NLP) tasks....

2 MIN READ

Feb 19, 2024

Experience NVIDIA cuOpt Accelerated Optimization to Boost Operational Efficiency

This week’s model release features NVIDIA cuOpt, a world-record-breaking accelerated optimization engine that helps teams solve complex routing problems and...

6 MIN READ

Decorative image of groups of people using speech AI in different ways standing around a globe.

Feb 12, 2024

Performance-Efficient Mamba-Chat from NVIDIA AI Foundation Models

This week’s release features the NVIDIA-optimized Mamba-Chat model, which you can experience directly from your browser. This post is part of Model Mondays, a...

3 MIN READ

Feb 05, 2024

Generate Code, Answer Queries, and Translate Text with New NVIDIA AI Foundation Models

This week’s Model Monday release features the NVIDIA-optimized code Llama, Kosmos-2, and SeamlessM4T, which you can experience directly from your browser....

10 MIN READ

Jan 22, 2024

Query Graphs with Optimized DePlot Model

NVIDIA AI Foundation Models and Endpoints provides access to a curated set of community and NVIDIA-built generative AI models to experience, customize, and...

6 MIN READ

Simulation / Modeling / Design

See all

May 07, 2024

NVIDIA GTC Training Labs On Demand Available Now

Missed GTC or want to replay your favorite training labs? Find it on demand with the NVIDIA GTC Training Labs playlist.

1 MIN READ

May 06, 2024

Automating Smart Pick-and-Place with Intrinsic Flowstate and NVIDIA Isaac Manipulator

We are announcing our collaboration with Intrinsic.ai on learning foundation skill models for industrial robotics tasks. Many pick-and-place problems in...

4 MIN READ

May 01, 2024

Spotlight: Continental and SoftServe Deliver Generative AI-Powered Virtual Factory Solutions with OpenUSD

With automotive consumers increasingly seeking more seamless, connected driving experiences, the industry has increased its focus on connectivity, advanced...

5 MIN READ

Apr 22, 2024

Developing Virtual Factory Solutions with OpenUSD and NVIDIA Omniverse

With NVIDIA AI, NVIDIA Omniverse, and the Universal Scene Description (OpenUSD) ecosystem, industrial developers are building virtual factory solutions that...

4 MIN READ

Apr 22, 2024

Just Released: NVIDIA Modulus v24.04

Modulus v24.04 delivers an optimized CorrDiff model and Earth2Studio for exploring weather AI models.

1 MIN READ

Photo of a cell tower at sunset among hills with fog.

Apr 22, 2024

Enhanced DU Performance and Workload Consolidation for 5G/6G with NVIDIA Aerial CUDA-Accelerated RAN

Aerial CUDA-Accelerated radio access network (RAN) enables acceleration of telco workloads, delivering new levels of spectral efficiency (SE) on a cloud-native...

14 MIN READ

Apr 11, 2024

New Video Series: OpenUSD for Developers

Universal Scene Description, also called OpenUSD or USD, is an open and extensible framework for creating, editing, querying, rendering, collaborating, and...

3 MIN READ

One image of the sun next to another of a weather satellite photo.

Apr 10, 2024

How Generative AI is Empowering Climate Tech with NVIDIA Earth-2

In the context of global warming, NVIDIA Earth-2 has emerged as a pivotal platform for climate tech, generating actionable insights in the face of increasingly...

14 MIN READ

Decorative image of bugs crawling over a computer chip.

Mar 27, 2024

Efficient CUDA Debugging: Using NVIDIA Compute Sanitizer with NVIDIA Tools Extension and Creating Custom Tools

NVIDIA Compute Sanitizer is a powerful tool that can save you time and effort while improving the reliability and performance of your CUDA applications....

14 MIN READ

Mar 25, 2024

Building High-Performance Applications in the Era of Accelerated Computing

AI is augmenting high-performance computing (HPC) with novel approaches to data processing, simulation, and modeling. Because of the computational requirements...

6 MIN READ

Mar 20, 2024

Record-Breaking NVIDIA cuOpt Algorithms Deliver Route Optimization Solutions 100x Faster

NVIDIA cuOpt is an accelerated optimization engine for solving complex routing problems. It efficiently solves problems with different aspects such as breaks,...

13 MIN READ

Image of a city simulation with a 6G network.

Mar 20, 2024

Accelerating the Future of Wireless Communication with the NVIDIA 6G Developer Program

6G will make the telco network AI-native for the first time. To develop 6G technologies, the telecom industry needs a whole new approach to research. The...

10 MIN READ

Computer Vision / Video Analytics

See all

May 03, 2024

Visual Language Models on NVIDIA Hardware with VILA

Visual language models have evolved significantly recently. However, the existing technology typically only supports one single image. They cannot reason among...

11 MIN READ

May 03, 2024

Visual Language Intelligence and Edge AI 2.0

VILA is a family of high-performance vision language models developed by NVIDIA Research and MIT. The largest model comes with ~40B parameters and the smallest...

8 MIN READ

Apr 26, 2024

Perception Model Training for Autonomous Vehicles with Tensor Parallelism

Due to the adoption of multicamera inputs and deep convolutional backbone networks, the GPU memory footprint for training autonomous driving perception models...

10 MIN READ

Apr 22, 2024

Advancing Cell Segmentation and Morphology Analysis with NVIDIA AI Foundation Model VISTA-2D

Genomics researchers use different sequencing techniques to better understand biological systems, including single-cell and spatial omics. Unlike single-cell,...

7 MIN READ

Apr 17, 2024

Advancing Medical Image Decoding with GPU-Accelerated nvImageCodec

This post delves into the capabilities of decoding DICOM medical images within AWS HealthImaging using the nvJPEG2000 library. We'll guide you through the...

16 MIN READ

Apr 12, 2024

Explainer: What Is a Convolutional Neural Network?

A convolutional neural network is a type of deep learning network used primarily to identify and classify images and to recognize objects within images.

1 MIN READ

Image of an intersection with bounding boxes and shades of light that indicate computer vision alerts.

Mar 22, 2024

Explainer: What Is Computer Vision?

Computer vision defines the field that enables devices to acquire, process, understand, and analyze digital images and videos and extract useful...

1 MIN READ

Mar 20, 2024

Developing Production-Ready AI Sensor Processing Applications with NVIDIA Holoscan 1.0

Edge AI developers are building AI applications and products for safety-critical and regulated use cases. With NVIDIA Holoscan 1.0, these applications can...

7 MIN READ

Mar 19, 2024

Breaking Barriers in Healthcare with New Models for Generative AI and Cellular Imaging

Driving the future of healthcare imaging, NVIDIA MONAI microservices are creating unique state-of-the-art models and expanded modalities to meet the demands of...

4 MIN READ

Picture of flowers split between VMAF-CUDA at 1424 FPS and CPU at 1034 FPS.

Mar 12, 2024

Calculating Video Quality Using NVIDIA GPUs and VMAF-CUDA

Video quality metrics are used to evaluate the fidelity of video content. They provide a consistent quantitative measurement to assess the performance of the...

14 MIN READ

Decorative image of matrices on a black background, with the text, "Part 2."

Mar 08, 2024

cuTENSOR 2.0: Applications and Performance

While part 1 focused on the usage of the new NVIDIA cuTENSOR 2.0 CUDA math library, this post introduces a variety of usage modes beyond that, specifically...

9 MIN READ

Decorative image of matrices on a black background, with the text "Part 1."

Mar 08, 2024

cuTENSOR 2.0: A Comprehensive Guide for Accelerating Tensor Computations

NVIDIA cuTENSOR is a CUDA math library that provides optimized implementations of tensor operations where tensors are dense, multi-dimensional arrays or array...

17 MIN READ

Data Science

See all

May 07, 2024

NVIDIA GTC Training Labs On Demand Available Now

Missed GTC or want to replay your favorite training labs? Find it on demand with the NVIDIA GTC Training Labs playlist.

1 MIN READ

May 07, 2024

Enhancing Anomaly Detection in Linux Audit Logs with AI

In cybersecurity, identifying threats swiftly and accurately is paramount to the success of the modern enterprise. Linux audit logs, which record system...

10 MIN READ

May 03, 2024

Explainer: What Is a Vector Database?

A vector database is an organized collection of vector embeddings that can be created, read, updated, and deleted at any point in time. Vector embeddings...

1 MIN READ

Apr 29, 2024

Top Data Science Sessions from NVIDIA GTC 2024 Now Available On Demand

At GTC 2024, experts from NVIDIA and our partners shared insights about GPU-accelerated tools, optimizations, and best practices for data scientists. From the...

2 MIN READ

Apr 23, 2024

Democratizing AI Workflows with Union.ai and NVIDIA DGX Cloud

GPUs were initially specialized for rendering 3D graphics in video games, primarily to accelerate linear algebra calculations. Today, GPUs have become one of...

7 MIN READ

Apr 22, 2024

Advancing Cell Segmentation and Morphology Analysis with NVIDIA AI Foundation Model VISTA-2D

Genomics researchers use different sequencing techniques to better understand biological systems, including single-cell and spatial omics. Unlike single-cell,...

7 MIN READ

Apr 17, 2024

Advancing Medical Image Decoding with GPU-Accelerated nvImageCodec

This post delves into the capabilities of decoding DICOM medical images within AWS HealthImaging using the nvJPEG2000 library. We'll guide you through the...

16 MIN READ

Apr 12, 2024

Explainer: What Is a Convolutional Neural Network?

A convolutional neural network is a type of deep learning network used primarily to identify and classify images and to recognize objects within images.

1 MIN READ

Apr 11, 2024

New Video Series: OpenUSD for Developers

Universal Scene Description, also called OpenUSD or USD, is an open and extensible framework for creating, editing, querying, rendering, collaborating, and...

3 MIN READ

Apr 05, 2024

Explainer: What Is Retrieval-Augmented Generation?

Retrieval-augmented generation enhances large language model prompts with relevant data for more practical, accurate responses.

1 MIN READ

Decorative image of graphs as light web.

Apr 03, 2024

Optimizing Memory and Retrieval for Graph Neural Networks with WholeGraph, Part 2

Large-scale graph neural network (GNN) training presents formidable challenges, particularly concerning the scale and complexity of graph data. These challenges...

5 MIN READ

Image of a chatbot as the interface between customers, with speech bubbles.

Mar 29, 2024

Explainer: What Is Sentiment Analysis?

Sentiment analysis is the automated interpretation and classification of emotions (usually positive, negative, or neutral) from textual data such as written...

1 MIN READ

Content Creation / Rendering

See all

Apr 29, 2024

GPU-Powered Windows 365 Cloud PCs with NVIDIA RTX Virtual Workstation for High-End Graphics Workloads

Professional workflows have become more complex with the increased demand for graphics-intensive scenarios. From regular office applications to demanding...

7 MIN READ

Apr 26, 2024

Enhance Text-to-Image Fine-Tuning with DRaFT+, Now Part of NVIDIA NeMo

Text-to-image diffusion models have been established as a powerful method for high-fidelity image generation based on given text. Nevertheless, diffusion models...

10 MIN READ

Apr 11, 2024

New Video Series: OpenUSD for Developers

Universal Scene Description, also called OpenUSD or USD, is an open and extensible framework for creating, editing, querying, rendering, collaborating, and...

3 MIN READ

Decorative collage of media images superimposed on data center mockup.

Apr 09, 2024

Next-Generation Live Media Apps on Repurposable Clusters with NVIDIA Holoscan for Media

NVIDIA Holoscan for Media is now available to all developers looking to build next-generation live media applications on fully repurposable clusters. ...

4 MIN READ

Mar 21, 2024

Speed Up Your AI Development: NVIDIA AI Workbench Goes GA

NVIDIA AI Workbench, a toolkit for AI and ML developers, is now generally available as a free download. It features automation that removes roadblocks for...

4 MIN READ

A GIF showing one of the new features of NSight.

Mar 21, 2024

Upgrade Your Graphics: Explore New Ray Tracing Features for NVIDIA Nsight Tools

The union of ray tracing and AI is pushing graphics fidelity and performance to new heights. Helping you build optimized, bug-free applications in this era of...

5 MIN READ

Still image from Covert Protocol game demo.

Mar 19, 2024

Generative AI for Digital Humans and New AI-powered NVIDIA RTX Lighting

At GDC 2024, NVIDIA announced that leading AI application developers such as Inworld AI are using NVIDIA digital human technologies to accelerate the deployment...

5 MIN READ

Mar 14, 2024

Powerful Shader Insights: Using Shader Debug Info with NVIDIA Nsight Graphics

As ray tracing becomes the predominant rendering technique in modern game engines, a single GPU RayGen shader can now perform most of the light simulation of a...

7 MIN READ

Mar 12, 2024

Streamline Live Media Application Development with New Features in NVIDIA Holoscan for Media

NVIDIA Holoscan for Media is a software-defined platform for building and deploying applications for live media. Recent updates introduce a user-friendly...

5 MIN READ

Mar 11, 2024

Advancing GPU-Driven Rendering with Work Graphs in Direct3D 12

GPU-driven rendering has long been a major goal for many game applications. It enables better scalability for handling large virtual scenes and reduces cases...

12 MIN READ

Mar 11, 2024

Work Graphs in Direct3D 12: A Case Study of Deferred Shading

When it comes to game application performance, GPU-driven rendering enables better scalability for handling large virtual scenes. Direct3D 12 (D3D12) introduces...

14 MIN READ

Decorative image of a video conference with avatars.

Mar 07, 2024

Robotics

See all

May 06, 2024

Automating Smart Pick-and-Place with Intrinsic Flowstate and NVIDIA Isaac Manipulator

We are announcing our collaboration with Intrinsic.ai on learning foundation skill models for industrial robotics tasks. Many pick-and-place problems in...

4 MIN READ

May 03, 2024

Visual Language Intelligence and Edge AI 2.0

VILA is a family of high-performance vision language models developed by NVIDIA Research and MIT. The largest model comes with ~40B parameters and the smallest...

8 MIN READ

Apr 26, 2024

Perception Model Training for Autonomous Vehicles with Tensor Parallelism

Due to the adoption of multicamera inputs and deep convolutional backbone networks, the GPU memory footprint for training autonomous driving perception models...

10 MIN READ

Apr 22, 2024

Developing Virtual Factory Solutions with OpenUSD and NVIDIA Omniverse

With NVIDIA AI, NVIDIA Omniverse, and the Universal Scene Description (OpenUSD) ecosystem, industrial developers are building virtual factory solutions that...

4 MIN READ

Mar 20, 2024

Developing Production-Ready AI Sensor Processing Applications with NVIDIA Holoscan 1.0

Edge AI developers are building AI applications and products for safety-critical and regulated use cases. With NVIDIA Holoscan 1.0, these applications can...

7 MIN READ

Mar 20, 2024

Record-Breaking NVIDIA cuOpt Algorithms Deliver Route Optimization Solutions 100x Faster

NVIDIA cuOpt is an accelerated optimization engine for solving complex routing problems. It efficiently solves problems with different aspects such as breaks,...

13 MIN READ

Mar 18, 2024

Scale AI-Enabled Robotics Development Workloads with NVIDIA OSMO

Autonomous machine development is an iterative process of data generation and gathering, model training, and deployment characterized by complex multi-stage,...

4 MIN READ

Mar 07, 2024

Make the Most of NVIDIA GTC 2024 with In-Person, Hands-On Learning

We are so excited to be back in person at GTC this year at the San Jose Convention Center. With thousands of developers, industry leaders, researchers, and...

6 MIN READ

Feb 29, 2024

Top Synthetic Data Generation Sessions at NVIDIA GTC 2024

Learn how synthetic data is supercharging 3D simulation and computer vision workflows, from visual inspection to autonomous machines.

1 MIN READ

Feb 26, 2024

Detecting Real-Time Waste Contamination Using Edge Computing and Video Analytics

The past few decades have witnessed a surge in rates of waste generation, closely linked to economic development and urbanization. This escalation in waste...

8 MIN READ

Feb 26, 2024

Developer Days at NVIDIA GTC 2024

Connect with industry leaders, learn from technical experts, and collaborate with peers at NVIDIA GTC 2024 Developer Days.

1 MIN READ

Decorative image of avatars working in different office locations.

Feb 21, 2024

Webinar: Accelerate Edge AI Development With NVIDIA Metropolis Microservices For Jetson

On March 5, 8am PT, learn how NVIDIA Metropolis microservices for Jetson Orin helps you modernize your app stack, streamline development and deployment, and...

1 MIN READ

Conversational AI

See all

Decorative image of text and speech recognition processes encircling the globe.

Apr 18, 2024

New Standard for Speech Recognition and Translation from the NVIDIA NeMo Canary Model

NVIDIA NeMo is an end-to-end platform for the development of multimodal generative AI models at scale anywhere—on any cloud and on-premises. The NeMo team...

4 MIN READ

Apr 18, 2024

Turbocharge ASR Accuracy and Speed with NVIDIA NeMo Parakeet-TDT

NVIDIA NeMo, an end-to-end platform for developing multimodal generative AI models at scale anywhere—on any cloud and on-premises—recently released...

6 MIN READ

Image of two people sitting in their cubicles with speech recognition visualizations in the background.

Apr 18, 2024

Pushing the Boundaries of Speech Recognition with NVIDIA NeMo Parakeet ASR Models

NVIDIA NeMo, an end-to-end platform for the development of multimodal generative AI models at scale anywhere—on any cloud and on-premises—released the...

6 MIN READ

Apr 12, 2024

Explainer: What Is a Convolutional Neural Network?

A convolutional neural network is a type of deep learning network used primarily to identify and classify images and to recognize objects within images.

1 MIN READ

Mar 27, 2024

Develop Custom Enterprise Generative AI with NVIDIA NeMo

Generative AI is transforming computing, paving new avenues for humans to interact with computers in natural, intuitive ways. For enterprises, the prospect of...

14 MIN READ

Mar 19, 2024

Generative AI for Digital Humans and New AI-powered NVIDIA RTX Lighting

At GDC 2024, NVIDIA announced that leading AI application developers such as Inworld AI are using NVIDIA digital human technologies to accelerate the deployment...

5 MIN READ

Mar 19, 2024

NVIDIA Speech and Translation AI Models Set Records for Speed and Accuracy

Speech and translation AI models developed at NVIDIA are pushing the boundaries of performance and innovation. The NVIDIA Parakeet automatic speech recognition...

8 MIN READ

Decorative collage of a sun with flaring light and federated learning models.

Mar 06, 2024

Turning Machine Learning to Federated Learning in Minutes with NVIDIA FLARE 2.4

Federated learning (FL) is experiencing accelerated adoption due to its decentralized, privacy-preserving nature. In sectors such as healthcare and financial...

16 MIN READ

Mar 04, 2024

Solve Complex AI Tasks with Leaderboard-Topping Smaug 72B from NVIDIA AI Foundation Models

This week’s model release features the NVIDIA-optimized language model Smaug 72B, which you can experience directly from your browser. NVIDIA AI Foundation...

2 MIN READ

Feb 29, 2024

Scalable Federated Learning with NVIDIA FLARE for Enhanced LLM Performance

In the ever-evolving landscape of large language models (LLMs), effective data management is a key challenge. Data is at the heart of model performance. While...

8 MIN READ

Feb 29, 2024

Event: Speech and Generative AI Developer Day at NVIDIA GTC 2024

Learn how to build a RAG-powered application with a human voice interface at NVIDIA GTC 2024 Speech and Generative AI Developer Day.

1 MIN READ

Feb 28, 2024

Unlock Your LLM Coding Potential with StarCoder2

Coding is essential in the digital age, but it can also be tedious and time-consuming. That's why many developers are looking for ways to automate and...

7 MIN READ

Edge Computing

See all

May 07, 2024

NVIDIA GTC Training Labs On Demand Available Now

Missed GTC or want to replay your favorite training labs? Find it on demand with the NVIDIA GTC Training Labs playlist.

1 MIN READ

May 03, 2024

Visual Language Intelligence and Edge AI 2.0

VILA is a family of high-performance vision language models developed by NVIDIA Research and MIT. The largest model comes with ~40B parameters and the smallest...

8 MIN READ

May 03, 2024

Visual Language Models on NVIDIA Hardware with VILA

Visual language models have evolved significantly recently. However, the existing technology typically only supports one single image. They cannot reason among...

11 MIN READ

Apr 22, 2024

Enhanced DU Performance and Workload Consolidation for 5G/6G with NVIDIA Aerial CUDA-Accelerated RAN

Aerial CUDA-Accelerated radio access network (RAN) enables acceleration of telco workloads, delivering new levels of spectral efficiency (SE) on a cloud-native...

14 MIN READ

Mar 20, 2024

Developing Production-Ready AI Sensor Processing Applications with NVIDIA Holoscan 1.0

Edge AI developers are building AI applications and products for safety-critical and regulated use cases. With NVIDIA Holoscan 1.0, these applications can...

7 MIN READ

Collage of product image plus three use case images.

Mar 20, 2024

Powering Mission-Critical AI at the Edge with NVIDIA AI Enterprise IGX

NVIDIA SDKs have been instrumental in accelerating AI applications across a spectrum of use cases spanning smart cities, medical, and robotics. However,...

6 MIN READ

Mar 20, 2024

Accelerating the Future of Wireless Communication with the NVIDIA 6G Developer Program

6G will make the telco network AI-native for the first time. To develop 6G technologies, the telecom industry needs a whole new approach to research. The...

10 MIN READ

Mar 05, 2024

Spotlight: Honeywell Accelerates Industrial Process Simulation with NVIDIA cuDSS

For over a decade, traditional industrial process modeling and simulation approaches have struggled to fully leverage multicore CPUs or acceleration devices to...

8 MIN READ

Mar 01, 2024

Featured Energy Sessions at NVIDIA GTC 2024

Hear from ExxonMobil, Honeywell, Siemens Energy, and more as they explore AI and HPC innovation in oil, gas, power, and utilities.

1 MIN READ

Image of a car with light streams connected to different towers along the street.

Mar 01, 2024

Top Telecom Sessions at NVIDIA GTC 2024

Hear from Amdocs, Indosat, KT, NTT, ServiceNow, Singtel, SoftBank, and Verizon, plus a special address from NVIDIA at GTC. Explore AI transforming customer...

1 MIN READ

Feb 26, 2024

Detecting Real-Time Waste Contamination Using Edge Computing and Video Analytics

The past few decades have witnessed a surge in rates of waste generation, closely linked to economic development and urbanization. This escalation in waste...

8 MIN READ

Computer-generated image of a canal with bounding boxes for houses, trees, boats, and people.

Feb 21, 2024

Top Computer Vision/Video Analytics Sessions at NVIDIA GTC 2024

Discover the transformative power of computer vision and video analytics at GTC. Dive into cutting-edge techniques such as vision transformers, AI agents,...

1 MIN READ

Data Center / Cloud

See all

May 07, 2024

Enhancing Anomaly Detection in Linux Audit Logs with AI

In cybersecurity, identifying threats swiftly and accurately is paramount to the success of the modern enterprise. Linux audit logs, which record system...

10 MIN READ

May 06, 2024

Intelligent Traffic Offload with Enhanced AI-powered 5G Security for Enterprises

Securing the private 5G and applications at the edge comes with many challenges. Sophisticated AI– and ML-based attack campaigns require security to respond...

6 MIN READ

Apr 29, 2024

GPU-Powered Windows 365 Cloud PCs with NVIDIA RTX Virtual Workstation for High-End Graphics Workloads

Professional workflows have become more complex with the increased demand for graphics-intensive scenarios. From regular office applications to demanding...

7 MIN READ

Apr 25, 2024

Announcing Confidential Computing General Access on NVIDIA H100 Tensor Core GPUs

NVIDIA launched the initial release of the Confidential Computing (CC) solution in private preview for early access in July 2023 through NVIDIA LaunchPad....

3 MIN READ

Apr 23, 2024

Democratizing AI Workflows with Union.ai and NVIDIA DGX Cloud

GPUs were initially specialized for rendering 3D graphics in video games, primarily to accelerate linear algebra calculations. Today, GPUs have become one of...

7 MIN READ

Apr 23, 2024

Webinar: Enhance LLMs with RAG and Accelerate Enterprise AI with Pure Storage and NVIDIA

Join Pure Storage and NVIDIA on April 25 to discover the benefits of enhancing LLMs with RAG for enterprise-scale generative AI applications.

1 MIN READ

Apr 22, 2024

Advancing Cell Segmentation and Morphology Analysis with NVIDIA AI Foundation Model VISTA-2D

Genomics researchers use different sequencing techniques to better understand biological systems, including single-cell and spatial omics. Unlike single-cell,...

7 MIN READ

Apr 22, 2024

Just Released: NVIDIA Modulus v24.04

Modulus v24.04 delivers an optimized CorrDiff model and Earth2Studio for exploring weather AI models.

1 MIN READ

Apr 22, 2024

Enhanced DU Performance and Workload Consolidation for 5G/6G with NVIDIA Aerial CUDA-Accelerated RAN

Aerial CUDA-Accelerated radio access network (RAN) enables acceleration of telco workloads, delivering new levels of spectral efficiency (SE) on a cloud-native...

14 MIN READ

Apr 19, 2024

Measuring the GPU Occupancy of Multi-stream Workloads

NVIDIA GPUs are becoming increasingly powerful with each new generation. This increase generally comes in two forms. Each streaming multi-processor (SM), the...

11 MIN READ

Apr 17, 2024

Advancing Medical Image Decoding with GPU-Accelerated nvImageCodec

This post delves into the capabilities of decoding DICOM medical images within AWS HealthImaging using the nvJPEG2000 library. We'll guide you through the...

16 MIN READ

Apr 10, 2024

How Generative AI is Empowering Climate Tech with NVIDIA Earth-2

In the context of global warming, NVIDIA Earth-2 has emerged as a pivotal platform for climate tech, generating actionable insights in the face of increasingly...

14 MIN READ