Posts by Vinh Nguyen
Generative AI / LLMs
Mar 14, 2024
Applying Mixture of Experts in LLM Architectures
Mixture of experts (MoE) large language model (LLM) architectures have recently emerged, both in proprietary LLMs such as GPT-4, as well as in community models...
12 MIN READ
Generative AI / LLMs
Nov 28, 2023
Build Enterprise Retrieval-Augmented Generation Apps with NVIDIA Retrieval QA Embedding Model
Large language models (LLMs) are transforming the AI landscape with their profound grasp of human and programming languages. Essential for next-generation...
12 MIN READ
Conversational AI
Mar 15, 2023
How to Create a Custom Language Model
Generative AI has captured the attention and imagination of the public over the past couple of years. From a given natural language prompt, these generative...
12 MIN READ
Simulation / Modeling / Design
Dec 08, 2022
Introducing NVIDIA Riva: A GPU-Accelerated SDK for Developing Speech AI Applications
This post was updated in March 2023. Sign up for the latest Speech AI news from NVIDIA. Speech AI is used in a variety of applications, including contact...
8 MIN READ
Conversational AI
Oct 28, 2022
Making an NVIDIA Riva ASR Service for a New Language
Speech AI is the ability of intelligent systems to communicate with users using a voice-based interface, which has become ubiquitous in everyday life. People...
13 MIN READ
Data Science
Aug 03, 2022
Accelerated Inference for Large Transformer Models Using NVIDIA Triton Inference Server
This is the first part of a two-part series discussing the NVIDIA Triton Inference Server’s FasterTransformer (FT) library, one of the fastest libraries for...
10 MIN READ