NVIDIA AI Frameworks and Applications from Scan AI

AI Software

Optimised libraries and frameworks from NVIDIA that accelerate agentic AI model development.

AI Software Ecosystem

Software tools, libraries, SDKs, frameworks and applications are the building blocks of all AI models. Their use helps accelerate the development and deployment of AI agents tailored to a wide variety of use cases.

NVIDIA AI Enterprise is an end-to-end software platform containing everything you need to get started in AI. Optimised for NVIDIA GPU platforms, each collection of libraries, frameworks and NVIDIA Inference Microservices (NIMs) is specifically designed to reduce the complexity of building AI applications from scratch.

NVIDIA Blueprints go one step further providing pre-trained reference AI workflows, customisable with your own business data, for common implementations including Retrieval-Augmented Generation (RAG), Video Search and Summarisation (VSS), digital human avatars, and customer service bots.

GTC Inference Diagram

Customer Service

AI agents for customer service go beyond conversational AI, scripted workflows, and basic automation to enable AI systems that can reason, plan, and take autonomous action across a wide breadth of customer service operations. The frameworks below contain tools and libraries for developing agent attributes from appearance to speech, conversational skills, translation, recommendations and more.

AI for Media

NVIDIA AI for Media (formerly Maxine) is a collection of SDKs, NIMs and Blueprints that enhance audio, video, and create augmented reality effects for visual workflows, including video conferencing and virtual customer service. It enables studio-quality audio effects such as background noise removal, echo correction, and voice quality enhancement — all using a standard microphone. Similarly, a regular webcam can be used to generate video effects including realtime background removal, relighting, super-resolution, and gaze, face and body tracking.

A further feature adjusts a user's gaze to make it appear they are looking directly at the camera, even while reading from a screen.

Example use cases:

  • Telehealth appointments
  • Presenting livestreams
  • IPTV broadcasting
NVIDIA AI for Media audio and video enhancement effects

Audio2Face

NVIDIA Audio2Face automatically generates realistic 3D facial animations, lip-syncing, and emotional expressions directly from audio files. It works by analysing acoustic features such as phonemes and intonation to create a stream of animation data, which is then mapped to a character's facial poses. This data can then be rendered offline for pre-scripted content or streamed in real-time for dynamic, AI-driven characters. Furthermore, it works with any language, including songs.

Audio2Face is part of the NVIDIA Avatar Cloud Engine (ACE) platform, along with Riva for speech development; and Nemotron to add reasoning and context for humanlike interaction. Furthermore, Audio2Face can be used with the Omniverse platform, enabling collaborative development and testing.

Example use cases:

  • Next-generation interactive avatars for customer service
  • Humanoid robots for virtual factories
NVIDIA Audio2Face generating realistic 3D facial animations from audio

Merlin

NVIDIA Merlin is an open source recommender systems framework, designed to streamline the building of recommenders by addressing common preprocessing, feature engineering, training, and inference challenges. Merlin consists of several specialised libraries designed to handle different stages of the recommendation pipelines — each optimised to support the retrieval, filtering, scoring and ordering of data — all controlled through easy-to-use APIs.

Merlin can pre-process terabyte-scale datasets on GPUs up to 10x faster than CPU-based Apache Spark models, delivering much improved predictions resulting in increased click-through rates.

Example use cases:

  • Online shopping decision trees
  • Customer service auto-attendants
NVIDIA Merlin GPU-accelerated recommender system pipeline

Nemotron

NVIDIA Nemotron is a family of large language models (LLMs) and small language models (SLMs) that provide digital humans with intelligence — capable of providing contextually aware responses for humanlike conversations. It consists of several streams offering pre-trained models with parameters in the billions (B) — Nano (30B) delivers the highest accuracy and efficiency for targeted tasks; Nano Omni (30B) consolidates video, audio, image, and text understanding for a simplified agent workflow; Super (120B) is designed for addressing complex tasks in multi-agent environments; and Ultra (253B) is ideal for multi-agent enterprise workflows requiring the highest accuracy.

Nemotron is part of the NVIDIA Avatar Cloud Engine (ACE) platform, along with Riva for speech development; and Audio2Face for generating realistic facial expressions.

Example use cases:

  • Virtual assistants
  • Customer service chatbots
NVIDIA Nemotron large language model powering a digital human avatar

NeMo

NVIDIA NeMo is a comprehensive toolkit for managing the AI agent lifecycle, primarily for models built with the NVIDIA Nemotron framework. It incudes a number of microservices - Curator cleans, filters, and deduplicates massive datasets, ensuring high-quality training data. Data Designer helps generate synthetic data from scratch to maximise training efficiency. Retriever helps extract text, tables, charts, and images from PDFs and databases within your business. Next, Evaluator is used to benchmark and test models, whilst the Agent Toolkit monitors autonomous agents, improving reliability and performance. Lastly, Guardrails allows developers to add safety, security, and topical constraints.

Once your agentic AI model is deployed, RL and Customiser drive constant optimisation with reinforcement learning techniques and fine-tuning to align models with domain data.

Example use cases:

  • Ensuring virtual assistants remain up to date
  • Managing Nemotron-based customer service chatbots
NVIDIA NeMo comprehensive toolkit for managing the AI agent lifecycle

NemoClaw

NVIDIA NemoClaw is an open source framework that adds privacy and security controls to OpenClaw — a popular autonomous AI agent that runs locally to automate complex, multi-step tasks across your computer, files, and messaging apps. It works by installing NVIDIA OpenShell to enforce policy-based guardrails, effectively sandboxing each OpenClaw session including data handling controls and audit logging. NemoClaw also evaluates available compute resources to run high-performance models such as NVIDIA Nemotron locally for enhanced privacy and cost efficiency.

NemoClaw secures autonomous agents on systems including NVIDIA GeForce RTX PCs and laptops; RTX Studio PCs and laptops, RTX PRO workstations and laptops, DGX Spark, DGX Station and all RTX-powered cloud instances.

Example use cases:

  • Enhanced productivity for customer service operators
  • Streamlined workflows for IT support functions
  • Cybersecurity teams monitoring employee 'always-on' AI agents
NVIDIA NemoClaw sandboxed autonomous AI agent with privacy guardrails

Riva

NVIDIA Riva is an SDK for building and deploying fully customisable, real-time conversational AI applications. It enables developers to integrate automatic speech recognition (ASR), text-to-speech (TTS), and neural machine translation (NMT) into their visual and audio workflows, with language support in English, Arabic, French, German, Hindi, Italian, Japanese, Korean, Mandarin, Portuguese, Russian, and Spanish; and with under 300ms latency for natural free-flowing conversations. Riva can also combine speech AI with visual cues, such as gaze detection and gestures, for a more contextual conversational experience.

Models can be trained on proprietary datasets using NVIDIA NeMo to adapt to specific vocabularies, accents, and domains, improving accuracy for specific use cases. As it is fully containerised, Riva can be deployed on-premise, in the cloud, or on edge devices containing NVIDIA Jetson GPUs, ensuring complete data privacy.

Example use cases:

  • Virtual assistants
  • Telehealth appointments
  • Presenting livestreams
NVIDIA Riva real-time conversational AI speech recognition and synthesis

Content Generation

AI agents designed for text, image or video generation, or a combination of all three. The frameworks below contain tools and libraries for developing generative AI content for gaming, broadcast, streaming and more.

ACE

NVIDIA ACE (Avatar Cloud Engine) is a suite of generative AI technologies designed for game developers to create smart, conversational, and animated non-playable characters (NPCs). It includes several frameworks — Audio2Face that generates digital humans with dynamic facial animations, emotions and accurate lip syncing; Riva to understand human language with translation for up to 36 languages; and Nemotron LLMs to provide background stories and contextually aware responses for humanlike conversations. ACE also features multi-modal small language models (SLMs) that allow NPC characters to perceive their environment, plan actions, and act autonomously.

Supported in the NVIDIA Omniverse collaborative cloud platform, ACE also offers plug-ins for Unreal Engine and Autodesk Maya.

Example use cases:

  • Development of NPCs in games
NVIDIA ACE Avatar Cloud Engine powering conversational game NPCs

NeMo

NVIDIA NeMo is a comprehensive toolkit for managing the AI agent lifecycle, primarily for models built with the NVIDIA Nemotron framework. It includes several microservices - Curator is a component used to clean, filter, and deduplicate massive datasets to ensure high-quality training data. Data Designer helps generate synthetic data from scratch to maximise training efficiency. Retriever helps extract text, tables, charts, and images from PDFs and databases within your business. Evaluator is used to benchmark and test models. Agent Toolkit monitors autonomous agents, improving reliability and performance. Lastly, Guardrails enables developers to add safety, security and topical constraints.

Once your agentic AI model is deployed, RL and Customiser drive constant optimisation with reinforcement learning techniques and fine-tuning to align models with domain data.

Example use cases:

  • Managing AI generated NPCs in games
NVIDIA NeMo AI agent lifecycle management toolkit

NIM

NVIDIA NIM (NVIDIA Inference Microservices) is a set of containerised, pre-built, and optimised software designed to accelerate the deployment of generative AI models, whether they be text, image or video based. Microservices include TensorRT-LLM which accelerates and optimises the inference performance of LLMs on NVIDIA GPUs resulting in higher throughput and lower latency. Similarly, Triton Inference Server optimises GPU utilisation through dynamic batching of incoming requests.

As NIMs are continuously updated with the latest optimised inference engines, you can expect to see performance on the same infrastructure improving over time.

Example use cases:

  • Deployment of digital avatars in games or customer service applications
  • Gain improvements in real-time computer vision tasks (image classification / object detection)
NVIDIA NIM containerised inference microservices accelerating AI deployment

Omniverse

NVIDIA Omniverse is a cloud computing platform that enhances 3D visualisation and development workflows, providing a collaborative space accessible by multiple users. It obeys real-world physics, ensuring simulations are as accurate as possible, guaranteeing their integrity when transferred to the real world. It utilises a common USD (Universal Scene Description) format, making collaboration across multiple third-party applications powerful and yet straight-forward.

Omniverse is a key part of many AI workflows, offering digital twin environments for safe simulation and learning, prior to real world deployment.

Example use cases:

  • Robotic and autonomous vehicle simulations with physically accurate worlds
  • Design and modelling of industrial facilities or entire smart cities
  • Synthetic data generation
NVIDIA Omniverse collaborative 3D digital twin environment

Picasso

NVIDIA Picasso is a DGX Cloud native application for building and deploying generative AI-powered image, video, and 3D applications. It offers advanced text-to-image, text-to-video, text-to-3D and 2D to 3D capabilities, utilising the NVIDIA Edify architecture used in Omniverse, and trained on licensed, safe data from sources including Shutterstock, Adobe and Getty Images.

From simple text prompts, Picasso can generate custom 360-degree, up to 8K-resolution, high-dynamic-range (HDR) images, which can be used as a base to set a background or custom light a scene.

Example use cases:

  • Generating props, characters, and environmental assets for games or virtual environments
  • Create custom, marketing content quickly
  • Rapid architectural visualisation
NVIDIA Picasso generative AI producing 8K HDR image from a text prompt

Data Analytics

AI agents designed for data analytics are designed to work seamlessly within existing workflows such as Python and Apache Spark, often requiring minimal or zero code changes for acceleration. NVIDIA AI Enterprise software ensures these tools are secure, supported, and ready for production deployment. The frameworks below contain tools and libraries for advanced data, graph, image and video analytics.

Cosmos

NVIDIA Cosmos generates world foundation models (WFMs), using advanced video analytics, enabling developers to create, simulate, and understand complex physical environments — real or synthetic — to train and validate physical AI models. While it is primarily designed as a platform for physical AI (robotics and autonomous vehicles), its capabilities including an inbuilt 7B VLM make it ideal for analysing and organising large-scale visual data resulting in a range of uses in industrial, safety and logistics environments. These attributes enable users to instantly query large-scale video datasets and retrieve specific scenarios for analysis.

As Cosmos is designed for physics-based, geospatially accurate simulations, it is supported in NVIDIA Omniverse environments to render photorealistic data.

Example use cases:

  • Interpretation of realtime or recorded video streams
  • Traffic monitoring
  • Generation of predictive AI worlds
NVIDIA Cosmos world foundation model simulating a physically accurate environment

DeepStream

NVIDIA DeepStream is an SDK designed to build AI-powered, real-time video, audio and image processing applications. It is ideal for data analytics as it performs computer vision tasks such as object detection, people counting and tracking, and classification on video feeds. It works by turning raw visual data into actionable insights for retail, traffic management, and industrial automation. It includes over 40 hardware-accelerated plug-ins and 30 sample applications and extensions to optimise pre/post processing, inference and multi-camera tracking.

DeepStream bridges the gap between edge devices such as NVIDIA Jetson and the cloud, allowing analytics to be processed locally. The analysed metadata can be sent to analytics engines for further analysis, reporting, or storage.

Example use cases:

  • Analysing customer traffic, aisle flow, and inventory
  • Traffic management, parking detection, and safety monitoring
  • Defect detection and worker safety
NVIDIA DeepStream real-time video analytics detecting objects across multiple camera feeds

Merlin

NVIDIA Merlin is an open source recommender systems framework, designed to streamline the building of recommenders by addressing common preprocessing, feature engineering, training, and inference challenges. Merlin consists of several specialised libraries designed to handle different stages of the recommendation pipelines — each optimised to support the retrieval, filtering, scoring and ordering of data — all controlled through easy-to-use APIs.

Merlin is capable of pre-processing terabyte-scale datasets on GPUs up to 10x faster than CPU-based Apache Spark models, delivering much improved data analysis at scale.

Example use cases:

  • End-to-end pipeline data optimisation
  • Embedding massive data tables that exceed GPU memory
  • Building models that predict user actions based on short-term session data
NVIDIA Merlin GPU-accelerated recommender pipeline processing terabyte-scale datasets

Modulus

NVIDIA Modulus is an open-source deep-learning framework designed to build, train, and deploy AI models that simulate the physical world. It bridges the gap between traditional physics, including fluid dynamics and structural mechanics, and AI, allowing engineers to create real-time digital twins and predictive models. Using Physics-Informed Neural Networks (PINNs), it embeds the laws of physics directly into the training of the neural network.

Modulus also takes historical or simulated data from traditional computer-aided engineering (CAE) tools, such as Ansys, and uses AI to drastically speed up calculations.

Example use cases:

  • Acceleration of interactive simulations for digital twins using PhysicsNeMo NIM
  • Ingestion of physics-grounded datasets from real-world sensors such as satellites
  • Design of heatsinks, microchips, and electronics cooling systems where heat distribution modelling is key
NVIDIA Modulus Physics-Informed Neural Network simulating fluid dynamics for a digital twin

RAPIDS

NVIDIA RAPIDS is an open-source suite of GPU-accelerated Python libraries designed to speed up end-to-end data science and analytics pipelines. It is designed for large-scale data manipulation offering performance gains of up to 40x–150x over traditional CPU-based workflows. RAPIDS enables the entire data science pipeline to run on the GPU, avoiding the time-consuming bottlenecks of moving data between CPU and GPU. Additionally, because RAPIDS is designed to feel like the standard Python PyData ecosystem, data scientists can leverage GPU power without learning CUDA programming.

RAPIDS also supports multi-node, multi-GPU deployments, making it suitable for distributed processing on large datasets.

Example use cases:

  • Faster data manipulation and larger dataset handling
  • Process large-scale data science workloads on GPUs rather than CPUs
  • RAPIDS plug-in for Apache Spark for performance gains or reduction in cloud costs
NVIDIA RAPIDS GPU-accelerated Python data science pipeline achieving 150x speedup

Healthcare

AI agents deployed for healthcare provide accelerated computing platforms and AI models that enable faster drug discovery, improve medical diagnostics, and promote efficient hospital workflows. The frameworks below contain tools and libraries for developing AI agents that aim to improve patient outcomes whilst retaining dignity and confidentiality.

BioNeMo

NVIDIA BioNeMo is a generative AI framework and development platform specifically designed for computational drug discovery and life sciences. It accelerates the research and development of biomolecular AI models, such as protein structure prediction, antibody and peptide design, small molecular generation, and RNA modelling. It features state-of-the-art pre-trained models for protein-protein complex formation, protein structure prediction, and molecular generation.

The BioNeMo framework is optimised to run on GPU-accelerated clusters and includes NIMs that enable inference of production-grade biomolecular foundation models across on-premise, hosted or cloud platforms.

Example use cases:

  • Generation of new chemical entities with desired properties
  • Predicting binding sites by 3D modelling of amino acids
  • Accelerating lab data analysis
NVIDIA BioNeMo generative AI framework for biomolecular drug discovery and protein structure prediction

Clara Discovery

NVIDIA Clara Discovery — part of the Clara healthcare platform — is a collection of frameworks, applications, and AI models enabling GPU-accelerated computational drug discovery. It brings GPU-acceleration to computational chemistry, genomics, and imaging; and AI and machine learning to genomics, proteomics, microscopy, virtual screening, visualisation, clinical imaging and natural language processing.

Clara Discovery supports genomic workflows with Clara Parabricks, medical imaging with Clara Imaging and drug discovery with BioNeMo.

Example use cases:

  • Generative AI for drug discovery
  • Predictive modelling using digital twins for drug manufacture
NVIDIA Clara Discovery GPU-accelerated computational drug discovery platform

Clara Guardian

NVIDIA Clara Guardian — part of the Clara healthcare platform — is a framework that simplifies the development and deployment of smart sensors within a healthcare facility. Its key components include automated speech recognition, text-to-speech and natural language understanding; alongside healthcare-specific versions of Riva and DeepStream for computer vision. Also included is NVIDIA Fleet Command, a hybrid-cloud platform for securely managing and scaling AI deployments across servers and edge devices at hospitals.

The open source nature of these frameworks makes it easy for third-party partners to add AI capabilities to common sensors used in medical environments.

Example use cases:

  • Management of PPE usage or social distancing
  • Body temperature checking
  • Heart rate estimation and gesture recognition models for patient monitoring
NVIDIA Clara Guardian smart sensor framework for AI-powered healthcare facility monitoring

Clara Imaging

NVIDIA Clara Imaging — part of the Clara healthcare platform — is a framework including pre-trained models and software libraries to accelerate image processing, analysis, and reconstruction for diagnostics from X-ray, CT, MRI, and ultrasound scans. It enables developers to create, train, and deploy advanced imaging models for tasks such as image segmentation, classification, and 3D visualisation.

One component of the framework, Clara Reason, provides step-by-step reasoning for image interpretation, acting as a "second reader" for radiologists to validate findings and build trust in AI assessments.

Example use cases:

  • Creating realistic 3D visualisations of anatomy for improved diagnostics
  • AI-assisted annotation to speed up the labelling of imaging data
  • Image enhancement and reconstruction to improve image quality and reduce noise
NVIDIA Clara Imaging AI-assisted medical image segmentation and 3D reconstruction from CT and MRI scans

Clara Parabricks

NVIDIA Clara Parabricks — part of the Clara healthcare platform — is a scalable genomics software suite for secondary analysis of RNA and DNA, that provides GPU-accelerated versions of trusted, open source tools. Compatible with leading sequencing instruments, it can be used across diverse bioinformatics workflows to increase speed and reduce cost, while enhancing accuracy and ensuring transparency.

Reducing time to results for researchers, Parabricks offers whole genomic sequencing in around 45 minutes as opposed to 30 hours with CPU-only solutions.

Example use cases:

  • Genomic sequencing to accelerate drug discovery
  • Large scale population analysis
  • Rapid analysis of clinical samples
NVIDIA Clara Parabricks GPU-accelerated genomic sequencing completing whole genome analysis in 45 minutes

FLARE

NVIDIA FLARE (Federated Learning Application Runtime Environment) is an SDK that enables secure, privacy-preserving federated learning across multiple parties. Federated learning is a way to develop and validate more accurate and generalisable AI models from diverse data sources by mitigating the risk of compromising data security or privacy. It enables AI models to be built with a consortium of data providers without the data ever leaving the individual site.

FLARE also includes built-in tools for managing SSL certificates, user authentication, and access authorisation, for when data governance is critical.

Example use cases:

  • Analysis of city- or county-wide data from multiple hospitals for patient trends
  • Rapid analysis across geographies for disease outbreaks such as COVID-19
  • Developing AI models for specific diseases with large datasets from multiple sources
NVIDIA FLARE federated learning SDK enabling privacy-preserving AI model training across hospital networks

Holoscan

NVIDIA Holoscan is a multimodal, AI sensor processing platform designed for realtime, low-latency streaming applications at the edge and in the cloud. It provides a full-stack infrastructure — combining hardware, software, and AI libraries — to build, deploy, and manage AI-enabled pipelines for medical devices. Holoscan employs NVIDIA Rivermax network streaming and GPUDirect RDMA technologies to bypass the CPU for faster data transfer, resulting in end-to-end latency as low as 10ms, essential in applications such as realtime surgical video.

The Holoscan SDK accelerates deployment on NVIDIA Jetson modules for development of production-ready, industrial-grade AI-powered medical hardware.

Example use cases:

  • Realtime tool tracking to assist surgeons by detecting anatomy during operations
  • 3D surgical guidance to help stereo depth estimation in procedures and training
  • Enhancing patient safety during procedures
NVIDIA Holoscan realtime AI sensor processing platform for low-latency surgical video streaming

Isaac for Healthcare

NVIDIA Isaac for Healthcare is a purpose-built platform containing tools that provide pre-trained AI models, simulation environments, and accelerated libraries for medical and assistive robotics. It supports developers across the entire workflow — from collecting and curating data to building and testing AI models in realistic simulated medical environments, where robots can practice medical tasks virtually, learn from their mistakes using reinforcement learning, and then transfer those learned policies directly to physical machines.

As patient data is often hard to gather or privacy-restricted, Isaac for Healthcare allows developers to generate photorealistic, physically accurate synthetic data to train their AI on a massive scale.

Example use cases:

  • Digital prototyping of next-gen healthcare robotic systems, sensors, and instruments
  • Training robotic policies for augmented dexterity for use in robot-assisted surgery
  • Synthetic data generation for medical environment simulation
NVIDIA Isaac for Healthcare robotics platform simulating medical assistive robot training environments

MONAI Toolkit

The NVIDIA MONAI Toolkit is a containerised, enterprise-grade development environment that accelerates the development, training, and deployment of AI medical imaging models. Built upon the open-source Medical Open Network for AI (MONAI) framework — co-founded by NVIDIA — the toolkit includes pre-trained models and libraries specifically optimised for healthcare AI workloads on NVIDIA GPUs. It provides a secure, scalable workflow supported within NVIDIA AI Enterprise (NVAIE), making it suitable for commercial application development.

MONAI Core provides a training framework for medical imaging that supports 3D segmentation, classification and detection. MONAI Deploy supports turning trained AI models into packaged applications for easy integration into clinical environments.

Example use cases:

  • AI-assisted intelligent medical data labelling
  • Building AI models for accelerated medical imaging analysis for specific diagnoses
NVIDIA MONAI Toolkit enterprise medical imaging AI development environment with 3D segmentation

IT Infrastructure

AI agents deployed for IT infrastructure optimise, secure, and manage modern IT and datacentre architectures. By leveraging GPUs, networking hardware, and native software, these applications reduce downtime, maximise hardware utilisation, and automate complex IT operation.

AI Aerial

NVIDIA AI Aerial is an accelerated computing platform that integrates wireless networks (5G / 6G) and AI workloads on the same IT infrastructure by implementing a virtual radio access network (vRAN). It transforms standard telecom towers and edge servers into software-defined, AI-native datacentres, enabling operators to run telecom functions and AI applications simultaneously on shared GPU.

It includes the Aerial Omniverse Digital Twin to simulate and validate complex signal propagation, tower placements, and network loads virtually before deploying physical infrastructure.

Example use cases:

  • Design of 5G / 6G mast deployments across campuses or entire cities
  • Dynamic multi-tenant GPUs handle peak telecom network demands (vRAN) during the day and shift compute power to generative AI or edge inference during off-peak hours
NVIDIA AI Aerial accelerated computing platform integrating 5G vRAN and AI workloads on shared GPU infrastructure

FLARE

NVIDIA FLARE (Federated Learning Application Runtime Environment) is an SDK designed to enable secure, privacy-preserving federated learning across multiple parties. Federated learning is a way to develop and validate more accurate and generalisable AI models from diverse data sources by mitigating the risk of compromising data security or privacy. It enables AI models to be built with a consortium of data providers without the data ever leaving the individual site.

FLARE also includes built-in tools for managing SSL certificates, user authentication, and access authorisation, for when data governance is critical.

Example use cases:

  • Seamless scaling across Docker and Kubernetes containerised environments
  • Deployment and automatic configuration of infrastructure-as-code on cloud providers such as AWS and Azure
NVIDIA FLARE federated learning SDK enabling secure privacy-preserving AI model training across distributed infrastructure

Holoscan

NVIDIA Holoscan is a multimodal, AI sensor processing platform for realtime, low-latency streaming applications at the edge and in the cloud. It provides a full-stack infrastructure — combining hardware, software, and AI libraries — to build, deploy, and manage AI-enabled pipelines for real-time streaming data applications. Holoscan employs NVIDIA Rivermax network streaming and GPUDirect RDMA technologies to bypass the CPU for faster data transfer, resulting in end-to-end latency as low as 10ms.

The Holoscan SDK accelerates deployment on NVIDIA Jetson modules for development of production-ready, industrial-grade AI-powered hardware.

Example use cases:

  • Processing of live video feeds on the same shared AI infrastructure that manages broadcast graphics, dynamic sports analytics or redundancy failover
NVIDIA Holoscan realtime AI sensor processing platform managing low-latency streaming pipelines at the edge

Morpheus

NVIDIA Morpheus is a framework for enterprise cybersecurity and IT infrastructure. It utilises unsupervised machine learning to inspect network traffic, analyse logs, and identify security threats or system anomalies in realtime, transforming cybersecurity from a reactive measure into a proactive, data-driven science. Creating a unique model for the typical behaviour of every user, service, account, and machine across your entire network, it allows the system to flag subtle, anomalous interactions the moment they happen.

Morpheus integrates directly with NVIDIA BlueField DPUs to inspect all IP traffic across the datacentre fabric instantaneously, without causing bottlenecks.

Example use cases:

  • Realtime network telemetry and digital "fingerprinting" of every user
  • Data loss protection and Spear Phish detection
  • Zero trust automation
NVIDIA Morpheus cybersecurity framework using unsupervised machine learning to detect network anomalies in realtime

Rivermax

NVIDIA Rivermax is an optimised networking SDK that enables ultra-low-latency, zero-copy data streaming directly from standard IP networks into GPU memory, bypassing the operating system kernel. It offloads the packet processing overhead to accelerate massive streaming workloads, lowering CPU utilisation while maintaining synchronisation, optimising streaming throughput across standard 10, 25, 50, and 100Gb/s networks.

Rivermax leverages NVIDIA ConnectX adapters and BlueField DPUs to manage packet-level headers and deliver precise hardware-based Precision Time Protocol (PTP) synchronisation.

Example use cases:

  • Ultra-low latency for algorithmic and high-frequency financial trading platforms
  • Powering rendering nodes for large-scale immersive environments and virtual production workflows
  • Assistance with ingesting massive real-time data sets for AI
NVIDIA Rivermax zero-copy GPU networking SDK delivering ultra-low-latency data streaming across 100Gb/s IP networks

Run:ai

NVIDIA Run:ai is a Kubernetes-native platform that orchestrates and optimises GPU resources for AI workloads. It aggregates isolated GPUs across the entire datacentre into a single, shared, elastic pool, maximising utilisation through fractional allocation. It automatically queues, prioritises, and preempts jobs to ensure high-priority AI training or inference tasks get the compute they need when they need it, whilst enforcing strict governance rules across on-premises, hybrid, or multi-cloud AI infrastructure.

Run:ai also provides IT administrators with detailed dashboards to manage quotas, track usage, and control project-level access.

Example use cases:

  • Dynamic resource pooling & allocation to maximise GPU utilisation
  • Enforcing GPU governance and quotas
  • Optimising inference tasks in conjunction with NIMs
NVIDIA Run:ai Kubernetes-native GPU orchestration platform managing shared elastic GPU pools across a datacentre

Slurm

NVIDIA Slurm (Simple Linux Utility for Resource Management) is an open source cluster management and job scheduling system for Linux. It supports granular GPU scheduling, allowing you to request specific numbers, architectures (Hopper, Blackwell), and exact configurations of GPUs, whilst coordinating shared memory, interconnect bandwidth (NVLink), and node topology to eliminate network bottlenecks during distributed AI training runs.

Slurm can effectively enforce strict fair-share queues, preemptive priority jobs, and customised workload policies, across clusters of thousands of GPUs.

Example use cases:

  • Coordination of multi-node GPU communication during massive AI model training
  • Ensuring fair allocation of GPU resource amongst teams
  • Hyperscale AI — Slurm handles AI training orchestration, while Kubernetes handles the deployment and inference serving of the resulting models
NVIDIA Slurm open source cluster management scheduling distributed GPU training jobs across thousands of nodes

TAO

NVIDIA TAO (Train, Adapt, Optimise) is a low-code framework that simplifies deep learning by abstracting complex code. It provides customisable pre-trained AI vision, vision foundation, depth estimation and multimodal models with domain-specific data, optimising them for fast edge or cloud deployment without needing deep AI expertise. It can be deployed across numerous NVIDIA platforms including Jetson, RTX PRO, MGX, HGX and DGX servers.

Once TAO finishes training and optimising a model, it exports the model in ONNX or TensorRT engine formats, so users can then deploy inference models using Triton Inference Server NIM or the DeepStream SDK.

Example use cases:

  • Adapt object detection models to monitor server health and track unauthorised access in datacentres
  • Fine-tune vision models without needing massive volumes of training data
  • Compress heavy neural networks to run on smaller Jetson compute nodes
NVIDIA TAO low-code deep learning framework training and optimising vision models for edge and cloud deployment

Robotics & Physical AI

AI agents deployed for physical AI are designed to provide a unified, end-to-end platform for building, training, and deploying AI-powered robots and autonomous vehicles.

Cosmos

NVIDIA Cosmos generates world foundation models (WFMs), using advanced video analytics, to create, simulate, and understand complex physical environments — real or synthetic — to train and validate physical AI models. While it is primarily designed as a platform for Physical AI (robotics and autonomous vehicles), its capabilities include an inbuilt 7B VLM making it ideal for analysing and organising large-scale visual data, resulting in a range of uses in industrial, safety, and logistics environments.

As Cosmos is designed for physics-based, geospatially accurate simulations, it is supported in NVIDIA Omniverse environments to render photorealistic data.

Example use cases:

  • Testing how different algorithmic versions for physical AI systems perform in virtual settings
  • Synthetic data generation to create photorealistic, physics-based environments
  • Foresight simulation generating multiple potential future outcomes based on a system's current state
NVIDIA Cosmos world foundation model simulating a physically accurate environment for physical AI training

Isaac Sim

NVIDIA Isaac Sim is a physically accurate robotics simulation platform built on the NVIDIA Omniverse platform. It allows developers to design, test, train, and deploy AI-driven robots in photorealistic virtual environments before building them in the real world. It allows the testing of robotic systems without potential real-world damage or danger; and the simulation of various sensors, including LiDAR, ultrasonic sensors, and RGB / depth cameras for model training.

Isaac Sim can also automatically generate labelled data for AI training, solving real-world data shortages; and it pairs with Isaac Lab for reinforcement learning so robots can learn how to walk, grasp, or navigate.

Example use cases:

  • Design, testing and training of AI-powered robots in a virtual world
  • Synthetic data generation to create photorealistic, perfectly labelled synthetic images of environments such as warehouses or factory floors
NVIDIA Isaac Sim physically accurate robotics simulation environment built on Omniverse

Isaac Lab

NVIDIA Isaac Lab is an open-source, modular framework for training robotic policies at scale using reinforcement and imitation learning. It provides developers with GPU-accelerated physics and high-fidelity sensor simulations to rapidly teach robots how to grasp and manipulate objects; or walk, navigate, avoid obstacles and cope with unpredictable terrain. Built on NVIDIA Isaac Sim, it leverages photorealistic rendering with highly accurate physics to make sim-to-real training seamless.

Isaac Lab also serves as the foundational robot learning framework of the NVIDIA Isaac ROS and GR00T platforms.

Example use cases:

  • Locomotion training for humanoid and quadruped robots
  • Training robotic arms and artificial hands to grasp objects, perform precise industrial assembly, and safely interact with deformable objects
  • Simulation-to-real deployment with trial-and-error cycles porting directly into physical robots
NVIDIA Isaac Lab open-source reinforcement learning framework training robotic locomotion and manipulation policies

Isaac ROS

NVIDIA Isaac ROS (Robot Operating System) is a low-level, modular perception and navigation framework built on the ROS 2 open framework. It includes ready-to-use packages for common tasks for optimal performance. It leverages NVIDIA hardware to provide advanced, real-time spatial awareness for object tracking, depth estimation, and 6D pose estimation, enabling developers to build faster, smarter, and more efficient robots by deploying models directly on embedded NVIDIA Jetson GPUs.

Although Isaac ROS acts as a general use platform for robotic development, Isaac GR00T is considered a specialised framework for whole body control in advanced humanoids.

Example use cases:

  • Creating highly accurate maps and calculating exact positions using cameras and lidar
  • Object detection and segmentation to identify obstacles, humans, or specific objects in real-time
NVIDIA Isaac ROS modular perception and navigation framework deploying real-time object detection on Jetson GPUs

Isaac GR00T

NVIDIA Isaac GR00T (Generalist Robot 00 Technology) is a comprehensive development platform and open foundation model for accelerating the creation of general-purpose humanoid robots. It integrates with Isaac Lab and Omniverse to act as the brain and development ecosystem that enables robots to process text, voice, video, and prior human demonstrations to break down complex instructions and execute them with human-like reasoning.

GR00T uses a "thinking fast and slow" approach where a slow-thinking system reasons and plans the correct actions, while a fast-thinking system translates those plans into precise, continuous, whole-body physical movements.

Example use cases:

  • Synthetic motion generation using minimal real-world human demonstrations to create large synthetic datasets
  • NLP allows humanoid robots to break down complex, multi-step instructions and execute them using common sense
  • Virtual reality teleoperation enables operators to remotely control robots in realtime within digital twins
NVIDIA Isaac GR00T humanoid robot foundation model integrating reasoning and whole-body physical control

Modulus

NVIDIA Modulus is an open-source deep learning framework for building, training, and deploying AI models that simulate the physical world. It bridges the gap between traditional physics, including fluid dynamics and structural mechanics, and AI, enabling engineers to create realtime digital twins and predictive models. Using Physics-Informed Neural Networks (PINNs), it embeds the laws of physics directly into the training of the neural network.

Modulus also takes historical or simulated data from traditional CAE tools such as Ansys, and uses AI to drastically speed up calculations.

Example use cases:

  • Acceleration of interactive simulations for digital twins using PhysicsNeMo NIM
  • Ingestion of physics-grounded datasets from real-world sensors such as satellites
  • Design of heatsinks, microchips, and electronics cooling systems where heat distribution modelling is key
NVIDIA Modulus Physics-Informed Neural Network simulating fluid dynamics for a digital twin

Omniverse

NVIDIA Omniverse is a cloud computing platform that enhances 3D visualisation and development workflows, providing a collaborative space accessible by multiple users. It obeys real-world physics, ensuring simulations are as accurate as possible, guaranteeing their integrity when transferred to the real world. It utilises a common USD (Universal Scene Description) format, making collaboration across multiple third-party applications powerful and yet straightforward.

Omniverse is a key part of many AI workflows, offering digital twin environments for safe simulation and learning, prior to real world deployment.

Example use cases:

  • Robotic and autonomous vehicle simulations with physically accurate worlds
  • Design and modelling of industrial facilities or entire smart cities
  • Synthetic data generation
NVIDIA Omniverse collaborative 3D digital twin environment

Smart Cities

AI agents deployed for smart cities including computing engines and developer frameworks focusing on video analytics and digital twins. Through edge AI and simulation, municipalities can automate traffic flow, optimise energy consumption, and enhance public safety.

AI Aerial

NVIDIA AI Aerial is an accelerated computing platform that integrates wireless networks (5G / 6G) and AI workloads on the same IT infrastructure by implementing a virtual radio access network (vRAN). It transforms standard telecom towers and edge servers into software-defined, AI-native datacentres, allowing operators to run telecom functions and AI applications simultaneously on shared GPU.

It includes the Aerial Omniverse Digital Twin to simulate and validate complex signal propagation, tower placements, and network loads virtually before deploying physical infrastructure.

Example use cases:

  • Design of 5G / 6G mast deployments across campuses or entire cities
  • Dynamic multi-tenant GPUs handle peak telecom network demands (vRAN) during the day and shift compute power to generative AI or edge inference during off-peak hours
NVIDIA AI Aerial accelerated computing platform integrating 5G vRAN and AI workloads on shared GPU infrastructure

Cosmos

NVIDIA Cosmos generates world foundation models (WFMs), using advanced video analytics, to create, simulate, and understand complex physical environments — real or synthetic — to train and validate physical AI models. While it is primarily designed as a platform for Physical AI (robotics and autonomous vehicles), its capabilities include an inbuilt 7B VLM making it ideal for analysing and organising large-scale visual data, resulting in a range of uses in industrial, safety, and logistics environments.

As Cosmos is designed for physics-based, geospatially accurate simulations, it is supported in NVIDIA Omniverse environments to render photorealistic data.

Example use cases:

  • Testing how different algorithmic versions for physical AI systems perform in virtual settings
  • Synthetic data generation to create photorealistic, physics-based environments
  • Foresight simulation generating multiple potential future outcomes based on a system's current state
NVIDIA Cosmos world foundation model simulating a physically accurate environment for smart city planning

cuOpt

NVIDIA cuOpt is an open-source, GPU-accelerated engine designed for decision optimisation and complex mathematical problem-solving. It enables organisations to calculate highly complex operational plans such as vehicle routing, supply chain logistics, and job scheduling, in near real-time. This allows businesses to solve large-scale problems and adapt to real-time supply chain disruptions in seconds.

cuOpt integrates with Omniverse, RAPIDS and the Isaac family of physical AI applications to provide seamless and visual planning.

Example use cases:

  • Fleet optimisation by calculating the most efficient routes to reduce mileage, fuel consumption, and carbon emissions
  • Job scheduling to maximise resources by optimally aligning tasks across workers, machines and shifts
  • Balancing risk and return by optimising capital allocation across complex financial assets
NVIDIA cuOpt GPU-accelerated optimisation engine calculating efficient vehicle routing and logistics plans in real-time

DRIVE

NVIDIA DRIVE is an end-to-end AI platform for autonomous vehicles, advanced driver-assistance systems (ADAS), and digital cockpits. It combines hardware, software, and cloud infrastructure to enable cars to perceive their environment and drive safely without human intervention. It is made up of DRIVE AV, the software responsible for autonomous driving; and DRIVE IX, that manages the in-cabin experience, including driver monitoring, dashboard displays, and voice recognition.

DRIVE integrates with Omniverse and Cosmos to generate real-world simulations for different driving conditions and is deployed within vehicles using dedicated Jetson DRIVE AGX modules.

Example use cases:

  • Creation of autonomous vehicles and intelligent cockpits
  • Generative AI assistants that perceive, reason, and provide realtime assistance
  • Development of active safety and parking functions
NVIDIA DRIVE end-to-end AI platform powering autonomous vehicles and intelligent ADAS systems

Metropolis

NVIDIA Metropolis is an application framework that brings visual AI to physical spaces. It simplifies the creation, deployment, and scaling of intelligent video analytics and computer vision agents — transforming raw video and sensor data from cameras into actionable, real-time insights, and provides the necessary tools for businesses and governments to build smart infrastructure across various sectors.

Metropolis is often deployed with Omniverse for advanced visualisation and colouration, and streamlined using the DeepStream SDK and NIMs.

Example use cases:

  • Optimising traffic flow or public transport, detecting accidents, and enhancing public safety
  • Monitoring assembly lines for defects, tracking inventory and ensuring workplace PPE compliance
  • Analysing customer foot traffic, generating heat maps and automating checkout experiences
NVIDIA Metropolis visual AI framework transforming camera feeds into real-time smart city insights

Omniverse

NVIDIA Omniverse is a cloud computing platform that enhances 3D visualisation and development workflows, providing a collaborative space accessible by multiple users. It obeys real-world physics, ensuring simulations are as accurate as possible, guaranteeing their integrity when transferred to the real world. It utilises a common USD (Universal Scene Description) format, making collaboration across multiple third-party applications powerful and yet straightforward.

Omniverse is a key part of many AI workflows, offering digital twin environments for safe simulation and learning, prior to real world deployment.

Example use cases:

  • Robotic and autonomous vehicle simulations with physically accurate worlds
  • Design and modelling of industrial facilities or entire smart cities
  • Synthetic data generation
NVIDIA Omniverse collaborative 3D digital twin environment

Multiple frameworks, libraries and SDKs may be required to tailor and fine-tune your final AI application. Scan and our consultancy partner MSBC Group can identify the most appropriate technology from open and closed sources and work with your existing infrastructure to minimise additional costs. We help design, build, test and validate your application and then scale it into production, to deliver the maximum ROI for your business.

Contact Us