Software for Embedded Vision

R²D²: Boost Robot Training with World Foundation Models and Workflows from NVIDIA Research
This blog post was originally published at NVIDIA’s website. It is reprinted here with the permission of NVIDIA. As physical AI systems advance, the demand for richly labeled datasets is accelerating beyond what we can manually capture in the real world. World foundation models (WFMs), which are generative AI models trained to simulate, predict, and

“LLMs and VLMs for Regulatory Compliance, Quality Control and Safety Applications,” a Presentation from Camio
Lazar Trifunovic, Solutions Architect at Camio, presents the “LLMs and VLMs for Regulatory Compliance, Quality Control and Safety Applications” tutorial at the May 2025 Embedded Vision Summit. By using vision-language models (VLMs) or combining large language models (LLMs) with conventional computer vision models, we can create vision systems that are… “LLMs and VLMs for Regulatory

NVIDIA Opens Portals to World of Robotics With New Omniverse Libraries, Cosmos Physical AI Models and AI Computing Infrastructure
New NVIDIA Omniverse NuRec 3D Gaussian Splatting Libraries Enable Large-Scale World Reconstruction New NVIDIA Cosmos Models Enable World Generation and Spatial Reasoning New NVIDIA RTX PRO Blackwell Servers and NVIDIA DGX Cloud Let Developers Run the Most Demanding Simulations Anywhere Physical AI Leaders Amazon Devices & Services, Boston Dynamics, Figure AI and Hexagon Embrace Simulation and Synthetic Data Generation August 11, 2025—SIGGRAPH—NVIDIA

“Simplifying Portable Computer Vision with OpenVX 2.0,” a Presentation from AMD
Kiriti Nagesh Gowda, Staff Engineer at AMD, presents the “Simplifying Portable Computer Vision with OpenVX 2.0” tutorial at the May 2025 Embedded Vision Summit. The Khronos OpenVX API offers a set of optimized primitives for low-level image processing, computer vision and neural network operators. It provides a simple method for… “Simplifying Portable Computer Vision with

“Quantization Techniques for Efficient Deployment of Large Language Models: A Comprehensive Review,” a Presentation from AMD
Dwith Chenna, MTS Product Engineer for AI Inference at AMD, presents the “Quantization Techniques for Efficient Deployment of Large Language Models: A Comprehensive Review” tutorial at the May 2025 Embedded Vision Summit. The deployment of large language models (LLMs) in resource-constrained environments is challenging due to the significant computational and… “Quantization Techniques for Efficient Deployment

Learn to Optimize Stable Diffusion on Qualcomm Cloud AI 100
This blog post was originally published at Qualcomm’s website. It is reprinted here with the permission of Qualcomm. Dive in to learn how we achieve a 1.4x latency decrease on Qualcomm Cloud AI 100 Ultra accelerators by applying an innovative DeepCache technique to text-to-image generation. What’s more, the throughput can be further improved by 3x

Texas Instruments Demonstration of Edge AI Inference and Video Streaming Over Wi-Fi
The demonstration shows how to use Texas Instruments’ AM6xA to capture live video, perform machine learning, and stream video over Wi-Fi. The video is encoded with H.264/H.265, and streamed via UDP over Wi-Fi using the CC33xx. At the receiver side, the video is decoded and displayed on a screen. The receiver side could be a

“Introduction to Data Types for AI: Trade-offs and Trends,” a Presentation from Synopsys
Joep Boonstra, Synopsys Scientist at Synopsys, presents the “Introduction to Data Types for AI: Trade-offs and Trends” tutorial at the May 2025 Embedded Vision Summit. The increasing complexity of AI models has led to a growing need for efficient data storage and processing. One critical way to gain efficiency is… “Introduction to Data Types for

Machine Vision Defect Detection: Edge AI Processing with Texas Instruments AM6xA Arm-based Processors
Texas Instruments’ portfolio of AM6xA Arm-based processors are designed to advance intelligence at the edge using high resolution camera support, an integrated image sensor processor and deep learning accelerator. This video demonstrates using AM62A to run a vision-based artificial intelligence model for defect detection for manufacturing applications. Watch the model test the produced units as

“Introduction to Radar and Its Use for Machine Perception,” a Presentation from Cadence
Amol Borkar, Product Marketing Director, and Vencatesh Subramanian, Design Engineering Architect, both of Cadence, co-present the “Introduction to Radar and Its Use for Machine Perception” tutorial at the May 2025 Embedded Vision Summit. Radar is a proven technology with a long history in various market segments and continues to plays an increasingly important role in

Optimizing LLMs for Performance and Accuracy with Post-training Quantization
This article was originally published at NVIDIA’s website. It is reprinted here with the permission of NVIDIA. Quantization is a core tool for developers aiming to improve inference performance with minimal overhead. It delivers significant gains in latency, throughput, and memory efficiency by reducing model precision in a controlled way—without requiring retraining. Today, most models

Alif Semiconductor Demonstration of Face Detection and Driver Monitoring On a Battery, at the Edge
Alexandra Kazerounian, Senior Product Marketing Manager at Alif Semiconductor, demonstrates the company’s latest edge AI and vision technologies and products at the 2025 Embedded Vision Summit. Specifically, Kazerounian demonstrates how AI/ML workloads can run directly on her company’s ultra-low-power Ensemble and Balletto 32-bit microcontrollers. Watch as the AI/ML AppKit runs real-time face detection using an

Nota AI Demonstration of Nota Vision Agent, Next-generation Video Monitoring at the Edge
Tae-Ho Kim, CTO and Co-founder of Nota AI, demonstrates the company’s latest edge AI and vision technologies and products at the 2025 Embedded Vision Summit. Specifically, Kim demonstrates Nota Vision Agent—a next-generation video monitoring solution powered by Vision Language Models (VLMs). The solution delivers real-time analytics and intelligent alerts across critical domains including industrial safety,

SiMa.ai Expands Strategic Collaboration with Synopsys to Accelerate Automotive AI Innovation
Transforming ADAS and In-Vehicle Infotainment Breakthroughs with Innovative ML IP, Chiplets, and System-on-Chip Reference Architectures SAN JOSE, Calif., July 30, 2025 /PRNewswire/ — SiMa.ai, a pioneer in ultra-efficient machine learning system-on-chip (MLSoC) platform, today announced the next phase of their strategic collaboration with Synopsys, the leading provider of engineering solutions from silicon to systems, to

Nota AI Demonstration of NetsPresso Optimization Studio, Streamlined with Visual Insights
Tairen Piao, Research Engineer at Nota AI, demonstrates the company’s latest edge AI and vision technologies and products at the 2025 Embedded Vision Summit. Specifically, Piao demonstrates NetsPresso Optimization Studio, the latest enhancement to Nota AI’s model optimization platform, NetsPresso. This intuitive interface simplifies the AI optimization process with advanced layer-wise analysis and automated quantization.

Renesas Introduces 64-bit RZ/G3E MPU for High-performance HMI Systems Requiring AI Acceleration and Edge Computing
MPU Integrates a Quad-Core CPU, an NPU, High-Speed Connectivity and Advanced Graphics to Power Next-Generation HMI Devices with Full HD Display TOKYO, Japan, July 29, 2025 ― Renesas Electronics Corporation (TSE:6723), a premier supplier of advanced semiconductor solutions, today announced the launch of its new 64-bit RZ/G3E microprocessor (MPU), a general-purpose device optimized for high-performance Human Machine

How to Run Coding Assistants for Free on RTX AI PCs and Workstations
This blog post was originally published at NVIDIA’s website. It is reprinted here with the permission of NVIDIA. AI-powered copilots deliver real-time assistance for projects from academic projects to production code — and are optimized for RTX AI PCs. Coding assistants or copilots — AI-powered assistants that can suggest, explain and debug code — are

Microchip Technology Demonstration of Real-time Object and Facial Recognition with Edge AI Platforms
Swapna Guramani, Applications Engineer for Microchip Technology, demonstrates the company’s latest edge AI and vision technologies and products at the 2025 Embedded Vision Summit. Specifically, Guramani demonstrates her company’s latest AI/ML capabilities in action: real-time object recognition using the SAMA7G54 32-bit MPU running Edge Impulse’s FOMO model, and facial recognition powered by TensorFlow Lite’s Mobile