Software for Embedded Vision

“Depth Estimation from Monocular Images Using Geometric Foundation Models,” a Presentation from Toyota Research Institute
Rareș Ambruș, Senior Manager for Large Behavior Models at Toyota Research Institute, presents the “Depth Estimation from Monocular Images Using Geometric Foundation Models” tutorial at the May 2025 Embedded Vision Summit. In this presentation, Ambruș looks at recent advances in depth estimation from images. He first focuses on the ability… “Depth Estimation from Monocular Images

Open-source Physics Engine and OpenUSD Advance Robot Learning
This blog post was originally published at NVIDIA’s website. It is reprinted here with the permission of NVIDIA. The Newton physics engine and enhanced NVIDIA Isaac GR00T models enable developers to accelerate robot learning through unified OpenUSD simulation workflows. Editor’s note: This blog is a part of Into the Omniverse, a series focused on how

“Introduction to DNN Training: Fundamentals, Process and Best Practices,” a Presentation from Think Circuits
Kevin Weekly, CEO of Think Circuits, presents the “Introduction to DNN Training: Fundamentals, Process and Best Practices” tutorial at the May 2025 Embedded Vision Summit. Training a model is a crucial step in machine learning, but it can be overwhelming for beginners. In this talk, Weekly provides a comprehensive introduction… “Introduction to DNN Training: Fundamentals,

Upcoming Workshop Explores Deploying PyTorch on the Edge
On October 21, 2022 at noon PT, Alliance Member company Qualcomm, along with Amazon, will deliver the free (advance registration required) half-day in-person workshop “PyTorch on the Edge: Amazon SageMaker x Qualcomm® AI Hub” at the InterContinental Hotel in San Francisco, California. From the event page: Seamless cloud-to-edge AI development experience: Train on AWS, deploy

D3 Embedded, HTEC, Texas Instruments and Tobii Pioneer the Integration of Single-camera and Radar Interior Sensor Fusion for In-cabin Sensing
The companies joined forces to develop sensor fusion based interior sensing for enhanced vehicle safety, launching at the InCabin Europe conference on October 7-9. Rochester, NY – October 6, 2025 – Tobii, with its automotive interior sensing branch Tobii Autosense, together with D3 Embedded, and HTEC today announced the development of an interior sensing solution

Breaking Free from the CUDA Lock-in
This blog post was originally published at SiMa.ai’s website. It is reprinted here with the permission of SiMa.ai. The AI hardware landscape is dominated by one uncomfortable truth: most teams feel trapped by CUDA. You trained your models on NVIDIA GPUs, deployed them with TensorRT, and now the thought of switching hardware feels like rewriting

Upcoming Seminar Explores the Latest Innovations in Mobile Robotics
On October 22, 2022 at 9:00 am PT, Alliance Member company NXP Semiconductors, along with Avnet, will deliver a free (advance registration required) half-day in-person robotics seminar at NXP’s office in San Jose, California. From the event page: Join us for a free in-depth seminar exploring the latest innovations in mobile robotics with a focus

Free Webinar Explores Edge AI-enabled Microcontroller Capabilities and Trends
On November 18, 2025 at 9 am PT (noon ET), the Yole Group’s Tom Hackenberg, principal analyst for computing, will present the free hour webinar “How AI-enabled Microcontrollers Are Expanding Edge AI Opportunities,” organized by the Edge AI and Vision Alliance. Here’s the description, from the event registration page: Running AI inference at the edge,

“Lessons Learned Building and Deploying a Weed-killing Robot,” a Presentation from Tensorfield Agriculture
Xiong Chang, CEO and Co-founder of Tensorfield Agriculture, presents the “Lessons Learned Building and Deploying a Weed-Killing Robot” tutorial at the May 2025 Embedded Vision Summit. Agriculture today faces chronic labor shortages and growing challenges around herbicide resistance, as well as consumer backlash to chemical inputs. Smarter, more sustainable approaches… “Lessons Learned Building and Deploying

Using the Qualcomm AI Inference Suite from Google Colab
This blog post was originally published at Qualcomm’s website. It is reprinted here with the permission of Qualcomm. Building off of the blog post here, which shows how easy it is to call the Cirrascale AI Inference Cloud using the Qualcomm AI Inference Suite, we’ll use Google Colab to show the same scenario. In the previous blog

“Transformer Networks: How They Work and Why They Matter,” a Presentation from Synthpop AI
Rakshit Agrawal, Principal AI Scientist at Synthpop AI, presents the “Transformer Networks: How They Work and Why They Matter” tutorial at the May 2025 Embedded Vision Summit. Transformer neural networks have revolutionized artificial intelligence by introducing an architecture built around self-attention mechanisms. This has enabled unprecedented advances in understanding sequential… “Transformer Networks: How They Work

Deploy High-performance AI Models in Windows Applications on NVIDIA RTX AI PCs
This blog post was originally published at NVIDIA’s website. It is reprinted here with the permission of NVIDIA. Microsoft is now making Windows ML available to developers. Windows ML enables C#, C++ and Python developers to optimally run AI models locally across PC hardware from CPU, NPU and GPUs. On NVIDIA RTX GPUs, it utilizes

“Understanding Human Activity from Visual Data,” a Presentation from Sportlogiq
Mehrsan Javan, Chief Technology Officer at Sportlogiq, presents the “Understanding Human Activity from Visual Data” tutorial at the May 2025 Embedded Vision Summit. Activity detection and recognition are crucial tasks in various industries, including surveillance and sports analytics. In this talk, Javan provides an in-depth exploration of human activity understanding,… “Understanding Human Activity from Visual

“Multimodal Enterprise-scale Applications in the Generative AI Era,” a Presentation from Skyworks Solutions
Mumtaz Vauhkonen, Senior Director of AI at Skyworks Solutions, presents the “Multimodal Enterprise-scale Applications in the Generative AI Era” tutorial at the May 2025 Embedded Vision Summit. As artificial intelligence is making rapid strides in use of large language models, the need for multimodality arises in multiple application scenarios. Similar… “Multimodal Enterprise-scale Applications in the

How CLIKA’s Automated Hardware-aware AI Compression Toolkit Efficiently Enables Scalable Deployment of AI on Any Target Hardware
This blog post was originally published at CLIKA’s website. It is reprinted here with the permission of CLIKA. Organisations are making a shift from experimental spending on AI to long-term investments in this new technology but there are challenges involved. Here’s how CLIKA can help. With democratising AI and greater access to open-source AI models,

Using the Qualcomm AI Inference Suite Directly from a Web Page
This blog post was originally published at Qualcomm’s website. It is reprinted here with the permission of Qualcomm. Applying the Qualcomm AI Inference Suite directly from a web page using JavaScript makes it easy to create and understand how AI inference works in in web solutions. Qualcomm Technologies in collaboration with Cirrascale has a free-to-try

“Real-world Deployment of Mobile Material Handling Robotics in the Supply Chain,” a Presentation from Pickle Robot Company
Peter Santos, Chief Operating Officer of Pickle Robot Company, presents the “Real-World Deployment of Mobile Material Handling Robotics in the Supply Chain” tutorial at the May 2025 Embedded Vision Summit. More and more of the supply chain needs to be, and can be, automated. Demographics, particularly in the developed world,… “Real-world Deployment of Mobile Material

“Developing a GStreamer-based Custom Camera System for Long-range Biometric Data Collection,” a Presentation from Oak Ridge National Laboratory
Gavin Jager, Researcher and Lab Space Manager at Oak Ridge National Laboratory, presents the “Developing a GStreamer-based Custom Camera System for Long-range Biometric Data Collection” tutorial at the May 2025 Embedded Vision Summit. In this presentation, Jager describes Oak Ridge National Laboratory’s work developing software for a custom camera system… “Developing a GStreamer-based Custom Camera
