Software for Embedded Vision

“Understanding Human Activity from Visual Data,” a Presentation from Sportlogiq
Mehrsan Javan, Chief Technology Officer at Sportlogiq, presents the “Understanding Human Activity from Visual Data” tutorial at the May 2025 Embedded Vision Summit. Activity detection and recognition are crucial tasks in various industries, including surveillance and sports analytics. In this talk, Javan provides an in-depth exploration of human activity understanding,… “Understanding Human Activity from Visual

“Multimodal Enterprise-scale Applications in the Generative AI Era,” a Presentation from Skyworks Solutions
Mumtaz Vauhkonen, Senior Director of AI at Skyworks Solutions, presents the “Multimodal Enterprise-scale Applications in the Generative AI Era” tutorial at the May 2025 Embedded Vision Summit. As artificial intelligence is making rapid strides in use of large language models, the need for multimodality arises in multiple application scenarios. Similar… “Multimodal Enterprise-scale Applications in the

How CLIKA’s Automated Hardware-aware AI Compression Toolkit Efficiently Enables Scalable Deployment of AI on Any Target Hardware
This blog post was originally published at CLIKA’s website. It is reprinted here with the permission of CLIKA. Organisations are making a shift from experimental spending on AI to long-term investments in this new technology but there are challenges involved. Here’s how CLIKA can help. With democratising AI and greater access to open-source AI models,

Using the Qualcomm AI Inference Suite Directly from a Web Page
This blog post was originally published at Qualcomm’s website. It is reprinted here with the permission of Qualcomm. Applying the Qualcomm AI Inference Suite directly from a web page using JavaScript makes it easy to create and understand how AI inference works in in web solutions. Qualcomm Technologies in collaboration with Cirrascale has a free-to-try

“Real-world Deployment of Mobile Material Handling Robotics in the Supply Chain,” a Presentation from Pickle Robot Company
Peter Santos, Chief Operating Officer of Pickle Robot Company, presents the “Real-World Deployment of Mobile Material Handling Robotics in the Supply Chain” tutorial at the May 2025 Embedded Vision Summit. More and more of the supply chain needs to be, and can be, automated. Demographics, particularly in the developed world,… “Real-world Deployment of Mobile Material

“Developing a GStreamer-based Custom Camera System for Long-range Biometric Data Collection,” a Presentation from Oak Ridge National Laboratory
Gavin Jager, Researcher and Lab Space Manager at Oak Ridge National Laboratory, presents the “Developing a GStreamer-based Custom Camera System for Long-range Biometric Data Collection” tutorial at the May 2025 Embedded Vision Summit. In this presentation, Jager describes Oak Ridge National Laboratory’s work developing software for a custom camera system… “Developing a GStreamer-based Custom Camera

“Sensors and Compute Needs and Challenges for Humanoid Robots,” a Presentation from Agility Robotics
Vlad Branzoi, Perception Sensors Team Lead at Agility Robotics, presents the “Sensors and Compute Needs and Challenges for Humanoid Robots” tutorial at the September 2025 Edge AI and Vision Innovation Forum.

How to Integrate Computer Vision Pipelines with Generative AI and Reasoning
This blog post was originally published at NVIDIA’s website. It is reprinted here with the permission of NVIDIA. Generative AI is opening new possibilities for analyzing existing video streams. Video analytics are evolving from counting objects to turning raw video content footage into real-time understanding. This enables more actionable insights. The NVIDIA AI Blueprint for

“Scaling Artificial Intelligence and Computer Vision for Conservation,” a Presentation from The Nature Conservancy
Matt Merrifield, Chief Technology Officer at The Nature Conservancy, presents the “Scaling Artificial Intelligence and Computer Vision for Conservation” tutorial at the May 2025 Embedded Vision Summit. In this presentation, Merrifield explains how the world’s largest environmental nonprofit is spearheading projects to scale the use of edge AI and vision… “Scaling Artificial Intelligence and Computer

“A Lightweight Camera Stack for Edge AI,” a Presentation from Meta
Jui Garagate, Camera Software Engineer, and Karthick Kumaran, Staff Software Engineer, both of Meta, co-present the “Lightweight Camera Stack for Edge AI” tutorial at the May 2025 Embedded Vision Summit. Electronic products for virtual and augmented reality, home robots and cars deploy multiple cameras for computer vision and AI use… “A Lightweight Camera Stack for

“Unlocking Visual Intelligence: Advanced Prompt Engineering for Vision-language Models,” a Presentation from LinkedIn Learning
Alina Li Zhang, Senior Data Scientist and Tech Writer at LinkedIn Learning, presents the “Unlocking Visual Intelligence: Advanced Prompt Engineering for Vision-language Models” tutorial at the May 2025 Embedded Vision Summit. Imagine a world where AI systems automatically detect thefts in grocery stores, ensure construction site safety and identify patient… “Unlocking Visual Intelligence: Advanced Prompt

The Edge’s Essential Role in the Future of AI
This blog post was originally published at Qualcomm’s website. It is reprinted here with the permission of Qualcomm. What you should know: The future of AI will be hybrid, with the cloud and the edge working together — each playing a vital role. The user interface (UI) is now human-centric — your device understands your

“Deploying Accelerated ML and AI: The Role of Khronos Open Standards,” a Presentation from the Khronos Group
Neil Trevett, President of the Khronos Group and Vice President of Developer Ecosystems at NVIDIA, presents the “Deploying Accelerated ML and AI: The Role of Khronos Open Standards” tutorial at the May 2025 Embedded Vision Summit. Accelerating machine learning and AI workloads often requires specialized hardware, but managing compatibility across… “Deploying Accelerated ML and AI:

“Scaling Computer Vision at the Edge,” a Presentation from Invisible AI
Eric Danziger, CEO of Invisible AI, presents the “Scaling Computer Vision at the Edge” tutorial at the May 2025 Embedded Vision Summit. In this presentation, Danziger introduces a comprehensive framework for scaling computer vision systems across three critical dimensions: capability evolution, infrastructure decisions and deployment scaling. Today’s leading-edge vision systems… “Scaling Computer Vision at the

How Do You Teach an AI Model to Reason? With Humans
This blog post was originally published at NVIDIA’s website. It is reprinted here with the permission of NVIDIA. NVIDIA’s data factory team creates the foundation for AI models like Cosmos Reason, which today topped the physical reasoning leaderboard on Hugging Face. AI models are advancing at a rapid rate and scale. But what might they

“Scaling Machine Learning with Containers: Lessons Learned,” a Presentation from Instrumental
Rustem Feyzkhanov, Machine Learning Engineer at Instrumental, presents the “Scaling Machine Learning with Containers: Lessons Learned” tutorial at the May 2025 Embedded Vision Summit. In the dynamic world of machine learning, efficiently scaling solutions from research to production is crucial. In this presentation, Feyzkhanov explores the nuances of scaling machine… “Scaling Machine Learning with Containers:

PerCV.ai: How a Vision AI Platform and the STM32N6 can Turn Around an 80% Failure Rate for AI Projects
This blog post was originally published at STMicroelectronics’ website. It is reprinted here with the permission of STMicroelectronics. The vision AI platform PerCV.ai (pronounced Perceive AI), could be the secret weapon that enables a company to deploy an AI application when so many others fail. The solution from Irida Labs, a member of the ST

“Vision-language Models on the Edge,” a Presentation from Hugging Face
Cyril Zakka, Health Lead at Hugging Face, presents the “Vision-language Models on the Edge” tutorial at the May 2025 Embedded Vision Summit. In this Zakka, we provides an overview of vision-language models (VLMs) and their deployment on edge devices using Hugging Face’s recently released SmolVLM as an example. He examines… “Vision-language Models on the Edge,”