Object Identification Functions

Qualcomm Dragonwing Intelligent Video Suite Modernizes Video Management with Generative AI at Its Core
This blog post was originally published at Qualcomm’s website. It is reprinted here with the permission of Qualcomm. Video cameras generate a lot of data. Companies that use a video management system (VMS) are left wanting to get more value out of all the video data they generate, enabling them to take the actions that

Visual Intelligence: Foundation Models + Satellite Analytics for Deforestation (Part 2)
This blog post was originally published at Tenyks’ website. It is reprinted here with the permission of Tenyks. In Part 2, we explore how Foundation Models can be leveraged to track deforestation patterns. Building upon the insights from our Sentinel-2 pipeline and Central Balkan case study, we dive into the revolution that foundation models have

A New Standard in Facial Recognition Security: Multispectral Imaging Technology
This blog post was originally published at Namuga Vision Connectivity’s website. It is reprinted here with the permission of Namuga Vision Connectivity. Traditional facial recognition technology has greatly enhanced both convenience and security. However, it still struggles to differentiate between real faces and sophisticated forgeries like silicone masks, printed photos and 3D models. Enter MultiSpectral

Visual Intelligence: Foundation Models + Satellite Analytics for Deforestation (Part 1)
This blog post was originally published at Tenyks’ website. It is reprinted here with the permission of Tenyks. Satellite imagery has revolutionized how we monitor Earth’s forests, offering unprecedented insights into deforestation patterns. In this two-part series, we explore both traditional and cutting-edge approaches to forest monitoring, using Bulgaria’s Central Balkan National Park as our

Andes Technology Demonstration of Its RISC-V IP in a Spherical Image Processor and Meta’s AI Accelerator
Marc Evans, Director of Business Development and Marketing at Andes Technology, demonstrates the company’s latest edge AI and vision technologies and products at the March 2025 Edge AI and Vision Alliance Forum. Specifically, Evans demonstrates the company’s RISC-V semiconductor processor IP, which enables customers to develop leading SoCs for AI, computer vision and other market

Radar-enhanced Safety for Advancing Autonomy
Front and side radars may have different primary uses and drivers for their innovation, but together, they form a vital part of ADAS for autonomous vehicles. IDTechEx‘s report, “Automotive Radar Market 2025-2045: Robotaxis & Autonomous Cars“, showcases the latest radar developments and explores autonomy leveling up as a result, with Level 2+ asserting itself within

Scalable Video Search: Cascading Foundation Models
This article was originally published at Tenyks’ website. It is reprinted here with the permission of Tenyks. Video has become the lingua franca of the digital age, but its ubiquity presents a unique challenge: how do we efficiently extract meaningful information from this ocean of visual data? In Part 1 of this series, we navigate

Digica Collaborates with Toradex to Enhance Interactive Robotics with AI-driven Object Recognition on Torizon
March 7, 2025 – Digica, a leader in AI-driven solutions, is excited to announce its collaboration with Toradex to develop an advanced robotics system featuring real-time object recognition and dynamic change of user interface display. This collaboration integrates cutting-edge computer vision technology with interactive robotics, pushing the boundaries of human-machine interaction. Toradex, a leading provider

3LC: What is It and Who is It For?
This blog post was originally published at 3LC’s website. It is reprinted here with the permission of 3LC. AI performance isn’t just about better architectures or more compute – it’s about better data. Even perfectly labeled datasets can hold hidden inefficiencies that limit accuracy. See how teams use 3LC to refine datasets, optimize labeling strategies,

Vision Language Model Prompt Engineering Guide for Image and Video Understanding
This blog post was originally published at NVIDIA’s website. It is reprinted here with the permission of NVIDIA. Vision language models (VLMs) are evolving at a breakneck speed. In 2020, the first VLMs revolutionized the generative AI landscape by bringing visual understanding to large language models (LLMs) through the use of a vision encoder. These

SAM 2 + GPT-4o: Cascading Foundation Models via Visual Prompting (Part 2)
This article was originally published at Tenyks’ website. It is reprinted here with the permission of Tenyks. In Part 2 of our Segment Anything Model 2 (SAM 2) Series, we show how foundation models (e.g., GPT-4o, Claude Sonnet 3.5 and YOLO-World) can be used to generate visual inputs (e.g., bounding boxes) for SAM 2. Learn

Nearly $1B Flows into Automotive Radar Startups
According to IDTechEx‘s latest report, “Automotive Radar Market 2025-2045: Robotaxis & Autonomous Cars“, newly established radar startups worldwide have raised nearly US$1.2 billion over the past 12 years; approximately US$980 million of which is predominantly directed toward the automotive sector. Through more than 40 funding rounds, these companies have driven the implementation and advancement of

New AI Model Offers Cellular-level View of Cancerous Tumors
This blog post was originally published at NVIDIA’s website. It is reprinted here with the permission of NVIDIA. Researchers studying cancer unveiled a new AI model that provides cellular-level mapping and visualizations of cancer cells, which scientists hope can shed light on how—and why—certain inter-cellular relationships triggers cancers to grow. BioTuring, a San Diego-based startup,

Top-tier ADAS Systems: Exploring Automotive Radar Technology
Radars have had a place within the automotive sector for over two decades, beginning with the first use for adaptive cruise control and many other developments taking place since. IDTechEx‘s “Automotive Radar Market 2025-2045: Robotaxis & Autonomous Cars” report explores the latest developments in radar technology within the automotive sector. ADAS safety systems ADAS (advanced

SAM 2 + GPT-4o: Cascading Foundation Models via Visual Prompting (Part 1)
This article was originally published at Tenyks’ website. It is reprinted here with the permission of Tenyks. In Part 1 of this article we introduce Segment Anything Model 2 (SAM 2). Then, we walk you through how you can set it up and run inference on your own video clips. Learn more about visual prompting

Empowering Civil Construction with AI-driven Spatial Perception
This blog post was originally published at Au-Zone Technologies’ website. It is reprinted here with the permission of Au-Zone Technologies. Transforming Safety, Efficiency, and Automation in Construction Ecosystems In the rapidly evolving field of civil construction, AI-based spatial perception technologies are reshaping the way machinery operates in dynamic and unpredictable environments. These systems enable advanced

Seeing In the Dark: Infrared for Automotive
Infrared sensors are becoming more popular in vehicles for advanced driver assistance systems (ADAS), in-cabin sensing, and driver monitoring systems (DMS), largely due to advancements in vehicle safety and awareness. IDTechEx‘s latest report, “Infrared (IR) Cameras for Automotive 2025-2035: Technologies, Opportunities, Forecasts“, explores the types of infrared sensors that are commonly used, and forecasts for

Multimodal Large Language Models: Transforming Computer Vision
This blog post was originally published at Tenyks’ website. It is reprinted here with the permission of Tenyks. This article introduces multimodal large language models (MLLMs) [1], their applications using challenging prompts, and the top models reshaping computer vision as we speak. What is a multimodal large language model (MLLM)? In layman terms, a multimodal