Multimodal

Implementing Multimodal GenAI Models on Modalix

Algorithms & Models, Blog Posts, Multimodal, Processors, SiMa.ai, Software, Tools / August 18, 2025

This blog post was originally published at SiMa.ai’s website. It is reprinted here with the permission of SiMa.ai. It has been our goal since starting SiMa.ai to create one software and hardware platform for the embedded edge that empowers companies to make their AI/ML innovations come to life. With the rise of Generative AI already […]

Implementing Multimodal GenAI Models on Modalix Read More +

“Customizing Vision-language Models for Real-world Applications,” a Presentation from NVIDIA

Algorithms & Models, Multimodal, NVIDIA, Processors, Software, Summit 2025, Tools, Videos / August 18, 2025

Monika Jhuria, Technical Marketing Engineer at NVIDIA, presents the “Customizing Vision-language Models for Real-world Applications” tutorial at the May 2025 Embedded Vision Summit. Vision-language models (VLMs) have the potential to revolutionize various applications, and their performance can be improved through fine-tuning and customization. In this presentation, Jhuria explores the concept… “Customizing Vision-language Models for Real-world

“Customizing Vision-language Models for Real-world Applications,” a Presentation from NVIDIA Read More +

XR Tech Market Report

Algorithms & Models, Market Analysis, Memory, Multimodal, Processors, Sensors and Cameras, Software, Tools / August 15, 2025

Woodside Capital Partners (WCP) is pleased to share its XR Tech Market Report, authored by senior bankers Alain Bismuth and Rudy Burger, and by analyst Alex Bonilla. Why we are interested in the XR Ecosystem Investors have been pouring billions of dollars into developing enabling technologies for augmented reality (AR) glasses aimed at the consumer market,

XR Tech Market Report Read More +

The Era of Physical AI is Here

Blog Posts, Multimodal, Processors, SiMa.ai, Tools / August 14, 2025

This blog post was originally published at SiMa.ai’s website. It is reprinted here with the permission of SiMa.ai. The AI landscape is undergoing a monumental shift. After a decade where AI flourished in the cloud, scaled by hyperscalers, we are now entering the era of Physical AI. Physical AI is poised to touch every facet

The Era of Physical AI is Here Read More +

R²D²: Boost Robot Training with World Foundation Models and Workflows from NVIDIA Research

Algorithms & Models, Blog Posts, Multimodal, NVIDIA, Processors, Robotics, Software, Tools / August 12, 2025

This blog post was originally published at NVIDIA’s website. It is reprinted here with the permission of NVIDIA. As physical AI systems advance, the demand for richly labeled datasets is accelerating beyond what we can manually capture in the real world. World foundation models (WFMs), which are generative AI models trained to simulate, predict, and

R²D²: Boost Robot Training with World Foundation Models and Workflows from NVIDIA Research Read More +

“LLMs and VLMs for Regulatory Compliance, Quality Control and Safety Applications,” a Presentation from Camio

Algorithms & Models, Camio, Multimodal, Software, Summit 2025, Tools, Videos / August 12, 2025

Lazar Trifunovic, Solutions Architect at Camio, presents the “LLMs and VLMs for Regulatory Compliance, Quality Control and Safety Applications” tutorial at the May 2025 Embedded Vision Summit. By using vision-language models (VLMs) or combining large language models (LLMs) with conventional computer vision models, we can create vision systems that are… “LLMs and VLMs for Regulatory

“LLMs and VLMs for Regulatory Compliance, Quality Control and Safety Applications,” a Presentation from Camio Read More +

Collaborating With Robots: How AI Is Enabling the Next Generation of Cobots

Ambarella, Blog Posts, Multimodal, Processors, Robotics / August 11, 2025

This blog post was originally published at Ambarella’s website. It is reprinted here with the permission of Ambarella. Collaborative robots, or cobots, are reshaping how we interact with machines. Designed to operate safely in shared environments, AI-enabled cobots are now embedded across manufacturing, logistics, healthcare, and even the home. But their role goes beyond automation—they

Collaborating With Robots: How AI Is Enabling the Next Generation of Cobots Read More +

R²D²: Building AI-based 3D Robot Perception and Mapping with NVIDIA Research

Algorithms & Models, Blog Posts, Multimodal, NVIDIA, Object Identification, Object Tracking, Processors, Robotics, Sensors and Cameras, Software, Tools / June 24, 2025

This blog post was originally published at NVIDIA’s website. It is reprinted here with the permission of NVIDIA. Robots must perceive and interpret their 3D environments to act safely and effectively. This is especially critical for tasks such as autonomous navigation, object manipulation, and teleoperation in unstructured or unfamiliar spaces. Advances in robotic perception increasingly

R²D²: Building AI-based 3D Robot Perception and Mapping with NVIDIA Research Read More +

A World’s First On-glass GenAI Demonstration: Qualcomm’s Vision for the Future of Smart Glasses

Algorithms & Models, Blog Posts, Multimodal, Processors, Qualcomm, Software, Tools / June 19, 2025

This blog post was originally published at Qualcomm’s website. It is reprinted here with the permission of Qualcomm. Our live demo of a generative AI assistant running completely on smart glasses — without the aid of a phone or the cloud — and the reveal of the new Snapdragon AR1+ platform spark new possibilities for

A World’s First On-glass GenAI Demonstration: Qualcomm’s Vision for the Future of Smart Glasses Read More +

We Built a Personalized, Multimodal AI Smart Glass Experience — Watch It Here

Blog Posts, Multimodal, Qualcomm / June 5, 2025

This blog post was originally published at Qualcomm’s website. It is reprinted here with the permission of Qualcomm. Our demo shows the power of on-device AI and why smart glasses make the ideal AI user interface Gabby walks into a gym while carrying a smartphone and wearing a pair of smart glasses. Unsure of where

We Built a Personalized, Multimodal AI Smart Glass Experience — Watch It Here Read More +

If you're building AI or vision-enabled products, you've come to the right place.

Implementing Multimodal GenAI Models on Modalix

“Customizing Vision-language Models for Real-world Applications,” a Presentation from NVIDIA

XR Tech Market Report

The Era of Physical AI is Here

R²D²: Boost Robot Training with World Foundation Models and Workflows from NVIDIA Research

“LLMs and VLMs for Regulatory Compliance, Quality Control and Safety Applications,” a Presentation from Camio

Collaborating With Robots: How AI Is Enabling the Next Generation of Cobots

R²D²: Building AI-based 3D Robot Perception and Mapping with NVIDIA Research

A World’s First On-glass GenAI Demonstration: Qualcomm’s Vision for the Future of Smart Glasses

We Built a Personalized, Multimodal AI Smart Glass Experience — Watch It Here

Pages

Topics

Contact

Address

Phone