Today, Intel launched the Movidius™ Neural Compute Stick, the world’s first USB-based deep learning inference kit and self-contained artificial intelligence (AI) accelerator that delivers dedicated deep neural network processing capabilities to a wide range of host devices at the edge. Designed for product developers, researchers and makers, the Movidius Neural Compute Stick aims to reduce barriers to developing, tuning and deploying AI applications by delivering dedicated high-performance deep-neural network processing in a small form factor.
The Movidius™ Neural Compute Stick is the world’s first USB-based deep learning inference kit and self-contained AI accelerator that delivers dedicated deep neural network processing capabilities to a wide range of host devices at the edge. (Credit: Intel Corporation)
As more developers adopt advanced machine learning approaches to build innovative applications and solutions, Intel is committed to providing the most comprehensive set of development tools and resources to ensure developers are retooling for an AI-centric digital economy. Whether it is training artificial neural networks on the Intel® Nervana™ cloud, optimizing for emerging workloads such as artificial intelligence, virtual and augmented reality, and automated driving with Intel® Xeon® Scalable processors, or taking AI to the edge with Movidius vision processing unit (VPU) technology, Intel offers a comprehensive AI portfolio of tools, training and deployment options for the next generation of AI-powered products and services.
“The Myriad 2 VPU housed inside the Movidius Neural Compute Stick provides powerful, yet efficient performance – more than 100 gigaflops of performance within a 1W power envelope – to run real-time deep neural networks directly from the device,” said Remi El-Ouazzane, vice president and general manager of Movidius, an Intel company. “This enables a wide range of AI applications to be deployed offline.”
Machine intelligence development is fundamentally composed of two stages: (1) training an algorithm on large sets of sample data via modern machine learning techniques and (2) running the algorithm in an end-application that needs to interpret real-world data. This second stage is referred to as “inference,” and performing inference at the edge – or natively inside the device – brings numerous benefits in terms of latency, power consumption and privacy:
- Compile: Automatically convert a trained Caffe-based convolutional neural network (CNN) into an embedded neural network optimized to run on the onboard Movidius Myriad 2 VPU.
- Tune: Layer-by-layer performance metrics for both industry-standard and custom-designed neural networks enable effective tuning for optimal real-world performance at ultra-low power. Validation scripts allow developers to compare the accuracy of the optimized model on the device to the original PC-based model.
- Accelerate: Unique to Movidius Neural Compute Stick, the device can behave as a discrete neural network accelerator by adding dedicated deep learning inference capabilities to existing computing platforms for improved performance and power efficiency.
Movidius Neural Compute Stick is now available for purchase through select distributors for MSRP $79 and at the conference on Computer Vision and Pattern Recognition (CVPR) in Honolulu, Hawaii, from July 22-25. For more details, visit the Movidius developer website.