OpenVINO

Developer(s)	Intel Corporation
Initial release	May 16, 2018;6 years ago
Stable release	2024.6 / December 2024.
Repository	github.com/openvinotoolkit/openvino
Written in	C++
Operating system	Cross-platform
License	Apache License 2.0
Website	docs.openvino.ai
As of	December 2024

Last updated December 20, 2024

OpenVINO is an open-source software toolkit for optimizing and deploying deep learning models. It enables programmers to develop scalable and efficient AI solutions with relatively few lines of code. It supports several popular model formats^[2] and categories, such as large language models, computer vision, and generative AI.

Workflow

The simplest OpenVINO usage involves obtaining a model and running it as is. Yet for the best results, a more complete workflow is suggested:^[4]

obtain a model in one of supported frameworks,
convert the model to OpenVINO IR using the OpenVINO Converter tool,
optimize the model, using training-time or post-training options provided by OpenVINO's NNCF.
execute inference, using OpenVINO Runtime by specifying one of several inference modes.

OpenVINO model format

OpenVINO IR^[5] is the default format used to run inference. It is saved as a set of two files, *.bin and *.xml, containing weights and topology, respectively. It is obtained by converting a model from one of the supported frameworks, using the application's API or a dedicated converter.

Models of the supported formats may also be used for inference directly, without prior conversion to OpenVINO IR. Such an approach is more convenient but offers fewer optimization options and lower performance, since the conversion is performed automatically before inference. Some pre-converted models can be found in the Hugging Face repository.^[6]

The supported model formats are:^[7]

PyTorch
TensorFlow
TensorFlow Lite
ONNX (including formats that may be serialized to ONNX)
PaddlePaddle
JAX/Flax

OS support

OpenVINO runs on Windows, Linux and MacOS.^[8]

Related Research Articles

OpenCV is a library of programming functions mainly for real-time computer vision. Originally developed by Intel, it was later supported by Willow Garage, then Itseez. The library is cross-platform and licensed as free and open-source software under Apache License 2. Starting in 2011, OpenCV features GPU acceleration for real-time operations.

In computing, CUDA is a proprietary parallel computing platform and application programming interface (API) that allows software to use certain types of graphics processing units (GPUs) for accelerated general-purpose processing, an approach called general-purpose computing on GPUs. CUDA was created by Nvidia in 2006. When it was first introduced, the name was an acronym for Compute Unified Device Architecture, but Nvidia later dropped the common use of the acronym and now rarely expands it.

Intel Fortran Compiler, as part of Intel OneAPI HPC toolkit, is a group of Fortran compilers from Intel for Windows, macOS, and Linux.

Eclipse Deeplearning4j is a programming library written in Java for the Java virtual machine (JVM). It is a framework with wide support for deep learning algorithms. Deeplearning4j includes implementations of the restricted Boltzmann machine, deep belief net, deep autoencoder, stacked denoising autoencoder and recursive neural tensor network, word2vec, doc2vec, and GloVe. These algorithms all include distributed parallel versions that integrate with Apache Hadoop and Spark.

TensorFlow is a software library for machine learning and artificial intelligence. It can be used across a range of tasks, but is used mainly for training and inference of neural networks. It is one of the most popular deep learning frameworks, alongside others such as PyTorch and PaddlePaddle. It is free and open-source software released under the Apache License 2.0.

The following tables compare notable software frameworks, libraries, and computer programs for deep learning applications.

Movidius Ltd. was a company based in San Mateo, California, that designed low-power processor chips for computer vision. The company was acquired by Intel in September 2016, who continues to sell the company's products under the Movidius line.

Keras is an open-source library that provides a Python interface for artificial neural networks. Keras was first independent software, then integrated into the TensorFlow library, and later supporting more. "Keras 3 is a full rewrite of Keras [and can be used] as a low-level cross-framework language to develop custom components such as layers, models, or metrics that can be used in native workflows in JAX, TensorFlow, or PyTorch — with one codebase." Keras 3 will be the default Keras version for TensorFlow 2.16 onwards, but Keras 2 can still be used.

spaCy is an open-source software library for advanced natural language processing, written in the programming languages Python and Cython. The library is published under the MIT license and its main developers are Matthew Honnibal and Ines Montani, the founders of the software company Explosion.

PyTorch is a machine learning library based on the Torch library, used for applications such as computer vision and natural language processing, originally developed by Meta AI and now part of the Linux Foundation umbrella. It is one of the most popular deep learning frameworks, alongside others such as TensorFlow and PaddlePaddle, offering free and open-source software released under the modified BSD license. Although the Python interface is more polished and the primary focus of development, PyTorch also has a C++ interface.

The cTuning Foundation is a global non-profit organization developing a common methodology and open-source tools to support sustainable, collaborative and reproducible research in Computer science and organize and automate artifact evaluation and reproducibility inititiaves at machine learning and systems conferences and journals.

The Open Neural Network Exchange (ONNX) [] is an open-source artificial intelligence ecosystem of technology companies and research organizations that establish open standards for representing machine learning algorithms and software tools to promote innovation and collaboration in the AI sector. ONNX is available on GitHub.

ROCm is an Advanced Micro Devices (AMD) software stack for graphics processing unit (GPU) programming. ROCm spans several domains: general-purpose computing on graphics processing units (GPGPU), high performance computing (HPC), heterogeneous computing. It offers several programming models: HIP, OpenMP, and OpenCL.

<span class="mw-page-title-main">ML.NET</span> Machine learning library

ML.NET is a free software machine learning library for the C# and F# programming languages. It also supports Python models when used together with NimbusML. The preview release of ML.NET included transforms for feature engineering like n-gram creation, and learners to handle binary classification, multi-class classification, and regression tasks. Additional ML tasks like anomaly detection and recommendation systems have since been added, and other approaches like deep learning will be included in future versions.

Neural Network Exchange Format (NNEF) is an artificial neural network data exchange format developed by the Khronos Group. It is intended to reduce machine learning deployment fragmentation by enabling a rich mix of neural network training tools and inference engines to be used by applications across a diverse range of devices and platforms.

Amazon SageMaker AI is a cloud-based machine-learning platform that allows the creation, training, and deployment by developers of machine-learning (ML) models on the cloud. It can be used to deploy ML models on embedded systems and edge-devices. The platform was launched in November 2017.

oneAPI (compute acceleration) Open standard for parallel computing

oneAPI is an open standard, adopted by Intel, for a unified application programming interface (API) intended to be used across different computing accelerator (coprocessor) architectures, including GPUs, AI accelerators and field-programmable gate arrays. It is intended to eliminate the need for developers to maintain separate code bases, multiple programming languages, tools, and workflows for each architecture.

Hugging Face, Inc. is an American company incorporated under the Delaware General Corporation Law and based in New York City that develops computation tools for building applications using machine learning. It is most notable for its transformers library built for natural language processing applications and its platform that allows users to share machine learning models and datasets and showcase their work.

Medical open network for AI (MONAI) is an open-source, community-supported framework for Deep learning (DL) in healthcare imaging. MONAI provides a collection of domain-optimized implementations of various DL algorithms and utilities specifically designed for medical imaging tasks. MONAI is used in research and industry, aiding the development of various medical imaging applications, including image segmentation, image classification, image registration, and image generation.

MindSpore is a open-source software framework for deep learning, machine learning and artificial intelligence developed by Huawei.

References

↑ "Release Notes for Intel Distribution of OpenVINO toolkit 2024.6". December 2024.
1 2 "OpenVINO Compatibility and Support". OpenVINO Documentation. 24 January 2024.
↑ "License". OpenVINO repository. 16 October 2018.
↑ "OpenVINO Workflow". OpenVINO Documentation. 25 April 2024.
↑ "OpenVINO IR". www.docs.openvino.ai. 2 February 2024.
↑ "Hugging Face OpenVINO Space". Hugging Face.
↑ "OpenVINO Model Preparation". OpenVINO Documentation. 24 January 2024.
↑ "System Requirements". OpenVINO Documentation. February 2024.

Agrawal, Vasu (2019). Ground Up Design of a Multi-modal Object Detection System (PDF) (MSc). Carnegie Mellon University Pittsburgh, PA. Archived (PDF) from the original on 26 January 2020.
Driaba, Alexander; Gordeev, Aleksei; Klyachin, Vladimir (2019). "Recognition of Various Objects from a Certain Categorical Set in Real Time Using Deep Convolutional Neural Networks" (PDF). Institute of Mathematics and Informational Technologies Volgograd State University. Archived (PDF) from the original on 26 January 2020. Retrieved 26 January 2020.{{cite journal}}: Cite journal requires |journal= (help)
Nanjappa, Ashwin (31 May 2019). Caffe2 Quick Start Guide: Modular and scalable deep learning made easy. Packt. pp. 91–98. ISBN 978-1789137750.

This page is based on this Wikipedia article
Text is available under the CC BY-SA 4.0 license; additional terms may apply.
Images, videos and audio are available under their respective licenses.

[1] "Release Notes for Intel Distribution of OpenVINO toolkit 2024.6". December 2024.

[:0-2] 1 2 "OpenVINO Compatibility and Support". OpenVINO Documentation. 24 January 2024.

[3] "License". OpenVINO repository. 16 October 2018.

[4] "OpenVINO Workflow". OpenVINO Documentation. 25 April 2024.

[5] "OpenVINO IR". www.docs.openvino.ai. 2 February 2024.

[6] "Hugging Face OpenVINO Space". Hugging Face.

[7] "OpenVINO Model Preparation". OpenVINO Documentation. 24 January 2024.

[8] "System Requirements". OpenVINO Documentation. February 2024.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

v t e Intel software
Items in italics are no longer maintained or have planned end-of-life dates.
Development	Parallel Studio C++ Compiler Fortran Compiler Advisor Inspector INTERP/80 VTune
Components	Data Analytics Library (DAL) Integrated Performance Primitives (IPP) Math Kernel Library (MKL) Threading Building Blocks (TBB)
Open source	Data Analytics Library (DAL) Threading Building Blocks (TBB) Tizen OpenVINO
Software programs	Telekinesys Research ¹ Havok ¹ Vision ¹
Organizations	Developer Zone Research
¹Sold to Microsoft

v t e Deep learning software
Comparison
Open source	Apache MXNet Apache SINGA Caffe Deeplearning4j DeepSpeed Dlib Keras Microsoft Cognitive Toolkit ML.NET OpenNN PyTorch TensorFlow Theano Torch ONNX OpenVINO MindSpore
Proprietary	Apple Core ML IBM Watson Neural Designer Wolfram Mathematica MATLAB Deep Learning Toolbox
Category