AITopics | onnx

Collaborating Authors

onnx

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

How Do Model Export Formats Impact the Development of ML-Enabled Systems? A Case Study on Model Integration

Parida, Shreyas Kumar, Gerostathopoulos, Ilias, Bogner, Justus

arXiv.org Artificial IntelligenceFeb-1-2025

Machine learning (ML) models are often integrated into ML-enabled systems to provide software functionality that would otherwise be impossible. This integration requires the selection of an appropriate ML model export format, for which many options are available. These formats are crucial for ensuring a seamless integration, and choosing a suboptimal one can negatively impact system development. However, little evidence is available to guide practitioners during the export format selection. We therefore evaluated various model export formats regarding their impact on the development of ML-enabled systems from an integration perspective. Based on the results of a preliminary questionnaire survey (n=17), we designed an extensive embedded case study with two ML-enabled systems in three versions with different technologies. We then analyzed the effect of five popular export formats, namely ONNX, Pickle, TensorFlow's SavedModel, PyTorch's TorchScript, and Joblib. In total, we studied 30 units of analysis (2 systems x 3 tech stacks x 5 formats) and collected data via structured field notes. The holistic qualitative analysis of the results indicated that ONNX offered the most efficient integration and portability across most cases. SavedModel and TorchScript were very convenient to use in Python-based systems, but otherwise required workarounds (TorchScript more than SavedModel). SavedModel also allowed the easy incorporation of preprocessing logic into a single file, which made it scalable for complex deep learning use cases. Pickle and Joblib were the most challenging to integrate, even in Python-based systems. Regarding technical support, all model export formats had strong technical documentation and strong community support across platforms such as Stack Overflow and Reddit. Practitioners can use our findings to inform the selection of ML export formats suited to their context.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2502.00429

Country:

Europe > Netherlands > North Holland > Amsterdam (0.05)
Europe > Portugal > Lisbon > Lisbon (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
(4 more...)

Genre:

Questionnaire & Opinion Survey (1.00)
Research Report > New Finding (0.48)

Industry:

Information Technology (0.93)
Media > News (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Energy consumption of code small language models serving with runtime engines and execution providers

Durán, Francisco, Martinez, Matias, Lago, Patricia, Martínez-Fernández, Silverio

arXiv.org Artificial IntelligenceDec-19-2024

Background. The rapid growth of Language Models (LMs), particularly in code generation, requires substantial computational resources, raising concerns about energy consumption and environmental impact. Optimizing LMs inference for energy efficiency is crucial, and Small Language Models (SLMs) offer a promising solution to reduce resource demands. Aim. Our goal is to analyze the impact of deep learning runtime engines and execution providers on energy consumption, execution time, and computing-resource utilization from the point of view of software engineers conducting inference in the context of code SLMs. Method. We conducted a technology-oriented, multi-stage experimental pipeline using twelve code generation SLMs to investigate energy consumption, execution time, and computing-resource utilization across the configurations. Results. Significant differences emerged across configurations. CUDA execution provider configurations outperformed CPU execution provider configurations in both energy consumption and execution time. Among the configurations, TORCH paired with CUDA demonstrated the greatest energy efficiency, achieving energy savings from 37.99% up to 89.16% compared to other serving configurations. Similarly, optimized runtime engines like ONNX with the CPU execution provider achieved from 8.98% up to 72.04% energy savings within CPU-based configurations. Also, TORCH paired with CUDA exhibited efficient computing-resource utilization. Conclusions. Serving configuration choice significantly impacts energy efficiency. While further research is needed, we recommend the above configurations best suited to software engineers' requirements for enhancing serving efficiency in energy and performance.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2412.15441

Country: Europe > Italy (0.28)

Genre: Research Report > New Finding (1.00)

Industry: Energy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

python - BackendIsNotSupposedToImplementIt Error: Converting ONNX to Tensorflow - Stack Overflow

#artificialintelligenceApr-9-2023, 10:36:56 GMT

When i run this code to convert onnx to tensorflow im getting error in google colab. I need to convert this onnx file to tensorflow lite so i can use it in android app. from onnx_tf.backend import

decoder, proj, transformer, (6 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.80)

Add feedback

ONNX: The Standard for Interoperable Deep Learning Models

#artificialintelligenceJan-24-2023, 11:15:16 GMT

The first time I heard about ONNX was during my internship at INRIA. I was working to develop Neural Network Pruning algorithms in the Julia language. There weren't many pre-trained models yet that I could use, so utilizing ONNX to import models developed with other languages and frameworks might have been a solution. In this article, I want to introduce ONNX and explain its enormous potential by also seeing a practical example. ONNX, or Open Neural Network Exchange, is an open-source standard for representing deep learning models. It was developed by Facebook and Microsoft in order to make it easier for researchers and engineers to move models between different deep-learning frameworks and hardware platforms.

artificial intelligence, machine learning, onnx, (14 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

7 Lessons I've Learnt From Deploying Machine Learning Models Using ONNX

#artificialintelligenceJun-26-2022, 08:45:46 GMT

In this post, we will outline key learnings from a real-world example of running inference on a sci-kit learn model using the ONNX Runtime API in an AWS Lambda function. This is not a tutorial but rather a guide focusing on useful tips, points to consider, and quirks that may save you some head-scratching! The Open Neural Network Exchange (ONNX) format is a bit like dipping your french fries into a milkshake; it shouldn't work but it just does. ONNX allows us to build a model using all the training frameworks we know and love like PyTorch and TensorFlow and package it up in a format supported by many hardware architectures and operating systems. The ONNX Runtime is a simple API that is cross-platform and provides optimal performance to run inference on an ONNX model exactly where you need them: the cloud, mobile, an IoT device, you name it!

deploying machine learning model, execution time, onnx, (14 more...)

#artificialintelligence

Industry: Information Technology > Services (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.56)

Add feedback

Dilated Convolutional Neural Networks for Lightweight Diacritics Restoration

Csanády, Bálint, Lukács, András

arXiv.org Artificial IntelligenceJan-18-2022

Diacritics restoration has become a ubiquitous task in the Latin-alphabet-based English-dominated Internet language environment. In this paper, we describe a small footprint 1D dilated convolution-based approach which operates on a character-level. We find that solutions based on 1D dilated convolutional neural networks are competitive alternatives to models based on recursive neural networks or linguistic modeling for the task of diacritics restoration. Our solution surpasses the performance of similarly sized models and is also competitive with larger models. A special feature of our solution is that it even runs locally in a web browser. We also provide a working example of this browser-based implementation. Our model is evaluated on different corpora, with emphasis on the Hungarian language. We performed comparative measurements about the generalization power of the model in relation to three Hungarian corpora. We also analyzed the errors to understand the limitation of corpus-based self-supervised training.

convolution, diacritic restoration, restoration, (15 more...)

arXiv.org Artificial Intelligence

2201.06757

Country:

Europe > Hungary > Budapest > Budapest (0.04)
Europe > Czechia > Prague (0.04)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Pytorch to Keras using ONNX

#artificialintelligenceSep-25-2021, 00:10:21 GMT

Model Deployment is the method by which you integrate a machine learning model into an existing production environment to make practical business decisions based on data. It is one of the last stages in the machine learning life cycle and can be one of the most cumbersome. Model deployment is probably the most important part of the Machine Learning model lifecycle but still, the least studied one. Most of the courses out there around the ML/DL universe teach how to explore data, engineer the features, train the model, and generate predictions. But they miss the most important part: what to do after that? Apart from the models developed for learning or for Kaggle competitions, all other models are built to generate revenue, and if you don't deploy a model into production then there's no one using it and thus no revenue.

important part, keras, pytorch, (2 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback

8 Alternatives to TensorFlow Serving

#artificialintelligenceJun-14-2021, 19:50:43 GMT

TensorFlow Serving is an easy-to-deploy, flexible and high performing serving system for machine learning models built for production environments. It allows easy deployment of algorithms and experiments while allowing developers to keep the same server architecture and APIs. TensorFlow Serving provides seamless integration with TensorFlow models, and can also be easily extended to other models and data. Open-source platform Cortex makes execution of real-time inference at scale seamless. It is designed to deploy trained machine learning models directly as a web service in production.

deployment, information, tensorflow serving, (12 more...)

#artificialintelligence

Industry: Information Technology (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.44)

Add feedback

Converting a model from Pytorch to Tensorflow: Guide to ONNX

#artificialintelligenceMar-14-2021, 13:10:33 GMT

Open Neural Network Exchange (ONNX) is a powerful and open format built to represent machine learning models. The final outcome of training any machine learning or deep learning algorithm is a model file that represents the mapping of input data to output predictions in an efficient manner. These models are stored in different file formats depending on the framework they were created in .pkl Therein lies the problem, you can't take a model created and trained in one framework and use it or deploy it in a different framework. The intent behind ONNX is to be like the "USB standard" of the machine learning world.

onnx, pytorch, tensorflow

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Machine learning groups form Consortium for Python Data API Standards to reduce fragmentation

#artificialintelligenceAug-19-2020, 00:30:06 GMT

Deep learning framework Apache MXNet and Open Neural Network Exchange (ONNX) today launched the Consortium for Python Data API Standards to improve interoperability for machine learning practitioners and data scientists using any framework, library, or tool from the Python ecosystem. ONNX itself was formed by Facebook and Microsoft in 2017 to encourage interoperability between frameworks and tools. Today, ONNX includes nearly 40 organizations with influence in AI and data science, including AWS, Baidu, and IBM, along with hardware makers like Arm, Intel, and Qualcomm. The new consortium, which will develop standards for dataframes and arrays or tensors, hopes to address the fragmentation that has affected the data ecosystem in recent years. The Python programming language is used for Python dataframes like Pandas, PySpark, and Apache Arrow.

artificial intelligence, machine learning, python data api standard, (9 more...)

#artificialintelligence

Industry: Information Technology (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.80)

Add feedback