AITopics | model serving

Collaborating Authors

model serving

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

MODEL SERVING IN PYTORCH

#artificialintelligenceJan-15-2022, 04:25:30 GMT

Deploying ML models in Production and scaling your ML services still continue to be big challenge. TorchServe, the model serving solution for PyTorch solves this problem and has now evolved into a multi-platform solution that can run on-prem or on any cloud with integrations for major OSS platforms like Kubernetes, MLflow, Kubeflow Pipelines, KServe. This talk will cover new features launched in TorchServe like model interpretability using Captum, best practices for production deployments in a responsible manner, along with examples of how companies like Amazon Ads, Meta AI and broader PyTorch community are using TorchServe.

model serving, pytorch, torchserve

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

GitHub - bentoml/BentoML: Model Serving Made Easy

#artificialintelligenceSep-8-2021, 07:00:25 GMT

BentoML is a flexible, high-performance framework for serving, managing, and deploying machine learning models. By providing a standard interface for describing a prediction service, BentoML abstracts away how to run model inference efficiently and how model serving workloads can integrate with cloud infrastructures. Be sure to check out deployment overview doc to understand which deployment option is best suited for your use case. BentoML provides APIs for defining a prediction service, a servable model so to speak, which includes the trained ML model itself, plus its pre-processing, post-processing code, input/output specifications and dependencies. The generated BentoML bundle is a file directory that contains all the code files, serialized models, and configs required for reproducing this prediction service for inference. BentoML automatically captures all the python dependencies information and have everything versioned and managed together in one place.

bentoml, bentoml bundle, prediction service, (12 more...)

#artificialintelligence

Industry: Information Technology > Services (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

The Latest In ML Ops - 5 Evolutions of Production ML

#artificialintelligenceAug-21-2020, 06:30:38 GMT

As more and more industries bring ML use cases to production, the need for consistent practices for managing ML in Production and optimizing ML Lifecycle iteration has grown rapidly. Last year, a few of us partnered with USENIX to drive the first-ever Industry/Academic conference dedicated to the challenges of and innovations in managing ML in Production. OpML 2019 was a great success - bringing together experts, practitioners, engineers, and researchers to discuss the latest and greatest in ML Ops. You can find a summary of OpML 2019 here. This year, due to COVID19, OpML 2020 became a virtual conference with video presentations and open discussions on Slack.

artificial intelligence, machine learning, production ml, (18 more...)

#artificialintelligence

Country:

North America > United States > Virginia (0.05)
North America > United States > Illinois (0.05)
North America > United States > California (0.05)

Industry: Information Technology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.40)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.40)

Add feedback