Optimizing ML Serving with Asynchronous Architectures

Nov-1-2020, 08:40:55 GMT–#artificialintelligence

When AI architects think about ML Serving, they focus primarily on speeding up the inference function in the Serving layer. When the solution is deployed, the cost of serving alarms those responsible for budgets, leading to abandoning of solutions. The default architecture that architects come up with is a synchronous one. An ML Service API, typical a REST API sits in front of the serving layer. It takes care of standard API functions like authentication and load balancing.

asynchronous architecture, machine learning, natural language, (14 more...)

#artificialintelligence

Nov-1-2020, 08:40:55 GMT

News Web Page

Add feedback

Industry:
- Information Technology > Security & Privacy (0.36)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (0.31)
  - Machine Learning (0.31)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found