Policy Optimization in Bayesian Network Hybrid Models of Biomanufacturing Processes
Zheng, Hua, Xie, Wei, Ryzhov, Ilya O., Xie, Dongming
–arXiv.org Artificial Intelligence
Biopharmaceutical manufacturing is a rapidly growing industry with impact in virtually all branches of medicine. Biomanufacturing processes require close monitoring and control, in the presence of complex bioprocess dynamics with many interdependent factors, as well as extremely limited data due to the high cost and long duration of experiments. We develop a novel model-based reinforcement learning framework that can achieve human-level control in low-data environments. The model uses a probabilistic knowledge graph to capture causal interdependencies between factors in the underlying stochastic decision process, leveraging information from existing kinetic models from different unit operations while incorporating real-world experimental data. We then present a computationally efficient, provably convergent stochastic gradient method for policy optimization. Validation is conducted on a realistic application with a multi-dimensional, continuous state variable.
arXiv.org Artificial Intelligence
May-13-2021
- Country:
- North America > United States
- New York (0.04)
- Massachusetts
- Middlesex County > Lowell (0.14)
- Suffolk County > Boston (0.04)
- Maryland > Prince George's County
- College Park (0.04)
- Europe > United Kingdom
- England
- Oxfordshire > Oxford (0.04)
- Cambridgeshire > Cambridge (0.04)
- England
- North America > United States
- Genre:
- Research Report
- Experimental Study (0.46)
- Promising Solution (0.34)
- Research Report
- Industry: