Gradients of Generative Models for Improved Discriminative Analysis of Tandem Mass Spectra
Halloran, John T., Rocke, David M.
–Neural Information Processing Systems
Tandem mass spectrometry (MS/MS) is a high-throughput technology used to identify the proteins in a complex biological sample, such as a drop of blood. A collection of spectra is generated at the output of the process, each spectrum of which is representative of a peptide (protein subsequence) present in the original complex sample. In this work, we leverage the log-likelihood gradients of generative modelsto improve the identification of such spectra. In particular, we show that the gradient of a recently proposed dynamic Bayesian network (DBN) [7] may be naturally employed by a kernel-based discriminative classifier. The resulting Fisher kernel substantially improves upon recent attempts to combine generative and discriminative models for post-processing analysis, outperforming all other methods on the evaluated datasets. We extend the improved accuracy offered by the Fisher kernel framework to other search algorithms by introducing Theseus, a DBN representing a large number of widely used MS/MS scoring functions. Furthermore, with gradient ascent and max-product inference at hand, we use Theseus to learn model parameters without any supervision.
Neural Information Processing Systems
Dec-31-2017
- Country:
- North America
- Canada > Quebec
- Capitale-Nationale Region
- Quebec City (0.04)
- Québec (0.04)
- Capitale-Nationale Region
- United States
- California
- Los Angeles County > Long Beach (0.04)
- Yolo County > Davis (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- California
- Canada > Quebec
- North America
- Industry: