A Reproducible, Scalable Pipeline for Synthesizing Autoregressive Model Literature