CiteSeerX: AI in a Digital Library Search Engine
Wu, Jian (Pennsylvania State University) | Williams, Kyle Mark (Pennsylvania State University) | Chen, Hung-Hsuan (Industrial Technology Research Institute) | Khabsa, Madian (Pennsylvania State University) | Caragea, Cornelia (University of North Texas) | Tuarob, Suppawong (Pennsylvania State University) | Ororbia, Alexander G. (Pennsylvania State University) | Jordan, Douglas (Pennsylvania State University) | Mitra, Prasenjit (Pennsylvania State University) | Giles, C. Lee (Pennsylvania State University)
CiteSeerX is a digital library search engine providing access to more than five million scholarly documents with nearly a million users and millions of hits per day. These AI technologies have been developed by CiteSeerX group members over the past 5–6 years. We also present AI technologies implemented in table and algorithm search, which are special search modes in CiteSeerX. While it is challenging to rebuild a system like CiteSeerX from scratch, many of these AI technologies are transferable to other digital libraries and/or search engines.
Sep-28-2015