CiteSeerX: AI in a Digital Library Search Engine
Wu, Jian (Pennsylvania State University) | Williams, Kyle Mark (Pennsylvania State University) | Chen, Hung-Hsuan (Industrial Technology Research Institute) | Khabsa, Madian (Pennsylvania State University) | Caragea, Cornelia (University of North Texas) | Tuarob, Suppawong (Pennsylvania State University) | Ororbia, Alexander G. (Pennsylvania State University) | Jordan, Douglas (Pennsylvania State University) | Mitra, Prasenjit (Pennsylvania State University) | Giles, C. Lee (Pennsylvania State University)
Since then, the project has been directed by C. Lee Giles. While it is challenging to rebuild a system like Cite-SeerX from scratch, many of these AI technologies are transferable to other digital libraries and search engines. This is different from arXiv, Harvard ADS, and machine cluster to a private cloud using virtualization PubMed, where papers are submitted by authors or techniques (Wu et al. 2014). CiteSeerX extensively pushed by publishers. Unlike Google Scholar and leverages open source software, which significantly Microsoft Academic Search, where a significant portion reduces development effort. Red Hat of documents have only metadata (such as titles, Enterprise Linux (RHEL) 5 and 6 are the operating authors, and abstracts) available, users have full-text systems for all servers. Tomcat 7 is CiteSeerX keeps its own repository, which used for web service deployment on web and indexing serves cached versions of papers even if their previous servers. MySQL is used as the database management links are not alive any more. In additional to system to store metadata. Apache Solr is used paper downloads, CiteSeerX provides automatically for the index, and the Spring framework is used in extracted metadata and citation context, which the web application. In this section, we highlight four AI solutions that are Document metadata download service is not available leveraged by CiteSeerX and that tackle different challenges from Google Scholar and only recently available in metadata extraction and ingestion modules from Microsoft Academic Search. Finally, CiteSeerX (tagged by C, E, D, and A in figure 1).
Sep-28-2015
- Country:
- North America > United States > New Jersey (0.28)
- Genre:
- Research Report (0.93)
- Industry:
- Education > Educational Setting (0.46)
- Information Technology (1.00)
- Technology: