Goto

Collaborating Authors

 impact prediction


Application and Evaluation of Large Language Models for Forecasting the Impact of Traffic Incidents

arXiv.org Artificial Intelligence

This study examines the feasibility of applying large language models (LLMs) for forecasting the impact of traffic incident s on the traffic flow. The use of LLMs for this task has several advantages over existing machine learning - based solutions such as not requiring a large training dataset and the ability to utilize free - text incident logs . We propose a fully LLM - based solution that predicts the incident impact using a combination of traffic features and LLM - extracted incident features. A key ingredient of this solution is an effective method of select ing examples for the LLM's in - context learning. We evaluate the performance of three advanced LLMs and two state - of - the - art machine learning models on a real traffic incident dataset . The results show that the best - performing LLM matches the accuracy of the most accurate machine learning model, despite the former not having been trained on this prediction task. The findings indicate that LLMs are a practically viable option for traffic incident impact prediction.


From Words to Worth: Newborn Article Impact Prediction with LLM

arXiv.org Artificial Intelligence

As the academic landscape expands, the challenge of efficiently identifying potentially high-impact articles among the vast number of newly published works becomes critical. This paper introduces a promising approach, leveraging the capabilities of fine-tuned LLMs to predict the future impact of newborn articles solely based on titles and abstracts. Moving beyond traditional methods heavily reliant on external information, the proposed method discerns the shared semantic features of highly impactful papers from a large collection of title-abstract and potential impact pairs. These semantic features are further utilized to regress an improved metric, TNCSI_SP, which has been endowed with value, field, and time normalization properties. Additionally, a comprehensive dataset has been constructed and released for fine-tuning the LLM, containing over 12,000 entries with corresponding titles, abstracts, and TNCSI_SP. The quantitative results, with an NDCG@20 of 0.901, demonstrate that the proposed approach achieves state-of-the-art performance in predicting the impact of newborn articles when compared to competitive counterparts. Finally, we demonstrate a real-world application for predicting the impact of newborn journal articles to demonstrate its noteworthy practical value. Overall, our findings challenge existing paradigms and propose a shift towards a more content-focused prediction of academic impact, offering new insights for assessing newborn article impact.


Attention: to Better Stand on the Shoulders of Giants

arXiv.org Machine Learning

Science of science (SciSci) is an emerging discipline wherein science is used to study the structure and evolution of science itself using large data sets. The increasing availability of digital data on scholarly outcomes offers unprecedented opportunities to explore SciSci. In the progress of science, the previously discovered knowledge principally inspires new scientific ideas, and citation is a reasonably good reflection of this cumulative nature of scientific research. The researches that choose potentially influential references will have a lead over the emerging publications. Although the peer review process is the mainly reliable way of predicting a paper's future impact, the ability to foresee the lasting impact based on citation records is increasingly essential in the scientific impact analysis in the era of big data. This paper develops an attention mechanism for the long-term scientific impact prediction and validates the method based on a real large-scale citation data set. The results break conventional thinking. Instead of accurately simulating the original power-law distribution, emphasizing the limited attention can better stand on the shoulders of giants.