AITopics

2407.12884

Country:

Pacific Ocean (0.04)
North America > United States > Ohio (0.04)
North America > United States > New Mexico > Los Alamos County > Los Alamos (0.04)
Atlantic Ocean > North Atlantic Ocean > Baltic Sea (0.04)

Genre: Research Report (0.50)

Industry:

Energy (0.46)
Government > Regional Government (0.46)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceJul-15-2024

Mitigating biases in big mobility data: a case study of monitoring large-scale transit systems

Wang, Feilong, Ban, Xuegang, Chen, Peng, Liu, Chenxi, Zhao, Rong

Big mobility datasets (BMD) have shown many advantages in studying human mobility and evaluating the performance of transportation systems. However, the quality of BMD remains poorly understood. This study evaluates biases in BMD and develops mitigation methods. Using Google and Apple mobility data as examples, this study compares them with benchmark data from governmental agencies. Spatio-temporal discrepancies between BMD and benchmark are observed and their impacts on transportation applications are investigated, emphasizing the urgent need to address these biases to prevent misguided policymaking. This study further proposes and tests a bias mitigation method. It is shown that the mitigated BMD could generate valuable insights into large-scale public transit systems across 100+ US counties, revealing regional disparities of the recovery of transit systems from the COVID-19. This study underscores the importance of caution when using BMD in transportation research and presents effective mitigation strategies that would benefit practitioners.

artificial intelligence, information management, machine learning, (18 more...)

doi: 10.1080/19427867.2024.2379703

2407.14541

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Washington > King County > Seattle (0.14)
North America > United States > Texas > Harris County > Houston (0.14)
(11 more...)

Genre: Research Report (1.00)

Industry:

Transportation > Infrastructure & Services (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
(2 more...)

Islam, Tunazzina, Goldwasser, Dan

Uncovering Latent Arguments in Social Media Messaging by Employing LLMs-in-the-Loop Strategy

arXiv.org Artificial IntelligenceJul-15-2024

The widespread use of social media has led to a surge in popularity for automated methods of analyzing public opinion. Supervised methods are adept at text categorization, yet the dynamic nature of social media discussions poses a continual challenge for these techniques due to the constant shifting of the focus. On the other hand, traditional unsupervised methods for extracting themes from public discourse, such as topic modeling, often reveal overarching patterns that might not capture specific nuances. Consequently, a significant portion of research into social media discourse still depends on labor-intensive manual coding techniques and a human-in-the-loop approach, which are both time-consuming and costly. In this work, we study the problem of discovering arguments associated with a specific theme. We propose a generic LLMs-in-the-Loop strategy that leverages the advanced capabilities of Large Language Models (LLMs) to extract latent arguments from social media messaging. To demonstrate our approach, we apply our framework to contentious topics. We use two publicly available datasets: (1) the climate campaigns dataset of 14k Facebook ads with 25 themes and (2) the COVID-19 vaccine campaigns dataset of 9k Facebook ads with 14 themes. Additionally, we design a downstream task as stance prediction by leveraging talking points in climate debates. Furthermore, we analyze demographic targeting and the adaptation of messaging based on real-world events.

argument, climate change, sustainability, (15 more...)

2404.10259

Country:

North America > United States > Alaska (0.14)
North America > United States > Texas (0.05)
North America > United States > California (0.04)
(27 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Vaccines (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Lim, Youngsun, Shim, Hyunjung

Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval

arXiv.org Artificial IntelligenceJul-15-2024

Text-to-image generation has shown remarkable progress with the emergence of diffusion models. However, these models often generate factually inconsistent images, failing to accurately reflect the factual information and common sense conveyed by the input text prompts. We refer to this issue as Image hallucination. Drawing from studies on hallucinations in language models, we classify this problem into three types and propose a methodology that uses factual images retrieved from external sources to generate realistic images. Depending on the nature of the hallucination, we employ off-the-shelf image editing tools, either InstructPix2Pix or IP-Adapter, to leverage factual information from the retrieved image. This approach enables the generation of images that accurately reflect the facts and common sense.

hallucination, image hallucination, information, (13 more...)

2407.10683

Country:

Europe > Germany (0.16)
Europe > Portugal (0.15)
North America > United States > California > San Francisco County > San Francisco (0.05)
(2 more...)

Genre: Research Report (0.64)

Industry: Media (0.89)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.52)

arXiv.org Artificial IntelligenceJul-14-2024

TwinS: Revisiting Non-Stationarity in Multivariate Time Series Forecasting

Hu, Jiaxi, Wen, Qingsong, Ruan, Sijie, Liu, Li, Liang, Yuxuan

Multivariate time series forecasting (MTSF) has gained widespread prominence in real-world applications, such as weather prediction, financial risk assessment, and traffic forecasting. Transformers (Vaswani et al., 2017) have emerged as the most popular approach for this task, primarily attributed to their power in capturing temporal dependencies Wen et al. (2023). Recent advances (Wu et al., 2021; Liu et al., 2021a; Zhou et al., 2022; Nie et al., 2023) have further bolstered the popularity. A long-lasting challenge in the realm of MTSF lies in effectively mitigating the non-stationarity inherent in real-world time series. In general, non-stationary time series exhibits a persistent alteration in its statistical attributes (e.g., mean and variance) and joint distribution across time, thereby diminishing its predictability. In previous work, several models have utilized time series pre-processing techniques (Passalis et al., 2019; Kim et al., 2021) to achieve stationarity or involved statistical guidance during model training (Liu et al., 2022b), resulting in significant performance enhancements. Though promising, the above endeavors still fall short of modeling the non-stationary period distribution. To verify this point, we empirically leverage the Morlet wavelet transform on the Weather dataset (Wu et al., 2021), leading to the energy distribution in Fig 1. We observe that (i) Non-stationary time series comprises multiple nested and overlapping periods, with diverse periodic patterns and varying strengths at each time step.

forecasting, information, time sery, (11 more...)

2406.0371

Country:

Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.04)
Oceania > New Zealand (0.04)
Oceania > Australia (0.04)
(10 more...)

Genre: Research Report (0.82)

Industry: Banking & Finance (0.86)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Vision (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

WayveScenes101: A Dataset and Benchmark for Novel View Synthesis in Autonomous Driving

Zürn, Jannik, Gladkov, Paul, Dudas, Sofía, Cotter, Fergal, Toteva, Sofi, Shotton, Jamie, Simaiaki, Vasiliki, Mohan, Nikhil

We present WayveScenes101, a dataset designed to help the community advance the state of the art in novel view synthesis that focuses on challenging driving scenes containing many dynamic and deformable elements with changing geometry and texture. The dataset comprises 101 driving scenes across a wide range of environmental conditions and driving scenarios. The dataset is designed for benchmarking reconstructions on in-the-wild driving scenes, with many inherent challenges for scene reconstruction methods including image glare, rapid exposure changes, and highly dynamic scenes with significant occlusion. Along with the raw images, we include COLMAP-derived camera poses in standard data formats. We propose an evaluation protocol for evaluating models on held-out camera views that are off-axis from the training views, specifically testing the generalisation capabilities of methods. Finally, we provide detailed metadata for all scenes, including weather, time of day, and traffic conditions, to allow for a detailed model performance breakdown across scene characteristics. Dataset and code are available at https://github.com/wayveai/

dataset, synthesis, wayvescenes101, (9 more...)

2407.0828

Country:

Europe > Germany > Baden-Württemberg > Karlsruhe Region > Karlsruhe (0.05)
Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.04)
North America > United States > District of Columbia > Washington (0.04)
(3 more...)

Genre: Research Report (0.40)

Industry:

Transportation > Ground > Road (0.66)
Information Technology > Robotics & Automation (0.42)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.52)

Simhi, Adi, Herzig, Jonathan, Szpektor, Idan, Belinkov, Yonatan

Constructing Benchmarks and Interventions for Combating Hallucinations in LLMs

Large language models (LLMs) are prone to hallucinations, which sparked a widespread effort to detect and prevent them. Recent work attempts to mitigate hallucinations by intervening in the model's generation, typically computing representative vectors of hallucinations vs. grounded generations, for steering the model's hidden states away from a hallucinatory state. However, common studies employ different setups and do not properly separate different possible causes of hallucinations, making interventions misguided. In this work, we introduce a method for categorizing examples based on the model's prior knowledge, named WACK. We construct WACK benchmarks that support interventions in two settings: open-book and closed-book question answering. Using the benchmarks, we perform an extensive investigation of the effect of different choices for intervention, such as the intervened components, and how often and how strongly to intervene. We find that intervention success varies depending on the component, with the attention blocks performing well and the residual stream proving detrimental to language modeling capabilities. We also show that interventions can benefit from representative vectors collected before, rather than after, a hallucination occurs. Finally, we introduce a new dynamic intervention, which intervenes only if needed, and thus is more robust than standard static interventions.

dataset, hallucination, intervention, (13 more...)

2404.09971

Country:

North America > United States (0.28)
Europe > France (0.04)
Asia > Middle East > Israel (0.04)
(8 more...)

Genre: Research Report (1.00)

Industry:

Media (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.32)

Graef, Nils, Clapp, Matthew, Wasielewski, Andrew

Flash normalization: fast RMSNorm for LLMs

RMSNorm is used by many LLMs such as Llama, Mistral, and OpenELM. This paper details FlashNorm, which is an exact but faster implementation of RMSNorm followed by linear layers. See https://huggingface.co/open-machine/FlashNorm for code and more transformer tricks.

linear layer, normalization, rms, (15 more...)

2407.09577

Country:

Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.86)
Information Technology > Artificial Intelligence > Machine Learning (0.69)

An Improved Traditional Chinese Evaluation Suite for Foundation Model

Tam, Zhi-Rui, Pai, Ya-Ting, Lee, Yen-Wei, Chen, Jun-Da, Chu, Wei-Min, Cheng, Sega, Shuai, Hong-Han

We present TMMLU+, a new benchmark designed for Traditional Chinese language understanding. TMMLU+ is a multi-choice question-answering dataset with 66 subjects from elementary to professional level. It is six times larger and boasts a more balanced subject distribution than its predecessor, Taiwan Massive Multitask Language Understanding (TMMLU). We also benchmark closed-source models and 26 open-weight Chinese large language models (LLMs) of parameters ranging from 1.8B to 72B on the proposed TMMLU+. Our findings reveal that (1.) Traditional Chinese models still trail behind their Simplified Chinese counterparts, highlighting a need for more focused advancements in LLMs catering to Traditional Chinese. (2.) Current LLMs still fall short of human performance in average scores, indicating a potential need for future research to delve deeper into social science and humanities subjects. (3.) Among all the tokenization compression metrics examined, we identify that only the fertility score uniquely demonstrates strong correlations with our benchmark results. We foresee that TMMLU+ will pinpoint areas for future model improvement, thereby narrowing the gap between machine and human linguistic capabilities and supporting researchers in developing Traditional Chinese LLMs. Our dataset, along with the benchmark source code, is accessible at huggingface.co/datasets/ikala/tmmluplus.

english translation, language model, preprint, (14 more...)

2403.01858

Country:

Asia > Taiwan (0.26)
North America > United States (0.14)
Europe > Spain (0.14)
(6 more...)

Genre: Research Report > New Finding (0.66)

Industry:

Law (1.00)
Government (1.00)
Banking & Finance > Insurance (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceJul-9-2024

LETS-C: Leveraging Language Embedding for Time Series Classification

Kaur, Rachneet, Zeng, Zhen, Balch, Tucker, Veloso, Manuela

Recent advancements in language modeling have shown promising results when applied to time series data. In particular, fine-tuning pre-trained large language models (LLMs) for time series classification tasks has achieved state-of-the-art (SOTA) performance on standard benchmarks. However, these LLM-based models have a significant drawback due to the large model size, with the number of trainable parameters in the millions. In this paper, we propose an alternative approach to leveraging the success of language modeling in the time series domain. Instead of fine-tuning LLMs, we utilize a language embedding model to embed time series and then pair the embeddings with a simple classification head composed of convolutional neural networks (CNN) and multilayer perceptron (MLP). We conducted extensive experiments on well-established time series classification benchmark datasets. We demonstrated LETS-C not only outperforms the current SOTA in classification accuracy but also offers a lightweight solution, using only 14.5% of the trainable parameters on average compared to the SOTA model. Our findings suggest that leveraging language encoders to embed time series data, combined with a simple yet effective classification head, offers a promising direction for achieving high-performance time series classification while maintaining a lightweight model architecture.

accuracy, dataset, time sery, (12 more...)

2407.06533

Country:

South America > Argentina > Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
(2 more...)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)