AITopics | Pacific Ocean

Collaborating Authors

Pacific Ocean

Resprompt: Residual Connection Prompting Advances Multi-Step Reasoning in Large Language Models

Jiang, Song, Shakeri, Zahra, Chan, Aaron, Sanjabi, Maziar, Firooz, Hamed, Xia, Yinglong, Akyildiz, Bugra, Sun, Yizhou, Li, Jinchao, Wang, Qifan, Celikyilmaz, Asli

arXiv.org Artificial IntelligenceOct-7-2023

Chain-of-thought (CoT) prompting, which offers step-by-step problem-solving rationales, has impressively unlocked the reasoning potential of large language models (LLMs). Yet, the standard CoT is less effective in problems demanding multiple reasoning steps. This limitation arises from the complex reasoning process in multi-step problems: later stages often depend on the results of several steps earlier, not just the results of the immediately preceding step. Such complexities suggest the reasoning process is naturally represented as a graph. The almost linear and straightforward structure of CoT prompting, however, struggles to capture this complex reasoning graph. To address this challenge, we propose Residual Connection Prompting (RESPROMPT), a new prompting strategy that advances multi-step reasoning in LLMs. Our key idea is to reconstruct the reasoning graph within prompts. We achieve this by integrating necessary connections-links present in the reasoning graph but missing in the linear CoT flow-into the prompts. Termed "residual connections", these links are pivotal in morphing the linear CoT structure into a graph representation, effectively capturing the complex reasoning graphs inherent in multi-step problems. We evaluate RESPROMPT on six benchmarks across three diverse domains: math, sequential, and commonsense reasoning. For the open-sourced LLaMA family of models, RESPROMPT yields a significant average reasoning accuracy improvement of 12.5% on LLaMA-65B and 6.8% on LLaMA2-70B. Breakdown analysis further highlights RESPROMPT particularly excels in complex multi-step reasoning: for questions demanding at least five reasoning steps, RESPROMPT outperforms the best CoT based benchmarks by a remarkable average improvement of 21.1% on LLaMA-65B and 14.3% on LLaMA2-70B. Through extensive ablation studies and analyses, we pinpoint how to most effectively build residual connections.

beaker, residual connection, rompt, (16 more...)

arXiv.org Artificial Intelligence

2310.04743

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
Africa > Rwanda > Kigali > Kigali (0.04)
(12 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Education (0.93)
Leisure & Entertainment > Sports > Hockey (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

A Structured Matrix Method for Nonequispaced Neural Operators

Lingsch, Levi, Michelis, Mike, de Bezenac, Emmanuel, Perera, Sirani M., Katzschmann, Robert K., Mishra, Siddhartha

arXiv.org Artificial IntelligenceOct-6-2023

The computational efficiency of many neural operators, widely used for learning solutions of PDEs, relies on the fast Fourier transform (FFT) for performing spectral computations. However, as FFT is limited to equispaced (rectangular) grids, this limits the efficiency of such neural operators when applied to problems where the input and output functions need to be processed on general non-equispaced point distributions. We address this issue by proposing a novel method that leverages batch matrix multiplications to efficiently construct Vandermonde-structured matrices and compute forward and inverse transforms, on arbitrarily distributed points. An efficient implementation of such structured matrix methods is coupled with existing neural operator models to allow the processing of data on arbitrary non-equispaced distributions of points. With extensive empirical evaluation, we demonstrate that the proposed method allows one to extend neural operators to very general point distributions with significant gains in training speed over baselines, while retaining or improving accuracy.

matrix, neural operator, point distribution, (14 more...)

arXiv.org Artificial Intelligence

2305.19663

Country:

Europe > Switzerland > Zürich > Zürich (0.15)
South America (0.04)
Pacific Ocean (0.04)
(2 more...)

Genre: Research Report (0.84)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.93)
Information Technology > Data Science > Data Quality > Data Transformation (0.71)

Add feedback

Beyond Tides and Time: Machine Learning Triumph in Water Quality

Li, Yinpu, Mao, Siqi, Yuan, Yaping, Wang, Ziren, Kang, Yixin, Yao, Yuanxin

arXiv.org Machine LearningOct-6-2023

Water resources are essential for sustaining human livelihoods and environmental well being. Accurate water quality prediction plays a pivotal role in effective resource management and pollution mitigation. In this study, we assess the effectiveness of five distinct predictive models linear regression, Random Forest, XGBoost, LightGBM, and MLP neural network, in forecasting pH values within the geographical context of Georgia, USA. Notably, LightGBM emerges as the top performing model, achieving the highest average precision. Our analysis underscores the supremacy of tree-based models in addressing regression challenges, while revealing the sensitivity of MLP neural networks to feature scaling. Intriguingly, our findings shed light on a counterintuitive discovery: machine learning models, which do not explicitly account for time dependencies and spatial considerations, outperform spatial temporal models. This unexpected superiority of machine learning models challenges conventional assumptions and highlights their potential for practical applications in water quality prediction. Our research aims to establish a robust predictive pipeline accessible to both data science experts and those without domain specific knowledge. In essence, we present a novel perspective on achieving high prediction accuracy and interpretability in data science methodologies. Through this study, we redefine the boundaries of water quality forecasting, emphasizing the significance of data driven approaches over traditional spatial temporal models. Our findings offer valuable insights into the evolving landscape of water resource management and environmental protection.

artificial intelligence, machine learning, prediction, (17 more...)

arXiv.org Machine Learning

2309.16951

Country:

North America > United States > Florida > Leon County > Tallahassee (0.05)
North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
Asia > China (0.04)
(7 more...)

Genre: Research Report > New Finding (1.00)

Industry: Water & Waste Management > Water Management > Water Supplies & Services (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.69)

Add feedback

Assassin's Creed Mirage: What to know about the 'Golden Age' of Baghdad

Al JazeeraOct-5-2023, 09:39:39 GMT

Whether you dream of holstering a flintlock pistol and sailing through the 18th-century Golden Age of Piracy or leading a clan of Vikings to settle in the fractured Anglo-Saxon kingdoms of the 9th century, Assassin's Creed video games have you covered. Since 2007, the popular action-adventure series created by video game publisher Ubisoft has been taking gamers on adventures around the globe through different historical periods. With its 13th instalment released on Thursday, Assassin's Creed Mirage attempts to immerse players in Iraq's 9th-century Baghdad during the rule of the Abbasid Caliphate, when it was one of the most significant cities in the world. Today's capital of Iraq is often associated, especially by those in the West, with the United States war and the destruction it brought more than two decades ago. But in Assassin's Creed Mirage, the game attempts to give players a glimpse into the rich and diverse history of the Abbasid Caliphate during the Islamic Golden Age.

abbasid caliphate, baghdad, golden age, (11 more...)

Al Jazeera

Country:

Asia > Middle East > Iraq > Baghdad Governorate > Baghdad (0.69)
North America > United States (0.25)
Africa (0.06)
(6 more...)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology: Information Technology > Artificial Intelligence > Games > Computer Games (1.00)

Add feedback

Break-A-Scene: Extracting Multiple Concepts from a Single Image

Avrahami, Omri, Aberman, Kfir, Fried, Ohad, Cohen-Or, Daniel, Lischinski, Dani

arXiv.org Artificial IntelligenceOct-4-2023

Text-to-image model personalization aims to introduce a user-provided concept to the model, allowing its synthesis in diverse contexts. However, current methods primarily focus on the case of learning a single concept from multiple images with variations in backgrounds and poses, and struggle when adapted to a different scenario. In this work, we introduce the task of textual scene decomposition: given a single image of a scene that may contain several concepts, we aim to extract a distinct text token for each concept, enabling fine-grained control over the generated scenes. To this end, we propose augmenting the input image with masks that indicate the presence of target concepts. These masks can be provided by the user or generated automatically by a pre-trained segmentation model. We then present a novel two-phase customization process that optimizes a set of dedicated textual embeddings (handles), as well as the model weights, striking a delicate balance between accurately capturing the concepts and avoiding overfitting. We employ a masked diffusion loss to enable handles to generate their assigned concepts, complemented by a novel loss on cross-attention maps to prevent entanglement. We also introduce union-sampling, a training strategy aimed to improve the ability of combining multiple concepts in generated images. We use several automatic metrics to quantitatively compare our method against several baselines, and further affirm the results using a user study. Finally, we showcase several applications of our method. Project page is available at: https://omriavrahami.com/break-a-scene/

background, input image, sa conference paper, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3610548.3618154

2305.16311

Country:

Oceania > Australia > New South Wales > Sydney (0.06)
Asia > Middle East > Israel > Jerusalem District > Jerusalem (0.04)
Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
(5 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

Demystifying CLIP Data

Xu, Hu, Xie, Saining, Tan, Xiaoqing Ellen, Huang, Po-Yao, Howes, Russell, Sharma, Vasu, Li, Shang-Wen, Ghosh, Gargi, Zettlemoyer, Luke, Feichtenhofer, Christoph

arXiv.org Artificial IntelligenceOct-2-2023

Contrastive Language-Image Pre-training (CLIP) is an approach that has advanced research and applications in computer vision, fueling modern recognition systems and generative models. We believe that the main ingredient to the success of CLIP is its data and not the model architecture or pre-training objective. However, CLIP only provides very limited information about its data and how it has been collected, leading to works that aim to reproduce CLIP's data by filtering with its model parameters. In this work, we intend to reveal CLIP's data curation approach and in our pursuit of making it open to the community introduce Metadata-Curated Language-Image Pre-training (MetaCLIP). MetaCLIP takes a raw data pool and metadata (derived from CLIP's concepts) and yields a balanced subset over the metadata distribution. Our experimental study rigorously isolates the model and training settings, concentrating solely on data. MetaCLIP applied to CommonCrawl with 400M image-text data pairs outperforms CLIP's data on multiple standard benchmarks. In zero-shot ImageNet classification, MetaCLIP achieves 70.8% accuracy, surpassing CLIP's 68.3% on ViT-B models. Scaling to 1B data, while maintaining the same training budget, attains 72.4%. Our observations hold across various model sizes, exemplified by ViT-H achieving 80.5%, without any bells-and-whistles. Curation code and training data distribution on metadata is made available at https://github.com/facebookresearch/MetaCLIP.

curation, image-text pair, metaclip, (16 more...)

arXiv.org Artificial Intelligence

2309.16671

Country:

Europe > United Kingdom > England > Staffordshire (0.04)
Pacific Ocean > North Pacific Ocean > Sea of Japan (0.04)
North America > United States > New York (0.04)
Asia > South Korea > Gangwon-do > Gangneung (0.04)

Genre:

Research Report > New Finding (0.48)
Research Report > Experimental Study (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Unified Data Management and Comprehensive Performance Evaluation for Urban Spatial-Temporal Prediction [Experiment, Analysis & Benchmark]

Jiang, Jiawei, Han, Chengkai, Zhao, Wayne Xin, Wang, Jingyuan

arXiv.org Artificial IntelligenceOct-2-2023

The field of urban spatial-temporal prediction is advancing rapidly with the development of deep learning techniques and the availability of large-scale datasets. However, challenges persist in accessing and utilizing diverse urban spatial-temporal datasets from different sources and stored in different formats, as well as determining effective model structures and components with the proliferation of deep learning models. This work addresses these challenges and provides three significant contributions. Firstly, we introduce "atomic files", a unified storage format designed for urban spatial-temporal big data, and validate its effectiveness on 40 diverse datasets, simplifying data management. Secondly, we present a comprehensive overview of technological advances in urban spatial-temporal prediction models, guiding the development of robust models. Thirdly, we conduct extensive experiments using diverse models and datasets, establishing a performance leaderboard and identifying promising research directions. Overall, this work effectively manages urban spatial-temporal data, guides future efforts, and facilitates the development of accurate and efficient urban spatial-temporal prediction models. It can potentially make long-term contributions to urban spatial-temporal data management and prediction, ultimately leading to improved urban living standards.

dataset, prediction, spatial-temporal data, (15 more...)

arXiv.org Artificial Intelligence

2308.12899

Country:

North America > United States > New York (0.05)
Asia > China > Beijing > Beijing (0.05)
North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
(10 more...)

Genre: Research Report (1.00)

Industry: Transportation > Ground > Road (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

CausalTime: Realistically Generated Time-series for Benchmarking of Causal Discovery

Cheng, Yuxiao, Wang, Ziqian, Xiao, Tingxiong, Zhong, Qin, Suo, Jinli, He, Kunlun

arXiv.org Machine LearningOct-2-2023

Time-series causal discovery (TSCD) is a fundamental problem of machine learning. However, existing synthetic datasets cannot properly evaluate or predict the algorithms' performance on real data. This study introduces the CausalTime pipeline to generate time-series that highly resemble the real data and with ground truth causal graphs for quantitative performance evaluation. The pipeline starts from real observations in a specific scenario and produces a matching benchmark dataset. Firstly, we harness deep neural networks along with normalizing flow to accurately capture realistic dynamics. Secondly, we extract hypothesized causal graphs by performing importance analysis on the neural network or leveraging prior knowledge. Thirdly, we derive the ground truth causal graphs by splitting the causal model into causal term, residual term, and noise term. Lastly, using the fitted network and the derived causal graph, we generate corresponding versatile time-series proper for algorithm assessment. In the experiments, we validate the fidelity of the generated data through qualitative and quantitative experiments, followed by a benchmarking of existing TSCD algorithms using these generated datasets. CausalTime offers a feasible solution to evaluating TSCD algorithms in real applications and can be generalized to a wide range of fields. For easy use of the proposed approach, we also provide a user-friendly website, hosted on www.causaltime.cc.

artificial intelligence, deep learning, machine learning, (14 more...)

arXiv.org Machine Learning

2310.01753

Country:

Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > District of Columbia > Washington (0.04)
(3 more...)

Genre: Research Report (0.64)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

LibCity: A Unified Library Towards Efficient and Comprehensive Urban Spatial-Temporal Prediction

Jiang, Jiawei, Han, Chengkai, Jiang, Wenjun, Zhao, Wayne Xin, Wang, Jingyuan

arXiv.org Artificial IntelligenceOct-1-2023

As deep learning technology advances and more urban spatial-temporal data accumulates, an increasing number of deep learning models are being proposed to solve urban spatial-temporal prediction problems. However, there are limitations in the existing field, including open-source data being in various formats and difficult to use, few papers making their code and data openly available, and open-source models often using different frameworks and platforms, making comparisons challenging. A standardized framework is urgently needed to implement and evaluate these methods. To address these issues, we propose LibCity, an open-source library that offers researchers a credible experimental tool and a convenient development framework. In this library, we have reproduced 65 spatial-temporal prediction models and collected 55 spatial-temporal datasets, allowing researchers to conduct comprehensive experiments conveniently. By enabling fair model comparisons, designing a unified data storage format, and simplifying the process of developing new models, LibCity is poised to make significant contributions to the spatial-temporal prediction field.

dataset, libcity, prediction, (15 more...)

arXiv.org Artificial Intelligence

2304.14343

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
North America > United States > New York (0.05)
Asia > China > Beijing > Beijing (0.05)
(13 more...)

Genre: Research Report (1.00)

Industry:

Transportation > Infrastructure & Services (0.69)
Transportation > Ground > Road (0.69)
Information Technology (0.68)
Transportation > Passenger (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)

Add feedback

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing

Gou, Zhibin, Shao, Zhihong, Gong, Yeyun, Shen, Yelong, Yang, Yujiu, Duan, Nan, Chen, Weizhu

arXiv.org Artificial IntelligenceSep-30-2023

Recent developments in large language models (LLMs) have been impressive. However, these models sometimes show inconsistencies and problematic behavior, such as hallucinating facts, generating flawed code, or creating offensive and toxic content. Unlike these models, humans typically utilize external tools to cross-check and refine their initial content, like using a search engine for fact-checking, or a code interpreter for debugging. Inspired by this observation, we introduce a framework called CRITIC that allows LLMs, which are essentially "black boxes" to validate and progressively amend their own outputs in a manner similar to human interaction with tools. More specifically, starting with an initial output, CRITIC interacts with appropriate tools to evaluate certain aspects of the text, and then revises the output based on the feedback obtained during this validation process. Comprehensive evaluations involving free-form question answering, mathematical program synthesis, and toxicity reduction demonstrate that CRITIC consistently enhances the performance of LLMs. Meanwhile, our research highlights the crucial importance of external feedback in promoting the ongoing self-improvement of LLMs.

answer plausible, elizabeth perkin, track and field title, (15 more...)

arXiv.org Artificial Intelligence

2305.11738

Country:

Asia > North Korea (0.28)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Georgia (0.14)
(48 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.92)
Personal > Honors > Award (0.46)

Industry:

Transportation (1.00)
Media > Music (1.00)
Media > Film (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback