AITopics | Pacific Ocean

Collaborating Authors

Pacific Ocean

Mining Causality from Continuous-time Dynamics Models: An Application to Tsunami Forecasting

Wu, Fan, Hong, Sanghyun, Rim, Donsub, Park, Noseong, Lee, Kookjin

arXiv.org Artificial IntelligenceOct-13-2022

Continuous-time dynamics models, such as neural ordinary differential equations, have enabled the modeling of underlying dynamics in time-series data and accurate forecasting. However, parameterization of dynamics using a neural network makes it difficult for humans to identify causal structures in the data. In consequence, this opaqueness hinders the use of these models in the domains where capturing causal relationships carries the same importance as accurate predictions, e.g., tsunami forecasting. In this paper, we address this challenge by proposing a mechanism for mining causal structures from continuous-time models. We train models to capture the causal structure by enforcing sparsity in the weights of the input layers of the dynamics models. We first verify the effectiveness of our method in the scenario where the exact causal-structures of time-series are known as a priori. We next apply our method to a real-world problem, namely tsunami forecasting, where the exact causal-structures are difficult to characterize. Experimental results show that the proposed method is effective in learning physically-consistent causal relationships while achieving high forecasting accuracy.

artificial intelligence, deep learning, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2210.04958

Country:

North America > United States > Arizona (0.04)
Pacific Ocean > North Pacific Ocean > Puget Sound (0.04)
North America > United States > Oregon (0.04)

Genre: Research Report > New Finding (0.66)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Regularized Graph Structure Learning with Semantic Knowledge for Multi-variates Time-Series Forecasting

Yu, Hongyuan, Li, Ting, Yu, Weichen, Li, Jianguo, Huang, Yan, Wang, Liang, Liu, Alex

arXiv.org Artificial IntelligenceOct-12-2022

Multivariate time-series forecasting is a critical task for many applications, and graph time-series network is widely studied due to its capability to capture the spatial-temporal correlation simultaneously. However, most existing works focus more on learning with the explicit prior graph structure, while ignoring potential information from the implicit graph structure, yielding incomplete structure modeling. Some recent works attempt to learn the intrinsic or implicit graph structure directly while lacking a way to combine explicit prior structure with implicit structure together. In this paper, we propose Regularized Graph Structure Learning (RGSL) model to incorporate both explicit prior structure and implicit structure together, and learn the forecasting deep networks along with the graph structure. RGSL consists of two innovative modules. First, we derive an implicit dense similarity matrix through node embedding, and learn the sparse graph structure using the Regularized Graph Generation (RGG) based on the Gumbel Softmax trick. Second, we propose a Laplacian Matrix Mixed-up Module (LM3) to fuse the explicit graph and implicit graph together. We conduct experiments on three real-word datasets. Results show that the proposed RGSL model outperforms existing graph forecasting algorithms with a notable margin, while learning meaningful graph structure simultaneously. Our code and models are made publicly available at https://github.com/alipay/RGSL.git.

artificial intelligence, forecasting, machine learning, (19 more...)

arXiv.org Artificial Intelligence

doi: 10.24963/ijcai.2022/328

2210.06126

Country:

North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.05)
Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.84)

Industry: Information Technology (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Are Driverless Cars the Future of Transportation?

#artificialintelligenceOct-11-2022, 02:05:38 GMT

What do you think about driverless cars? Would you ride in one? Do you think they are the way of the future? In "Stuck on the Streets of San Francisco in a Driverless Car," the Times technology reporter Cade Metz went for a ride in the back seat of an experimental autonomous vehicle and wrote about his experience: It was about 9 p.m. on a cool Tuesday evening in San Francisco this month when I hailed a car outside a restaurant a few blocks from Golden Gate Park. A few minutes later, as I waited at a stoplight, a white Mercedes pulled up next to me.

driverless car, san francisco, transportation, (6 more...)

#artificialintelligence

Country:

North America > United States > California > San Francisco County > San Francisco (0.51)
Pacific Ocean > North Pacific Ocean > San Francisco Bay > Golden Gate (0.25)
Europe > United Kingdom (0.05)

Genre:

Personal > Interview (0.98)
Questionnaire & Opinion Survey (0.63)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Information Technology > Robotics & Automation (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

Add feedback

Chinese Discourse Annotation Reference Manual

Peng, Siyao, Liu, Yang Janet, Zeldes, Amir

arXiv.org Artificial IntelligenceOct-11-2022

This document provides extensive guidelines and examples for Rhetorical Structure Theory (RST) annotation in Mandarin Chinese. The guideline is divided into three sections. We first introduce preprocessing steps to prepare data for RST annotation. Secondly, we discuss syntactic criteria to segment texts into Elementary Discourse Units (EDUs). Lastly, we provide examples to define and distinguish discourse relations in different genres. We hope that this reference manual can facilitate RST annotations in Chinese and accelerate the development of the RST framework across languages.

artificial intelligence, natural language, relation, (18 more...)

arXiv.org Artificial Intelligence

2212.06037

Country:

Oceania > Palau (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
Asia > China > Tianjin Province > Tianjin (0.04)
(18 more...)

Genre: Research Report (0.81)

Industry:

Media (1.00)
Information Technology (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
(5 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.46)

Add feedback

Deep Mind's AlphaZero Game Playing AI -- Reduces Compute Time, Cuts Costs & Saves Energy

#artificialintelligenceOct-10-2022, 20:45:06 GMT

CIOs & CTOs, the MIT Technology Review reported on October 5, 2022 reported that "DeepMind's game-playing AI AlphaZero has beaten a 50-year-old record in computer science. DeepMind has used its board-game playing AI to discover a faster way to solve a fundamental math problem in computer science, beating a record that has stood for more than 50 years. The problem, matrix multiplication, is a crucial type of calculation at the heart of many different applications, from displaying images on a screen to simulating complex physics. It is also fundamental to machine learning itself. Speeding up this calculation could have a big impact on thousands of everyday computer tasks, cutting costs and saving energy."

cut cost & save energy, learning, reduce compute time, (8 more...)

#artificialintelligence

Country:

Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.08)
North America > United States > Minnesota > Hennepin County > Eden Prairie (0.08)
North America > United States > California > San Francisco County > San Francisco (0.08)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Learning to Reconstruct Missing Data from Spatiotemporal Graphs with Sparse Observations

Marisca, Ivan, Cini, Andrea, Alippi, Cesare

arXiv.org Artificial IntelligenceOct-10-2022

Modeling multivariate time series as temporal signals over a (possibly dynamic) graph is an effective representational framework that allows for developing models for time series analysis. In fact, discrete sequences of graphs can be processed by autoregressive graph neural networks to recursively learn representations at each discrete point in time and space. Spatiotemporal graphs are often highly sparse, with time series characterized by multiple, concurrent, and long sequences of missing data, e.g., due to the unreliable underlying sensor network. In this context, autoregressive models can be brittle and exhibit unstable learning dynamics. The objective of this paper is, then, to tackle the problem of learning effective models to reconstruct, i.e., impute, missing data points by conditioning the reconstruction only on the available observations. In particular, we propose a novel class of attention-based architectures that, given a set of highly sparse discrete observations, learn a representation for points in time and space by exploiting a spatiotemporal propagation architecture aligned with the imputation task. Representations are trained end-to-end to reconstruct observations w.r.t. the corresponding sensor and its neighboring nodes. Compared to the state of the art, our model handles sparse data without propagating prediction errors or requiring a bidirectional model to encode forward and backward time dependencies. Empirical results on representative benchmarks show the effectiveness of the proposed method.

artificial intelligence, machine learning, representation, (16 more...)

arXiv.org Artificial Intelligence

2205.13479

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.04)
North America > United States > New York > New York County > New York City (0.04)
(4 more...)

Genre: Research Report (0.64)

Industry: Energy (0.67)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

PaLM: Scaling Language Modeling with Pathways

Chowdhery, Aakanksha, Narang, Sharan, Devlin, Jacob, Bosma, Maarten, Mishra, Gaurav, Roberts, Adam, Barham, Paul, Chung, Hyung Won, Sutton, Charles, Gehrmann, Sebastian, Schuh, Parker, Shi, Kensen, Tsvyashchenko, Sasha, Maynez, Joshua, Rao, Abhishek, Barnes, Parker, Tay, Yi, Shazeer, Noam, Prabhakaran, Vinodkumar, Reif, Emily, Du, Nan, Hutchinson, Ben, Pope, Reiner, Bradbury, James, Austin, Jacob, Isard, Michael, Gur-Ari, Guy, Yin, Pengcheng, Duke, Toju, Levskaya, Anselm, Ghemawat, Sanjay, Dev, Sunipa, Michalewski, Henryk, Garcia, Xavier, Misra, Vedant, Robinson, Kevin, Fedus, Liam, Zhou, Denny, Ippolito, Daphne, Luan, David, Lim, Hyeontaek, Zoph, Barret, Spiridonov, Alexander, Sepassi, Ryan, Dohan, David, Agrawal, Shivani, Omernick, Mark, Dai, Andrew M., Pillai, Thanumalayan Sankaranarayana, Pellat, Marie, Lewkowycz, Aitor, Moreira, Erica, Child, Rewon, Polozov, Oleksandr, Lee, Katherine, Zhou, Zongwei, Wang, Xuezhi, Saeta, Brennan, Diaz, Mark, Firat, Orhan, Catasta, Michele, Wei, Jason, Meier-Hellstern, Kathy, Eck, Douglas, Dean, Jeff, Petrov, Slav, Fiedel, Noah

arXiv.org Artificial IntelligenceOct-5-2022

Large language models have been shown to achieve remarkable performance across a variety of natural language tasks using few-shot learning, which drastically reduces the number of task-specific training examples needed to adapt the model to a particular application. To further our understanding of the impact of scale on few-shot learning, we trained a 540-billion parameter, densely activated, Transformer language model, which we call Pathways Language Model PaLM. We trained PaLM on 6144 TPU v4 chips using Pathways, a new ML system which enables highly efficient training across multiple TPU Pods. We demonstrate continued benefits of scaling by achieving state-of-the-art few-shot learning results on hundreds of language understanding and generation benchmarks. On a number of these tasks, PaLM 540B achieves breakthrough performance, outperforming the finetuned state-of-the-art on a suite of multi-step reasoning tasks, and outperforming average human performance on the recently released BIG-bench benchmark. A significant number of BIG-bench tasks showed discontinuous improvements from model scale, meaning that performance steeply increased as we scaled to our largest model. PaLM also has strong capabilities in multilingual tasks and source code generation, which we demonstrate on a wide array of benchmarks. We additionally provide a comprehensive analysis on bias and toxicity, and study the extent of training data memorization with respect to model scale. Finally, we discuss the ethical considerations related to large language models and discuss potential mitigation strategies.

large language model, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

2204.02311

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
(26 more...)

Genre: Research Report > New Finding (0.45)

Industry:

Leisure & Entertainment (1.00)
Law (1.00)
Health & Medicine (1.00)
(7 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(4 more...)

Add feedback

Learning Spatially-Adaptive Squeeze-Excitation Networks for Image Synthesis and Image Recognition

Shen, Jianghao, Wu, Tianfu

arXiv.org Artificial IntelligenceOct-3-2022

Learning light-weight yet expressive deep networks in both image synthesis and image recognition remains a challenging problem. Inspired by a more recent observation that it is the data-specificity that makes the multi-head self-attention (MHSA) in the Transformer model so powerful, this paper proposes to extend the widely adopted light-weight Squeeze-Excitation (SE) module to be spatially-adaptive to reinforce its data specificity, as a convolutional alternative of the MHSA, while retaining the efficiency of SE and the inductive basis of convolution. It presents two designs of spatially-adaptive squeeze-excitation (SASE) modules for image synthesis and image recognition respectively. For image synthesis tasks, the proposed SASE is tested in both low-shot and one-shot learning tasks. It shows better performance than prior arts. For image recognition tasks, the proposed SASE is used as a drop-in replacement for convolution layers in ResNets and achieves much better accuracy than the vanilla ResNets, and slightly better than the MHSA counterparts such as the Swin-Transformer and Pyramid-Transformer in the ImageNet-1000 dataset, with significantly smaller models.

machine learning, pattern recognition, sase, (12 more...)

arXiv.org Artificial Intelligence

2112.14804

Country:

Pacific Ocean > North Pacific Ocean > San Francisco Bay > Golden Gate (0.04)
North America > United States > North Carolina (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Image Matching (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Celebrate over 20 years of AI/ML at Innovation Day

#artificialintelligenceSep-30-2022, 23:23:18 GMT

Be our guest as we celebrate 20 years of AI/ML innovation on October 25, 2022, 9:00 AM – 10:30 AM PT. The first 1,500 people to register will receive $50 of AWS credits. Over the past 20 years, Amazon has delivered many world firsts for artificial intelligence (AI) and machine learning (ML). ML is an integral part of Amazon and is used for everything from applying personalization models at checkout, to forecasting the demand for products globally, to creating autonomous flight for Amazon Prime Air drones, to natural language processing (NLP) on Alexa. And the use of ML isn't slowing down anytime soon, because ML helps Amazon exceed customer expectations for convenience, cost, and delivery speed.

ai ml, decision-making, innovation day, (8 more...)

#artificialintelligence

Country:

Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.05)
North America > United States > California > San Francisco County > San Francisco (0.05)
Asia > Middle East > Jordan (0.05)

Industry: Retail > Online (0.71)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Cloud Classification with Unsupervised Deep Learning

Kurihana, Takuya, Foster, Ian, Willett, Rebecca, Jenkins, Sydney, Koenig, Kathryn, Werman, Ruby, Lourenco, Ricardo Barros, Neo, Casper, Moyer, Elisabeth

arXiv.org Artificial IntelligenceSep-30-2022

We present a framework for cloud characterization that leverages modern unsupervised deep learning technologies. While previous neural network-based cloud classification models have used supervised learning methods, unsupervised learning allows us to avoid restricting the model to artificial categories based on historical cloud classification schemes and enables the discovery of novel, more detailed classifications. Our framework learns cloud features directly from radiance data produced by NASA's Moderate Resolution Imaging Spectroradiometer (MODIS) satellite instrument, deriving cloud characteristics from millions of images without relying on pre-defined cloud types during the training process. We present preliminary results showing that our method extracts physically relevant information from radiance data and produces meaningful cloud classes.

artificial intelligence, classification, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2209.15585

Country:

North America > United States > Illinois > Cook County > Chicago (0.06)
Pacific Ocean (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)

Genre: Research Report > New Finding (0.34)

Industry:

Government > Space Agency (0.34)
Government > Regional Government > North America Government > United States Government (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback