AITopics | Indian Ocean

Collaborating Authors

Indian Ocean

Koopman Invertible Autoencoder: Leveraging Forward and Backward Dynamics for Temporal Modeling

Tayal, Kshitij, Renganathan, Arvind, Ghosh, Rahul, Jia, Xiaowei, Kumar, Vipin

arXiv.org Artificial IntelligenceSep-18-2023

Accurate long-term predictions are the foundations for many machine learning applications and decision-making processes. However, building accurate long-term prediction models remains challenging due to the limitations of existing temporal models like recurrent neural networks (RNNs), as they capture only the statistical connections in the training data and may fail to learn the underlying dynamics of the target system. To tackle this challenge, we propose a novel machine learning model based on Koopman operator theory, which we call Koopman Invertible Autoencoders (KIA), that captures the inherent characteristic of the system by modeling both forward and backward dynamics in the infinite-dimensional Hilbert space. This enables us to efficiently learn low-dimensional representations, resulting in more accurate predictions of long-term system behavior. Moreover, our method's invertibility design guarantees reversibility and consistency in both forward and inverse operations. We illustrate the utility of KIA on pendulum and climate datasets, demonstrating 300% improvements in long-term prediction capability for pendulum while maintaining robustness against noise. Additionally, our method excels in long-term climate prediction, further validating our method's effectiveness.

koopman operator, neural network, prediction, (14 more...)

arXiv.org Artificial Intelligence

2309.10291

Country:

Asia > Southeast Asia (0.05)
North America > United States > Minnesota (0.04)
Indian Ocean > Arabian Gulf (0.04)
(2 more...)

Genre:

Research Report (0.64)
Overview (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Multi-fidelity climate model parameterization for better generalization and extrapolation

Bhouri, Mohamed Aziz, Peng, Liran, Pritchard, Michael S., Gentine, Pierre

arXiv.org Artificial IntelligenceSep-18-2023

Machine-learning-based parameterizations (i.e. representation of sub-grid processes) of global climate models or turbulent simulations have recently been proposed as a powerful alternative to physical, but empirical, representations, offering a lower computational cost and higher accuracy. Yet, those approaches still suffer from a lack of generalization and extrapolation beyond the training data, which is however critical to projecting climate change or unobserved regimes of turbulence. Here we show that a multi-fidelity approach, which integrates datasets of different accuracy and abundance, can provide the best of both worlds: the capacity to extrapolate leveraging the physically-based parameterization and a higher accuracy using the machine-learning-based parameterizations. In an application to climate modeling, the multi-fidelity framework yields more accurate climate projections without requiring major increase in computational resources. Our multi-fidelity randomized prior networks (MF-RPNs) combine physical parameterization data as low-fidelity and storm-resolving historical run's data as high-fidelity. To extrapolate beyond the training data, the MF-RPNs are tested on high-fidelity warming scenarios, $+4K$, data. We show the MF-RPN's capacity to return much more skillful predictions compared to either low- or high-fidelity (historical data) simulations trained only on one regime while providing trustworthy uncertainty quantification across a wide range of scenarios. Our approach paves the way for the use of machine-learning based methods that can optimally leverage historical observations or high-fidelity simulations and extrapolate to unseen regimes such as climate change.

parameterization, tendency, vertical level, (17 more...)

arXiv.org Artificial Intelligence

2309.10231

Country:

Atlantic Ocean > South Atlantic Ocean (0.04)
North America > United States > New York > New York County > New York City (0.04)
South America (0.04)
(8 more...)

Genre: Research Report > New Finding (0.46)

Industry: Energy (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Empowering Fake-News Mitigation: Insights from Sharers' Social Media Post-Histories

Schoenmueller, Verena, Blanchard, Simon J., Johar, Gita V.

arXiv.org Artificial IntelligenceSep-18-2023

Misinformation is a global concern and limiting its spread is critical for protecting democracy, public health, and consumers. We propose that consumers' own social media post-histories are an underutilized data source to study what leads them to share links to fake-news. In Study 1, we explore how textual cues extracted from post-histories distinguish fake-news sharers from random social media users and others in the misinformation ecosystem. Among other results, we find across two datasets that fake-news sharers use more words related to anger, religion and power. In Study 2, we show that adding textual cues from post-histories improves the accuracy of models to predict who is likely to share fake-news. In Study 3, we provide a preliminary test of two mitigation strategies deduced from Study 1 - activating religious values and reducing anger - and find that they reduce fake-news sharing and sharing more generally. In Study 4, we combine survey responses with users' verified Twitter post-histories and show that using empowering language in a fact-checking browser extension ad increases download intentions. Our research encourages marketers, misinformation scholars, and practitioners to use post-histories to develop theories and test interventions to reduce the spread of misinformation.

fake-news sharer, sharer, textual cue, (17 more...)

arXiv.org Artificial Intelligence

2203.1056

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > California > Los Angeles County > Los Angeles (0.04)
North America > United States > New York > New York County > New York City (0.04)
(21 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)
Overview (0.92)

Industry:

Media > News (1.00)
Leisure & Entertainment > Sports > Football (1.00)
Leisure & Entertainment > Sports > Basketball (1.00)
(5 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

X-PARADE: Cross-Lingual Textual Entailment and Information Divergence across Paragraphs

Rodriguez, Juan Diego, Erk, Katrin, Durrett, Greg

arXiv.org Artificial IntelligenceSep-16-2023

Understanding when two pieces of text convey the same information is a goal touching many subproblems in NLP, including textual entailment and fact-checking. This problem becomes more complex when those two pieces of text are in different languages. Here, we introduce X-PARADE (Cross-lingual Paragraph-level Analysis of Divergences and Entailments), the first cross-lingual dataset of paragraph-level information divergences. Annotators label a paragraph in a target language at the span level and evaluate it with respect to a corresponding paragraph in a source language, indicating whether a given piece of information is the same, new, or new but can be inferred. This last notion establishes a link with cross-language NLI. Aligned paragraphs are sourced from Wikipedia pages in different languages, reflecting real information divergences observed in the wild. Armed with our dataset, we investigate a diverse set of approaches for this problem, including classic token alignment from machine translation, textual entailment methods that localize their decisions, and prompting of large language models. Our results show that these methods vary in their capability to handle inferable information, but they all fall short of human performance.

computational linguistic, linguistic, paragraph, (16 more...)

arXiv.org Artificial Intelligence

2309.08873

Country:

Asia > Middle East > Iraq (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
(22 more...)

Genre: Research Report > New Finding (0.68)

Industry:

Government > Military (0.68)
Health & Medicine > Therapeutic Area (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

Temporal-spatial model via Trend Filtering

Padilla, Carlos Misael Madrid, Padilla, Oscar Hernan Madrid, Wang, Daren

arXiv.org Machine LearningSep-12-2023

This research focuses on the estimation of a non-parametric regression function designed for data with simultaneous time and space dependencies. In such a context, we study the Trend Filtering, a nonparametric estimator introduced by \cite{mammen1997locally} and \cite{rudin1992nonlinear}. For univariate settings, the signals we consider are assumed to have a kth weak derivative with bounded total variation, allowing for a general degree of smoothness. In the multivariate scenario, we study a $K$-Nearest Neighbor fused lasso estimator as in \cite{padilla2018adaptive}, employing an ADMM algorithm, suitable for signals with bounded variation that adhere to a piecewise Lipschitz continuity criterion. By aligning with lower bounds, the minimax optimality of our estimators is validated. A unique phase transition phenomenon, previously uncharted in Trend Filtering studies, emerges through our analysis. Both Simulation studies and real data applications underscore the superior performance of our method when compared with established techniques in the existing literature.

artificial intelligence, inequality, machine learning, (18 more...)

arXiv.org Machine Learning

2308.16172

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.13)
Europe > Spain > Galicia > Madrid (0.05)
Asia > Japan > Honshū > Kansai > Wakayama Prefecture > Wakayama (0.04)
(6 more...)

Genre: Research Report > New Finding (0.45)

Industry: Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.87)

Add feedback

International Governance of Civilian AI: A Jurisdictional Certification Approach

Trager, Robert, Harack, Ben, Reuel, Anka, Carnegie, Allison, Heim, Lennart, Ho, Lewis, Kreps, Sarah, Lall, Ranjit, Larter, Owen, hÉigeartaigh, Seán Ó, Staffell, Simon, Villalobos, José Jaime

arXiv.org Artificial IntelligenceSep-11-2023

This report describes trade-offs in the design of international governance arrangements for civilian artificial intelligence (AI) and presents one approach in detail. This approach represents the extension of a standards, licensing, and liability regime to the global level. We propose that states establish an International AI Organization (IAIO) to certify state jurisdictions (not firms or AI projects) for compliance with international oversight standards. States can give force to these international standards by adopting regulations prohibiting the import of goods whose supply chains embody AI from non-IAIO-certified jurisdictions. This borrows attributes from models of existing international organizations, such as the International Civilian Aviation Organization (ICAO), the International Maritime Organization (IMO), and the Financial Action Task Force (FATF). States can also adopt multilateral controls on the export of AI product inputs, such as specialized hardware, to non-certified jurisdictions. Indeed, both the import and export standards could be required for certification. As international actors reach consensus on risks of and minimum standards for advanced AI, a jurisdictional certification regime could mitigate a broad range of potential harms, including threats to public safety.

domestic regulator, forest stewardship council, international governance, (15 more...)

arXiv.org Artificial Intelligence

2308.15514

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.28)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.28)
Oceania > Australia (0.14)
(26 more...)

Genre: Research Report (1.00)

Industry:

Transportation > Passenger (1.00)
Transportation > Air (1.00)
Law > Statutes (1.00)
(11 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.68)

Add feedback

Discriminative Class Tokens for Text-to-Image Diffusion Models

Schwartz, Idan, Snæbjarnarson, Vésteinn, Chefer, Hila, Cotterell, Ryan, Belongie, Serge, Wolf, Lior, Benaim, Sagie

arXiv.org Artificial IntelligenceSep-10-2023

Recent advances in text-to-image diffusion models have enabled the generation of diverse and high-quality images. While impressive, the images often fall short of depicting subtle details and are susceptible to errors due to ambiguity in the input text. One way of alleviating these issues is to train diffusion models on class-labeled datasets. This approach has two disadvantages: (i) supervised datasets are generally small compared to large-scale scraped text-image datasets on which text-to-image models are trained, affecting the quality and diversity of the generated images, or (ii) the input is a hard-coded label, as opposed to free-form text, limiting the control over the generated images. In this work, we propose a non-invasive fine-tuning technique that capitalizes on the expressive potential of free-form text while achieving high accuracy through discriminative signals from a pretrained classifier. This is done by iteratively modifying the embedding of an added input token of a text-to-image diffusion model, by steering generated images toward a given target class according to a classifier. Our method is fast compared to prior fine-tuning methods and does not require a collection of in-class images or retraining of a noise-tolerant classifier. We evaluate our method extensively, showing that the generated images are: (i) more accurate and of higher quality than standard diffusion models, (ii) can be used to augment training data in a low-resource setting, and (iii) reveal information about the data used to train the guiding classifier. The code is available at \url{https://github.com/idansc/discriminative_class_tokens}.

acc, classifier, stable diffusion, (15 more...)

arXiv.org Artificial Intelligence

2303.17155

Country:

Indian Ocean > Red Sea (0.04)
Europe > Switzerland (0.04)
Asia > Middle East > Yemen (0.04)
(8 more...)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models

Chuang, Yung-Sung, Xie, Yujia, Luo, Hongyin, Kim, Yoon, Glass, James, He, Pengcheng

arXiv.org Artificial IntelligenceSep-7-2023

Despite their impressive capabilities, large language models (LLMs) are prone to hallucinations, i.e., generating content that deviates from facts seen during pretraining. We propose a simple decoding strategy for reducing hallucinations with pretrained LLMs that does not require conditioning on retrieved external knowledge nor additional fine-tuning. Our approach obtains the next-token distribution by contrasting the differences in logits obtained from projecting the later layers versus earlier layers to the vocabulary space, exploiting the fact that factual knowledge in an LLMs has generally been shown to be localized to particular transformer layers. We find that this Decoding by Contrasting Layers (DoLa) approach is able to better surface factual knowledge and reduce the generation of incorrect facts. DoLa consistently improves the truthfulness across multiple choices tasks and open-ended generation tasks, for example improving the performance of LLaMA family models on TruthfulQA by 12-17% absolute points, demonstrating its potential in making LLMs reliably generate truthful facts.

dola, language model, pre-print, (16 more...)

arXiv.org Artificial Intelligence

2309.03883

Country:

Asia > India (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Portugal (0.04)
(9 more...)

Genre: Research Report > New Finding (0.46)

Industry: Energy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

Neural lasso: a unifying approach of lasso and neural networks

Delgado, David, Curbelo, Ernesto, Carreras, Danae

arXiv.org Machine LearningSep-7-2023

In recent years, there is a growing interest in combining techniques attributed to the areas of Statistics and Machine Learning in order to obtain the benefits of both approaches. In this article, the statistical technique lasso for variable selection is represented through a neural network. It is observed that, although both the statistical approach and its neural version have the same objective function, they differ due to their optimization. In particular, the neural version is usually optimized in one-step using a single validation set, while the statistical counterpart uses a two-step optimization based on cross-validation. The more elaborated optimization of the statistical method results in more accurate parameter estimation, especially when the training set is small. For this reason, a modification of the standard approach for training neural networks, that mimics the statistical framework, is proposed. During the development of the above modification, a new optimization algorithm for identifying the significant variables emerged. Experimental results, using synthetic and real data sets, show that this new optimization algorithm achieves better performance than any of the three previous optimization approaches.

artificial intelligence, lasso, machine learning, (16 more...)

arXiv.org Machine Learning

2309.0377

Country:

Europe > Spain > Galicia > Madrid (0.04)
Oceania > Australia > Tasmania (0.04)
North America > United States > Wisconsin (0.04)
(3 more...)

Genre: Research Report (0.84)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.94)
Health & Medicine > Therapeutic Area > Neurology > Attention Deficit/Hyperactivity Disorder (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Saudi Arabia's e-sports looking to nurture its own hit games

Al JazeeraSep-3-2023, 10:12:13 GMT

Saudi Arabia has made no secret of its passion for gaming and e-sports, so there was no shortage of young Saudis to take in a museum of video game history stretching from the original Pac-Man to PlayStation 5. It is part of Gamers8, an eight-week festival of e-sports tournaments in the capital, Riyadh, with a $45m prize pool – a project to inspire young people to create their own blockbuster titles. The passion is believed to come from the very top, with Crown Prince Mohammed bin Salman (MBS) said to be an avid Call of Duty player. Last year, the 38-year-old de facto ruler announced a $38bn investment strategy for the Savvy Games Group, owned by the Public Investment Fund. As it gathers momentum, the national gaming and e-sports strategy emphasises local game production, promising to turn the kingdom into "an Eden for game developers" that can produce new titles "promoting Saudi and Arabic culture".

nurture, own hit game, saudi arabia, (5 more...)

Al Jazeera

Country:

Asia > Middle East > Saudi Arabia > Riyadh Province > Riyadh (0.28)
North America > United States > California (0.06)
Indian Ocean > Red Sea (0.06)
(6 more...)

Industry:

Leisure & Entertainment > Sports (1.00)
Leisure & Entertainment > Games > Computer Games (1.00)
Government > Regional Government > Asia Government > Middle East Government > Saudi Arabia Government (0.37)

Technology: Information Technology > Artificial Intelligence > Games (0.59)

Add feedback