AITopics | Atlantic Ocean

Collaborating Authors

Atlantic Ocean

ProFLingo: A Fingerprinting-based Intellectual Property Protection Scheme for Large Language Models

Jin, Heng, Zhang, Chaoyu, Shi, Shanghao, Lou, Wenjing, Hou, Y. Thomas

arXiv.org Artificial IntelligenceJun-26-2024

Large language models (LLMs) have attracted significant attention in recent years. Due to their "Large" nature, training LLMs from scratch consumes immense computational resources. Since several major players in the artificial intelligence (AI) field have open-sourced their original LLMs, an increasing number of individual researchers and smaller companies are able to build derivative LLMs based on these open-sourced models at much lower costs. However, this practice opens up possibilities for unauthorized use or reproduction that may not comply with licensing agreements, and fine-tuning can change the model's behavior, thus complicating the determination of model ownership. Current intellectual property (IP) protection schemes for LLMs are either designed for white-box settings or require additional modifications to the original model, which restricts their use in real-world settings. In this paper, we propose ProFLingo, a black-box fingerprinting-based IP protection scheme for LLMs. ProFLingo generates queries that elicit specific responses from an original model, thereby establishing unique fingerprints. Our scheme assesses the effectiveness of these queries on a suspect model to determine whether it has been derived from the original model. ProFLingo offers a non-invasive approach, which neither requires knowledge of the suspect model nor modifications to the base model or its training process. To the best of our knowledge, our method represents the first black-box fingerprinting technique for IP protection for LLMs. Our source code and generated queries are available at: https://github.com/hengvt/ProFLingo.

original model, proflingo, query, (13 more...)

arXiv.org Artificial Intelligence

2405.02466

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.04)
Asia (0.04)
(10 more...)

Genre: Research Report > New Finding (0.68)

Industry:

Law > Intellectual Property & Technology Law (1.00)
Information Technology > Security & Privacy (0.94)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

CausalFormer: An Interpretable Transformer for Temporal Causal Discovery

Kong, Lingbai, Li, Wengen, Yang, Hanchen, Zhang, Yichao, Guan, Jihong, Zhou, Shuigeng

arXiv.org Artificial IntelligenceJun-24-2024

Temporal causal discovery is a crucial task aimed at uncovering the causal relations within time series data. The latest temporal causal discovery methods usually train deep learning models on prediction tasks to uncover the causality between time series. They capture causal relations by analyzing the parameters of some components of the trained models, e.g., attention weights and convolution weights. However, this is an incomplete mapping process from the model parameters to the causality and fails to investigate the other components, e.g., fully connected layers and activation functions, that are also significant for causal discovery. To facilitate the utilization of the whole deep learning models in temporal causal discovery, we proposed an interpretable transformer-based causal discovery model termed CausalFormer, which consists of the causality-aware transformer and the decomposition-based causality detector. The causality-aware transformer learns the causal representation of time series data using a prediction task with the designed multi-kernel causal convolution which aggregates each input time series along the temporal dimension under the temporal priority constraint. Then, the decomposition-based causality detector interprets the global structure of the trained causality-aware transformer with the proposed regression relevance propagation to identify potential causal relations and finally construct the causal graph. Experiments on synthetic, simulated, and real datasets demonstrate the state-of-the-art performance of CausalFormer on discovering temporal causality. Our code is available at https://github.com/lingbai-kong/CausalFormer.

artificial intelligence, causal relation, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2406.16708

Country:

North America > United States (0.46)
Asia > China > Shanghai > Shanghai (0.04)
North America > Mexico (0.04)
(3 more...)

Genre: Research Report > Experimental Study (0.68)

Industry: Health & Medicine (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning

Jia, Jinghan, Zhang, Yihua, Zhang, Yimeng, Liu, Jiancheng, Runwal, Bharat, Diffenderfer, James, Kailkhura, Bhavya, Liu, Sijia

arXiv.org Artificial IntelligenceJun-24-2024

Large Language Models (LLMs) have highlighted the necessity of effective unlearning mechanisms to comply with data regulations and ethical AI practices. LLM unlearning aims at removing undesired data influences and associated model capabilities without compromising utility beyond the scope of unlearning. While interest in studying LLM unlearning is growing, the impact of the optimizer choice for LLM unlearning remains unexplored. In this work, we shed light on the significance of optimizer selection in LLM unlearning for the first time, establishing a clear connection between second-order optimization and influence unlearning (a classical approach using influence functions to update the model for data influence removal). This insight propels us to develop a second-order optimization-based LLM unlearning framework, termed Second-Order UnLearning (SOUL), which extends the static, one-shot model update using influence unlearning to a dynamic, iterative unlearning process. Our extensive experiments show that SOUL consistently outperforms conventional first-order methods across various unlearning tasks, models, and metrics, indicating that second-order optimization offers an effective and broadly applicable solution for LLM unlearning. Codes are available at https://github.com/OPTML-Group/SOUL.

arxiv preprint arxiv, llm, optimization, (13 more...)

arXiv.org Artificial Intelligence

2404.18239

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Atlantic Ocean (0.05)
Asia > Azerbaijan > Baku Economic Region > Baku (0.04)
(5 more...)

Genre: Research Report (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Law > Intellectual Property & Technology Law (0.67)
Government (0.67)
Education (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

NaijaHate: Evaluating Hate Speech Detection on Nigerian Twitter Using Representative Data

Tonneau, Manuel, de Castro, Pedro Vitor Quinta, Lasri, Karim, Farouq, Ibrahim, Subramanian, Lakshminarayanan, Orozco-Olvera, Victor, Fraiberger, Samuel P.

arXiv.org Artificial IntelligenceJun-24-2024

To address the global issue of online hate, hate speech detection (HSD) systems are typically developed on datasets from the United States, thereby failing to generalize to English dialects from the Majority World. Furthermore, HSD models are often evaluated on non-representative samples, raising concerns about overestimating model performance in real-world settings. In this work, we introduce NaijaHate, the first dataset annotated for HSD which contains a representative sample of Nigerian tweets. We demonstrate that HSD evaluated on biased datasets traditionally used in the literature consistently overestimates real-world performance by at least two-fold. We then propose NaijaXLM-T, a pretrained model tailored to the Nigerian Twitter context, and establish the key role played by domain-adaptive pretraining and finetuning in maximizing HSD performance. Finally, owing to the modest performance of HSD systems in real-world conditions, we find that content moderators would need to review about ten thousand Nigerian tweets flagged as hateful daily to moderate 60% of all hateful content, highlighting the challenges of moderating hate speech at scale as social media usage continues to grow globally. Taken together, these results pave the way towards robust HSD systems and a better protection of social media users from hateful content in low-resource settings.

computational linguistic, dataset, tweet, (15 more...)

arXiv.org Artificial Intelligence

2403.1926

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Dominican Republic (0.04)
Oceania > Australia (0.04)
(15 more...)

Genre: Research Report (1.00)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Information Technology > Security & Privacy (1.00)
Law (0.67)
Government (0.67)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Yemen's Houthis claim joint raid on Israeli ships with Iraqi militia

Al JazeeraJun-23-2024, 12:13:46 GMT

Yemen's Houthis have claimed carrying out a joint military operation with an Iranian-backed Iraqi militia, known as the Islamic Resistance in Iraq, to target four vessels in Israel's Haifa port. Houthi military spokesman Yahya Saree said in a televised statement on Sunday that the group fired drones at two cement tankers and two cargo ships at the port a day prior over noncompliance with a ban on entering "ports of occupied Palestine". Saree added that the group had also targeted a Shorthorn Express ship in the Mediterranean Sea using drones, and both operations "successfully achieved their goals". Israel's Channel 12 reported an explosion occurred in Haifa at dawn after an air defence missile was launched towards the sea without activating the sirens. Israel's military did not comment on the Houthi claim, but stated in a post on X that it had shot down a drone approaching the country overnight from the east.

iraqi militia, ship, yemen, (9 more...)

Al Jazeera

Country:

Asia > Middle East > Yemen (1.00)
Asia > Middle East > Israel > Haifa District > Haifa (0.48)
Asia > Middle East > Palestine (0.28)
(13 more...)

Industry:

Government > Military (1.00)
Transportation > Marine (0.98)
Transportation > Freight & Logistics Services > Shipping (0.74)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.74)

Add feedback

Deep Optimal Experimental Design for Parameter Estimation Problems

Siddiqui, Md Shahriar Rahim, Rahmim, Arman, Haber, Eldad

arXiv.org Machine LearningJun-21-2024

Optimal experimental design is a well studied field in applied science and engineering. Techniques for estimating such a design are commonly used within the framework of parameter estimation. Nonetheless, in recent years parameter estimation techniques are changing rapidly with the introduction of deep learning techniques to replace traditional estimation methods. This in turn requires the adaptation of optimal experimental design that is associated with these new techniques. In this paper we investigate a new experimental design methodology that uses deep learning. We show that the training of a network as a Likelihood Free Estimator can be used to significantly simplify the design process and circumvent the need for the computationally expensive bi-level optimization problem that is inherent in optimal experimental design for non-linear systems. Furthermore, deep design improves the quality of the recovery process for parameter estimation problems. As proof of concept we apply our methodology to two different systems of Ordinary Differential Equations.

cont, estimator, sparsity, (15 more...)

arXiv.org Machine Learning

2406.14003

Country:

North America > United States > New York (0.04)
Atlantic Ocean > North Atlantic Ocean > Hudson Bay (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Health & Medicine > Nuclear Medicine (0.93)
Health & Medicine > Pharmaceuticals & Biotechnology (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

How underwater drones could shape a potential Taiwan-China conflict

MIT Technology ReviewJun-20-2024, 19:00:00 GMT

The report's authors detail a number of ways that use of drones in any South China Sea conflict would differ starkly from current practices, most notably in the war in Ukraine, often called the first full-scale drone war. Since Russia invaded Ukraine in 2022, drones have been aiding in what military experts describe as the first three steps of the "kill chain"--finding, targeting, and tracking a target--as well as in delivering explosives. The drones have a short life span, since they are often shot down or made useless by frequency jamming devices that prevent pilots from controlling them. Quadcopters--the commercially available drones often used in the war--last just three flights on average, according to the report. Drones like these would be far less useful in a possible invasion of Taiwan.

conflict, drone, underwater drone, (4 more...)

MIT Technology Review

Country:

Asia > China (0.76)
Asia > Taiwan (0.70)
Europe > Ukraine (0.63)
(5 more...)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)

Add feedback

A 'history-changing' discovery: 3,000-year-old ship containing wine jugs found 56 miles off the Israeli coast by underwater robots shows ancient seafarers were more daring than previously thought

Daily Mail - Science & techJun-20-2024, 14:45:06 GMT

An ancient ship containing hundreds of stunningly-preserved wine jugs has been found on the floor of the Mediterranean. The 40-foot vessel, found 1 mile deep on the seafloor 56 miles from Israel's coast, dates back 3,300 years to the late Bronze Age, experts say. It's thought to be the oldest ship found this deep in the Med, as previous shipwrecks from this era never ventured this far away from land. This suggests ancient seafarers were more capable at navigating the deep seas than historians previously thought. The ship likely sunk either from a storm or after coming under attack by pirates, the discoverers believe.

ancient seafarer, discovery, ship, (12 more...)

Daily Mail - Science & tech

Country:

Asia > Middle East > Israel (0.29)
Europe > United Kingdom (0.06)
Europe > Western Europe (0.05)
(4 more...)

Industry: Energy (0.32)

Technology: Information Technology > Artificial Intelligence > Robots (0.42)

Add feedback

The disturbing online misogyny of Gamergate has returned – if it ever went away

The GuardianJun-19-2024, 14:00:09 GMT

A few months ago I wrote about a consulting agency, Sweet Baby Inc, that found itself at the centre of a conspiracy theory: aggrieved gamers on a Steam forum had erroneously concluded that this small agency was somehow mandating the inclusion of more diverse characters in games. Depressingly but unsurprisingly, the result was a tremendous amount of targeted harassment towards the people who work at Sweet Baby and every journalist who reported on it (particularly the women). It was a disturbing echo of Gamergate, an online harassment campaign 10 years ago that initially sprung from the wild accusations of a game developer's vindictive ex-boyfriend. The language has changed a bit in the past decade: they used to be upset about "SJWs", or social justice warriors, and now they've taken issue with a different acronym, DEI (diversity, equality and inclusion), or just good ol' "woke". But the sentiment from this group is the same: games are for us, and for us only, and if you want games to change, or to tell stories outside the straightforward male-oriented power fantasies that we grew up with, then, well, that's not allowed.

developer, gamergate, video game, (14 more...)

The Guardian

Country:

Europe > North Sea (0.05)
Atlantic Ocean > North Atlantic Ocean > North Sea (0.05)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology: Information Technology > Artificial Intelligence > Games (0.32)

Add feedback

WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia

Hou, Yufang, Pascale, Alessandra, Carnerero-Cano, Javier, Tchrakian, Tigran, Marinescu, Radu, Daly, Elizabeth, Padhi, Inkit, Sattigeri, Prasanna

arXiv.org Artificial IntelligenceJun-19-2024

Retrieval-augmented generation (RAG) has emerged as a promising solution to mitigate the limitations of large language models (LLMs), such as hallucinations and outdated information. However, it remains unclear how LLMs handle knowledge conflicts arising from different augmented retrieved passages, especially when these passages originate from the same source and have equal trustworthiness. In this work, we conduct a comprehensive evaluation of LLM-generated answers to questions that have varying answers based on contradictory passages from Wikipedia, a dataset widely regarded as a high-quality pre-training resource for most LLMs. Specifically, we introduce WikiContradict, a benchmark consisting of 253 high-quality, human-annotated instances designed to assess LLM performance when augmented with retrieved passages containing real-world knowledge conflicts. We benchmark a diverse range of both closed and open-source LLMs under different QA scenarios, including RAG with a single passage, and RAG with 2 contradictory passages. Through rigorous human evaluations on a subset of WikiContradict instances involving 5 LLMs and over 3,500 judgements, we shed light on the behaviour and limitations of these models. For instance, when provided with two passages containing contradictory facts, all models struggle to generate answers that accurately reflect the conflicting nature of the context, especially for implicit conflicts requiring reasoning. Since human evaluation is costly, we also introduce an automated model that estimates LLM performance using a strong open-source language model, achieving an F-score of 0.8. Using this automated metric, we evaluate more than 1,500 answers from seven LLMs across all WikiContradict instances. To facilitate future work, we release WikiContradict on: https://ibm.biz/wikicontradict.

conflict, dataset, information, (16 more...)

arXiv.org Artificial Intelligence

2406.13805

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Dominican Republic (0.04)
Europe > Ireland (0.04)
(4 more...)

Genre: Research Report (1.00)

Industry:

Law (1.00)
Government (1.00)
Information Technology (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Add feedback