AITopics | paw

Collaborating Authors

paw

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Task-Agnostic Language Model Watermarking via High Entropy Passthrough Layers

Masrani, Vaden, Akbari, Mohammad, Yue, David Ming Xuan, Rezaei, Ahmad, Zhang, Yong

arXiv.org Artificial IntelligenceDec-17-2024

In the era of costly pre-training of large language models, ensuring the intellectual property rights of model owners, and insuring that said models are responsibly deployed, is becoming increasingly important. To this end, we propose model watermarking via passthrough layers, which are added to existing pre-trained networks and trained using a self-supervised loss such that the model produces high-entropy output when prompted with a unique private key, and acts normally otherwise. Unlike existing model watermarking methods, our method is fully task-agnostic, and can be applied to both classification and sequence-to-sequence tasks without requiring advanced access to downstream fine-tuning datasets. We evaluate the proposed passthrough layers on a wide range of downstream tasks, and show experimentally our watermarking method achieves a near-perfect watermark extraction accuracy and false-positive rate in most cases without damaging original model performance. Additionally, we show our method is robust to both downstream fine-tuning, fine-pruning, and layer removal attacks, and can be trained in a fraction of the time required to train the original model. Code is available in the paper.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2412.12563

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
North America > Canada > Ontario > Toronto (0.04)
Asia (0.04)

Genre: Research Report > Promising Solution (0.54)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)

Add feedback

Embed and Project: Discrete Sampling with Universal Hashing

Neural Information Processing SystemsMar-13-2024, 17:13:45 GMT

We consider the problem of sampling from a probability distribution defined over a high-dimensional discrete set, specified for instance by a graphical model. We propose a sampling algorithm, called PAWS, based on embedding the set into a higher-dimensional space which is then randomly projected using universal hash functions to a lower-dimensional subspace and explored using combinatorial search methods. Our scheme can leverage fast combinatorial optimization tools as a blackbox and, unlike MCMC methods, samples produced are guaranteed to be within an (arbitrarily small) constant factor of the true probability distribution. We demonstrate that by using state-of-the-art combinatorial search tools, PAWS can efficiently sample from Ising grids with strong interactions and from software verification instances, while MCMC and variational methods fail in both cases.

hash function, probability, probability distribution, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > New York > Tompkins County > Ithaca (0.04)
Asia > Middle East > Jordan (0.04)

Industry: Information Technology (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)

Add feedback

Improved Prototypical Semi-Supervised Learning with Foundation Models: Prototype Selection, Parametric vMF-SNE Pretraining and Multi-view Pseudolabelling

Mannix, Evelyn, Bondell, Howard

arXiv.org Artificial IntelligenceNov-28-2023

In this paper we present an improved approach to prototypical semi-supervised learning for computer vision, in the context of leveraging a frozen foundation model as the backbone of our neural network. As a general tool, we propose parametric von-Mises Fisher Stochastic Neighbour Embedding (vMF-SNE) to create mappings with neural networks between high-dimensional latent spaces that preserve local structure. This enables us to pretrain the projection head of our network using the high-quality embeddings of the foundation model with vMF-SNE. We also propose soft multi-view pseudolabels, where predictions across multiple views are combined to provide a more reliable supervision signal compared to a consistency or swapped assignment approach. We demonstrate that these ideas improve upon P}redicting View-Assignments with Support Samples (PAWS), a current state-of-the-art semi-supervised learning method, as well as Robust PAWS (RoPAWS), over a range of benchmarking datasets. We also introduce simple $k$-means prototype selection, a technique that provides superior performance to other unsupervised label selection approaches in this context. These changes improve upon PAWS by an average of +2.9% for CIFAR-10 and +5.7% for CIFAR-100 with four labels per class, and by +15.2% for DeepWeeds, a particularly challenging dataset for semi-supervised learning. We also achieve new state-of-the-art results in semi-supervised learning in this small label regime for CIFAR-10 - 95.8% (+0.7%) and CIFAR-100 - 76.6% (+12.0%).

dataset, learning, prototype, (14 more...)

arXiv.org Artificial Intelligence

2311.17093

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Oceania > Australia > Victoria > Melbourne (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Efficiently Enhancing Zero-Shot Performance of Instruction Following Model via Retrieval of Soft Prompt

Ye, Seonghyeon, Jang, Joel, Kim, Doyoung, Jo, Yongrae, Seo, Minjoon

arXiv.org Artificial IntelligenceOct-16-2023

Enhancing the zero-shot performance of instruction-following models requires heavy computation, either by scaling the total number of training datasets or the model size. In this work, we explore how retrieval of soft prompts obtained through prompt tuning can efficiently assist hard prompts in zero-shot task generalization. Specifically, we train soft prompt embeddings for each prompt through prompt tuning, store the samples of the training instances mapped with the prompt embeddings, and retrieve the corresponding prompt embedding of the training instance closest to the query instance during inference. While only adding 0.007% additional parameters, retrieval of soft prompt enhances the performance of T0 on unseen tasks by outperforming it on 10 out of 11 datasets as well as improving the mean accuracy of T0 on BIG-bench benchmark by 2.39% points. Also, we report an interesting finding that retrieving source embeddings trained on similar answer choice formats is more important than those on similar task types.

dataset, paw, qa show choice, (13 more...)

arXiv.org Artificial Intelligence

2210.03029

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Asia > China > Hong Kong (0.04)
(13 more...)

Genre: Research Report (0.82)

Industry:

Media (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.97)

Add feedback

It Costs Just $400 to Build an AI Disinformation Machine

WIREDAug-29-2023, 12:00:00 GMT

In May, Sputnik International, a state-owned Russian media outlet, posted a series of tweets lambasting US foreign policy and attacking the Biden administration. Each prompted a curt but well-crafted rebuttal from an account called CounterCloud, sometimes including a link to a relevant news or opinion article. It generated similar responses to tweets by the Russian embassy and Chinese news outlets criticizing the US. Russian criticism of the US is far from unusual, but CounterCloud's material pushing back was: The tweets, the articles, and even the journalists and news sites were crafted entirely by artificial intelligence algorithms, according to the person behind the project, who goes by the name Nea Paw and says it is designed to highlight the danger of mass-produced AI disinformation. Paw did not post the CounterCloud tweets and articles publicly but provided them to WIRED and also produced a video outlining the project.

ai disinformation machine, information campaign, sophisticated information campaign, (10 more...)

WIRED

Country: North America > United States (1.00)

Industry:

Media > News (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.36)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.35)

Add feedback

Mechanical Evidence for the Phylogenetic Origin of the Red Panda's False Thumb as an Adaptation to Arboreal Locomotion

Barnett, Braden, Lyu, Yiqi, Pichney, Kyle, Sun, Brian, Wu, Jixiao

arXiv.org Artificial IntelligenceMay-8-2023

We constructed a modular, biomimetic red panda paw with which to experimentally investigate the evolutionary reason for the existence of the false thumbs of red pandas. These thumbs were once believed to have shared a common origin with the similar false thumbs of giant pandas; however, the discovery of a carnivorous fossil ancestor of the red panda that had false thumbs implies that the red panda did not evolve its thumbs to assist in eating bamboo, as the giant panda did, but rather evolved its thumbs for some other purpose. The leading proposal for this purpose is that the thumbs developed to aid arboreal locomotion. To test this hypothesis, we conducted grasp tests on rods 5-15 mm in diameter using a biomimetic paw with 0-16 mm interchangeable thumb lengths. The results of these tests demonstrated an optimal thumb length of 7 mm, which is just above that of the red panda's true thumb length of 5.5 mm. Given trends in the data that suggest that smaller thumbs are better suited to grasping larger diameter rods, we conclude that the red panda's thumb being sized below the optimum length suggests an adaptation toward grasping branches as opposed to relatively thinner food items, supporting the new proposal that the red panda's thumbs are an adaptation primary to climbing rather than food manipulation.

artificial intelligence, panda, red panda, (16 more...)

arXiv.org Artificial Intelligence

2305.05086

Country: Europe > Netherlands > North Holland > Amsterdam (0.04)

Genre: Research Report (0.83)

Technology: Information Technology > Artificial Intelligence > Robots (0.95)

Add feedback

GAPX: Generalized Autoregressive Paraphrase-Identification X

Zhou, Yifei, Li, Renyu, Housen, Hayden, Lim, Ser-Nam

arXiv.org Artificial IntelligenceOct-4-2022

Paraphrase Identification is a fundamental task in Natural Language Processing. While much progress has been made in the field, the performance of many state-of-the-art models often suffer from distribution shift during inference time. We verify that a major source of this performance drop comes from biases introduced by negative examples. To overcome these biases, we propose in this paper to train two separate models, one that only utilizes the positive pairs and the other the negative pairs. This enables us the option of deciding how much to utilize the negative model, for which we introduce a perplexity based out-of-distribution metric that we show can effectively and automatically determine how much weight it should be given during inference. We support our findings with strong empirical results.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2210.01979

Country:

Asia > India (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > Sweden > Uppsala County > Uppsala (0.04)
(3 more...)

Genre: Research Report > New Finding (0.66)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

AI to Bring the Next Wildlife Conservation Revolution

#artificialintelligenceFeb-20-2022, 08:05:06 GMT

The conversation around preserving the natural habitats and wildlife surrounding us is under frequent change. Technologies are being developed in a manner such that protecting, sheltering and working for the discovery and preservation of wildlife species becomes less time consuming, effortless and community-centric. The report by researchers at the Society of Conservation Biology suggests the development and implementation of conservation technologies will be hampered unless funding, coordination, and capacity-building issues are addressed. OT tracks what are the new proposals put forth on the table to bring about change in Wildlife protection with the ongoing challenges. Why Is AI Required In Wildlife Conservation?

environmental data, information, wildlife conservation revolution, (12 more...)

#artificialintelligence

Country: Asia > India (0.05)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Computer Conservation: Lily Xu Uses Artificial Intelligence To Stop Poaching Around the World

#artificialintelligenceDec-1-2021, 07:20:15 GMT

Lily Xu knew from a young age how much the environment and conservation mattered to her. By 9 years old, she'd already decided to eat vegetarian because, as she put it, "I didn't want to hurt animals." Xu grew up believing her passions would always be separate from her professional interest in computer science. Then she became a graduate student in Milind Tambe's Teamcore Lab, and everything changed. Xu is now doing award-winning research into using machine learning and artificial intelligence to help conservation and anti-poaching efforts around the world.

artificial intelligence, lily xu use artificial intelligence, srepok wildlife sanctuary, (8 more...)

#artificialintelligence

Country:

Asia > Cambodia (0.08)
North America > United States > Rhode Island (0.05)
North America > United States > Maryland (0.05)
North America > United States > District of Columbia > Washington (0.05)

Industry: Education (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.36)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.30)

Add feedback

AI for Social Good

#artificialintelligenceSep-28-2021, 01:05:07 GMT

Artificial Intelligence (AI) for social good is a field of work which, broadly speaking, uses AI to make the world a better place. I had a chance to interview two leaders in the field, Dr. Bryan Wilder, who recently received his Ph.D. from Harvard (and will be joining the faculty at Carnegie Mellon next fall) and current Harvard Ph.D. student, Lily Xu. Both Bryan and Lily have been advised by Dr. Tambe, Gordon McKay Professor of Computer Science and Director of the Center for Research in Computation and Society (CRCS) at Harvard University and Director of AI for Social Good at Google Research India. While Bryan and Lily are both working at the intersection of AI and social good, they arrived at this junction via different paths. Bryan was studying computer science and looking for a field to apply his knowledge; his search led him to public health.

bryan and lily, lily, social good, (8 more...)

#artificialintelligence

Country:

Asia > India (0.25)
Asia > Cambodia (0.07)
Africa > Uganda (0.05)

Genre: Research Report (0.33)

Industry:

Social Sector (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.35)
Health & Medicine > Therapeutic Area > Immunology (0.35)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.32)

Add feedback