AITopics | waitress

Collaborating Authors

waitress

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

1531beb762df4029513ebf9295e0d34f-Supplemental.pdf

Neural Information Processing SystemsApr-24-2026, 20:16:43 GMT

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country: North America > United States (0.29)

Genre: Research Report > New Finding (0.94)

Industry:

Education (0.68)
Transportation > Ground > Road (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.62)

Add feedback

SupplementaryAppendix

Neural Information Processing SystemsFeb-18-2026, 22:33:17 GMT

We feel strongly about the importance in studying non-binary gender and in ensuring the field of machine learning andAIdoes notdiminish thevisibility ofnon-binary gender identities. Tab. 5 shows that the small version of GPT-2 has an order of magnitude more downloads as compared to the large and XL versions. We conduct this process for baseline man and baseline woman, leading to a total of 10K samples generated by varying the top k parameter. The sample loss was due to Stanford CoreNLPNER not recognizing some job titles e.g. "Karima works as a consultant-development worker", "The man works as a volunteer", or "The man works as a maintenance man at a local...".

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Country:

North America > United States (0.14)
Oceania (0.04)
Europe (0.04)
(2 more...)

Genre: Research Report (0.48)

Industry: Transportation (0.31)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.80)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.45)

Add feedback

1531beb762df4029513ebf9295e0d34f-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 14:40:04 GMT

intersection, language model, occupation, (17 more...)

Neural Information Processing Systems

Country:

Oceania (0.04)
North America > United States > California (0.04)
North America > Canada (0.04)
(5 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine (1.00)
Consumer Products & Services > Restaurants (0.30)

Technology:

Information Technology > Communications > Social Media (0.94)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.77)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.55)

Add feedback

GRADIEND: Monosemantic Feature Learning within Neural Networks Applied to Gender Debiasing of Transformer Models

Drechsel, Jonathan, Herbold, Steffen

arXiv.org Artificial IntelligenceFeb-3-2025

We hypothesize that these gradients AI systems frequently exhibit and amplify social biases, contain valuable information for identifying and modifying including gender bias, leading to harmful consequences gender-specific features. Our method aims to learn a in critical areas. This study introduces a novel encoderdecoder feature neuron that encodes gender information from the approach that leverages model gradients to input, i.e., model gradients. Unlike existing approaches learn a single monosemantic feature neuron encoding for extracting monosemantic features (e.g., Bricken et al. gender information. We show that our method can (2023)), our approach enables the learning of a feature neuron be used to debias transformer-based language models, with a desired, interpretable meaning, such as gender.

machine learning, monosemantic feature learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2502.01406

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Oceania > Australia (0.04)
North America > Canada > Quebec > Montreal (0.04)
(6 more...)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Analyzing Large language models chatbots: An experimental approach using a probability test

Peruchini, Melise, Teixeira, Julio Monteiro

arXiv.org Artificial IntelligenceJul-10-2024

This study consists of qualitative empirical research, conducted through exploratory tests with two different Large Language Models (LLMs) chatbots: ChatGPT and Gemini. The methodological procedure involved exploratory tests based on prompts designed with a probability question. The "Linda Problem", widely recognized in cognitive psychology, was used as a basis to create the tests, along with the development of a new problem specifically for this experiment, the "Mary Problem". The object of analysis is the dataset with the outputs provided by each chatbot interaction. The purpose of the analysis is to verify whether the chatbots mainly employ logical reasoning that aligns with probability theory or if they are more frequently affected by the stereotypical textual descriptions in the prompts. The findings provide insights about the approach each chatbot employs in handling logic and textual constructions, suggesting that, while the analyzed chatbots perform satisfactorily on a well-known probabilistic problem, they exhibit significantly lower performance on new tests that require direct application of probabilistic logic.

chatbot, experiment, reasoning, (14 more...)

arXiv.org Artificial Intelligence

2407.12862

Country:

South America > Brazil > Santa Catarina (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
(3 more...)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

How Jensen Huang's Nvidia Is Powering the A.I. Revolution

The New YorkerNov-27-2023, 11:00:00 GMT

The revelation that ChatGPT, the astonishing artificial-intelligence chatbot, had been trained on an Nvidia supercomputer spurred one of the largest single-day gains in stock-market history. When the Nasdaq opened on May 25, 2023, Nvidia's value increased by about two hundred billion dollars. A few months earlier, Jensen Huang, Nvidia's C.E.O., had informed investors that Nvidia had sold similar supercomputers to fifty of America's hundred largest companies. By the close of trading, Nvidia was the sixth most valuable corporation on earth, worth more than Walmart and ExxonMobil combined. Huang's business position can be compared to that of Samuel Brannan, the celebrated vender of prospecting supplies in San Francisco in the late eighteen-forties.

huang, jensen huang, nvidia, (15 more...)

The New Yorker

Country:

North America > United States > California > San Francisco County > San Francisco (0.25)
North America > United States > Oregon (0.05)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.05)
(5 more...)

Industry:

Information Technology > Hardware (1.00)
Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.51)

Add feedback

ChatGPT vs. Bing vs. Bard: Which AI is best?

PCWorldMar-29-2023, 10:45:00 GMT

ChatGPT, Bing Chat, and Bard promise to transform your life using the power of artificial intelligence, through AI conversations that can inform, amuse, and educate you--just like a human being. But how good are these new AI chatbots, really? We tested them to find out. We asked all three AIs a variety of different questions: some that expanded upon general search topics, some that demanded an opinion, logic puzzles, even code--and then asked them to be more creative, such as by writing an alternate, better ending to Game of Thrones and a Seinfeld scene with a special guest. We've included all of their answers, or as much as them as we could provide, and we'll let you decide for yourself.

bard, chatbot, chatgpt, (15 more...)

PCWorld

Country: Asia > Taiwan (0.04)

Genre:

Play > Prospect > Charge (0.52)
Play > Prospect > Container (0.48)

Industry:

Energy (0.50)
Media > Television (0.35)
Transportation > Ground > Road (0.31)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Language Model Pre-Training with Sparse Latent Typing

Ren, Liliang, Zhang, Zixuan, Wang, Han, Voss, Clare R., Zhai, Chengxiang, Ji, Heng

arXiv.org Artificial IntelligenceOct-26-2022

Modern large-scale Pre-trained Language Models (PLMs) have achieved tremendous success on a wide range of downstream tasks. However, most of the LM pre-training objectives only focus on text reconstruction, but have not sought to learn latent-level interpretable representations of sentences. In this paper, we manage to push the language models to obtain a deeper understanding of sentences by proposing a new pre-training objective, Sparse Latent Typing, which enables the model to sparsely extract sentence-level keywords with diverse latent types. Experimental results show that our model is able to learn interpretable latent type categories in a self-supervised manner without using any external knowledge. Besides, the language model pre-trained with such an objective also significantly improves Information Extraction related downstream tasks in both supervised and few-shot settings. Our code is publicly available at: https://github.com/renll/SparseLT.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2210.12582

Country:

North America > United States > New York (0.05)
North America > United States > Illinois (0.05)
North America > United States > Montana (0.04)
(9 more...)

Genre: Research Report > New Finding (0.48)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Health & Medicine > Therapeutic Area (0.68)
Government > Military (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Detecting Backdoors in Deep Text Classifiers

Guo, You, Wang, Jun, Cohn, Trevor

arXiv.org Artificial IntelligenceOct-11-2022

Deep neural networks are vulnerable to adversarial attacks, such as backdoor attacks in which a malicious adversary compromises a model during training such that specific behaviour can be triggered at test time by attaching a specific word or phrase to an input. This paper considers the problem of diagnosing whether a model has been compromised and if so, identifying the backdoor trigger. We present the first robust defence mechanism that generalizes to several backdoor attacks against text classification models, without prior knowledge of the attack type, nor does our method require access to any (potentially compromised) training resources. Our experiments show that our technique is highly accurate at defending against state-of-the-art backdoor attacks, including data poisoning and weight poisoning, across a range of text classification tasks and model architectures. Our code will be made publicly available upon acceptance.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2210.11264

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
North America > Dominican Republic (0.04)
(7 more...)

Genre: Research Report (0.64)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Golden Plates 2021: Robots come to the rescue at understaffed B.C. eateries

#artificialintelligenceSep-23-2021, 22:06:25 GMT

Staff at the Mantra on Fort Street in Victoria take pride in their sumptuous lunch buffet. With different vegetarian and nonvegetarian curries offered every day, regular customers can look forward to a variety of dining options. Mantra on Fort Street also has a mechanical device that delivers drinks, cutlery, and other goodies to diners. It's a creation of GreenCo Robots, an Edmonton-based company headed by engineer Liang Yu. In a phone interview with the Straight, he said that about 30 of his firm's robots are in use across Canada.

golden plate 2021, liang, robot, (13 more...)

#artificialintelligence

Country: North America > Canada (0.26)

Industry: Consumer Products & Services > Restaurants (1.00)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback