AITopics | bowman

Collaborating Authors

bowman

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The big AI job swap: why white-collar workers are ditching their careers

The GuardianFeb-11-2026, 05:00:08 GMT

Have you retrained or moved careers due to your previous career path being at risk of an artificial intelligence takeover? Please include as much detail as possible. Did you have a dream profession that you have decided not to pursue because of fears it will be thwarted by AI? Optional Please include as much detail as possible.

artificial intelligence, feenstra, social media, (15 more...)

The Guardian

Country:

Europe > Sweden (0.28)
North America > United States (0.28)

Genre: Personal (0.46)

Industry:

Education (1.00)
Banking & Finance (0.94)
Leisure & Entertainment > Sports (0.68)
(2 more...)

Technology:

Information Technology > Communications > Social Media (0.95)
Information Technology > Artificial Intelligence > Robots (0.68)

Add feedback

Ordered Memory

Yikang Shen, Shawn Tan, Arian Hosseini, Zhouhan Lin, Alessandro Sordoni, Aaron C. Courville

Neural Information Processing SystemsAug-20-2025, 04:52:17 GMT

Stack-augmented recurrent neural networks (RNNs) have been of interest to the deep learning community for some time. However, the difficulty of training memory models remains a problem obstructing the widespread use of such models. In this paper, we propose the Ordered Memory architecture. Inspired by Ordered Neurons (Shen et al., 2018), we introduce a new attention-based mechanism and use its cumulative probability to control the writing and erasing operation of memory. We also introduce a new Gated Recursive Cell to compose lower level representations into higher level representation. We demonstrate that our model achieves strong performance on the logical inference task (Bowman et al., 2015) and the ListOps (Nangia and Bowman, 2018) task. We can also interpret the model to retrieve the induced tree structure, and find that these induced structures align with the ground truth. Finally, we evaluate our model on the Stanford Sentiment Treebank tasks (Socher et al., 2013), and find that it performs comparatively with the state-of-the-art methods in the literature

arxiv preprint arxiv, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: North America > Canada (0.15)

Genre:

Research Report > New Finding (0.46)
Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Appendix for " Fine-Tuning Pre-Trained Language Models Effectively by Optimizing Subnetworks Adaptively "

Neural Information Processing SystemsAug-16-2025, 15:45:25 GMT

In Sec.3.3, we have experimentally verified that DPS outperforms various fine-tuning methods. Table 1: Eight datasets used in this paper form GLUE benchmark. In this paper, we investigate the performance of DPS on five distinctive and widely used large-scale pre-trained language models, namely BERT Devlin et al. [2018], RoBERTa Liu et al. [2019], DeBERTa improves Transforme-based pre-trained model with disentangled attention mechanism and enhanced mask decoder. We use mixed precision training to speed up the experimental process. This method is applied by ELECTRA when fine-tuning downstream tasks. 2 D Appendix D. Experimental Details for Different Fine-tuning Methods The following is our hyperparameter search space for different fine-tuning regularization methods: Mixout We grid search Mixout probability p {0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8}.

arxiv preprint arxiv, machine learning, natural language, (16 more...)

Neural Information Processing Systems

Country: Asia > China > Beijing > Beijing (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.70)

Add feedback

Why Anthropic's New AI Model Sometimes Tries to 'Snitch'

WIREDMay-28-2025, 19:40:45 GMT

Anthropic's alignment team was doing routine safety testing in the weeks leading up to the release of its latest AI models when researchers discovered something unsettling: When one of the models detected that it was being used for "egregiously immoral" purposes, it would attempt to "use command-line tools to contact the press, contact regulators, try to lock you out of the relevant systems, or all of the above," researcher Sam Bowman wrote in a post on X last Thursday. Bowman deleted the post shortly after he shared it, but the narrative about Claude's whistleblower tendencies had already escaped containment. "Claude is a snitch," became a common refrain in some tech circles on social media. At least one publication framed it as an intentional product feature rather than what it was--an emergent behavior. "It was a hectic 12 hours or so while the Twitter wave was cresting," Bowman tells WIRED.

anthropic, bowman, claude, (6 more...)

WIRED

Country: North America > United States (0.53)

Genre: Research Report (0.35)

Industry:

Health & Medicine (0.82)
Government > Regional Government > North America Government > United States Government (0.53)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.62)
Information Technology > Communications > Social Media (0.57)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.51)

Add feedback

Why AI Safety Researchers Are Worried About DeepSeek

TIME - TechJan-29-2025, 17:07:13 GMT

The release of DeepSeek R1 stunned Wall Street and Silicon Valley this month, spooking investors and impressing tech leaders. But amid all the talk, many overlooked a critical detail about the way the new Chinese AI model functions--a nuance that has researchers worried about humanity's ability to control sophisticated new artificial intelligence systems. It's all down to an innovation in how DeepSeek R1 was trained--one that led to surprising behaviors in an early version of the model, which researchers described in the technical documentation accompanying its release. During testing, researchers noticed that the model would spontaneously switch between English and Chinese while it was solving problems. When they forced it to stick to one language, thus making it easier for users to follow along, they found that the system's ability to solve the same problems would diminish.

ai system, deepseek, reasoning, (7 more...)

TIME - Tech

Country:

North America > United States > New York > New York County > New York City (0.25)
North America > United States > California (0.25)

Industry: Government (0.31)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Emergent inabilities? Inverse scaling over the course of pretraining

Michaelov, James A., Bergen, Benjamin K.

arXiv.org Artificial IntelligenceNov-15-2023

Does inverse scaling only occur as a function of model size, or can it also occur over the course of training? We carry out an exploratory study investigating whether the performance of language models on specific tasks can decrease (while general performance remains high) during training on the language modeling task. We find 8 tasks on which Pythia 12B (Biderman et al., 2023) shows decreased performance over the course of training. Five of these tasks (TruthfulQA-MC1, TruthfulQA-MC2, Hindsight Neglect, Memo Trap, and Pattern Match Suppression) additionally show a consistent relationship whereby larger language models show a greater decrease in performance the more they are trained, despite showing standard (positive) scaling overall. This highlights the importance of testing performance at all relevant benchmarks any time models are trained on additional data, even if their overall performance improves

inverse, language model, show inverse, (13 more...)

arXiv.org Artificial Intelligence

2305.14681

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(3 more...)

Genre: Research Report > Experimental Study (0.49)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.34)

Add feedback

Large Language Models: The Need for Nuance in Current Debates and a Pragmatic Perspective on Understanding

van Dijk, Bram M. A., Kouwenhoven, Tom, Spruit, Marco R., van Duijn, Max J.

arXiv.org Artificial IntelligenceOct-31-2023

Current Large Language Models (LLMs) are unparalleled in their ability to generate grammatically correct, fluent text. LLMs are appearing rapidly, and debates on LLM capacities have taken off, but reflection is lagging behind. Thus, in this position paper, we first zoom in on the debate and critically assess three points recurring in critiques of LLM capacities: i) that LLMs only parrot statistical patterns in the training data; ii) that LLMs master formal but not functional language competence; and iii) that language learning in LLMs cannot inform human language learning. Drawing on empirical and theoretical arguments, we show that these points need more nuance. Second, we outline a pragmatic perspective on the issue of `real' understanding and intentionality in LLMs. Understanding and intentionality pertain to unobservable mental states we attribute to other humans because they have pragmatic value: they allow us to abstract away from complex underlying mechanics and predict behaviour effectively. We reflect on the circumstances under which it would make sense for humans to similarly attribute mental states to LLMs, thereby outlining a pragmatic philosophical context for LLMs as an increasingly prominent technology in society.

computational linguistic, llm, mental state, (13 more...)

arXiv.org Artificial Intelligence

2310.19671

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Canada > Ontario > Toronto (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
(5 more...)

Genre: Research Report (0.82)

Industry:

Health & Medicine (0.67)
Leisure & Entertainment > Games (0.46)
Education > Curriculum > Subject-Specific Education (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Mysterious sounds in stratosphere can't be traced to any known source

New ScientistMay-11-2023, 19:50:22 GMT

Solar-powered balloons floating in the stratosphere have recorded low-frequency sounds of mysterious origin. "When we started flying balloons years ago, we didn't really know what we'd hear," says Daniel Bowman at Sandia National Laboratories in New Mexico. "We learned how to identify sounds from explosions, meteor crashes, aircraft, thunderstorms and cities. But virtually every time we send balloons up, we find sounds that we cannot identify." Bowman and his colleagues measured infrasound signals – sounds with a frequency so low they are inaudible to human ears – using solar-powered balloons floating 20 kilometres high.

balloon, solar-powered balloon, stratosphere, (8 more...)

New Scientist

Country:

North America > United States > New Mexico > Chaves County > Roswell (0.06)
North America > United States > Mississippi (0.06)
North America > United States > Illinois > Cook County > Chicago (0.06)
(2 more...)

Industry:

Government > Regional Government > North America Government > United States Government (0.55)
Energy > Renewable > Solar (0.51)
Government > Military (0.33)

Technology: Information Technology > Artificial Intelligence (0.37)

Add feedback

Ordered Memory Baselines

Borisov, Daniel, D'Iorio, Matthew, Hyacinthe, Jeffrey

arXiv.org Artificial IntelligenceFeb-8-2023

Natural language semantics can be modeled using the phrase-structured model, which can be represented using a tree-type architecture. As a result, recent advances in natural language processing have been made utilising recursive neural networks using memory models that allow them to infer tree-type representations of the input sentence sequence. These new tree models have allowed for improvements in sentiment analysis and semantic recognition. Here we review the Ordered Memory model proposed by Shen et al. (2019) at the NeurIPS 2019 conference, and try to either create baselines that can perform better or create simpler models that can perform equally as well. We found that the Ordered Memory model performs on par with the state-of-the-art models used in tree-type modelling, and performs better than simplified baselines that require fewer parameters.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2302.06451

Country: North America > Canada > Quebec > Montreal (0.47)

Genre: