AITopics | almanac

Collaborating Authors

almanac

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The 'Farmer's Almanac' says goodbye after 208 years

Popular ScienceNov-7-2025, 13:51:21 GMT

Environment Agriculture The'Farmer's Almanac' says goodbye after 208 years The 2026 edition will be its last. Breakthroughs, discoveries, and DIY tips sent every weekday. After more than 200 years of weather wisdom, folklore, and time-tested advice, editors announced that the 2026 will be the last edition. The website will remain operational through the end of December 2025. "Many of you grew up hearing your parents or grandparents quote from the, always having a copy nearby. Maybe you have planted by our Moon phases, consulted the for the'Best Days' to potty train, wean, or go fishing," Editor Sandi Duncan and Editor Emeritus Peter Geiger wrote in the announcement.

almanac, farmer, service and privacy policy, (9 more...)

Popular Science

Country:

North America > United States > New Hampshire (0.05)
North America > United States > Maine > Androscoggin County > Lewiston (0.05)
Asia > Middle East > UAE > Dubai Emirate > Dubai (0.05)

Industry:

Media > Photography (0.74)
Retail (0.73)

Technology: Information Technology > Artificial Intelligence (0.52)

Add feedback

August stargazing: The Perseids, a 'big fish,' celestial conjunctions, and more

Breakthroughs, discoveries, and DIY tips sent every weekday. As any diligent stargazer knows, mid-summer means one thing: the Perseids! This meteor shower hits its peak on August 12 this year, and while that date is inconveniently close to that of this month's full moon, there should still be plenty of meteors on show for those who choose their time and location with care. As another long summer day has finally receded into another summer night, look east. If the sky is clear, you might well spy the Summer Triangle.

artificial intelligence, celestial conjunction, perseid, (9 more...)

Popular Science

Country: North America > United States > New York (0.05)

Technology: Information Technology > Artificial Intelligence (0.36)

Add feedback

ALMANACS: A Simulatability Benchmark for Language Model Explainability

Mills, Edmund, Su, Shiye, Russell, Stuart, Emmons, Scott

arXiv.org Machine LearningDec-19-2023

How do we measure the efficacy of language model explainability methods? While many explainability methods have been developed, they are typically evaluated on bespoke tasks, preventing an apples-to-apples comparison. To help fill this gap, we present ALMANACS, a language model explainability benchmark. ALMANACS scores explainability methods on simulatability, i.e., how well the explanations improve behavior prediction on new inputs. The ALMANACS scenarios span twelve safety-relevant topics such as ethical reasoning and advanced AI behaviors; they have idiosyncratic premises to invoke model-specific behavior; and they have a train-test distributional shift to encourage faithful explanations. By using another language model to predict behavior based on the explanations, ALMANACS is a fully automated benchmark. We use ALMANACS to evaluate counterfactuals, rationalizations, attention, and Integrated Gradients explanations. Our results are sobering: when averaged across all topics, no explanation method outperforms the explanation-free control. We conclude that despite modest successes in prior work, developing an explanation method that aids simulatability in ALMANACS remains an open challenge. Understanding the behavior of deep neural networks is critical for their safe deployment. While deep neural networks are a black box by default, a wide variety of interpretability methods are being developed to explain their behavior (Räuker et al., 2023; Nauta et al., 2022). Some approaches, such as LIME (Ribeiro et al., 2016) and MUSE (Lakkaraju et al., 2019), try to approximate output behavior. Other approaches try to mechanistically explain the circuits inside a network (Nanda et al., 2023; Wang et al., 2023). Some approaches imitate explanations in the training data (Camburu et al., 2018; Narang et al., 2020; Marasović et al., 2022). Other approaches study the network's activations, such as a transformer's attention over its input (Serrano & Smith, 2019; Wiegreffe & Pinter, 2019). Others aim to create neural networks that are intrinsically explainable (Jain et al., 2020). With so many interpretability methods to choose from, how can we tell which one works best? Despite years of work in the field, there is no consistent evaluation standard. New interpretability papers generally test their methods on bespoke tasks, making it difficult to assess their true effectiveness. To solve this issue, Doshi-Velez & Kim (2017), Nauta et al. (2022), and Räuker et al. (2023) argue that we need standard interpretability benchmarks. Just as benchmarks have driven progress in computer vision (Deng et al., 2009), natural language processing (Wang et al., 2019b;a), and reinforcement learning (Brockman et al., 2016; Tunyasuvunakool et al., 2020), we seek to drive progress in interpretability by enabling apples-to-apples comparisons across diverse methods.

explanation, machine learning, natural language, (20 more...)

arXiv.org Machine Learning

2312.12747

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > Canada > Ontario > Toronto (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report > New Finding (0.66)

Industry:

Information Technology (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Almanac: Retrieval-Augmented Language Models for Clinical Medicine

Zakka, Cyril, Chaurasia, Akash, Shad, Rohan, Dalal, Alex R., Kim, Jennifer L., Moor, Michael, Alexander, Kevin, Ashley, Euan, Boyd, Jack, Boyd, Kathleen, Hirsch, Karen, Langlotz, Curt, Nelson, Joanna, Hiesinger, William

arXiv.org Artificial IntelligenceMay-31-2023

In recent years, language model pre-training has emerged as a powerful training paradigm in natural language processing (NLP) [1-4]. For a large number of these language models, performance improvements have been empirically observed to scale with model and dataset size, with the well-documented emergence of zero-shot capabilities and sample efficiency on a range of downstream NLP tasks [5-7]. However, due the nature of their training objective-- predicting the next token in a sentence--large language models (LLMs) can be prone to generating factually incorrect statements, a phenomenon commonly known as hallucination [8, 9]. More contentiously, many works have also demonstrated these models' ability to reproduce social biases, as well as generating statements reinforcing gender, racial, and religious stereotypes [10, 11]. In an effort to reduce these unwanted behaviors, several works have explored different ways of steering LLM outputs to more closely align with user-intent, including fine-tuning with human feedback [12, 13] and natural language prompt engineering [14, 15].

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2303.01229

Country:

North America > United States > California > Santa Clara County > Palo Alto (0.04)
Asia > Japan > Honshū > Chūbu > Toyama Prefecture > Toyama (0.04)
Asia > China > Hong Kong (0.04)

Genre:

Research Report > New Finding (1.00)
Overview (0.93)
Research Report > Experimental Study (0.68)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
(4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)

Add feedback

La Niña Effects? National Weather Service Predicts 2017 Winter Climate

International Business TimesOct-20-2017, 20:50:18 GMT

The National Weather Service has released its first winter weather predictions for the approaching season in the United States. But if the wildcard La Niña develops, it might shake some things up. The chances that it will develop are strong too, observations as well as computer models suggest that La Niña is likely to develop. If it does develop, Mike Halpert, the deputy director of the Climate prediction Center at the National Oceanic and Atmospheric Administration predicts that it will be "weak and potentially short-lived." Colder than normal conditions in the Pacific Ocean near the equator is what is commonly referred to as La Niña.

artificial intelligence, national weather service predict 2017, prediction, (8 more...)

International Business Times

Country:

North America > United States (1.00)
Pacific Ocean (0.26)

Industry: Government > Regional Government > North America Government > United States Government (1.00)

Technology: Information Technology > Artificial Intelligence (0.39)

Add feedback