AITopics

2409.00222

Country:

North America > United States > New York (0.04)
North America > United States > California (0.04)
Antarctica (0.04)

Genre: Research Report (0.82)

Industry:

Law (1.00)
Government (0.93)
Health & Medicine > Therapeutic Area > Immunology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

BBC NewsSep-17-2024, 13:10:22 GMT

Murder arrests after death of baby boy

Two people have been arrested on suspicion of murder after a baby died in Stoke-on-Trent. Police and the ambulance service were called to Sherwin Road, Burslem, shortly after 09:00 BST on 27 August following the death of a baby boy. A woman, 26, and man, 25, were arrested on 30 August on suspicion of causing or allowing the death of a child. The pair, from Stoke-on-Trent, were further arrested on suspicion of murder on Tuesday, Staffordshire Police said. A spokesperson for the force said specialist officers were supporting the baby's family.

artificial intelligence, social media, stoke-on-trent, (8 more...)

Country:

Europe > United Kingdom > England > Staffordshire > Stoke-on-Trent (0.56)
South America (0.17)
North America > United States (0.17)
(15 more...)

Industry: Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)

Technology:

Information Technology > Communications > Social Media (0.36)
Information Technology > Artificial Intelligence (0.32)

Lingo, Ryan, Arroyo, Martin, Chhajer, Rajeev

Enhancing LLM Problem Solving with REAP: Reflection, Explicit Problem Deconstruction, and Advanced Prompting

arXiv.org Artificial IntelligenceSep-14-2024

Large Language Models (LLMs) have transformed natural language processing, yet improving their problem-solving capabilities, particularly for complex, reasoning-intensive tasks, remains a persistent challenge. This paper introduces the REAP (Reflection, Explicit Problem Deconstruction, and Advanced Prompting) method, an innovative approach within the dynamic context generation framework. REAP guides LLMs through reflection on the query, deconstructing it into manageable components, and generating relevant context to enhance the solution process. We evaluated REAP using a dataset designed to expose LLM limitations, comparing zero-shot prompting with REAP-enhanced prompts across six state-of-the-art models: OpenAI's o1-preview, o1-mini, GPT-4o, GPT-4o-mini, Google's Gemini 1.5 Pro, and Claude 3.5 Sonnet. The results demonstrate notable performance gains, with o1-mini improving by 40.97%, GPT-4o by 66.26%, and GPT-4o-mini by 112.93%. Despite the already strong baseline performance of OpenAI's o1-preview, modest gains were observed. Beyond performance improvements, REAP offers a cost-effective solution; for example, GPT-4o-mini, which is approximately 100 times cheaper than o1-preview, delivered competitive results. REAP also improves the clarity of model outputs, making it easier for humans to understand the reasoning behind the results and simplifying the process of identifying and addressing any issues. These findings demonstrate REAP's potential to greatly improve the capabilities of LLMs, providing both better performance and increased cost-efficiency across a wide range of applications.

explicit temporal information, information and logical deduction, straightforward interpretation, (17 more...)

2409.09415

Country:

North America > United States (0.04)
Europe > United Kingdom > Scotland (0.04)
North America > Canada > Ontario > Middlesex County > London (0.04)
(2 more...)

Genre:

Research Report > Promising Solution (1.00)
Research Report > New Finding (1.00)
Research Report > Experimental Study (0.92)

Industry:

Leisure & Entertainment > Sports (1.00)
Health & Medicine (1.00)
Information Technology (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.46)

BBC NewsSep-13-2024, 06:25:44 GMT

Teenager invents robot to solve Rubik's Cube

Teenager invents robot to solve Rubik's Cube BBCRuarcc the year 10 student who has programmed a robot that can solve a Rubik's Cube puzzle A 13-year-old schoolboy has invented a Lego robot that can solve a Rubik's cube. Ruarcc, from St Malachy's College in north Belfast, first took steps to create puzzle-solving robot prototypes in his second year at school, aged 12. This was made possible after the school launched its creative digital technology hub (CDTH) last year. Teacher Clare McGrath commented she "didn't believe" that Ruarcc's robot would work at first.'People are amazed my robot can solve Rubik's Cube' Ruarcc told BBC News NI it was "frustrating", but he worked on making it better. "People tend to be amazed that it can solve one," he said.

robot, solve rubik, teenager invent robot, (10 more...)

Country:

Europe > United Kingdom > Northern Ireland (0.17)
South America (0.16)
North America > United States (0.16)
(14 more...)

Industry:

Leisure & Entertainment > Games > Rubik's Cube (1.00)
Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (1.00)

Ibrahim, Michael, Rozas, Heraldo, Gebraeel, Nagi

Wasserstein Distributionally Robust Multiclass Support Vector Machine

arXiv.org Machine LearningSep-12-2024

We study the problem of multiclass classification for settings where data features $\mathbf{x}$ and their labels $\mathbf{y}$ are uncertain. We identify that distributionally robust one-vs-all (OVA) classifiers often struggle in settings with imbalanced data. To address this issue, we use Wasserstein distributionally robust optimization to develop a robust version of the multiclass support vector machine (SVM) characterized by the Crammer-Singer (CS) loss. First, we prove that the CS loss is bounded from above by a Lipschitz continuous function for all $\mathbf{x} \in \mathcal{X}$ and $\mathbf{y} \in \mathcal{Y}$, then we exploit strong duality results to express the dual of the worst-case risk problem, and we show that the worst-case risk minimization problem admits a tractable convex reformulation due to the regularity of the CS loss. Moreover, we develop a kernel version of our proposed model to account for nonlinear class separation, and we show that it admits a tractable convex upper bound. We also propose a projected subgradient method algorithm for a special case of our proposed linear model to improve scalability. Our numerical experiments demonstrate that our model outperforms state-of-the art OVA models in settings where the training data is highly imbalanced. We also show through experiments on popular real-world datasets that our proposed model often outperforms its regularized counterpart as the first accounts for uncertain labels unlike the latter.

algorithm, cs pm, experiment, (16 more...)

arXiv.org Machine Learning

2409.08409

Country:

North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > Los Angeles County > Santa Monica (0.04)
Antarctica (0.04)

Genre: Research Report (0.82)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area (0.92)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (1.00)

BBC NewsSep-5-2024, 12:02:25 GMT

John Lewis brings back 'never knowingly undersold'

John Lewis brings back'never knowingly undersold' Getty Images Retailer John Lewis is bringing back its "never knowingly undersold" price pledge from Monday, two years after abandoning it. It will also apply to online sales for the first time, whereas it previously only applied to in-store shopping, and will use AI to match the prices of 25 top retailers. The department store chain has been trying to win back customers after a tough few years that has seen it cut jobs and close several stores. It swung back to profit earlier this year, but is expected to continue shedding jobs as it seeks to revive its fortunes. The decision by John Lewis' new managing director Pete Ruis to restore the price pledge marks a change of direction from his predecessor.

artificial intelligence, john lewis bring back, never knowingly undersold, (5 more...)

Country:

South America (0.17)
North America > United States (0.17)
North America > Central America (0.17)
(13 more...)

Industry:

Retail (1.00)
Government > Regional Government (0.32)

Technology: Information Technology > Artificial Intelligence (0.72)

arXiv.org Artificial IntelligenceAug-28-2024

LM-PUB-QUIZ: A Comprehensive Framework for Zero-Shot Evaluation of Relational Knowledge in Language Models

Ploner, Max, Wiland, Jacek, Pohl, Sebastian, Akbik, Alan

Knowledge probing evaluates the extent to which a language model (LM) has acquired relational knowledge during its pre-training phase. It provides a cost-effective means of comparing LMs of different sizes and training setups and is useful for monitoring knowledge gained or lost during continual learning (CL). In prior work, we presented an improved knowledge probe called BEAR (Wiland et al., 2024), which enables the comparison of LMs trained with different pre-training objectives (causal and masked LMs) and addresses issues of skewed distributions in previous probes to deliver a more unbiased reading of LM knowledge. With this paper, we present LM-PUB- QUIZ, a Python framework and leaderboard built around the BEAR probing mechanism that enables researchers and practitioners to apply it in their work. It provides options for standalone evaluation and direct integration into the widely-used training pipeline of the Hugging Face TRANSFORMERS library. Further, it provides a fine-grained analysis of different knowledge types to assist users in better understanding the knowledge in each evaluated LM. We publicly release LM-PUB-QUIZ as an open-source project.

large language model, machine learning, natural language, (16 more...)

2408.15729

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Antarctica (0.05)
Africa > Uganda > Central Region > Kampala (0.05)
(7 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

The New YorkerAug-26-2024, 10:00:00 GMT

Putting a Fine-Art Touch on Fixer-Uppers

You can view art (e.g., at a museum). "To go from photographing the getaway to making the getaway, it's like art becoming reality," the photographer Gray Malin said the other day. He was sitting behind the wheel of a blue Range Rover, dressed in a denim shirt, white jeans, and raffia loafers, cruising up the 101 to his latest work: a house he has renovated in Montecito, California, in order to rent to visitors. "How I vacation is how I want you to vacation," he said. As an artist, Malin specializes in glossy portraits of the good life: beaches, boats, a plane landing in St. Bart's.

fine-art touch, fixer-upper, malin, (9 more...)

The New Yorker

Country:

North America > United States > Nevada > Clark County > Las Vegas (0.05)
North America > United States > California > Los Angeles County > Los Angeles (0.05)
Asia > Bhutan (0.05)
Antarctica (0.05)

Industry:

Banking & Finance > Real Estate (0.51)
Media (0.36)

Technology: Information Technology > Artificial Intelligence (0.32)

BBC NewsAug-21-2024, 08:22:31 GMT

Seven things we learned from Gamescom opening night

It has been a year with no major new console launches and where the industry has seen strikes and cuts with thousands of workers being laid off. The opening night of Gamescom is often an opportunity for a big shiny night to get fans all excited for the year ahead. Setting the stage for the next 12 months, here are the biggest things we found out from Europe's biggest gaming show in Germany. In a year when games became films, and films became games, the convention centre in Cologne saw a night all about the big trailers. This year, Borderlands has taken attention for its movie adaptation starring Cate Blanchett and Kevin Hart. That film received some of the year's harshest reviews, but that has not scuppered plans for a new game in the mainline series.

gamescom, playstation 5, trailer, (16 more...)

Country:

South America (0.15)
North America > Central America (0.15)
North America > United States > Indiana (0.06)
(17 more...)

Industry:

Leisure & Entertainment > Games > Computer Games (1.00)
Government (1.00)

Technology: Information Technology > Artificial Intelligence (0.55)

arXiv.org Artificial IntelligenceAug-12-2024

Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery

Rao, Sukrut, Mahajan, Sweta, Böhle, Moritz, Schiele, Bernt

Concept Bottleneck Models (CBMs) have recently been proposed to address the 'black-box' problem of deep neural networks, by first mapping images to a human-understandable concept space and then linearly combining concepts for classification. Such models typically require first coming up with a set of concepts relevant to the task and then aligning the representations of a feature extractor to map to these concepts. However, even with powerful foundational feature extractors like CLIP, there are no guarantees that the specified concepts are detectable. In this work, we leverage recent advances in mechanistic interpretability and propose a novel CBM approach -- called Discover-then-Name-CBM (DN-CBM) -- that inverts the typical paradigm: instead of pre-selecting concepts based on the downstream classification task, we use sparse autoencoders to first discover concepts learnt by the model, and then name them and train linear probes for classification. Our concept extraction strategy is efficient, since it is agnostic to the downstream task, and uses concepts already known to the model. We perform a comprehensive evaluation across multiple datasets and CLIP architectures and show that our method yields semantically meaningful concepts, assigns appropriate names to them that make them easy to interpret, and yields performant and interpretable CBMs. Code available at https://github.com/neuroexplicit-saar/discover-then-name.

concept bottleneck, dataset, task-agnostic concept bottleneck, (13 more...)

2407.14499

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > Montserrat (0.04)
Europe > Portugal > Lisbon > Lisbon (0.04)
(8 more...)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment > Sports (0.68)
Transportation > Air (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)