AITopics | creating

Collaborating Authors

creating

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Elon Musk Is Rolling xAI Into SpaceX--Creating the World's Most Valuable Private Company

WIREDFeb-2-2026, 23:07:19 GMT

Elon Musk Is Rolling xAI Into SpaceX--Creating the World's Most Valuable Private Company By fusing SpaceX and xAI--which acquired X last year--Elon Musk tightens his grip over technologies that shape national security, social media, and artificial intelligence. Elon Musk's rocket and satellite company SpaceX is acquiring his AI startup xAI, the centibillionaire announced on Monday. In a blog post, Musk said the acquisition was warranted because global electricity demand for AI cannot be met with "terrestrial solutions," and Silicon Valley will soon need to build data centers in space to power its AI ambitions. "In the long term, space-based AI is obviously the only way to scale," Musk wrote. "The only logical solution therefore is to transport these resource-intensive efforts to a location with vast power and space. I mean, space is called'space' for a reason."

artificial intelligence, musk, social media, (13 more...)

WIRED

Country: North America > United States > California (0.25)

Industry: Aerospace & Defense (1.00)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Social Media (0.76)

Add feedback

Creating a Public Repository for Joining Private Data

Neural Information Processing SystemsDec-27-2025, 00:02:38 GMT

How can one publish a dataset with sensitive attributes in a way that both preserves privacy and enables joins with other datasets on those same sensitive attributes? This problem arises in many contexts, e.g., a hospital and an airline may want to jointly determine whether people who take long-haul flights are more likely to catch respiratory infections. If they join their data by a common keyed user identifier such as email address, they can determine the answer, though it breaks privacy. This paper shows how the hospital can generate a private sketch and how the airline can privately join with the hospital's sketch by email address. The proposed solution satisfies pure differential privacy and gives approximate answers to linear queries and optimization problems over those joins. Whereas prior work such as secure function evaluation requires sender/receiver interaction, a distinguishing characteristic of the proposed approach is that it is non-interactive. Consequently, the sketch can be published to a repository for any organization to join with, facilitating data discovery. The accuracy of the method is demonstrated through both theoretical analysis and extensive empirical evidence.

creating, name change, public repository, (7 more...)

Neural Information Processing Systems

Industry:

Health & Medicine (1.00)
Transportation > Air (0.60)
Information Technology > Security & Privacy (0.43)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.60)

Add feedback

LLM Robustness Leaderboard v1 --Technical report

Lefebvre, Pierre Peigné -, Feuillade-Montixi, Quentin, David, Tom, Miailhe, Nicolas

arXiv.org Artificial IntelligenceAug-14-2025

This technical report accompanies the LLM robustness leaderboard published by PRISM Eval for the Paris AI Action Summit. We introduce PRISM Eval Behavior Elicitation Tool (BET), an AI system performing automated red-teaming through Dynamic Adversarial Optimization that achieves 100% Attack Success Rate (ASR) against 37 of 41 state-of-the-art LLMs. Beyond binary success metrics, we propose a fine-grained robustness metric estimating the average number of attempts required to elicit harmful behaviors, revealing that attack difficulty varies by over 300-fold across models despite universal vulnerability. We introduce primitive-level vulnerability analysis to identify which jailbreaking techniques are most effective for specific hazard categories. Our collaborative evaluation with trusted third parties from the AI Safety Network demonstrates practical pathways for distributed robustness assessment across the community.

information, large language model, machine learning, (22 more...)

arXiv.org Artificial Intelligence

2508.06296

Country: North America > United States (0.92)

Genre: Research Report > New Finding (0.46)

Industry:

Law > Criminal Law (1.00)
Law Enforcement & Public Safety > Terrorism (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
(7 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

A Step-by-Step Guide to Creating a Robust Autonomous Drone Testing Pipeline

Jiang, Yupeng, Deng, Yao, Schroder, Sebastian, Liang, Linfeng, Gambhir, Suhaas, James, Alice, Seth, Avishkar, Pirrie, James, Zhang, Yihao, Zheng, Xi

arXiv.org Artificial IntelligenceJun-16-2025

Autonomous drones are rapidly reshaping industries ranging from aerial delivery and infrastructure inspection to environmental monitoring and disaster response. Ensuring the safety, reliability, and efficiency of these systems is paramount as they transition from research prototypes to mission-critical platforms. This paper presents a step-by-step guide to establishing a robust autonomous drone testing pipeline, covering each critical stage: Software-in-the-Loop (SIL) Simulation Testing, Hardware-in-the-Loop (HIL) Testing, Controlled Real-World Testing, and In-Field Testing. Using practical examples, including the marker-based autonomous landing system, we demonstrate how to systematically verify drone system behaviors, identify integration issues, and optimize performance. Furthermore, we highlight emerging trends shaping the future of drone testing, including the integration of Neurosymbolic and LLMs, creating co-simulation environments, and Digital Twin-enabled simulation-based testing techniques. By following this pipeline, developers and researchers can achieve comprehensive validation, minimize deployment risks, and prepare autonomous drones for safe and reliable real-world operations.

artificial intelligence, drone, module, (16 more...)

arXiv.org Artificial Intelligence

2506.114

Country: North America > United States (0.67)

Genre:

Workflow (1.00)
Instructional Material > Training Manual (0.61)

Industry:

Transportation > Air (1.00)
Media (1.00)
Aerospace & Defense > Aircraft (1.00)
(4 more...)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)

Add feedback

THiNK: Can Large Language Models Think-aloud?

Yu, Yongan, Wu, Mengqian, Lin, Yiran, Lobczowski, Nikki G.

arXiv.org Artificial IntelligenceMay-27-2025

Assessing higher-order thinking skills in large language models (LLMs) remains a fundamental challenge, especially in tasks that go beyond surface-level accuracy. In this work, we propose THiNK (Testing Higher-order Notion of Knowledge), a multi-agent, feedback-driven evaluation framework grounded in Bloom's Taxonomy. THiNK frames reasoning assessment as an iterative task of problem generation, critique, and revision, encouraging LLMs to think-aloud through step-by-step reflection and refinement. This enables a systematic evaluation of both lower-order (e.g., remember, understand) and higher-order (e.g., evaluate, create) thinking skills. We apply THiNK to seven state-of-the-art LLMs and perform a detailed cognitive analysis of their outputs. Results reveal that while models reliably perform lower-order categories well, they struggle with applying knowledge in realistic contexts and exhibit limited abstraction. Structured feedback loops significantly improve reasoning performance, particularly in higher-order thinking. Qualitative evaluations further confirm that THiNK-guided outputs better align with domain logic and problem structure. The code of our framework provides a scalable methodology for probing and enhancing LLM reasoning, offering new directions for evaluation grounded in learning science, which is available at our GitHub repository.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2505.20184

Genre: Research Report (1.00)

Industry:

Education > Educational Setting (0.67)
Education > Curriculum > Subject-Specific Education (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Think Twice Before Creating That ChatGPT Action Figure

WIREDMay-1-2025, 13:56:35 GMT

At the start of April, an influx of action figure started appearing on social media sites including LinkedIn and X. Each figure depicted the person who had created it with uncanny accuracy, complete with personalized accessories such as reusable coffee cups, yoga mats, and headphones. All this is possible because of OpenAI's new GPT-4o-powered image generator, which supercharges ChatGPT's ability to edit pictures, render text, and more. OpenAI's ChatGPT image generator can also create pictures in the style of Japanese animated film company Studio Ghibli--a trend that quickly went viral, too. The images are fun and easy to make--all you need is a free ChatGPT account and a photo.

large language model, machine learning, natural language, (11 more...)

WIRED

Country: Europe > United Kingdom (0.06)

Industry:

Media > Film (1.00)
Information Technology > Security & Privacy (1.00)
Leisure & Entertainment (0.92)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.60)

Add feedback

Creating a Public Repository for Joining Private Data

Neural Information Processing SystemsJan-20-2025, 00:47:00 GMT

creating, private data, public repository, (5 more...)

Neural Information Processing Systems

Industry:

Health & Medicine (1.00)
Transportation > Air (0.63)
Information Technology > Security & Privacy (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.43)
Information Technology > Security & Privacy (0.40)

Add feedback

Creating a Formally Verified Neural Network for Autonomous Navigation: An Experience Report

Bukhari, Syed Ali Asadullah, Flinkow, Thomas, Inkarbekov, Medet, Pearlmutter, Barak A., Monahan, Rosemary

arXiv.org Artificial IntelligenceNov-21-2024

The increased reliance of self-driving vehicles on neural networks opens up the challenge of their verification. In this paper we present an experience report, describing a case study which we undertook to explore the design and training of a neural network on a custom dataset for vision-based autonomous navigation. We are particularly interested in the use of machine learning with differentiable logics to obtain networks satisfying basic safety properties by design, guaranteeing the behaviour of the neural network after training. We motivate the choice of a suitable neural network verifier for our purposes and report our observations on the use of neural network verifiers for self-driving systems.

constraint, neural network, verification, (11 more...)

arXiv.org Artificial Intelligence

doi: 10.4204/EPTCS.411.12

2411.14163

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > Austria > Vienna (0.14)
North America > United States > New York > New York County > New York City (0.04)
Europe > Ireland (0.04)

Genre: Research Report (0.50)

Industry:

Information Technology (1.00)
Transportation > Ground > Road (0.69)
Automobiles & Trucks (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Dr.Academy: A Benchmark for Evaluating Questioning Capability in Education for Large Language Models

Chen, Yuyan, Wu, Chenwei, Yan, Songzhou, Liu, Panjun, Zhou, Haoyu, Xiao, Yanghua

arXiv.org Artificial IntelligenceAug-20-2024

Teachers are important to imparting knowledge and guiding learners, and the role of large language models (LLMs) as potential educators is emerging as an important area of study. Recognizing LLMs' capability to generate educational content can lead to advances in automated and personalized learning. While LLMs have been tested for their comprehension and problem-solving skills, their capability in teaching remains largely unexplored. In teaching, questioning is a key skill that guides students to analyze, evaluate, and synthesize core concepts and principles. Therefore, our research introduces a benchmark to evaluate the questioning capability in education as a teacher of LLMs through evaluating their generated educational questions, utilizing Anderson and Krathwohl's taxonomy across general, monodisciplinary, and interdisciplinary domains. We shift the focus from LLMs as learners to LLMs as educators, assessing their teaching capability through guiding them to generate questions. We apply four metrics, including relevance, coverage, representativeness, and consistency, to evaluate the educational quality of LLMs' outputs. Our results indicate that GPT-4 demonstrates significant potential in teaching general, humanities, and science courses; Claude2 appears more apt as an interdisciplinary teacher. Furthermore, the automatic scores align with human perspectives.

huxley, opération, python 3, (13 more...)

arXiv.org Artificial Intelligence

2408.10947

Country:

Africa > Middle East > Egypt (0.04)
Europe > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.04)
Asia > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.04)
(23 more...)

Genre:

Research Report > New Finding (0.47)
Personal > Interview (0.46)
Instructional Material > Course Syllabus & Notes (0.34)

Industry:

Materials > Chemicals > Industrial Gases (1.00)
Education > Assessment & Standards (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.92)

Add feedback

Interview with Henok Biadglign Ademtew: Creating an Amharic, Ge'ez and English parallel dataset

AIHubJun-4-2024, 07:46:43 GMT

African languages are not well-represented in natural language processing (NLP). This is in large part due to a lack of resources for training models. Henok Biadglign Ademtew and Mikiyas Girma Birbo have created an Amharic, Ge'ez, and English parallel dataset to help advance research into low-resource languages. We spoke to Henok about this project, the creation of the dataset, and some of the challenges faced. Most of the languages in Africa are very low-resourced, and not much text data is available.

artificial intelligence, dataset, natural language, (12 more...)

AIHub

Country:

Africa > Ethiopia (0.06)
Europe (0.05)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.90)

Add feedback