AITopics

Hackenberg, Maren, Connor, Sophia G., Kabus, Fabian, Brawner, June, Markham, Ella, Hardalupas, Mahi, Chowdhury, Areeq, Backofen, Rolf, Köttgen, Anna, Rohde, Angelika, Binder, Nadine, Binder, Harald, Data, the Collaborative Research Center 1597 Small

Small Data Explainer -- The impact of small data methods in everyday life

The emergence of breakthrough artificial intelligence (AI) techniques has led to a renewed focus on how small data settings, i.e., settings with limited information, can benefit from such developments. This includes societal issues such as how best to include under-represented groups in data-driven policy and decision making, or the health benefits of assistive technologies such as wearables. We provide a conceptual overview, in particular contrasting small data with big data, and identify common themes from exemplary case studies and application areas. Potential solutions are described in a more detailed technical overview of current data analysis and modelling techniques, highlighting contributions from different disciplines, such as knowledge-driven modelling from statistics and data-driven modelling from computer science. By linking application settings, conceptual contributions and specific techniques, we highlight what is already feasible and suggest what an agenda for fully leveraging small data might look like.

data mining, large language model, machine learning, (22 more...)

2507.11773

Country:

North America > United States (1.00)
Europe (0.93)

Genre:

Overview (1.00)
Research Report > Experimental Study (0.67)
Research Report > Promising Solution (0.48)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Oncology (1.00)
(5 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(5 more...)

A Study on the Application of Artificial Intelligence in Ecological Design

Zhao, Hengyue

Can we acknowledge that our relationship with nature has evolved from human dominance to an intimate interconnectedness, recognizing that nature has genuinely attained a form of "personhood," and that artificial intelligence (AI) can facilitate this transforma - tion, serving as a novel medium for human-nature connection? This article begins by examining the critical role of AI at the heart of the urgent ecological transformation currently underway, exploring the paradigm shift emerging from the intersection of AI and non-human life. The discussion progressively narrows its focus to how this innovative AI-nature paradigm manifests specifically within the fields of art and design, highlighting its distinctiveness from traditional artistic and design media. The article seeks to explore how various artists and designers incorporate AI into ecological, microbiological, and geophysical creative practices. Through a comparative analysis of their creative strategies, it elaborates on the relationship between different applications of AI--such as data analysis, image recognition, and ecological restoration--and their unique artistic expressions, while also considering the extended value inherent in AI-driven art and design. However, the precise value of this emergent design paradigm remains subject to ongoing discourse.

artificial intelligence, machine learning, reinforcement learning, (13 more...)

2507.11595

Country:

Asia (0.95)
North America > United States (0.93)

Genre: Research Report (0.50)

Industry:

Law > Environmental Law (1.00)
Information Technology (1.00)
Energy (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Applied AI (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.68)

Synthetic Tabular Data Generation: A Comparative Survey for Modern Techniques

Challagundla, Raju, Dorodchi, Mohsen, Wang, Pu, Lee, Minwoo

As privacy regulations become more stringent and access to real-world data becomes increasingly constrained, synthetic data generation has emerged as a vital solution, especially for tabular datasets, which are central to domains like finance, healthcare and the social sciences. This survey presents a comprehensive and focused review of recent advances in synthetic tabular data generation, emphasizing methods that preserve complex feature relationships, maintain statistical fidelity, and satisfy privacy requirements. A key contribution of this work is the introduction of a novel taxonomy based on practical generation objectives, including intended downstream applications, privacy guarantees, and data utility, directly informing methodological design and evaluation strategies. Therefore, this review prioritizes the actionable goals that drive synthetic data creation, including conditional generation and risk-sensitive modeling. Additionally, the survey proposes a benchmark framework to align technical innovation with real-world demands. By bridging theoretical foundations with practical deployment, this work serves as both a roadmap for future research and a guide for implementing synthetic tabular data in privacy-critical environments.

data mining, machine learning, natural language, (20 more...)

2507.1159

Country: North America > United States > North Carolina (0.14)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine > Health Care Technology > Medical Record (0.46)
Government > Regional Government > North America Government > United States Government (0.45)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications (1.00)
(11 more...)

Targeted Deep Architectures: A TMLE-Based Framework for Robust Causal Inference in Neural Networks

Li, Yi, Mccoy, David, Gunter, Nolan, Lee, Kaitlyn, Schuler, Alejandro, van der Laan, Mark

Modern deep neural networks are powerful predictive tools yet often lack valid inference for causal parameters, such as treatment effects or entire survival curves. While frameworks like Double Machine Learning (DML) and Targeted Maximum Likelihood Estimation (TMLE) can debias machine-learning fits, existing neural implementations either rely on "targeted losses" that do not guarantee solving the efficient influence function equation or computationally expensive post-hoc "fluctuations" for multi-parameter settings. We propose Targeted Deep Architectures (TDA), a new framework that embeds TMLE directly into the network's parameter space with no restrictions on the backbone architecture. Specifically, TDA partitions model parameters - freezing all but a small "targeting" subset - and iteratively updates them along a targeting gradient, derived from projecting the influence functions onto the span of the gradients of the loss with respect to weights. This procedure yields plug-in estimates that remove first-order bias and produce asymptotically valid confidence intervals. Crucially, TDA easily extends to multi-dimensional causal estimands (e.g., entire survival curves) by merging separate targeting gradients into a single universal targeting update. Theoretically, TDA inherits classical TMLE properties, including double robustness and semiparametric efficiency. Empirically, on the benchmark IHDP dataset (average treatment effects) and simulated survival data with informative censoring, TDA reduces bias and improves coverage relative to both standard neural-network estimators and prior post-hoc approaches. In doing so, TDA establishes a direct, scalable pathway toward rigorous causal inference within modern deep architectures for complex multi-parameter targets.

artificial intelligence, machine learning, targ, (19 more...)

2507.12435

Genre: Research Report > Experimental Study (0.68)

Industry:

Health & Medicine (0.46)
Law > Civil Rights & Constitutional Law (0.36)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.54)

Improving Data and Parameter Efficiency of Neural Language Models Using Representation Analysis

Jukić, Josip

This thesis addresses challenges related to data and parameter efficiency in neural language models, with a focus on representation analysis and the introduction of new optimization techniques. The first part examines the properties and dynamics of language representations within neural models, emphasizing their significance in enhancing robustness and generalization. It proposes innovative approaches based on representation smoothness, including regularization strategies that utilize Jacobian and Hessian matrices to stabilize training and mitigate sensitivity to input perturbations. The second part focuses on methods to significantly enhance data and parameter efficiency by integrating active learning strategies with parameter-efficient fine-tuning, guided by insights from representation smoothness analysis. It presents smoothness-informed early-stopping techniques designed to eliminate the need for labeled validation sets and proposes innovative combinations of active learning and parameter-efficient fine-tuning to reduce labeling efforts and computational resources. Extensive experimental evaluations across various NLP tasks demonstrate that these combined approaches substantially outperform traditional methods in terms of performance, stability, and efficiency. The third part explores weak supervision techniques enhanced by in-context learning to effectively utilize unlabeled data, further reducing dependence on extensive labeling. It shows that using in-context learning as a mechanism for weak supervision enables models to better generalize from limited labeled data by leveraging unlabeled examples more effectively during training. Comprehensive empirical evaluations confirm significant gains in model accuracy, adaptability, and robustness, especially in low-resource settings and dynamic data environments.

large language model, machine learning, natural language, (27 more...)

2507.12004

Country:

Europe (1.00)
Asia (1.00)
North America > United States > Minnesota (0.27)

Genre:

Research Report > Promising Solution (1.00)
Research Report > New Finding (1.00)
Overview (1.00)

Industry:

Leisure & Entertainment (1.00)
Media > Film (0.92)
Law (0.67)
Education > Curriculum (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
(6 more...)

Akram, Waseem, Din, Muhayy Ud, Soud, Lyes Saad, Hussain, Irfan

A Review of Generative AI in Aquaculture: Foundations, Applications, and Future Directions for Smart and Sustainable Farming

Generative Artificial Intelligence (GAI) has rapidly emerged as a transformative force in aquaculture, enabling intelligent synthesis of multimodal data, including text, images, audio, and simulation outputs for smarter, more adaptive decision-making. As the aquaculture industry shifts toward data-driven, automation and digital integration operations under the Aquaculture 4.0 paradigm, GAI models offer novel opportunities across environmental monitoring, robotics, disease diagnostics, infrastructure planning, reporting, and market analysis. This review presents the first comprehensive synthesis of GAI applications in aquaculture, encompassing foundational architectures (e.g., diffusion models, transformers, and retrieval augmented generation), experimental systems, pilot deployments, and real-world use cases. We highlight GAI's growing role in enabling underwater perception, digital twin modeling, and autonomous planning for remotely operated vehicle (ROV) missions. We also provide an updated application taxonomy that spans sensing, control, optimization, communication, and regulatory compliance. Beyond technical capabilities, we analyze key limitations, including limited data availability, real-time performance constraints, trust and explainability, environmental costs, and regulatory uncertainty. This review positions GAI not merely as a tool but as a critical enabler of smart, resilient, and environmentally aligned aquaculture systems.

large language model, machine learning, real time system, (17 more...)

2507.11974

Country:

Europe (0.67)
North America > United States (0.45)
Asia > Middle East > UAE (0.28)

Genre:

Overview (1.00)
Instructional Material (0.92)
Research Report > Promising Solution (0.67)
Research Report > New Finding (0.46)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
(8 more...)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
(2 more...)

ClarifAI: Enhancing AI Interpretability and Transparency through Case-Based Reasoning and Ontology-Driven Approach for Improved Decision-Making

Vemula, Srikanth

This study introduces Clarity and Reasoning Interface for Artificial Intelligence (ClarifAI), a novel approach designed to augment the transparency and interpretability of artificial intelligence (AI) in the realm of improved decision making. Leveraging the Case-Based Reasoning (CBR) methodology and integrating an ontology-driven approach, ClarifAI aims to meet the intricate explanatory demands of various stakeholders involved in AI-powered applications. The paper elaborates on ClarifAI's theoretical foundations, combining CBR and ontologies to furnish exhaustive explanation mechanisms. It further elaborates on the design principles and architectural blueprint, highlighting ClarifAI's potential to enhance AI interpretability across different sectors and its applicability in high-stake environments.

artificial intelligence, clarifai, machine learning, (16 more...)

2507.11733

Country: North America > United States (0.68)

Genre:

Overview > Innovation (0.48)
Research Report > Promising Solution (0.34)

Industry:

Law (0.68)
Government (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Case-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Memory-Based Learning (1.00)

FOX NewsJul-16-2025, 01:14:44 GMT

Trump challenges AOC and Jasmine Crockett to intelligence test after calling them 'very low IQ'

Before boarding Marine One on Tuesday afternoon, President Trump challenged two progressive Democrat congresswomen to an intelligence test. President Donald Trump lobbed a signature zinger on Tuesday as he paused to speak with reporters before boarding Marine One en route to an artificial intelligence summit. "[Alexandria Ocasio-Cortez], look, I think she's very nice, but she's very low IQ, and we really don't need low IQ," Trump said, smiling as cameras rolled. He added, "Between her and Crockett, we're going to give them both an IQ test to see who comes out best." TRUMP DARES AOC TO TRY TO IMPEACH HIM: 'MAKE MY DAY' President Donald Trump said AOC and Jasmine Crockett should take IQ tests.

aoc and jasmine crockett, artificial intelligence, crockett, (10 more...)

FOX News

Country: North America > United States (1.00)

Industry:

Law (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology: Information Technology > Artificial Intelligence (0.84)

Ulmer, Dennis, Lorson, Alexandra, Titov, Ivan, Hardmeier, Christian

Anthropomimetic Uncertainty: What Verbalized Uncertainty in Language Models is Missing

arXiv.org Artificial IntelligenceJul-16-2025

Human users increasingly rely on natural language interactions with large language models (LLMs) in order to receive help on a large variety of tasks and problems. However, the trustworthiness and perceived legitimacy of LLMs is undermined by the fact that their output is frequently stated in very confident terms, even when its accuracy is questionable. Therefore, there is a need to signal the confidence of the language model to a user in order to reap the benefits of human-machine collaboration and mitigate potential harms. Verbalized uncertainty is the expression of confidence with linguistic means, an approach that integrates perfectly into language-based interfaces. Nevertheless, most recent research in natural language processing (NLP) overlooks the nuances surrounding human uncertainty communication and the data biases that influence machine uncertainty communication. We argue for anthropomimetic uncertainty, meaning that intuitive and trustworthy uncertainty communication requires a degree of linguistic authenticity and personalization to the user, which could be achieved by emulating human communication. We present a thorough overview over the research in human uncertainty communication, survey ongoing research, and perform additional analyses to demonstrate so-far overlooked biases in verbalized uncertainty. We conclude by pointing out unique factors in human-machine communication of uncertainty and deconstruct anthropomimetic uncertainty into future research directions for NLP.

large language model, machine learning, natural language, (19 more...)

2507.10587

Country:

North America > United States (1.00)
Europe (1.00)
Asia (1.00)

Genre:

Overview (1.00)
Research Report > New Finding (0.46)
Research Report > Experimental Study (0.46)

Industry:

Law (0.67)
Media > News (0.67)
Health & Medicine (0.67)
Education > Educational Setting (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)