Zeeland
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- North America > United States > California > San Diego County > San Diego (0.04)
- Europe > Netherlands > Zeeland (0.04)
- (2 more...)
- Energy (0.46)
- Government > Regional Government (0.46)
Identifying environmental factors associated with tetrodotoxin contamination in bivalve mollusks using eXplainable AI
Schoppema, M. C., van der Velden, B. H. M., Hürriyetoğlu, A., Klijnstra, M. D., Faassen, E. J., Gerssen, A., van der Fels-Klerx, H. J.
Since 2012, tetrodotoxin (TTX) has been found in seafoods such as bivalve mollusks in temperate European waters. TTX contamination leads to food safety risks and economic losses, making early prediction of TTX contamination vital to the food industry and competent authorities. Recent studies have pointed to shallow habitats and water temperature as main drivers to TTX contamination in bivalve mollusks. However, the temporal relationships between abiotic factors, biotic factors, and TTX contamination remain unexplored. We have developed an explainable, deep learning-based model to predict TTX contamination in the Dutch Zeeland estuary. Inputs for the model were meteorological and hydrological features; output was the presence or absence of TTX contamination. Results showed that the time of sunrise, time of sunset, global radiation, water temperature, and chloride concentration contributed most to TTX contamination. Thus, the effective number of sun hours, represented by day length and global radiation, was an important driver for tetrodotoxin contamination in bivalve mollusks. To conclude, our explainable deep learning model identified the aforementioned environmental factors (number of sun hours, global radiation, water temperature, and water chloride concentration) to be associated with tetrodotoxin contamination in bivalve mollusks; making our approach a valuable tool to mitigate marine toxin risks for food industry and competent authorities.
- Europe > Netherlands > Zeeland (0.25)
- Atlantic Ocean > Mediterranean Sea > Adriatic Sea (0.04)
- Europe > United Kingdom > England (0.04)
- (7 more...)
XtraGPT: Context-Aware and Controllable Academic Paper Revision
Chen, Nuo, HuiKai, Andre Lin, Wu, Jiaying, Hou, Junyi, Zhang, Zining, Wang, Qian, Wang, Xidong, He, Bingsheng
Despite the growing adoption of large language models (LLMs) in academic workflows, their capabilities remain limited to support high-quality scientific writing. Most existing systems are designed for general-purpose scientific text generation and fail to meet the sophisticated demands of research communication beyond surface-level polishing, such as conceptual coherence across sections. Furthermore, academic writing is inherently iterative and revision-driven, a process not well supported by direct prompting-based paradigms. To address these scenarios, we propose a human-AI collaboration framework for academic paper revision centered on criteria-guided intent alignment and context-aware modeling. To validate the framework, we curate a dataset of 7,000 research papers from top-tier venues annotated with 140,000 instruction-response pairs that reflect realistic, section-level scientific revisions. We instantiate the framework in XtraGPT, the first suite of open-source LLMs (1.5B to 14B parameters) for context-aware, instruction-guided writing assistance. Extensive experiments validate that XtraGPT significantly outperforms same-scale baselines and approaches the quality of proprietary systems. Both automated preference assessments and human evaluations confirm the effectiveness of XtraGPT in improving scientific drafts.
- North America > United States > Florida > Miami-Dade County > Miami (0.04)
- North America > United States > Illinois > Cook County > Chicago (0.04)
- Asia > Singapore (0.04)
- (9 more...)
- Research Report > New Finding (1.00)
- Summary/Review (0.88)
ARM: Adaptive Reasoning Model
Wu, Siye, Xie, Jian, Zhang, Yikai, Chen, Aili, Zhang, Kai, Su, Yu, Xiao, Yanghua
While large reasoning models demonstrate strong performance on complex tasks, they lack the ability to adjust reasoning token usage based on task difficulty. This often leads to the "overthinking" problem -- excessive and unnecessary reasoning -- which, although potentially mitigated by human intervention to control the token budget, still fundamentally contradicts the goal of achieving fully autonomous AI. In this work, we propose Adaptive Reasoning Model (ARM), a reasoning model capable of adaptively selecting appropriate reasoning formats based on the task at hand. These formats include three efficient ones -- Direct Answer, Short CoT, and Code -- as well as a more elaborate format, Long CoT. To train ARM, we introduce Ada-GRPO, an adaptation of Group Relative Policy Optimization (GRPO), which addresses the format collapse issue in traditional GRPO. Ada-GRPO enables ARM to achieve high token efficiency, reducing tokens by an average of 30%, and up to 70%, while maintaining performance comparable to the model that relies solely on Long CoT. Furthermore, not only does it improve inference efficiency through reduced token generation, but it also brings a 2x speedup in training. In addition to the default Adaptive Mode, ARM supports two additional reasoning modes: 1) Instruction-Guided Mode, which allows users to explicitly specify the reasoning format via special tokens -- ideal when the appropriate format is known for a batch of tasks. 2) Consensus-Guided Mode, which aggregates the outputs of the three efficient formats and resorts to Long CoT in case of disagreement, prioritizing performance with higher token usage.
- Asia > China > Shanghai > Shanghai (0.04)
- North America > United States > Ohio (0.04)
- Europe > Netherlands > Zeeland (0.04)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
- North America > United States > California > San Diego County > San Diego (0.04)
- Europe > Netherlands > Zeeland (0.04)
- (2 more...)
- Energy (0.46)
- Government > Regional Government (0.45)
Towards deployment-centric multimodal AI beyond vision and language
Liu, Xianyuan, Zhang, Jiayang, Zhou, Shuo, van der Plas, Thijs L., Vijayaraghavan, Avish, Grishina, Anastasiia, Zhuang, Mengdie, Schofield, Daniel, Tomlinson, Christopher, Wang, Yuhan, Li, Ruizhe, van Zeeland, Louisa, Tabakhi, Sina, Demeocq, Cyndie, Li, Xiang, Das, Arunav, Timmerman, Orlando, Baldwin-McDonald, Thomas, Wu, Jinge, Bai, Peizhen, Sahili, Zahraa Al, Alwazzan, Omnia, Do, Thao N., Suvon, Mohammod N. I., Wang, Angeline, Cipolina-Kun, Lucia, Moretti, Luigi A., Farndale, Lucas, Jain, Nitisha, Efremova, Natalia, Ge, Yan, Varela, Marta, Lam, Hak-Keung, Celiktutan, Oya, Evans, Ben R., Coca-Castro, Alejandro, Wu, Honghan, Abdallah, Zahraa S., Chen, Chen, Danchev, Valentin, Tkachenko, Nataliya, Lu, Lei, Zhu, Tingting, Slabaugh, Gregory G., Moore, Roger K., Cheung, William K., Charlton, Peter H., Lu, Haiping
Multimodal artificial intelligence (AI) integrates diverse types of data via machine learning to improve understanding, prediction, and decision-making across disciplines such as healthcare, science, and engineering. However, most multimodal AI advances focus on models for vision and language data, while their deployability remains a key challenge. We advocate a deployment-centric workflow that incorporates deployment constraints early to reduce the likelihood of undeployable solutions, complementing data-centric and model-centric approaches. We also emphasise deeper integration across multiple levels of multimodality and multidisciplinary collaboration to significantly broaden the research scope beyond vision and language. To facilitate this approach, we identify common multimodal-AI-specific challenges shared across disciplines and examine three real-world use cases: pandemic response, self-driving car design, and climate change adaptation, drawing expertise from healthcare, social science, engineering, science, sustainability, and finance. By fostering multidisciplinary dialogue and open research practices, our community can accelerate deployment-centric development for broad societal impact.
- North America > United States (0.46)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.28)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
- (11 more...)
- Research Report > Experimental Study (1.00)
- Workflow (0.89)
- Research Report > Strength High (0.68)
- Transportation (1.00)
- Information Technology > Services (1.00)
- Information Technology > Security & Privacy (1.00)
- (8 more...)
- Information Technology > Data Science > Data Mining (1.00)
- Information Technology > Communications > Social Media (1.00)
- Information Technology > Artificial Intelligence > Vision (1.00)
- (6 more...)
TGLF-SINN: Deep Learning Surrogate Model for Accelerating Turbulent Transport Modeling in Fusion
Cao, Yadi, Zhang, Futian, Liu, Wesley, Neiser, Tom, Meneghini, Orso, Fuller, Lawson, Smith, Sterling, Nazikian, Raffi, Sammuli, Brian, Yu, Rose
The Trapped Gyro-Landau Fluid (TGLF) model provides fast, accurate predictions of turbulent transport in tokamaks, but whole device simulations requiring thousands of evaluations remain computationally expensive. Neural network (NN) surrogates offer accelerated inference with fully differentiable approximations that enable gradient-based coupling but typically require large training datasets to capture transport flux variations across plasma conditions, creating significant training burden and limiting applicability to expensive gyrokinetic simulations. We propose \textbf{TGLF-SINN (Spectra-Informed Neural Network)} with three key innovations: (1) principled feature engineering that reduces target prediction range, simplifying the learning task; (2) physics-guided regularization of transport spectra to improve generalization under sparse data; and (3) Bayesian Active Learning (BAL) to strategically select training samples based on model uncertainty, reducing data requirements while maintaining accuracy. Our approach achieves superior performance with significantly less training data. In offline settings, TGLF-SINN reduces logarithmic root mean squared error (LRMSE) by 12. 4\% compared to the current baseline \base. Using only 25\% of the complete dataset with BAL, we achieve LRMSE only 0.0165 higher than \base~and 0.0248 higher than our offline model (0.0583). In downstream flux matching applications, our NN surrogate provides 45x speedup over TGLF while maintaining comparable accuracy, demonstrating potential for training efficient surrogates for higher-fidelity models where data acquisition is costly and sparse.
- North America > United States > California > San Diego County > San Diego (0.04)
- North America > United States > California > San Diego County > La Jolla (0.04)
- Europe > Netherlands > Zeeland (0.04)
- Africa > South Sudan > Equatoria > Central Equatoria > Juba (0.04)
- Energy (1.00)
- Government > Regional Government > North America Government > United States Government (0.68)
Population-Scale Network Embeddings Expose Educational Divides in Network Structure Related to Right-Wing Populist Voting
Lüken, Malte, Garcia-Bernardo, Javier, Deb, Sreeparna, Hafner, Flavio, Khosla, Megha
Administrative registry data can be used to construct population-scale networks whose ties reflect shared social contexts between persons. With machine learning, such networks can be encoded into numerical representations -- embeddings -- that automatically capture individuals' position within the network. We created embeddings for all persons in the Dutch population from a population-scale network that represents five shared contexts: neighborhood, work, family, household, and school. To assess the informativeness of these embeddings, we used them to predict right-wing populist voting. Embeddings alone predicted right-wing populist voting above chance-level but performed worse than individual characteristics. Combining the best subset of embeddings with individual characteristics only slightly improved predictions. After transforming the embeddings to make their dimensions more sparse and orthogonal, we found that one embedding dimension was strongly associated with the outcome. Mapping this dimension back to the population network revealed differences in network structure related to right-wing populist voting between different school ties and achieved education levels. Our study contributes methodologically by demonstrating how population-scale network embeddings can be made interpretable, and substantively by linking structural network differences in education to right-wing populist voting.
- North America > United States > Illinois > Cook County > Chicago (0.04)
- Europe > Netherlands > South Holland > Delft (0.04)
- Europe > Netherlands > South Holland > Rotterdam (0.04)
- (7 more...)
- Government > Voting & Elections (0.93)
- Information Technology (0.68)
- Education > Educational Setting (0.68)
Multi-Timescale Dynamics Model Bayesian Optimization for Plasma Stabilization in Tokamaks
Sonker, Rohit, Capone, Alexandre, Rothstein, Andrew, Kaga, Hiro Josep Farre, Kolemen, Egemen, Schneider, Jeff
Machine learning algorithms often struggle to control complex real-world systems. In the case of nuclear fusion, these challenges are exacerbated, as the dynamics are notoriously complex, data is poor, hardware is subject to failures, and experiments often affect dynamics beyond the experiment's duration. Existing tools like reinforcement learning, supervised learning, and Bayesian optimization address some of these challenges but fail to provide a comprehensive solution. To overcome these limitations, we present a multi-scale Bayesian optimization approach that integrates a high-frequency data-driven dynamics model with a low-frequency Gaussian process. By updating the Gaussian process between experiments, the method rapidly adapts to new data, refining the predictions of the less reliable dynamical model. We validate our approach by controlling tearing instabilities in the DIII-D nuclear fusion plant. Offline testing on historical data shows that our method significantly outperforms several baselines. Results on live experiments on the DIII-D tokamak, conducted under high-performance plasma scenarios prone to instabilities, shows a 50% success rate, marking a 117% improvement over historical outcomes.
- North America > United States > New Jersey > Mercer County > Princeton (0.04)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- North America > Canada (0.04)
- (2 more...)
- Government > Regional Government > North America Government > United States Government (0.47)
- Energy > Power Industry (0.46)
Text-to-Image Generation for Vocabulary Learning Using the Keyword Method
Attygalle, Nuwan T., Kljun, Matjaž, Quigley, Aaron, Pucihar, Klen čOpič, Grubert, Jens, Biener, Verena, Leiva, Luis A., Yoneyama, Juri, Toniolo, Alice, Miguel, Angela, Kato, Hirokazu, Weerasinghe, Maheshya
The 'keyword method' is an effective technique for learning vocabulary of a foreign language. It involves creating a memorable visual link between what a word means and what its pronunciation in a foreign language sounds like in the learner's native language. However, these memorable visual links remain implicit in the people's mind and are not easy to remember for a large set of words. To enhance the memorisation and recall of the vocabulary, we developed an application that combines the keyword method with text-to-image generators to externalise the memorable visual links into visuals. These visuals represent additional stimuli during the memorisation process. To explore the effectiveness of this approach we first run a pilot study to investigate how difficult it is to externalise the descriptions of mental visualisations of memorable links, by asking participants to write them down. We used these descriptions as prompts for text-to-image generator (DALL-E2) to convert them into images and asked participants to select their favourites. Next, we compared different text-to-image generators (DALL-E2, Midjourney, Stable and Latent Diffusion) to evaluate the perceived quality of the generated images by each. Despite heterogeneous results, participants mostly preferred images generated by DALL-E2, which was used also for the final study. In this study, we investigated whether providing such images enhances the retention of vocabulary being learned, compared to the keyword method only. Our results indicate that people did not encounter difficulties describing their visualisations of memorable links and that providing corresponding images significantly improves memory retention.
- North America > United States > New York > New York County > New York City (0.14)
- North America > United States > California > Los Angeles County > Los Angeles (0.14)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
- (24 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Questionnaire & Opinion Survey (1.00)
- Education (1.00)
- Health & Medicine > Consumer Health (0.87)