harvard
SGD on Neural Networks Learns Functions of Increasing Complexity
Dimitris Kalimeris, Gal Kaplun, Preetum Nakkiran, Benjamin Edelman, Tristan Yang, Boaz Barak, Haofeng Zhang
Neural networks have been extremely successful in modern machine learning, achieving the state-of-the-art inawiderangeofdomains, including image-recognition, speech-recognition, andgame-playing [ 14, 18, 23, 37]. Practitioners often train deep neural networks with hundreds of layers and millions of parameters and manage to find networks with good out-of-sample performance.However, this practical prowess isaccompanied by feeble theoreticalunderstanding.
- North America > United States > Virginia > Arlington County > Arlington (0.04)
- North America > Canada > Quebec > Montreal (0.04)
- Law (0.95)
- Government (0.69)
AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery
Yin, Yuqi, Fu, Yibo, Wang, Siyuan, Sun, Peng, Wang, Hongyu, Wang, Xiaohui, Zheng, Lei, Li, Zhiyong, Liu, Zhirong, Wang, Jianji, Sun, Zhaoxi
The discovery of novel Ionic Liquids (ILs) is hindered by critical challenges in property prediction, including limited data, poor model accuracy, and fragmented workflows. Leveraging the power of Large Language Models (LLMs), we introduce AIonopedia, to the best of our knowledge, the first LLM agent for IL discovery. Powered by an LLM-augmented multimodal domain foundation model for ILs, AIonopedia enables accurate property predictions and incorporates a hierarchical search architecture for molecular screening and design. Trained and evaluated on a newly curated and comprehensive IL dataset, our model delivers superior performance. Complementing these results, evaluations on literature-reported systems indicate that the agent can perform effective IL modification. Moving beyond offline tests, the practical efficacy was further confirmed through real-world wet-lab validation, in which the agent demonstrated exceptional generalization capabilities on challenging out-of-distribution tasks, underscoring its ability to accelerate real-world IL discovery.
- Materials > Chemicals (1.00)
- Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
- Energy (1.00)
- North America > United States > New York > New York County > New York City (0.04)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
- Law (0.94)
- Government (0.68)
RadGame: An AI-Powered Platform for Radiology Education
Baharoon, Mohammed, Raissi, Siavash, Jun, John S., Heintz, Thibault, Alabbad, Mahmoud, Alburkani, Ali, Kim, Sung Eun, Kleinschmidt, Kent, Alhumaydhi, Abdulrahman O., Alghamdi, Mohannad Mohammed G., Palacio, Jeremy Francis, Bukhaytan, Mohammed, Prudlo, Noah Michael, Akula, Rithvik, Chrisler, Brady, Galligos, Benjamin, Almutairi, Mohammed O., Alanazi, Mazeen Mohammed, Alrashdi, Nasser M., Hwang, Joel Jihwan, Jaliparthi, Sri Sai Dinesh, Nelson, Luke David, Nguyen, Nathaniel, Suryadevara, Sathvik, Kim, Steven, Mohammed, Mohammed F., Semenov, Yevgeniy R., Yu, Kun-Hsing, Aljouie, Abdulrhman, AlOmaish, Hassan, Rodman, Adam, Rajpurkar, Pranav
We introduce RadGame, an AI-powered gam-ified platform for radiology education that targets two core skills: localizing findings and generating reports. Traditional radiology training is based on passive exposure to cases or active practice with real-time input from supervising radiologists, limiting opportunities for immediate and scalable feedback. RadGame addresses this gap by combining gamification with large-scale public datasets and automated, AI-driven feedback that provides clear, structured guidance to human learners. In RadGame Localize, players draw bounding boxes around abnormalities, which are automatically compared to radiologist-drawn annotations from public datasets, and visual explanations are generated by vision-language models for user missed findings. In RadGame Report, players compose findings given a chest X-ray, patient age and indication, and receive structured AI feedback based on radiology report generation metrics, highlighting errors and omissions compared to a radiologist's written ground truth report from public datasets, producing a final performance and style score. In a prospective evaluation, participants using RadGame achieved a 68% improvement in localization accuracy compared to 17% with traditional passive methods and a 31% improvement in report-writing accuracy compared to 4% with traditional methods after seeing the same cases. RadGame highlights the potential of AI-driven gamification to deliver scalable, feedback-rich radiology training and reimagines the application of medical AI resources in education.
- North America > United States > Massachusetts > Suffolk County > Boston (0.05)
- Asia > Middle East > Saudi Arabia > Riyadh Province > Riyadh (0.05)
- Europe > Netherlands > Limburg > Maastricht (0.04)
- (4 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Instructional Material (1.00)
- Health & Medicine > Nuclear Medicine (1.00)
- Health & Medicine > Diagnostic Medicine > Imaging (1.00)
- Information Technology > Artificial Intelligence > Machine Learning (1.00)
- Information Technology > Artificial Intelligence > Natural Language (0.88)
- Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
- Information Technology > Artificial Intelligence > Applied AI (0.68)
'I have to do it': Why one of the world's most brilliant AI scientists left the US for China
'I have to do it': Why one of the world's most brilliant AI scientists left the US for China In 2020, after spending half his life in the US, Song-Chun Zhu took a one-way ticket to China. By the time Song-Chun Zhu was six years old, he had encountered death more times than he could count. This was the early 1970s, the waning years of the Cultural Revolution, and his father ran a village supply store in rural China . There was little to do beyond till the fields and study Mao Zedong at home, and so the shop became a refuge where people could rest, recharge and share tales. Zhu grew up in that shop, absorbing a lifetime's worth of tragedies: a family friend lost in a car crash, a relative from an untreated illness, stories of suicide or starvation. "That was really tough," Zhu recalled recently. The young Zhu became obsessed with what people left behind after they died. One day, he came across a book that contained his family genealogy. When he asked the bookkeeper why it included his ancestors' dates of birth and death but nothing about their lives, the man told him matter of factly that they were peasants, so there was nothing worth recording. He resolved that his fate would be different. Today, at 56, Zhu is one of the world's leading authorities in artificial intelligence. In 1992, he left China for the US to pursue a PhD in computer science at Harvard. Later, at University of California, Los Angeles (UCLA), he led one of the most prolific AI research centres in the world, won numerous major awards, and attracted prestigious research grants from the Pentagon and the National Science Foundation. He was celebrated for his pioneering research into how machines can spot patterns in data, which helped lay the groundwork for modern AI systems such as ChatGPT and DeepSeek. He and his wife, and their two US-born daughters, lived in a hilltop home on Los Angeles's Mulholland Drive. He thought he would never leave. But in August 2020, after 28 years in the US, Zhu astonished his colleagues and friends by suddenly moving back to China, where he took up professorships at two top Beijing universities and a directorship in a state-sponsored AI institute.
- North America > United States > California > Los Angeles County > Los Angeles (0.74)
- Asia > China > Beijing > Beijing (0.26)
- Pacific Ocean > North Pacific Ocean > South China Sea (0.04)
- (12 more...)
- Leisure & Entertainment > Sports (1.00)
- Law (1.00)
- Government > Regional Government > North America Government > United States Government (1.00)
- (3 more...)
Explainable AI for Accelerated Microstructure Imaging: A SHAP-Guided Protocol on the Connectome 2.0 scanner
Uhl, Quentin, Pavan, Tommaso, Gerold, Julianna, Chan, Kwok-Shing, Jun, Yohan, Fujita, Shohei, Bhatt, Aneri, Ma, Yixin, Wang, Qiaochu, Lee, Hong-Hsi, Huang, Susie Y., Bilgic, Berkin, Jelescu, Ileana
The diffusion MRI Neurite Exchange Imaging model offers a promising framework for probing gray matter microstructure by estimating parameters such as compartment sizes, diffusivities, and inter-compartmental water exchange time. However, existing protocols require long scan times. This study proposes a reduced acquisition scheme for the Connectome 2.0 scanner that preserves model accuracy while substantially shortening scan duration. We developed a data-driven framework using explainable artificial intelligence with a guided recursive feature elimination strategy to identify an optimal 8-feature subset from a 15-feature protocol. The performance of this optimized protocol was validated in vivo and benchmarked against the full acquisition and alternative reduction strategies. Parameter accuracy, preservation of anatomical contrast, and test-retest reproducibility were assessed. The reduced protocol yielded parameter estimates and cortical maps comparable to the full protocol, with low estimation errors in synthetic data and minimal impact on test-retest variability. Compared to theory-driven and heuristic reduction schemes, the optimized protocol demonstrated superior robustness, reducing the deviation in water exchange time estimates by over two-fold. In conclusion, this hybrid optimization framework enables viable imaging of neurite exchange in 14 minutes without loss of parameter fidelity. This approach supports the broader application of exchange-sensitive diffusion magnetic resonance imaging in neuroscience and clinical research, and offers a generalizable method for designing efficient acquisition protocols in biophysical parameter mapping.
- North America > United States > California > San Francisco County > San Francisco (0.14)
- Europe > Switzerland > Vaud > Lausanne (0.05)
- North America > United States > Massachusetts > Suffolk County > Boston (0.04)
- (4 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Health & Medicine > Therapeutic Area > Neurology (1.00)
- Health & Medicine > Diagnostic Medicine > Imaging (1.00)
A Novel Evaluation Benchmark for Medical LLMs: Illuminating Safety and Effectiveness in Clinical Domains
Wang, Shirui, Tang, Zhihui, Yang, Huaxia, Gong, Qiuhong, Gu, Tiantian, Ma, Hongyang, Wang, Yongxin, Sun, Wubin, Lian, Zeliang, Mao, Kehang, Jiang, Yinan, Huang, Zhicheng, Ma, Lingyun, Shen, Wenjie, Ji, Yajie, Tan, Yunhui, Wang, Chunbo, Gao, Yunlu, Ye, Qianling, Lin, Rui, Chen, Mingyu, Niu, Lijuan, Wang, Zhihao, Yu, Peng, Lang, Mengran, Liu, Yue, Zhang, Huimin, Shen, Haitao, Chen, Long, Zhao, Qiguang, Liu, Si-Xuan, Zhou, Lina, Gao, Hua, Ye, Dongqiang, Meng, Lingmin, Yu, Youtao, Liang, Naixin, Wu, Jianxiong
Large language models (LLMs) hold promise in clinical decision support but face major challenges in safety evaluation and effectiveness validation. We developed the Clinical Safety-Effectiveness Dual-Track Benchmark (CSEDB), a multidimensional framework built on clinical expert consensus, encompassing 30 criteria covering critical areas like critical illness recognition, guideline adherence, and medication safety, with weighted consequence measures. Thirty-two specialist physicians developed and reviewed 2,069 open-ended Q&A items aligned with these criteria, spanning 26 clinical departments to simulate real-world scenarios. Benchmark testing of six LLMs revealed moderate overall performance (average total score 57.2%, safety 54.7%, effectiveness 62.3%), with a significant 13.3% performance drop in high-risk scenarios (p < 0.0001). Domain-specific medical LLMs showed consistent performance advantages over general-purpose models, with relatively higher top scores in safety (0.912) and effectiveness (0.861). The findings of this study not only provide a standardized metric for evaluating the clinical application of medical LLMs, facilitating comparative analyses, risk exposure identification, and improvement directions across different scenarios, but also hold the potential to promote safer and more effective deployment of large language models in healthcare environments.