AITopics | Yang, Sen

Collaborating Authors

Yang, Sen

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Exploring Runtime Decision Support for Trauma Resuscitation

Li, Keyi, Yang, Sen, Sullivan, Travis M., Burd, Randall S., Marsic, Ivan

arXiv.org Artificial IntelligenceJul-6-2022

AI-based recommender systems have been successfully applied in many domains (e.g., e-commerce, feeds ranking). Medical experts believe that incorporating such methods into a clinical decision support system may help reduce medical team errors and improve patient outcomes during treatment processes (e.g., trauma resuscitation, surgical processes). Limited research, however, has been done to develop automatic data-driven treatment decision support. We explored the feasibility of building a treatment recommender system to provide runtime next-minute activity predictions. The system uses patient context (e.g., demographics and vital signs) and process context (e.g., activities) to continuously predict activities that will be performed in the next minute. We evaluated our system on a pre-recorded dataset of trauma resuscitation and conducted an ablation study on different model variants. The best model achieved an average F1-score of 0.67 for 61 activity types. We include medical team feedback and discuss the future work.

artificial intelligence, machine learning, prediction, (17 more...)

arXiv.org Artificial Intelligence

2207.02922

Genre: Research Report (0.82)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Health Care Providers & Services (0.88)
Health & Medicine > Diagnostic Medicine > Vital Signs (0.37)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

OpenKBP-Opt: An international and reproducible evaluation of 76 knowledge-based planning pipelines

Babier, Aaron, Mahmood, Rafid, Zhang, Binghao, Alves, Victor G. L., Barragán-Montero, Ana Maria, Beaudry, Joel, Cardenas, Carlos E., Chang, Yankui, Chen, Zijie, Chun, Jaehee, Diaz, Kelly, Eraso, Harold David, Faustmann, Erik, Gaj, Sibaji, Gay, Skylar, Gronberg, Mary, Guo, Bingqi, He, Junjun, Heilemann, Gerd, Hira, Sanchit, Huang, Yuliang, Ji, Fuxin, Jiang, Dashan, Giraldo, Jean Carlo Jimenez, Lee, Hoyeon, Lian, Jun, Liu, Shuolin, Liu, Keng-Chi, Marrugo, José, Miki, Kentaro, Nakamura, Kunio, Netherton, Tucker, Nguyen, Dan, Nourzadeh, Hamidreza, Osman, Alexander F. I., Peng, Zhao, Muñoz, José Darío Quinto, Ramsl, Christian, Rhee, Dong Joo, Rodriguez, Juan David, Shan, Hongming, Siebers, Jeffrey V., Soomro, Mumtaz H., Sun, Kay, Hoyos, Andrés Usuga, Valderrama, Carlos, Verbeek, Rob, Wang, Enpei, Willems, Siri, Wu, Qi, Xu, Xuanang, Yang, Sen, Yuan, Lulin, Zhu, Simeng, Zimmermann, Lukas, Moore, Kevin L., Purdie, Thomas G., McNiven, Andrea L., Chan, Timothy C. Y.

arXiv.org Artificial IntelligenceFeb-16-2022

We establish an open framework for developing plan optimization models for knowledge-based planning (KBP) in radiotherapy. Our framework includes reference plans for 100 patients with head-and-neck cancer and high-quality dose predictions from 19 KBP models that were developed by different research groups during the OpenKBP Grand Challenge. The dose predictions were input to four optimization models to form 76 unique KBP pipelines that generated 7600 plans. The predictions and plans were compared to the reference plans via: dose score, which is the average mean absolute voxel-by-voxel difference in dose a model achieved; the deviation in dose-volume histogram (DVH) criterion; and the frequency of clinical planning criteria satisfaction. We also performed a theoretical investigation to justify our dose mimicking models. The range in rank order correlation of the dose score between predictions and their KBP pipelines was 0.50 to 0.62, which indicates that the quality of the predictions is generally positively correlated with the quality of the plans. Additionally, compared to the input predictions, the KBP-generated plans performed significantly better (P<0.05; one-sided Wilcoxon test) on 18 of 23 DVH criteria. Similarly, each optimization model generated plans that satisfied a higher percentage of criteria than the reference plans. Lastly, our theoretical investigation demonstrated that the dose mimicking models generated plans that are also optimal for a conventional planning model. This was the largest international effort to date for evaluating the combination of KBP prediction and optimization models. In the interest of reproducibility, our data and code is freely available at https://github.com/ababier/open-kbp-opt.

artificial intelligence, knowledge-based planning pipeline, oncology, (3 more...)

arXiv.org Artificial Intelligence

2202.08303

Genre: Research Report > New Finding (0.53)

Industry: Health & Medicine > Therapeutic Area > Oncology (0.73)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (0.60)

Add feedback

SpeechNAS: Towards Better Trade-off between Latency and Accuracy for Large-Scale Speaker Verification

Zhu, Wentao, Kong, Tianlong, Lu, Shun, Li, Jixiang, Zhang, Dawei, Deng, Feng, Wang, Xiaorui, Yang, Sen, Liu, Ji

arXiv.org Artificial IntelligenceSep-18-2021

Recently, x-vector has been a successful and popular approach for speaker verification, which employs a time delay neural network (TDNN) and statistics pooling to extract speaker characterizing embedding from variable-length utterances. Improvement upon the x-vector has been an active research area, and enormous neural networks have been elaborately designed based on the x-vector, eg, extended TDNN (E-TDNN), factorized TDNN (F-TDNN), and densely connected TDNN (D-TDNN). In this work, we try to identify the optimal architectures from a TDNN based search space employing neural architecture search (NAS), named SpeechNAS. Leveraging the recent advances in the speaker recognition, such as high-order statistics pooling, multi-branch mechanism, D-TDNN and angular additive margin softmax (AAM) loss with a minimum hyper-spherical energy (MHE), SpeechNAS automatically discovers five network architectures, from SpeechNAS-1 to SpeechNAS-5, of various numbers of parameters and GFLOPs on the large-scale text-independent speaker recognition dataset VoxCeleb1. Our derived best neural network achieves an equal error rate (EER) of 1.02% on the standard test set of VoxCeleb1, which surpasses previous TDNN based state-of-the-art approaches by a large margin. Code and trained weights are in https://github.com/wentaozhu/speechnas.git

deep learning, neural network, speaker verification, (18 more...)

arXiv.org Artificial Intelligence

2109.08839

Country: Asia (0.14)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Speech (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Learning with Non-Convex Truncated Losses by SGD

Xu, Yi, Zhu, Shenghuo, Yang, Sen, Zhang, Chi, Jin, Rong, Yang, Tianbao

arXiv.org Machine LearningMay-20-2018

Learning with a {\it convex loss} function has been a dominating paradigm for many years. It remains an interesting question how non-convex loss functions help improve the generalization of learning with broad applicability. In this paper, we study a family of objective functions formed by truncating traditional loss functions, which is applicable to both shallow learning and deep learning. Truncating loss functions has potential to be less vulnerable and more robust to large noise in observations that could be adversarial. More importantly, it is a generic technique without assuming the knowledge of noise distribution. To justify non-convex learning with truncated losses, we establish excess risk bounds of empirical risk minimization based on truncated losses for heavy-tailed output, and statistical error of an approximate stationary point found by stochastic gradient descent (SGD) method. Our experiments for shallow and deep learning for regression with outliers, corrupted data and heavy-tailed noise further justify the proposed method.

deep learning, neural network, truncated loss, (20 more...)

arXiv.org Machine Learning

1805.0788

Country: North America > United States > Iowa > Johnson County > Iowa City (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Mathematical & Statistical Methods (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.54)

Add feedback

Fused Multiple Graphical Lasso

Yang, Sen, Lu, Zhaosong, Shen, Xiaotong, Wonka, Peter, Ye, Jieping

arXiv.org Machine LearningDec-30-2013

In this paper, we consider the problem of estimating multiple graphical models simultaneously using the fused lasso penalty, which encourages adjacent graphs to share similar structures. A motivating example is the analysis of brain networks of Alzheimer's disease using neuroimaging data. Specifically, we may wish to estimate a brain network for the normal controls (NC), a brain network for the patients with mild cognitive impairment (MCI), and a brain network for Alzheimer's patients (AD). We expect the two brain networks for NC and MCI to share common structures but not to be identical to each other; similarly for the two brain networks for MCI and AD. The proposed formulation can be solved using a second-order method. Our key technical contribution is to establish the necessary and sufficient condition for the graphs to be decomposable. Based on this key property, a simple screening rule is presented, which decomposes the large graphs into small subgraphs and allows an efficient estimation of multiple independent (small) subgraphs, dramatically reducing the computational cost. We perform experiments on both synthetic and real data; our results demonstrate the effectiveness and efficiency of the proposed approach.

neurology, optimization problem, screening rule, (19 more...)

arXiv.org Machine Learning

1209.2139

Country: North America > United States > Minnesota > Hennepin County > Minneapolis (0.28)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Multi-task Vector Field Learning

Lin, Binbin, Yang, Sen, Zhang, Chiyuan, Ye, Jieping, He, Xiaofei

Neural Information Processing SystemsDec-31-2012

Multi-task learning (MTL) aims to improve generalization performance by learning multiple related tasks simultaneously and identifying the shared information among tasks. Most of existing MTL methods focus on learning linear models under the supervised setting. We propose a novel semi-supervised and nonlinear approach for MTL using vector fields. A vector field is a smooth mapping from the manifold to the tangent spaces which can be viewed as a directional derivative of functions on the manifold. We argue that vector fields provide a natural way to exploit the geometric structure of data as well as the shared differential structure of tasks, both are crucial for semi-supervised multi-task learning. In this paper, we develop multi-task vector field learning (MTVFL) which learns the prediction functions and the vector fields simultaneously. MTVFL has the following key properties: (1) the vector fields we learned are close to the gradient fields of the prediction functions; (2) within each task, the vector field is required to be as parallel as possible which is expected to span a low dimensional subspace; (3) the vector fields from all tasks share a low dimensional subspace. We formalize our idea in a regularization framework and also provide a convex relaxation method to solve the original non-convex problem. The experimental results on synthetic and real data demonstrate the effectiveness of our proposed approach.

artificial intelligence, machine learning, vector field, (15 more...)

Neural Information Processing Systems

Country:

Europe (0.29)
North America > United States > Arizona (0.14)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback