otus
Human Limits in Machine Learning: Prediction of Plant Phenotypes Using Soil Microbiome Data
Aghdam, Rosa, Tang, Xudong, Shan, Shan, Lankau, Richard, Solís-Lemus, Claudia
The preservation of soil health has been identified as one of the main challenges of the XXI century given its vast (and potentially threatening) ramifications in agriculture, human health and biodiversity. Here, we provide the first deep investigation of the predictive potential of machine-learning models to understand the connections between soil and biological phenotypes. Indeed, we investigate an integrative framework performing accurate machine-learning-based prediction of plant phenotypes from biological, chemical and physical properties of the soil via two models: random forest and Bayesian neural network. We show that prediction is improved, as evidenced by higher weighted F1 scores, when incorporating into the models environmental features like soil physicochemical properties and microbial population density in addition to the microbiome information. Furthermore, by exploring multiple data preprocessing strategies such as normalization, zero replacement, and data augmentation, we confirm that human decisions have a huge impact on the predictive performance. In particular, we show that the naive total sum scaling normalization that is commonly used in microbiome research is not the optimal strategy to maximize predictive power. In addition, we find that accurately defined labels are more important than normalization, taxonomic level or model characteristics. That is, if humans are unable to classify the samples and provide accurate labels, the performance of machine-learning models will be limited. Lastly, we present strategies for domain scientists via a full model selection decision tree to identify the human choices that maximize the prediction power of the models. Our work is accompanied by open source reproducible scripts (https://github.com/solislemuslab/soil-microbiome-nn) for maximum outreach among the microbiome research community.
- North America > United States > Wisconsin > Dane County > Madison (0.28)
- North America > United States > Minnesota (0.04)
- Health & Medicine (1.00)
- Food & Agriculture > Agriculture (1.00)
Kneron Acquires OTUS in a Move to Accelerate Autonomous Vehicle Applications
Leading full-stack edge AI company Kneron announces its acquisition of OTUS, a subsidiary of VIVOTEK (a Delta Group company). OTUS is a camera and imaging solutions provider specializing in applications for automotive, virtual reality, as well as other panoramic scenarios. Over the past two years, the company has worked closely with Kneron to jointly commercialize several automotive applications. As a vehicle solutions provider, OTUS has experienced over the past few years substantial business expansion as well as strong customer stickiness due to the high growth trajectory of automotive markets. The company provides ADAS and DMS functions such as road object detection and driver behavior monitoring.
Latent Prompt Tuning for Text Summarization
Zhang, Yubo, Zhang, Xingxing, Wang, Xun, Chen, Si-qing, Wei, Furu
Prompts with different control signals (e.g., length, keywords, etc.) can be used to control text summarization. When control signals are available, they can control the properties of generated summaries and potentially improve summarization quality (since more information are given). Unfortunately, control signals are not already available during inference time. In this paper, we propose Lotus (shorthand for Latent Prompt Tuning for Summarization), which is a single model that can be applied in both controlled and uncontrolled (without control signals) modes. During training, Lotus learns latent prompt representations from prompts with gold control signals using a contrastive learning objective. Experiments show Lotus in uncontrolled mode consistently improves upon strong (uncontrollable) summarization models across four different summarization datasets. We also demonstrate generated summaries can be controlled using prompts with user specified control tokens.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Asia > China > Hong Kong (0.04)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
- Europe > Germany > Berlin (0.04)
The Phylogenetic LASSO and the Microbiome
Rush, Stephen T, Lee, Christine H, Mio, Washington, Kim, Peter T
Scientific investigations that incorporate next generation sequencing involve analyses of high-dimensional data where the need to organize, collate and interpret the outcomes are pressingly important. Currently, data can be collected at the microbiome level leading to the possibility of personalized medicine whereby treatments can be tailored at this scale. In this paper, we lay down a statistical framework for this type of analysis with a view toward synthesis of products tailored to individual patients. Although the paper applies the technique to data for a particular infectious disease, the methodology is sufficiently rich to be expanded to other problems in medicine, especially those in which coincident `-omics' covariates and clinical responses are simultaneously captured.
- North America > Canada > Ontario > Hamilton (0.14)
- North America > United States > New York (0.04)
- North America > Canada > Ontario > Wellington County > Guelph (0.04)
- (2 more...)