practical machine learning
Practical Machine Learning for Aphasic Discourse Analysis
Pittman, Jason M., Phillips, Anton Jr., Medina-Santos, Yesenia, Stark, Brielle C.
Analyzing spoken discourse is a valid means of quantifying language ability in persons with aphasia. There are many ways to quantify discourse, one common way being to evaluate the informativeness of the discourse. That is, given the total number of words produced, how many of those are context-relevant and accurate. This type of analysis is called Correct Information Unit (CIU) analysis and is one of the most prevalent discourse analyses used by speech-language pathologists (SLPs). Despite this, CIU analysis in the clinic remains limited due to the manual labor needed by SLPs to code and analyze collected speech. Recent advances in machine learning (ML) seek to augment such labor by automating modeling of propositional, macrostructural, pragmatic, and multimodal dimensions of discourse. To that end, this study evaluated five ML models for reliable identification of Correct Information Units (CIUs, Nicholas & Brookshire, 1993), during a picture description task. The five supervised ML models were trained using randomly selected human-coded transcripts and accompanying words and CIUs from persons with aphasia. The baseline model training produced a high accuracy across transcripts for word vs non-word, with all models achieving near perfect performance (0.995) with high AUC range (0.914 min, 0.995 max). In contrast, CIU vs non-CIU showed a greater variability, with the k-nearest neighbor (k-NN) model the highest accuracy (0.824) and second highest AUC (0.787). These findings indicate that while the supervised ML models can distinguish word from not word, identifying CIUs is challenging.
- North America > United States > Maryland (0.04)
- North America > United States > Indiana (0.04)
Investigating the Impact of Quantization on Adversarial Robustness
Li, Qun, Meng, Yuan, Tang, Chen, Jiang, Jiacheng, Wang, Zhi
Quantization is a promising technique for reducing the bit-width of deep models to improve their runtime performance and storage efficiency, and thus becomes a fundamental step for deployment. In real-world scenarios, quantized models are often faced with adversarial attacks which cause the model to make incorrect inferences by introducing slight perturbations. However, recent studies have paid less attention to the impact of quantization on the model robustness. More surprisingly, existing studies on this topic even present inconsistent conclusions, which prompted our in-depth investigation. In this paper, we conduct a first-time analysis of the impact of the quantization pipeline components that can incorporate robust optimization under the settings of Post-Training Quantization and Quantization-Aware Training. Through our detailed analysis, we discovered that this inconsistency arises from the use of different pipelines in different studies, specifically regarding whether robust optimization is performed and at which quantization stage it occurs. Our research findings contribute insights into deploying more secure and robust quantized networks, assisting practitioners in reference for scenarios with high-security requirements and limited resources.
- Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
- Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
- Asia > China > Guangdong Province > Shenzhen (0.04)
- Information Technology > Security & Privacy (0.36)
- Government > Military (0.36)
Bridging Diversity and Uncertainty in Active learning with Self-Supervised Pre-Training
Doucet, Paul, Estermann, Benjamin, Aczel, Till, Wattenhofer, Roger
This study addresses the integration of diversity-based and uncertainty-based sampling strategies in active learning, particularly within the context of self-supervised pre-trained models. We introduce a straightforward heuristic called TCM that mitigates the cold start problem while maintaining strong performance across various data levels. By initially applying TypiClust for diversity sampling and subsequently transitioning to uncertainty sampling with Margin, our approach effectively combines the strengths of both strategies. Our experiments demonstrate that TCM consistently outperforms existing methods across various datasets in both low and high data regimes.
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.05)
- Oceania > Australia > New South Wales > Sydney (0.04)
- North America > United States > Maryland > Baltimore (0.04)
- (8 more...)
SUPClust: Active Learning at the Boundaries
Ono, Yuta, Aczel, Till, Estermann, Benjamin, Wattenhofer, Roger
Active learning is a machine learning paradigm designed to optimize model performance in a setting where labeled data is expensive to acquire. In this work, we propose a novel active learning method called SUPClust that seeks to identify points at the decision boundary between classes. By targeting these points, SUP-Clust aims to gather information that is most informative for refining the model's prediction of complex decision regions. We demonstrate experimentally that labeling these points leads to strong model performance. This improvement is observed even in scenarios characterized by strong class imbalance.
- North America > United States > Florida > Miami-Dade County > Miami (0.04)
- North America > Canada > Ontario > Toronto (0.04)
- Europe > Switzerland > Zürich > Zürich (0.04)
- (4 more...)
Practical Machine Learning
One of the most common tasks performed by data scientists and data analysts are prediction and machine learning. This course will cover the basic components of building and applying prediction functions with an emphasis on practical applications. The course will provide basic grounding in concepts such as training and tests sets, overfitting, and error rates. The course will also introduce a range of model based and algorithmic machine learning methods including regression, classification trees, Naive Bayes, and random forests. The course will cover the complete process of building prediction functions including data collection, feature creation, algorithms, and evaluation.
- Education > Educational Technology > Educational Software > Computer Based Training (0.40)
- Education > Educational Setting > Online (0.40)
Practical Machine Learning in R: Nwanganga, Fred, Chapple, Mike + Free Shipping
Mike Chapple is Teaching Professor of IT, Analytics, and Operations at the University of Notre Dame's Mendoza College of Business where he teaches graduate and undergraduate courses in cybersecurity and business analytics. Prior to joining Notre Dame's faculty, Mike served as Senior Director for IT Service Delivery at the University. In this role, he oversaw the information security, IT compliance, cloud computing, data governance, IT architecture, learning platforms, project management, strategic planning and product management functions for the Office of Information Technologies. Mike led Notre Dame's Cloud First strategy which moved 80% of the institution's IT services into the cloud over three years. Mike previously served as Senior Advisor to the Executive Vice President at Notre Dame for two years.
- North America > United States > Indiana > St. Joseph County > Notre Dame (0.07)
- North America > United States > Idaho (0.07)
- Information Technology > Security & Privacy (0.87)
- Education > Educational Setting > Higher Education (0.42)
- Government > Military > Cyberwarfare (0.39)
O'Reilly - Practical Machine Learning for Computer Vision - ch3
Using machine learning models to extract information from images is one of the trickiest ML tasks--but it often yields invaluable insights. What's more, image classification is the "Hello World" of deep learning: It's a stepping stone to other deep learning domains, such as natural language processing. In chapter 3, the authors of Practical Machine Learning for Computer Vision lay out the techniques and model architectures that take advantage of the special properties of images. And did we mention, it's free?
Practical Machine Learning in JavaScript: TensorFlow.js for Web Developers: Gerard, Charlie: 9781484264171: Amazon.com: Books
You'll learn not only theory, but also dive into code samples and example projects with TensorFlow.js. Using these skills and your knowledge as a web developer, you'll add a whole new field of development to your tool set. This will give you a more concrete understanding of the possibilities offered by machine learning. Discover how ML will impact the future of not just programming in general, but web development specifically.
- Education > Educational Technology > Educational Software > Computer Based Training (0.40)
- Education > Educational Setting > Online (0.40)
- Asia > India > NCT > New Delhi (0.14)
- North America > United States (0.05)
- Asia > India > Uttar Pradesh (0.05)
- Asia > India > NCT > Delhi (0.05)
- Instructional Material > Online (0.40)
- Instructional Material > Course Syllabus & Notes (0.40)
- Information Technology > Security & Privacy (1.00)
- Education > Educational Technology > Educational Software > Computer Based Training (0.51)
- Education > Educational Setting > Online (0.51)
- Information Technology > Software (1.00)
- Information Technology > Security & Privacy (1.00)
- Information Technology > Communications (1.00)
- (2 more...)