dcis
DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search
Yang, Lei, Xu, Shaoyang, Xiong, Deyi
Large language models (LLMs) based on the Transformer architecture usually have their context length limited due to the high training cost. Recent advancements extend the context window by adjusting the scaling factors of RoPE and fine-tuning. However, suboptimal initialization of these factors results in increased fine-tuning costs and reduced performance at target length. To address these challenges, we propose an innovative RoPE-based fine-tuning framework that diverges from conventional scaling factors search. Specifically, we present a Divide-and-Conquer Incremental Search (DCIS) algorithm that strategically determines the better scaling factors. Further fine-tuning with the identified scaling factors effectively extends the context window of LLMs. Empirical results demonstrate that our methodology not only mitigates performance decay at extended target lengths but also allows the model to fine-tune on short contexts and generalize to long contexts, thereby reducing the cost of fine-tuning. The scaling factors obtained through DCIS can even perform effectively without fine-tuning. Further analysis of the search space reveals that DCIS achieves twice the search efficiency compared to other methods. We also examine the impact of the non-strictly increasing scaling factors utilized in DCIS and evaluate the general capabilities of LLMs across various context lengths.
Multi-Stain Multi-Level Convolutional Network for Multi-Tissue Breast Cancer Image Segmentation
Modi, Akash, Jha, Sumit Kumar, Mishra, Purnendu, Kumar, Rajiv, Aatre, Kiran, Singh, Gursewak, Mathur, Shubham
Digital pathology and microscopy image analysis are widely employed in the segmentation of digitally scanned IHC slides, primarily to identify cancer and pinpoint regions of interest (ROI) indicative of tumor presence. However, current ROI segmentation models are either stain-specific or suffer from the issues of stain and scanner variance due to different staining protocols or modalities across multiple labs. Also, tissues like Ductal Carcinoma in Situ (DCIS), acini, etc. are often classified as Tumors due to their structural similarities and color compositions. In this paper, we proposed a novel convolutional neural network (CNN) based Multi-class Tissue Segmentation model for histopathology whole-slide Breast slides which classify tumors and segments other tissue regions such as Ducts, acini, DCIS, Squamous epithelium, Blood Vessels, Necrosis, etc. as a separate class. Our unique pixel-aligned non-linear merge across spatial resolutions empowers models with both local and global fields of view for accurate detection of various classes. Our proposed model is also able to separate bad regions such as folds, artifacts, blurry regions, bubbles, etc. from tissue regions using multi-level context from different resolutions of WSI. Multi-phase iterative training with context-aware augmentation and increasing noise was used to efficiently train a multi-stain generic model with partial and noisy annotations from 513 slides. Our training pipeline used 12 million patches generated using context-aware augmentations which made our model stain and scanner invariant across data sources. To extrapolate stain and scanner invariance, our model was evaluated on 23000 patches which were for a completely new stain (Hematoxylin and Eosin) from a completely new scanner (Motic) from a different lab. The mean IOU was 0.72 which is on par with model performance on other data sources and scanners.
Can Occult Invasive Disease in Ductal Carcinoma In Situ Be Predicted Using Computer-extracted Mammographic Features? - PubMed
Rationale and objectives: This study aimed to determine whether mammographic features assessed by radiologists and using computer algorithms are prognostic of occult invasive disease for patients showing ductal carcinoma in situ (DCIS) only in core biopsy. Materials and methods: In this retrospective study, we analyzed data from 99 subjects with DCIS (74 pure DCIS, 25 DCIS with occult invasion). We developed a computer-vision algorithm capable of extracting 113 features from magnification views in mammograms and combining these features to predict whether a DCIS case will be upstaged to invasive cancer at the time of definitive surgery. In comparison, we also built predictive models based on physician-interpreted features, which included histologic features extracted from biopsy reports and Breast Imaging Reporting and Data System-related mammographic features assessed by two radiologists. The generalization performance was assessed using leave-one-out cross validation with the receiver operating characteristic curve analysis.
Data Uncertainty without Prediction Models
Data acquisition processes for machine learning are often costly. To construct a high-performance prediction model with fewer data, a degree of difficulty in prediction is often deployed as the acquisition function in adding a new data point. The degree of difficulty is referred to as uncertainty in prediction models. We propose an uncertainty estimation method named a Distance-weighted Class Impurity without explicit use of prediction models. We estimated uncertainty using distances and class impurities around the location, and compared it with several methods based on prediction models for uncertainty estimation by active learning tasks. We verified that the Distance-weighted Class Impurity works effectively regardless of prediction models.
A Computer-Aided Diagnosis System for Breast Pathology: A Deep Learning Approach with Model Interpretability from Pathological Perspective
Hsu, Wei-Wen, Wu, Yongfang, Hao, Chang, Hou, Yu-Ling, Gao, Xiang, Shao, Yun, Zhang, Xueli, He, Tao, Tai, Yanhong
Objective: We develop a computer-aided diagnosis (CAD) system using deep learning approaches for lesion detection and classification on whole-slide images (WSIs) with breast cancer. The deep features being distinguishing in classification from the convolutional neural networks (CNN) are demonstrated in this study to provide comprehensive interpretability for the proposed CAD system using pathological knowledge. Methods: In the experiment, a total of 186 slides of WSIs were collected and classified into three categories: Non-Carcinoma, Ductal Carcinoma in Situ (DCIS), and Invasive Ductal Carcinoma (IDC). Instead of conducting pixel-wise classification into three classes directly, we designed a hierarchical framework with the multi-view scheme that performs lesion detection for region proposal at higher magnification first and then conducts lesion classification at lower magnification for each detected lesion. Results: The slide-level accuracy rate for three-category classification reaches 90.8% (99/109) through 5-fold cross-validation and achieves 94.8% (73/77) on the testing set. The experimental results show that the morphological characteristics and co-occurrence properties learned by the deep learning models for lesion classification are accordant with the clinical rules in diagnosis. Conclusion: The pathological interpretability of the deep features not only enhances the reliability of the proposed CAD system to gain acceptance from medical specialists, but also facilitates the development of deep learning frameworks for various tasks in pathology. Significance: This paper presents a CAD system for pathological image analysis, which fills the clinical requirements and can be accepted by medical specialists with providing its interpretability from the pathological perspective.
AI at Case Western Reserve lab predicts which pre-malignant breast lesions will progress to invasive cancer
New research at Case Western Reserve University could help better determine which patients diagnosed with the pre-malignant breast cancer commonly referred to as stage 0 are likely to progress to invasive breast cancer and therefore might benefit from additional therapy over and above surgery alone. Once a lumpectomy of breast tissue reveals this pre-cancerous tumor, most women have surgery to remove the remainder of the affected tissue and some are given radiation therapy as well, said Anant Madabhushi, the F. Alex Nason Professor II of Biomedical Engineering at the Case School of Engineering. "Current testing places patients in high risk, low risk and indeterminate risk--but then treats those indeterminates with radiation, anyway," said Madabhushi, whose Center for Computational Imaging and Personalized Diagnostics (CCIPD) conducted the new research. "They err on the side of caution, but we're saying that it appears that it should go the other way--the middle should be classified with the lower risk. "In short, we're probably overtreating patients," Madabhushi continued.
Deep learning analysis of breast MRIs for prediction of occult invasive disease in ductal carcinoma in situ
To determine whether deep learning-based algorithms applied to breast MR images can aid in the prediction of occult invasive disease following the diagnosis of ductal carcinoma in situ (DCIS) by core needle biopsy. The data was collected from 2000 to 2014. In this institutional review board-approved study, we analyzed dynamic contrast-enhanced fat-saturated T1-weighted MRI sequences from 131 patients with a core needle biopsy-confirmed diagnosis of DCIS. We explored two different deep learning approaches to predict whether there was an occult invasive component in the analyzed tumors that was ultimately identified at surgical excision. In the first approach, we adopted the transfer learning strategy.
New AI Delivers More Accurate Breast Cancer Diagnoses Than Human Doctors
"It is critical to get a correct diagnosis from the beginning so that we can guide patients to the most effective treatments," said Dr. Joann Elmore, the study's senior author and a professor of medicine at the David Geffen School of Medicine at UCLA. Why would there be a need for such a study? Well, because, according to a 2015 study led by Elmore, pathologists often disagree on the outcome of breast biopsies. Furthermore, research has also found that diagnostic errors occurred in about one out of every six women who had ductal carcinoma in situ (DCIS) and incorrect diagnoses were given in about half of the biopsy cases of breast atypia. These are quite some significant errors.
Artificial intelligence could diagnose breast cancer better than doctors
A computer could be better than a doctor at diagnosing certain types of cancerous and precancerous breast lesions, new research suggests. Researchers at the University of California, Los Angeles, trained an artificial intelligence system using 240 biopsy images, and tested it against 87 pathologists. The machine performed more or less as well as doctors at detecting and classifying all of the breast biopsies. However, it was better at making one crucial distinction: telling the difference between DCIS (ductal carcinoma in situ), a type of cancer, and atypical hyperplasia, a high-risk lesion that has very similar hallmarks but does is not cancerous and does not require the same level of treatment. 'Medical images of breast biopsies contain a great deal of complex data and interpreting them can be very subjective,' said Dr Joann Elmore, lead author of the study published in the JAMA Network Open journal.
A tree augmented naive Bayesian network experiment for breast cancer prediction
In order to investigate the breast cancer prediction problem on the aging population with the grades of DCIS, we conduct a tree augmented naive Bayesian network experiment trained and tested on a large clinical dataset including consecutive diagnostic mammography examinations, consequent biopsy outcomes and related cancer registry records in the population of women across all ages. Our tasks are to classify the conventional "Benign vs. Malignant" and the new "Benign/LG vs. IntG/HG/Invasive" based on mammography examination features and patient demographic information, specifically to predict the probability of malignancy, for the biopsy threshold setting and the biopsy decision making. The aggregated results of our tenfold cross validation method recommend a biopsy threshold higher than 2% for the aging population. The Receiver Operating Characteristic curves and the Precision-Recall curves by aggregating the tenfold cross validation results are interesting.