Overijssel
Improving OCR for Historical Texts of Multiple Languages
Westerdijk, Hylke, Blankenborg, Ben, Islam, Khondoker Ittehadul
This paper presents our methodology and findings from three tasks across Optical Character Recognition (OCR) and Document Layout Analysis using advanced deep learning techniques. First, for the historical Hebrew fragments of the Dead Sea Scrolls, we enhanced our dataset through extensive data augmentation and employed the Kraken and TrOCR models to improve character recognition. In our analysis of 16th to 18th-century meeting resolutions task, we utilized a Convolutional Recurrent Neural Network (CRNN) that integrated DeepLabV3+ for semantic segmentation with a Bidirectional LSTM, incorporating confidence-based pseudolabeling to refine our model. Finally, for modern English handwriting recognition task, we applied a CRNN with a ResNet34 encoder, trained using the Connectionist Temporal Classification (CTC) loss function to effectively capture sequential dependencies. This report offers valuable insights and suggests potential directions for future research.
- Europe > Norway > Eastern Norway > Oslo (0.04)
- Europe > Netherlands > Overijssel (0.04)
- Research Report (1.00)
- Overview (0.68)
Automatic and standardized surgical reporting for central nervous system tumors
Bouget, David, Faanes, Mathilde Gajda, Jakola, Asgeir Store, Barkhof, Frederik, Ardon, Hilko, Bello, Lorenzo, Berger, Mitchel S., Hervey-Jumper, Shawn L., Furtner, Julia, Idema, Albert J. S., Kiesel, Barbara, Widhalm, Georg, Tewarie, Rishi Nandoe, Mandonnet, Emmanuel, Robe, Pierre A., Wagemakers, Michiel, Smith, Timothy R., Hamer, Philip C. De Witt, solheim, Ole, Reinertsen, Ingerid
Magnetic resonance (MR) imaging is essential for evaluating central nervous system (CNS) tumors, guiding surgical planning, treatment decisions, and assessing postoperative outcomes and complication risks. While recent work has advanced automated tumor segmentation and report generation, most efforts have focused on preoperative data, with limited attention to postoperative imaging analysis. This study introduces a comprehensive pipeline for standardized postsurtical reporting in CNS tumors. Using the Attention U-Net architecture, segmentation models were trained for the preoperative (non-enhancing) tumor core, postoperative contrast-enhancing residual tumor, and resection cavity. Additionally, MR sequence classification and tumor type identification for contrast-enhancing lesions were explored using the DenseNet architecture. The models were integrated into a reporting pipeline, following the RANO 2.0 guidelines. Training was conducted on multicentric datasets comprising 2000 to 7000 patients, using a 5-fold cross-validation. Evaluation included patient-, voxel-, and object-wise metrics, with benchmarking against the latest BraTS challenge results. The segmentation models achieved average voxel-wise Dice scores of 87%, 66%, 70%, and 77% for the tumor core, non-enhancing tumor core, contrast-enhancing residual tumor, and resection cavity, respectively. Classification models reached 99.5% balanced accuracy in MR sequence classification and 80% in tumor type classification. The pipeline presented in this study enables robust, automated segmentation, MR sequence classification, and standardized report generation aligned with RANO 2.0 guidelines, enhancing postoperative evaluation and clinical decision-making. The proposed models and methods were integrated into Raidionics, open-source software platform for CNS tumor analysis, now including a dedicated module for postsurgical analysis.
- North America > United States > California > San Francisco County > San Francisco (0.14)
- Europe > Sweden > Vaestra Goetaland > Gothenburg (0.14)
- Europe > Austria > Vienna (0.14)
- (10 more...)
- Research Report > Experimental Study (0.45)
- Research Report > New Finding (0.34)
- Health & Medicine > Therapeutic Area > Oncology > Central Nervous System Cancer (1.00)
- Health & Medicine > Therapeutic Area > Neurology (1.00)
Processing of synthetic data in AI development for healthcare and the definition of personal data in EU law
Vallevik, Vibeke Binz, Befring, Anne Kjersti C., Elvatun, Severin, Nygaard, Jan Franz
Artificial intelligence (AI) has the potential to transform healthcare, but it requires access to health data. Synthetic data that is generated through machine learning models trained on real data, offers a way to share data while preserving privacy. However, uncertainties in the practical application of the General Data Protection Regulation (GDPR) create an administrative burden, limiting the benefits of synthetic data. Through a systematic analysis of relevant legal sources and an empirical study, this article explores whether synthetic data should be classified as personal data under the GDPR. The study investigates the residual identification risk through generating synthetic data and simulating inference attacks, challenging common perceptions of technical identification risk. The findings suggest synthetic data is likely anonymous, depending on certain factors, but highlights uncertainties about what constitutes reasonably likely risk. To promote innovation, the study calls for clearer regulations to balance privacy protection with the advancement of AI in healthcare.
- Europe > Germany (0.45)
- Europe > Norway > Eastern Norway > Oslo (0.04)
- Europe > Netherlands > Overijssel (0.04)
- (8 more...)
- Law (1.00)
- Information Technology > Security & Privacy (1.00)
- Government > Regional Government > Europe Government (1.00)
- Health & Medicine > Therapeutic Area > Oncology (0.93)
On the expressivity of deep Heaviside networks
Kong, Insung, Chen, Juntong, Langer, Sophie, Schmidt-Hieber, Johannes
The Heaviside activation function is for instance used in Hopfield networks [ 1 ] that have recently seen a resurge due to their connections t o attention layers [ 2, 3 ] and the 2024 Nobel Prize in Physics that was partially award ed for their development. Moreover, the Heaviside activation function is closely related to quantized neural networks [ 4, 5 ], playing a key role in enabling energy efficient deployment o f large language models (LLMs) [ 6, 7 ]. We refer to neural networks with several hidden layers and th e Heaviside activation function as deep Heaviside (neural) networks (DHNs). These networks are also known as (linear) threshold networks. The Heaviside activation function can be traced back to the fi rst attempts to build an artificial counterpart of a biological neuron. In the brain, the inputs of a neuron contribute to its membrane potential and the neuron discharges/fires if th e membrane potential exceeds a certain threshold.
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Europe > Netherlands > Overijssel (0.04)
- (2 more...)
- Research Report (0.63)
- Personal > Honors (0.54)
Deep Learning for automated multi-scale functional field boundaries extraction using multi-date Sentinel-2 and PlanetScope imagery: Case Study of Netherlands and Pakistan
Zahid, Saba, Ghuffar, Sajid, Obaid-ur-Rehman, null, Shah, Syed Roshaan Ali
This study explores the effectiveness of multi-temporal satellite imagery for better functional field boundary delineation using deep learning semantic segmentation architecture on two distinct geographical and multi-scale farming systems of Netherlands and Pakistan. Multidate images of April, August and October 2022 were acquired for PlanetScope and Sentinel-2 in sub regions of Netherlands and November 2022, February and March 2023 for selected area of Dunyapur in Pakistan. For Netherlands, Basic registration crop parcels (BRP) vector layer was used as labeled training data. while self-crafted field boundary vector data were utilized for Pakistan. Four deep learning models with UNET architecture were evaluated using different combinations of multi-date images and NDVI stacks in the Netherlands subregions. A comparative analysis of IoU scores assessed the effectiveness of the proposed multi-date NDVI stack approach. These findings were then applied for transfer learning, using pre-trained models from the Netherlands on the selected area in Pakistan. Additionally, separate models were trained using self-crafted field boundary data for Pakistan, and combined models were developed using data from both the Netherlands and Pakistan. Results indicate that multi-date NDVI stacks provide additional temporal context, reflecting crop growth over different times of the season. The study underscores the critical role of multi-scale ground information from diverse geographical areas in developing robust and universally applicable models for field boundary delineation. The results also highlight the importance of fine spatial resolution for extraction of field boundaries in regions with small scale framing. The findings can be extended to multi-scale implementations for improved automatic field boundary delineation in heterogeneous agricultural environments.
- Asia > Pakistan > Islamabad Capital Territory > Islamabad (0.04)
- Asia > Bangladesh (0.04)
- Europe > Netherlands > Flevoland (0.04)
- (9 more...)
Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers
Xie, Roy, Ahia, Orevaoghene, Tsvetkov, Yulia, Anastasopoulos, Antonios
Identifying linguistic differences between dialects of a language often requires expert knowledge and meticulous human analysis. This is largely due to the complexity and nuance involved in studying various dialects. We present a novel approach to extract distinguishing lexical features of dialects by utilizing interpretable dialect classifiers, even in the absence of human experts. We explore both post-hoc and intrinsic approaches to interpretability, conduct experiments on Mandarin, Italian, and Low Saxon, and experimentally demonstrate that our method successfully identifies key language-specific lexical features that contribute to dialectal variations.
- Asia > Taiwan (0.05)
- Europe > Croatia > Dubrovnik-Neretva County > Dubrovnik (0.04)
- Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
- (19 more...)
Towards Global Glacier Mapping with Deep Learning and Open Earth Observation Data
Maslov, Konstantin A., Persello, Claudio, Schellenberger, Thomas, Stein, Alfred
Accurate global glacier mapping is critical for understanding climate change impacts. It is challenged by glacier diversity, difficult-to-classify debris and big data processing. Here we propose Glacier-VisionTransformer-U-Net (GlaViTU), a convolutional-transformer deep learning model, and five strategies for multitemporal global-scale glacier mapping using open satellite imagery. Assessing the spatial, temporal and cross-sensor generalisation shows that our best strategy achieves intersection over union >0.85 on previously unobserved images in most cases, which drops to >0.75 for debris-rich areas such as High-Mountain Asia and increases to >0.90 for regions dominated by clean ice. Additionally, adding synthetic aperture radar data, namely, backscatter and interferometric coherence, increases the accuracy in all regions where available. The calibrated confidence for glacier extents is reported making the predictions more reliable and interpretable. We also release a benchmark dataset that covers 9% of glaciers worldwide. Our results support efforts towards automated multitemporal and global glacier mapping.
- North America > United States > Alaska (0.05)
- Europe > Sweden (0.05)
- Europe > Denmark (0.05)
- (14 more...)
Data-Centric Machine Learning for Geospatial Remote Sensing Data
Roscher, Ribana, Rußwurm, Marc, Gevaert, Caroline, Kampffmeyer, Michael, Santos, Jefersson A. dos, Vakalopoulou, Maria, Hänsch, Ronny, Hansen, Stine, Nogueira, Keiller, Prexl, Jonathan, Tuia, Devis
Recent developments and research in modern machine learning have led to substantial improvements in the geospatial field. Although numerous deep learning models have been proposed, the majority of them have been developed on benchmark datasets that lack strong real-world relevance. Furthermore, the performance of many methods has already saturated on these datasets. We argue that shifting the focus towards a complementary data-centric perspective is necessary to achieve further improvements in accuracy, generalization ability, and real impact in end-user applications. This work presents a definition and precise categorization of automated data-centric learning approaches for geospatial data. It highlights the complementary role of data-centric learning with respect to model-centric in the larger machine learning deployment cycle. We review papers across the entire geospatial field and categorize them into different groups. A set of representative experiments shows concrete implementation examples. These examples provide concrete steps to act on geospatial data with data-centric machine learning approaches.
- Europe > United Kingdom (0.14)
- Asia > India > Maharashtra > Mumbai (0.05)
- North America > Mexico > Mexico City > Mexico City (0.04)
- (12 more...)
Segmentation of glioblastomas in early post-operative multi-modal MRI with deep neural networks
Helland, Ragnhild Holden, Ferles, Alexandros, Pedersen, André, Kommers, Ivar, Ardon, Hilko, Barkhof, Frederik, Bello, Lorenzo, Berger, Mitchel S., Dunås, Tora, Nibali, Marco Conti, Furtner, Julia, Hervey-Jumper, Shawn, Idema, Albert J. S., Kiesel, Barbara, Tewari, Rishi Nandoe, Mandonnet, Emmanuel, Müller, Domenique M. J., Robe, Pierre A., Rossi, Marco, Sagberg, Lisa M., Sciortino, Tommaso, Aalders, Tom, Wagemakers, Michiel, Widhalm, Georg, Witte, Marnix G., Zwinderman, Aeilko H., Majewska, Paulina L., Jakola, Asgeir S., Solheim, Ole, Hamer, Philip C. De Witt, Reinertsen, Ingerid, Eijgelaar, Roelant S., Bouget, David
Extent of resection after surgery is one of the main prognostic factors for patients diagnosed with glioblastoma. To achieve this, accurate segmentation and classification of residual tumor from post-operative MR images is essential. The current standard method for estimating it is subject to high inter- and intra-rater variability, and an automated method for segmentation of residual tumor in early post-operative MRI could lead to a more accurate estimation of extent of resection. In this study, two state-of-the-art neural network architectures for pre-operative segmentation were trained for the task. The models were extensively validated on a multicenter dataset with nearly 1000 patients, from 12 hospitals in Europe and the United States. The best performance achieved was a 61\% Dice score, and the best classification performance was about 80\% balanced accuracy, with a demonstrated ability to generalize across hospitals. In addition, the segmentation performance of the best models was on par with human expert raters. The predicted segmentations can be used to accurately classify the patients into those with residual tumor, and those with gross total resection.
- North America > United States > California > San Francisco County > San Francisco (0.28)
- Europe > Austria > Vienna (0.14)
- Europe > Netherlands > North Holland > Amsterdam (0.05)
- (9 more...)
- Research Report > Experimental Study (0.68)
- Research Report > New Finding (0.66)
- Health & Medicine > Therapeutic Area > Oncology > Brain Cancer (1.00)
- Health & Medicine > Therapeutic Area > Neurology (1.00)
- Health & Medicine > Therapeutic Area > Oncology > Childhood Cancer (0.73)
Modeling the Social Influence of COVID-19 via Personalized Propagation with Deep Learning
Liu, Yufei, Cao, Jie, Pi, Dechang
Social influence prediction has permeated many domains, including marketing, behavior prediction, recommendation systems, and more. However, traditional methods of predicting social influence not only require domain expertise,they also rely on extracting user features, which can be very tedious. Additionally, graph convolutional networks (GCNs), which deals with graph data in non-Euclidean space, are not directly applicable to Euclidean space. To overcome these problems, we extended DeepInf such that it can predict the social influence of COVID-19 via the transition probability of the page rank domain. Furthermore, our implementation gives rise to a deep learning-based personalized propagation algorithm, called DeepPP. The resulting algorithm combines the personalized propagation of a neural prediction model with the approximate personalized propagation of a neural prediction model from page rank analysis. Four social networks from different domains as well as two COVID-19 datasets were used to demonstrate the efficiency and effectiveness of the proposed algorithm. Compared to other baseline methods, DeepPP provides more accurate social influence predictions. Further, experiments demonstrate that DeepPP can be applied to real-world prediction data for COVID-19.
- Asia > China > Jiangsu Province > Nanjing (0.04)
- Asia > China > Hubei Province > Wuhan (0.04)
- Europe > Italy (0.04)
- (7 more...)