Goto

Collaborating Authors

 chromosome


Graph Coloring via Neural Networks for Haplotype Assembly and Viral Quasispecies Reconstruction

Neural Information Processing Systems

Understanding genetic variation, e.g., through mutations, in organisms is crucial to unravel their effects on the environment and human health. A fundamental characterization can be obtained by solving the haplotype assembly problem, which yields the variation across multiple copies of chromosomes. Variations among fast evolving viruses that lead to different strains (called quasispecies) are also deciphered with similar approaches. In both these cases, high-throughput sequencing technologies that provide oversampled mixtures of large noisy fragments (reads) of genomes, are used to infer constituent components (haplotypes or quasispecies). The problem is harder for polyploid species where there are more than two copies of chromosomes. State-of-the-art neural approaches to solve this NP-hard problem do not adequately model relations among the reads that are important for deconvolving the input signal. We address this problem by developing a new method, called NeurHap, that combines graph representation learning with combinatorial optimization. Our experiments demonstrate the substantially better performance of NeurHap in real and synthetic datasets compared to competing approaches.


Efficient Chromosome Parallelization for Precision Medicine Genomic Workflows

Montserrat, Daniel Mas, Verma, Ray, Barrabés, Míriam, de la Vega, Francisco M., Bustamante, Carlos D., Ioannidis, Alexander G.

arXiv.org Artificial Intelligence

Large-scale genomic workflows used in precision medicine can process datasets spanning tens to hundreds of gigabytes per sample, leading to high memory spikes, intensive disk I/O, and task failures due to out-of-memory errors. Simple static resource allocation methods struggle to handle the variability in per-chromosome RAM demands, resulting in poor resource utilization and long runtimes. In this work, we propose multiple mechanisms for adaptive, RAM-efficient par-allelization of chromosome-level bioinformatics workflows. First, we develop a symbolic regression model that estimates per-chromosome memory consumption for a given task and introduces an interpolating bias to conservatively minimize over-allocation. Second, we present a dynamic scheduler that adaptively predicts RAM usage with a polynomial regression model, treating task packing as a Knapsack problem to optimally batch jobs based on predicted memory requirements. Additionally, we present a static scheduler that optimizes chromosome processing order to minimize peak memory while preserving throughput. Our proposed methods, evaluated on simulations and real-world genomic pipelines, provide new mechanisms to reduce memory overruns and balance load across threads. We thereby achieve faster end-to-end execution, showcasing the potential to optimize large-scale genomic workflows.



A MatNet variants

Neural Information Processing Systems

A.1 Multiple data matrices A combinatorial optimization problem can be presented with multiple ( f) relationship features between two groups of items. In FFSP, for example, a production cost could be different for each process that one has to take into account for scheduling in addition to the processing time for each pair of the job and the machine. "Trainable element-wise function" block in Figure A.1 is now an MLP with f + 1 input nodes and 1 output node. The bold line indicates the change from the original. A.3 Alternatives to one-hot initial node embeddings For initial node representations of nodes in group B, one only needs mutually distinct-enough N We have chosen to present our model with one-hot vectors because it is the simplest to implement this way.



DPCformer: An Interpretable Deep Learning Model for Genomic Prediction in Crops

Deng, Pengcheng, Liu, Kening, Zhou, Mengxi, Li, Mingxi, Yang, Rui, Cao, Chuzhe, Wang, Maojun, Zhang, Zeyu

arXiv.org Artificial Intelligence

Genomic Selection (GS) uses whole-genome information to predict crop phenotypes and accelerate breeding. Traditional GS methods, however, struggle with prediction accuracy for complex traits and large datasets. We propose DPCformer, a deep learning model integrating convolutional neural networks with a self-attention mechanism to model complex genotype-phenotype relationships. We applied DPCformer to 13 traits across five crops (maize, cotton, tomato, rice, chickpea). Our approach uses an 8-dimensional one-hot encoding for SNP data, ordered by chromosome, and employs the PMF algorithm for feature selection. Evaluations show DPCformer outperforms existing methods. In maize datasets, accuracy for traits like days to tasseling and plant height improved by up to 2.92%. For cotton, accuracy gains for fiber traits reached 8.37%. On small-sample tomato data, the Pearson Correlation Coefficient for a key trait increased by up to 57.35%. In chickpea, the yield correlation was boosted by 16.62%. DPCformer demonstrates superior accuracy, robustness in small-sample scenarios, and enhanced interpretability, providing a powerful tool for precision breeding and addressing global food security challenges.


Precise classification of low quality G-banded Chromosome Images by reliability metrics and data pruning classifier

Moattari, Mojtaba

arXiv.org Artificial Intelligence

In the last decade, due to high resolution cameras and accurate meta - phase analyzes, the accuracy of chromosome classification has improved substantially. However, current Karyotyping systems demand large number of high quality train data to have an adequa tely plausible Precision per each chromosome. Such provision of high quality train data with accurate devices are not yet accomplished in some out - reached pathological laboratories. To prevent false positive detections in low - cost systems and low - quality i mages settings, this paper improves the classification Precision of chromosomes using proposed reliability thresholding metrics and deliberately engineered features. The proposed method has been evaluated using a variation of deep Alex - Net neural network, SVM, K - Nearest - Neighbors, and their cascade pipelines to an automated filtering of semi - straight chromosome. The classification results have highly improved over 90% for the chromosomes with more common defections and translocations. Furthermore, a compara tive analysis over the proposed thresholding metrics has been conducted and the best metric is bolded with its salient characteristics. The high Precision results provided for a very low - quality G - banding database verifies suitability of the proposed metri cs and pruning method for Karyotyping facilities in poor countries and low - budget pathological laboratories. Keywords: G - banded Karyotyping, Precision, Reliability metrics, Pattern Recognition, Medical Imaging 1 Introduction One of the ways to study and dia gnose birth - defects and biological disorders is through using Cytogenetics. This branch of science endeavors to analyze chromosome shapes and patterns to find out common defects. The methods used for such analyzes includes G - Banding, Fluorescent In - Situ Hy bridization (FISH), Comparative Genomic Hybridization (CGH) and Chromosome - specific unique - sequence probes [27] . While Molecular Cytogenetics methods are effective in biological disorders, they do not necessarily manifest specific chromosome defects. FISH methods, though having higher accuracy results in stains, are costly and unable to identify all chromosome abnorm alities. Being temporary in sustaining fluorescence detector, they demand higher provision effort and substance supply that might not be affordable for some countries . Furthermore, detecting some abnormalities implies having G - banding technique involved an d not merely using stains.


A MatNet variants

Neural Information Processing Systems

A.1 Multiple data matrices A combinatorial optimization problem can be presented with multiple ( f) relationship features between two groups of items. In FFSP, for example, a production cost could be different for each process that one has to take into account for scheduling in addition to the processing time for each pair of the job and the machine. "Trainable element-wise function" block in Figure A.1 is now an MLP with f + 1 input nodes and 1 output node. The bold line indicates the change from the original. A.3 Alternatives to one-hot initial node embeddings For initial node representations of nodes in group B, one only needs mutually distinct-enough N We have chosen to present our model with one-hot vectors because it is the simplest to implement this way.


Scientists reveal the surprising reason why women live longer than men

Daily Mail - Science & tech

Shroud of Turin mystery deepens as surgeon spots hidden detail that points to Jesus' resurrection I was so happy after trying a trendy new cosmetic procedure. But 10 years later I suffered a devastating side effect... the doctor had lied I'm no longer sleeping with my husband - and never will again, says MOLLY RYDDELL. I love him, but counted down the moments until he climaxed. Then I couldn't bear it any more and the truth spilled out... so many women feel the same The'middle-class kinks' saving marriages: Wives reveal the eight buzzy sex trends that revived their lagging libidos - including the fantasy husbands are secretly obsessed with Lori Loughlin's husband Mossimo Giannulli seen with mystery brunette in tiny skirt day after shock split I'm a woman with autism... here are the signs you might be masking, even from yourself Cake-faced 90s sitcom star looks unrecognizable as she ditches the heavy eyeshadow for an LA errand run can you guess who? Trump dollar coin design released by Treasury... and it's inspired by the most iconic political photo of the century I've loved Taylor Swift for years. Mystery deepens over Hulk Hogan's death as his widow faces fresh anguish Body count from Houston's bayous rises as serial killer whispers grip city and residents are told: 'Be vigilant' Prison chief reveals exactly where Diddy could end up... and the one horrifying jail he MUST avoid READ MORE: Women could have'virgin births' without men, experts say It's a fact that has been known for more than two centuries - wherever you are around the world, women on average live longer than men.


Women really do live longer than men. Here's why.

Popular Science

Environment Animals Wildlife Women really do live longer than men. Female mammals live 12 percent longer than males, on average. Breakthroughs, discoveries, and DIY tips sent every weekday. It's fairly obvious that women live longer than men on average. This pattern holds true across most countries and historical time periods.

  Country:
  Genre: Research Report > New Finding (1.00)
  Industry: Health & Medicine (1.00)