AITopics

arXiv.org Machine LearningMay-29-2026

Cluster analysis is a widely applied machine learning technique to understand the existing patterns in the population of gamma-ray bursts (GRBs), in order to explore their physical sources. In the present scenario, the number of clusters corresponding to differentiable groups is still under conflict, in spite of numerous attempts with the state-of-the-art clustering procedures. This crucial unknown parameter needs to be evaluated, either directly or indirectly in terms of other tuning parameters, to produce the clusters in GRBs through implementation of an appropriate clustering algorithm. While most of the applied algorithms reached two physically explained groups of merger and collapsar predominated by the short and long bursts respectively, other statistical approaches violated this binary partition. However, physical establishment of any additional cluster(s) is not yet confirmed. Therefore, we propose a new algorithm, from a different stream of clustering referred to as `completely parameter-free', which carries out the classification of GRBs in a manner that has not been tried so far. It indicates two main groups, of short and long duration bursts from the BATSE sample, compatible with the merger-collapsar theory.

algorithm, artificial intelligence, machine learning, (15 more...)

arXiv.org Machine Learning

2605.30175

Country:

North America > United States (1.00)
Asia (1.00)

Genre: Research Report > Experimental Study (0.93)

Industry: Banking & Finance > Mergers & Acquisitions (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Confirmation of Binary Clustering in Gamma-Ray Bursts through an Integrated $p$-value from Multiple Nonparametric Tests of Hypotheses

arXiv.org Machine LearningMay-7-2026

The paper applies a new, nonparametric, interpoint distance-based measure to confirm the inherent groups prevailing in the brightest source of light in the universe: gamma-ray bursts. Our effective metric, in association with clustering methods like Gaussian-mixture model-based and $K$-means algorithms, resolves the conflict regarding the possibility about existence of more than binary clusters in the gamma-ray burst population. Here we carry out multiple nonparametric statistical tests of hypotheses, as many as the number of bursts available from the `BATSE' catalog. An integrated $p$-value achieved from the aforesaid dependent tests solves our concern confirming two groups of short and long bursts.

artificial intelligence, gamma-ray burst, machine learning, (16 more...)

arXiv.org Machine Learning

doi: 10.1016/j.ascom.2025.100931

2605.04739

Country:

North America > United States (0.68)
Asia (0.46)

Genre: Research Report > Experimental Study (0.94)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Chattopadhyay, Subhagata, Chattopadhyay, Amit K

Identifying Heart Attack Risk in Vulnerable Population: A Machine Learning Approach

arXiv.org Artificial IntelligenceMay-28-2025

The COVID-19 pandemic has significantly increased the incidence of post-infection cardiovascular events, particularly myocardial infarction, in individuals over 40. While the underlying mechanisms remain elusive, this study employs a hybrid machine learning approach to analyze epidemiological data in assessing 13 key heart attack risk factors and their susceptibility. Based on a unique dataset that combines demographic, biochemical, ECG, and thallium stress-tests, this study categorizes distinct subpopulations against varying risk profiles and then divides the population into 'at-risk' (AR) and 'not-at-risk' (NAR) groups using clustering algorithms. The study reveals strong association between the likelihood of experiencing a heart attack on the 13 risk factors studied. The aggravated risk for postmenopausal patients indicates compromised individual risk factors due to estrogen depletion that may be, further compromised by extraneous stress impacts, like anxiety and fear, aspects that have traditionally eluded data modeling predictions.

artificial intelligence, dataset, machine learning, (17 more...)

doi: 10.3390/info16040265

2505.21139

Country:

Europe (0.28)
Asia > India (0.28)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Rehman, Tohida, Ghosh, Soumabha, Das, Kuntal, Bhattacharjee, Souvik, Sanyal, Debarshi Kumar, Chattopadhyay, Samiran

Evaluating LLMs and Pre-trained Models for Text Summarization Across Diverse Datasets

arXiv.org Artificial IntelligenceMar-13-2025

Text summarization plays a crucial role in natural language processing by condensing large volumes of text into concise and coherent summaries. As digital content continues to grow rapidly and the demand for effective information retrieval increases, text summarization has become a focal point of research in recent years. This study offers a thorough evaluation of four leading pre-trained and open-source large language models: BART, FLAN-T5, LLaMA-3-8B, and Gemma-7B, across five diverse datasets CNN/DM, Gigaword, News Summary, XSum, and BBC News. The evaluation employs widely recognized automatic metrics, including ROUGE-1, ROUGE-2, ROUGE-L, BERTScore, and METEOR, to assess the models' capabilities in generating coherent and informative summaries. The results reveal the comparative strengths and limitations of these models in processing various text types.

bertscore, dataset, summarization, (14 more...)

2502.19339

Country:

Asia > Sri Lanka (0.15)
Asia > India > West Bengal > Kolkata (0.05)
North America > United States > Kentucky > Jefferson County > Louisville (0.04)
(3 more...)

Genre:

Research Report (0.50)
Overview (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Chattopadhyay, Nandish, Basit, Abdul, Ouni, Bassem, Shafique, Muhammad

A Survey of Adversarial Defenses in Vision-based Systems: Categorization, Methods and Challenges

arXiv.org Artificial IntelligenceMar-1-2025

Adversarial attacks have emerged as a major challenge to the trustworthy deployment of machine learning models, particularly in computer vision applications. These attacks have a varied level of potency and can be implemented in both white box and black box approaches. Practical attacks include methods to manipulate the physical world and enforce adversarial behaviour by the corresponding target neural network models. Multiple different approaches to mitigate different kinds of such attacks are available in the literature, each with their own advantages and limitations. In this survey, we present a comprehensive systematization of knowledge on adversarial defenses, focusing on two key computer vision tasks: image classification and object detection. We review the state-of-the-art adversarial defense techniques and categorize them for easier comparison. In addition, we provide a schematic representation of these categories within the context of the overall machine learning pipeline, facilitating clearer understanding and benchmarking of defenses. Furthermore, we map these defenses to the types of adversarial attacks and datasets where they are most effective, offering practical insights for researchers and practitioners. This study is necessary for understanding the scope of how the available defenses are able to address the adversarial threats, and their shortcomings as well, which is necessary for driving the research in this area in the most appropriate direction, with the aim of building trustworthy AI systems for regular practical use-cases.

adversarial attack, adversarial example, robustness, (15 more...)

2503.00384

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
Europe > France > Île-de-France > Paris > Paris (0.14)
Europe > Austria > Vienna (0.14)
(9 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.46)
Research Report > Promising Solution (0.46)

Industry:

Information Technology > Security & Privacy (1.00)
Commercial Services & Supplies > Security & Alarm Services (0.92)
Government (0.91)
Transportation > Ground > Road (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Quality check of a sample partition using multinomial distribution

arXiv.org Machine LearningApr-11-2024

In this paper, we advocate a novel measure for the purpose of checking the quality of a cluster partition for a sample into several distinct classes, and thus, determine the unknown value for the true number of clusters prevailing the provided set of data. Our objective leads us to the development of an approach through applying the multinomial distribution to the distances of data members, clustered in a group, from their respective cluster representatives. This procedure is carried out independently for each of the clusters, and the concerned statistics are combined together to design our targeted measure. Individual clusters separately possess the category-wise probabilities which correspond to different positions of its members in the cluster with respect to a typical member, in the form of cluster-centroid, medoid or mode, referred to as the corresponding cluster representative. Our method is robust in the sense that it is distribution-free, since this is devised irrespective of the parent distribution of the underlying sample. It fulfills one of the rare coveted qualities, present in the existing cluster accuracy measures, of having the capability to investigate whether the assigned sample owns any inherent clusters other than a single group of all members or not. Our measure's simple concept, easy algorithm, fast runtime, good performance, and wide usefulness, demonstrated through extensive simulation and diverse case-studies, make it appealing.

algorithm, case study, partition, (15 more...)

arXiv.org Machine Learning

2404.07778

Country:

Asia > India > West Bengal > Kolkata (0.14)
North America > United States > New Jersey (0.04)
North America > United States > New York (0.04)
(4 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Frezat, Hugo, Sommer, Julien Le, Fablet, Ronan, Balarac, Guillaume, Lguensat, Redouane

A posteriori learning for quasi-geostrophic turbulence parametrization

arXiv.org Artificial IntelligenceNov-24-2022

The use of machine learning to build subgrid parametrizations for climate models is receiving growing attention. State-of-the-art strategies address the problem as a supervised learning task and optimize algorithms that predict subgrid fluxes based on information from coarse resolution models. In practice, training data are generated from higher resolution numerical simulations transformed in order to mimic coarse resolution simulations. By essence, these strategies optimize subgrid parametrizations to meet so-called $\textit{a priori}$ criteria. But the actual purpose of a subgrid parametrization is to obtain good performance in terms of $\textit{a posteriori}$ metrics which imply computing entire model trajectories. In this paper, we focus on the representation of energy backscatter in two dimensional quasi-geostrophic turbulence and compare parametrizations obtained with different learning strategies at fixed computational complexity. We show that strategies based on $\textit{a priori}$ criteria yield parametrizations that tend to be unstable in direct simulations and describe how subgrid parametrizations can alternatively be trained end-to-end in order to meet $\textit{a posteriori}$ criteria. We illustrate that end-to-end learning strategies yield parametrizations that outperform known empirical and data-driven schemes in terms of performance, stability and ability to apply to different flow configurations. These results support the relevance of differentiable programming paradigms for climate models in the future.

artificial intelligence, machine learning, modeling earth system, (19 more...)

doi: 10.1029/2022MS003124

2204.03911

Country: Europe > France (0.46)

Genre: Research Report (0.82)

Industry: Energy > Oil & Gas > Upstream (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

A new nonparametric interpoint distance-based measure for assessment of clustering

arXiv.org Artificial IntelligenceOct-1-2022

A new interpoint distance-based measure is proposed to identify the optimal number of clusters present in a data set. Designed in nonparametric approach, it is independent of the distribution of given data. Interpoint distances between the data members make our cluster validity index applicable to univariate and multivariate data measured on arbitrary scales, or having observations in any dimensional space where the number of study variables can be even larger than the sample size. Our proposed criterion is compatible with any clustering algorithm, and can be used to determine the unknown number of clusters or to assess the quality of the resulting clusters for a data set. Demonstration through synthetic and real-life data establishes its superiority over the well-known clustering accuracy measures of the literature.

algorithm, artificial intelligence, machine learning, (16 more...)

doi: 10.1080/00949655.2021.1984487

2210.08972

Country:

Asia > India > West Bengal > Kolkata (0.14)
North America > United States > New York (0.04)
North America > United States > New Jersey (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Los Angeles TimesJul-4-2022, 12:00:02 GMT

Researchers use AI to predict crime, biased policing in major U.S. cities like L.A.

For once, algorithms that predict crime might be used to uncover bias in policing, instead of reinforcing it. A group of social and data scientists developed a machine learning tool it hoped would better predict crime. The scientists say they succeeded, but their work also revealed inferior police protection in poorer neighborhoods in eight major U.S. cities, including Los Angeles. Instead of justifying more aggressive policing in those areas, however, the hope is the technology will lead to "changes in policy that result in more equitable, need-based resource allocation," including sending officials other than law enforcement to certain kinds of calls, according to a report published Thursday in the journal Nature Human Behavior. The tool, developed by a team led by University of Chicago professor Ishanu Chattopadhyay, forecasts crime by spotting patterns amid vast amounts of public data on property crimes and crimes of violence, learning from the data as it goes.

chattopadhyay, crime, neighborhood, (11 more...)

Los Angeles Times

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.39)
North America > United States > Illinois > Cook County > Chicago (0.26)
North America > United States > Texas > Travis County > Austin (0.05)
(2 more...)

Genre: Research Report (0.70)

Industry: Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.73)

#artificialintelligenceJul-4-2022, 04:16:38 GMT

Researchers are using AI to predict crime, again

Scientists are looking for a way to predict crime using, you guessed it, artificial intelligence. There are loads of studies that show using AI to predict crime results in consistently racist outcomes. For instance, one AI crime prediction model that the Chicago Police Department tried out in 2016 tried to get rid of its racist biases but had the opposite effect. It used a model to predict who might be most at risk of being involved in a shooting, but 56% of 20-29 year old Black men in the city appeared on the list. Despite it all, scientists are still trying to use the tool to find out when, and where, crime might occur.

chattopadhyay, crime, predict crime, (9 more...)

#artificialintelligence

Country: North America > United States > Illinois > Cook County > Chicago (0.29)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Law > Civil Rights & Constitutional Law (0.95)

Technology: Information Technology > Artificial Intelligence > Applied AI (1.00)