AITopics | fashion mnist dataset

Collaborating Authors

fashion mnist dataset

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ac450d10e166657ec8f93a1b65ca1b14-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-9-2026, 19:17:44 GMT

algorithm, convergence rate, gradient, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.74)

Add feedback

ac450d10e166657ec8f93a1b65ca1b14-AuthorFeedback.pdf

Neural Information Processing SystemsAug-15-2025, 18:13:04 GMT

algorithm, convergence rate, gradient, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.74)

Add feedback

Selective Feature Re-Encoded Quantum Convolutional Neural Network with Joint Optimization for Image Classification

Sarkar, Shaswata Mahernob, Ahmed, Sheikh Iftekhar, Mahmud, Jishnu, Fattah, Shaikh Anowarul, Sharma, Gaurav

arXiv.org Artificial IntelligenceJul-4-2025

Quantum Machine Learning (QML) has seen significant advancements, driven by recent improvements in Noisy Intermediate-Scale Quantum (NISQ) devices. Leveraging quantum principles such as entanglement and superposition, quantum convolutional neural networks (QCNNs) have demonstrated promising results in classifying both quantum and classical data. This study examines QCNNs in the context of image classification and proposes a novel strategy to enhance feature processing and a QCNN architecture for improved classification accuracy. First, a selective feature re-encoding strategy is proposed, which directs the quantum circuits to prioritize the most informative features, thereby effectively navigating the crucial regions of the Hilbert space to find the optimal solution space. Secondly, a novel parallel-mode QCNN architecture is designed to simultaneously incorporate features extracted by two classical methods, Principal Component Analysis (PCA) and Autoencoders, within a unified training scheme. The joint optimization involved in the training process allows the QCNN to benefit from complementary feature representations, enabling better mutual readjustment of model parameters. To assess these methodologies, comprehensive experiments have been performed using the widely used MNIST and Fashion MNIST datasets for binary classification tasks. Experimental findings reveal that the selective feature re-encoding method significantly improves the quantum circuit's feature processing capability and performance. Furthermore, the jointly optimized parallel QCNN architecture consistently outperforms the individual QCNN models and the traditional ensemble approach involving independent learning followed by decision fusion, confirming its superior accuracy and generalization capabilities.

artificial intelligence, machine learning, qubit, (20 more...)

arXiv.org Artificial Intelligence

2507.02086

Country:

North America > United States (0.14)
Asia > Bangladesh > Dhaka Division > Dhaka District > Dhaka (0.04)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Enhancing Federated Learning Through Secure Cluster-Weighted Client Aggregation

Ranaweera, Kanishka, Neiat, Azadeh Ghari, Liu, Xiao, Kashyap, Bipasha, Pathirana, Pubudu N.

arXiv.org Artificial IntelligenceMar-29-2025

Federated learning (FL) has emerged as a promising paradigm in machine learning, enabling collaborative model training across decentralized devices without the need for raw data sharing. In FL, a global model is trained iteratively on local datasets residing on individual devices, each contributing to the model's improvement. However, the heterogeneous nature of these local datasets, stemming from diverse user behaviours, device capabilities, and data distributions, poses a significant challenge. The inherent heterogeneity in federated learning gives rise to various issues, including model performance discrepancies, convergence challenges, and potential privacy concerns. As the global model progresses through rounds of training, the disparities in local data quality and quantity can impede the overall effectiveness of federated learning systems. Moreover, maintaining fairness and privacy across diverse user groups becomes a paramount concern. To address this issue, this paper introduces a novel FL framework, ClusterGuardFL, that employs dissimilarity scores, k-means clustering, and reconciliation confidence scores to dynamically assign weights to client updates. The dissimilarity scores between global and local models guide the formation of clusters, with cluster size influencing the weight allocation. Within each cluster, a reconciliation confidence score is calculated for individual data points, and a softmax layer generates customized weights for clients. These weights are utilized in the aggregation process, enhancing the model's robustness and privacy. Experimental results demonstrate the efficacy of the proposed approach in achieving improved model performance in diverse datasets.

artificial intelligence, clusterguardfl, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2503.22971

Country: Oceania > Australia > Queensland (0.04)

Genre: Research Report > New Finding (0.88)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.49)

Add feedback

Interpreting Adversarial Attacks and Defences using Architectures with Enhanced Interpretability

Rao, Akshay G, Lakshminarayanan, Chandrashekhar, Rajkumar, Arun

arXiv.org Artificial IntelligenceFeb-20-2025

Adversarial attacks in deep learning represent a significant threat to the integrity and reliability of machine learning models. Adversarial training has been a popular defence technique against these adversarial attacks. In this work, we capitalize on a network architecture, namely Deep Linearly Gated Networks (DLGN), which has better interpretation capabilities than regular deep network architectures. Using this architecture, we interpret robust models trained using PGD adversarial training and compare them with standard training. Feature networks in DLGN act as feature extractors, making them the only medium through which an adversary can attack the model. We analyze the feature network of DLGN with fully connected layers with respect to properties like alignment of the hyperplanes, hyperplane relation with PCA, and sub-network overlap among classes and compare these properties between robust and standard models. We also consider this architecture having CNN layers wherein we qualitatively (using visualizations) and quantitatively contrast gating patterns between robust and standard models. We uncover insights into hyperplanes resembling principal components in PGD-AT and STD-TR models, with PGD-AT hyperplanes aligned farther from the data points. We use path activity analysis to show that PGD-AT models create diverse, non-overlapping active subnetworks across classes, preventing attack-induced gating overlaps. Our visualization ideas show the nature of representations learnt by PGD-AT and STD-TR models.

hyperplane, mnist dataset, std-tr model, (13 more...)

arXiv.org Artificial Intelligence

2502.15017

Country: North America > United States (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Information Technology > Security & Privacy (0.91)
Government > Military (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

Gradient Correlation Subspace Learning against Catastrophic Forgetting

Dubnov, Tammuz, Thengane, Vishal

arXiv.org Artificial IntelligenceMar-4-2024

Efficient continual learning techniques have been a topic of significant research over the last few years. A fundamental problem with such learning is severe degradation of performance on previously learned tasks, known also as catastrophic forgetting. This paper introduces a novel method to reduce catastrophic forgetting in the context of incremental class learning called Gradient Correlation Subspace Learning (GCSL). The method detects a subspace of the weights that is least affected by previous tasks and projects the weights to train for the new task into said subspace. The method can be applied to one or more layers of a given network architectures and the size of the subspace used can be altered from layer to layer and task to task. Code will be available at https://github.com/vgthengane/GCSL Traditionally, as a neural network learns multi-class classification, the network learns to extract features indicative of the target labels and to perform the classification. When learning on all target labels simultaneously, networks are able to reach the highest accuracy presumably because they can learn the features relevant to all labels at the same time. If a network is trained on a set of labels and then at a later point trained on a different set of labels, the network often "catastrophically forgets" how to classify the original labels.

configuration, dataset, experiment, (13 more...)

arXiv.org Artificial Intelligence

2403.02334

Country:

Europe > Italy > Sardinia (0.04)
Asia > Middle East > Israel (0.04)
Asia > India (0.04)

Genre: Research Report > Promising Solution (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

How to Backdoor HyperNetwork in Personalized Federated Learning?

Lai, Phung, Phan, NhatHai, Khalil, Issa, Khreishah, Abdallah, Wu, Xintao

arXiv.org Artificial IntelligenceDec-11-2023

This paper explores previously unknown backdoor risks in HyperNet-based personalized federated learning (HyperNetFL) through poisoning attacks. Based upon that, we propose a novel model transferring attack (called HNTroj), i.e., the first of its kind, to transfer a local backdoor infected model to all legitimate and personalized local models, which are generated by the HyperNetFL model, through consistent and effective malicious local gradients computed across all compromised clients in the whole training process. As a result, HNTroj reduces the number of compromised clients needed to successfully launch the attack without any observable signs of sudden shifts or degradation regarding model utility on legitimate data samples making our attack stealthy. To defend against HNTroj, we adapted several backdoor-resistant FL training algorithms into HyperNetFL. An extensive experiment that is carried out using several benchmark datasets shows that HNTroj significantly outperforms data poisoning and model replacement attacks and bypasses robust training algorithms even with modest numbers of compromised clients.

gradient, hnt roj, hypernetfl, (14 more...)

arXiv.org Artificial Intelligence

2201.07063

Country:

North America > United States > New Jersey (0.04)
North America > United States > Arkansas (0.04)
Asia > Nepal (0.04)
Asia > Middle East > Qatar (0.04)

Genre: Research Report > Promising Solution (0.34)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

"Task-relevant autoencoding" enhances machine learning for human neuroscience

Orouji, Seyedmehdi, Taschereau-Dumouchel, Vincent, Cortese, Aurelio, Odegaard, Brian, Cushing, Cody, Cherkaoui, Mouslim, Kawato, Mitsuo, Lau, Hakwan, Peters, Megan A. K.

arXiv.org Artificial IntelligenceSep-22-2023

In human neuroscience, machine learning can help reveal lower-dimensional neural representations relevant to subjects' behavior. However, state-of-the-art models typically require large datasets to train, so are prone to overfitting on human neuroimaging data that often possess few samples but many input dimensions. Here, we capitalized on the fact that the features we seek in human neuroscience are precisely those relevant to subjects' behavior. We thus developed a Task-Relevant Autoencoder via Classifier Enhancement (TRACE), and tested its ability to extract behaviorally-relevant, separable representations compared to a standard autoencoder, a variational autoencoder, and principal component analysis for two severely truncated machine learning datasets. We then evaluated all models on fMRI data from 59 subjects who observed animals and objects. TRACE outperformed all models nearly unilaterally, showing up to 12% increased classification accuracy and up to 56% improvement in discovering "cleaner", task-relevant representations. These results showcase TRACE's potential for a wide variety of data related to human behavior.

bottleneck dimensionality, dataset, dimensionality, (13 more...)

arXiv.org Artificial Intelligence

2208.08478

Country:

North America > United States > California > Orange County > Irvine (0.28)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Florida > Alachua County > Gainesville (0.14)
(5 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Health Care Technology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Explainable Convolutional Neural Networks with PyTorch + SHAP

#artificialintelligenceAug-16-2022, 19:56:38 GMT

Complex technologies such as deep learning used to be a kind of black-box model since you couldn't have a thorough idea of what was happening inside. However, tools like SHAP (SHapely Additive exPlanations) make it a thing of the past. With SHAP, you can easily interpret the predictions of deep learning models with minimal coding. CNNs aren't among the most straightforward concepts to understand. A network using mathematical calculations learns the kernels for images and detects the useful patterns to classify unseen images correctly.

explainable convolutional neural network, script src, shap, (11 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Gentle Introduction to tensorflow.data API

#artificialintelligenceJul-25-2022, 12:58:28 GMT

Before we see how the tf.data API works, let's review how we usually train a Keras model. First, we need a dataset. An example is the fashion MNIST dataset that comes with the Keras API, which we have 60,000 training samples and 10,000 test samples of 28 28 pixels in grayscale and the corresponding classification label is encoded with integers 0 to 9. The dataset is a NumPy array. Then we can build a Keras model for classification, and with the model's fit() function, we provide the NumPy array as data.

dataset, generator function, numpy array, (16 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.71)

Add feedback