Goto

Collaborating Authors

 accuracy increase


CHIP: CHannel Independence-based Pruning for Compact Neural Networks

Neural Information Processing Systems

Filter pruning has been widely used for neural network compression because of its enabled practical acceleration. To date, most of the existing filter pruning works explore the importance of filters via using intra-channel information. In this paper, starting from an inter-channel perspective, we propose to perform efficient filter pruning using Channel Independence, a metric that measures the correlations among different feature maps. The less independent feature map is interpreted as containing less useful information$/$knowledge, and hence its corresponding filter can be pruned without affecting model capacity. We systematically investigate the quantification metric, measuring scheme and sensitiveness$/$reliability of channel independence in the context of filter pruning. Our evaluation results for different models on various datasets show the superior performance of our approach. Notably, on CIFAR-10 dataset our solution can bring $0.75\%$ and $0.94\%$ accuracy increase over baseline ResNet-56 and ResNet-110 models, respectively, and meanwhile the model size and FLOPs are reduced by $42.8\%$ and $47.4\%$ (for ResNet-56) and $48.3\%$ and $52.1\%$ (for ResNet-110), respectively. On ImageNet dataset, our approach can achieve $40.8\%$ and $44.8\%$ storage and computation reductions, respectively, with $0.15\%$ accuracy increase over the baseline ResNet-50 model.


Appendix for Evaluating Efficient Performance Estimators of Neural Architectures

Neural Information Processing Systems

As the train ing goes on, the pa ram e ter sizes of top - ranked ar chi tec tures by the OSEs be come larger . We visualize the Pareto frontiers discovered by OSEs in the right subplots of the two figures. The blue lines with square markers show the one-shot scores of the GT Pareto frontier, while the orange/green/red lines show the GT scores of the OS Pareto frontier.


Scaling Semantic Categories: Investigating the Impact on Vision Transformer Labeling Performance

arXiv.org Artificial Intelligence

This study explores the impact of scaling semantic categories on the image classification performance of vision transformers (ViTs). In this specific case, the CLIP server provided by Jina AI is used for experimentation. The research hypothesizes that as the number of ground truth and artificially introduced semantically equivalent categories increases, the labeling accuracy of ViTs improves until a theoretical maximum or limit is reached. A wide variety of image datasets were chosen to test this hypothesis. These datasets were processed through a custom function in Python designed to evaluate the model's accuracy, with adjustments being made to account for format differences between datasets. By exponentially introducing new redundant categories, the experiment assessed accuracy trends until they plateaued, decreased, or fluctuated inconsistently. The findings show that while semantic scaling initially increases model performance, the benefits diminish or reverse after surpassing a critical threshold, providing insight into the limitations and possible optimization of category labeling strategies for ViTs.


CHIP: CHannel Independence-based Pruning for Compact Neural Networks

Neural Information Processing Systems

Filter pruning has been widely used for neural network compression because of its enabled practical acceleration. To date, most of the existing filter pruning works explore the importance of filters via using intra-channel information. In this paper, starting from an inter-channel perspective, we propose to perform efficient filter pruning using Channel Independence, a metric that measures the correlations among different feature maps. The less independent feature map is interpreted as containing less useful information / knowledge, and hence its corresponding filter can be pruned without affecting model capacity. We systematically investigate the quantification metric, measuring scheme and sensitiveness / reliability of channel independence in the context of filter pruning.


A Novel Review of Stability Techniques for Improved Privacy-Preserving Machine Learning

arXiv.org Artificial Intelligence

Data, especially private data, has become increasingly valuable in the modern era. From hospital records to personal search histories, the increased collection and use of private data means that data analysis conducted on these data sets must protect sensitive information about individuals. Without this protection, a leak of sensitive information could easily have lasting consequences for an individual, even from seemingly innocuous data like a photo. The rise of machine learning has exacerbated these concerns even further due to its need for specific and abundant data to produce accurate predictions. This large amount of required data and machine learning models' tendency to memorize specific yet unnecessary information, such as specific IP addresses during text responses, makes private machine learning especially important[1].


A chaotic maps-based privacy-preserving distributed deep learning for incomplete and Non-IID datasets

arXiv.org Artificial Intelligence

Federated Learning is a machine learning approach that enables the training of a deep learning model among several participants with sensitive data that wish to share their own knowledge without compromising the privacy of their data. In this research, the authors employ a secured Federated Learning method with an additional layer of privacy and proposes a method for addressing the non-IID challenge. Moreover, differential privacy is compared with chaotic-based encryption as layer of privacy. The experimental approach assesses the performance of the federated deep learning model with differential privacy using both IID and non-IID data. In each experiment, the Federated Learning process improves the average performance metrics of the deep neural network, even in the case of non-IID data.


Lumos: Heterogeneity-aware Federated Graph Learning over Decentralized Devices

arXiv.org Artificial Intelligence

Graph neural networks (GNN) have been widely deployed in real-world networked applications and systems due to their capability to handle graph-structured data. However, the growing awareness of data privacy severely challenges the traditional centralized model training paradigm, where a server holds all the graph information. Federated learning is an emerging collaborative computing paradigm that allows model training without data centralization. Existing federated GNN studies mainly focus on systems where clients hold distinctive graphs or sub-graphs. The practical node-level federated situation, where each client is only aware of its direct neighbors, has yet to be studied. In this paper, we propose the first federated GNN framework called Lumos that supports supervised and unsupervised learning with feature and degree protection on node-level federated graphs. We first design a tree constructor to improve the representation capability given the limited structural information. We further present a Monte Carlo Markov Chain-based algorithm to mitigate the workload imbalance caused by degree heterogeneity with theoretically-guaranteed performance. Based on the constructed tree for each client, a decentralized tree-based GNN trainer is proposed to support versatile training. Extensive experiments demonstrate that Lumos outperforms the baseline with significantly higher accuracy and greatly reduced communication cost and training time.


6 Important Steps to build a Machine Learning System

#artificialintelligence

Creating a great machine learning system is an art. There are a lot of things to consider while building a great machine learning system. But often it happens that we as data scientists only worry about certain parts of the project. Most of the time that happens to be modeling, but in reality, the success or failure of a Machine Learning project depends on a lot of other factors. It is essential to understand what happens before training a model and after training the model and deploying it in production.