Goto

Collaborating Authors

 index 0


YOLO V2 Configuration file Explained!!

#artificialintelligence

The first step in NMS is to remove all the predicted bounding boxes that have a detection probability that is less than a given NMS threshold.


A New Shortest Path Algorithm using Lists

#artificialintelligence

In graph theory, the shortest path problem can be defined as a problem of finding a path between two vertices (or nodes) in a graph such that the sum of the weights of its constituent edges is minimized. A topic that is revered in the field of graph theory and has numerous practical applications including vehicular routing, network designs, etc., the shortest path problem has had several notable approaches to it, including Floyd-Warshall Algorithm, Bellman-Ford Algorithm, and the most renowned Dijkstra's Algorithm. When I came across the algorithm behind Dijkstra's method of solving the shortest path problem in a weighted directed graph (though it can be applied to undirected graphs as well), I wondered if I could come up with my own algorithm for solving the problem using one of Python's data structures: Lists. After over a month of tens of trial-and-error attempts, my peer Syed Abdul Azeem and I came up with an algorithm to solve the problem. The code will not be shared on account of confidentiality, however, you, the reader, might as well be able to figure out the code as we go ahead and explain our approach to a given problem.


Explainable Artificial Intelligence Based Fault Diagnosis and Insight Harvesting for Steel Plates Manufacturing

arXiv.org Artificial Intelligence

With the advent of Industry 4.0, Data Science and Explainable Artificial Intelligence (XAI) has received considerable intrest in recent literature. However, the entry threshold into XAI, in terms of computer coding and the requisite mathematical apparatus, is really high. For fault diagnosis of steel plates, this work reports on a methodology of incorporating XAI based insights into the Data Science process of development of high precision classifier. Using Synthetic Minority Oversampling Technique (SMOTE) and notion of medoids, insights from XAI tools viz. Ceteris Peribus profiles, Partial Dependence and Breakdown profiles have been harvested. Additionally, insights in the form of IF-THEN rules have also been extracted from an optimized Random Forest and Association Rule Mining. Incorporating all the insights into a single ensemble classifier, a 10 fold cross validated performance of 94% has been achieved. In sum total, this work makes three main contributions viz.: methodology based upon utilization of medoids and SMOTE, of gleaning insights and incorporating into model development process. Secondly the insights themselves are contribution, as they benefit the human experts of steel manufacturing industry, and thirdly a high precision fault diagnosis classifier has been developed.


How to Connect Model Input Data With Predictions for Machine Learning

#artificialintelligence

Fitting a model to a training dataset is so easy today with libraries like scikit-learn. A model can be fit and evaluated on a dataset in just a few lines of code. It is so easy that it has become a problem. The same few lines of code are repeated again and again and it may not be obvious how to actually use the model to make a prediction. Or, if a prediction is made, how to relate the predicted values to the actual input values.


ShuffleNASNets: Efficient CNN models through modified Efficient Neural Architecture Search

arXiv.org Machine Learning

Abstract--Neural network architectures found by sophistic search algorithms achieve strikingly good test performance, surpassing most human-crafted network models by significant margins. Although computationally efficient, their design is often very complex, impairing execution speed. Additionally, finding models outside of the search space is not possible by design. While our space is still limited, we implement undiscoverable expert knowledge into the economic search algorithm Efficient Neural Architecture Search (ENAS), guided by the design principles and architecture of ShuffleNet V2. While maintaining baselinelike 2.85%test error on CIFAR-10, our ShuffleNASNets are significantly less complex, require fewer parameters, and are two times faster than the ENAS baseline in a classification task.


Derivation of Convolutional Neural Network from Fully Connected Network Step-By-Step

@machinelearnbot

In image analysis, convolutional neural networks (CNNs or ConvNets for short) are time and memory efficient than fully connected (FC) networks. What are the advantages of ConvNets over FC networks in image analysis? How is ConvNet derived from FC networks? Where the term convolution in CNNs came from? These questions are to be answered in this article.


Learning Graph-Level Representation for Drug Discovery

arXiv.org Machine Learning

Predicating macroscopic influences of drugs on human body, like efficacy and toxicity, is a central problem of small-molecule based drug discovery. Molecules can be represented as an undirected graph, and we can utilize graph convolution networks to predication molecular properties. However, graph convolutional networks and other graph neural networks all focus on learning node-level representation rather than graph-level representation. Previous works simply sum all feature vectors for all nodes in the graph to obtain the graph feature vector for drug predication. In this paper, we introduce a dummy super node that is connected with all nodes in the graph by a directed edge as the representation of the graph and modify the graph operation to help the dummy super node learn graph-level feature. Thus, we can handle graph-level classification and regression in the same way as node-level classification and regression. In addition, we apply focal loss to address class imbalance in drug datasets. The experiments on MoleculeNet show that our method can effectively improve the performance of molecular properties predication.


Bank distress in the news: Describing events through deep learning

arXiv.org Artificial Intelligence

While many models are purposed for detecting the occurrence of significant events in financial systems, the task of providing qualitative detail on the developments is not usually as well automated. We present a deep learning approach for detecting relevant discussion in text and extracting natural language descriptions of events. Supervised by only a small set of event information, comprising entity names and dates, the model is leveraged by unsupervised learning of semantic vector representations on extensive text data. We demonstrate applicability to the study of financial risk based on news (6.6M articles), particularly bank distress and government interventions (243 events), where indices can signal the level of bank-stress-related reporting at the entity level, or aggregated at national or European level, while being coupled with explanations. Thus, we exemplify how text, as timely, widely available and descriptive data, can serve as a useful complementary source of information for financial and systemic risk analytics.