Goto

Collaborating Authors

 Regression


The Most Intuitive and Easiest Guide for Artificial Neural Network

#artificialintelligence

Anyone who is living in a world of 2019, would have heard of these words more than once. And you probably have seen the awesome works such as image classification, computer vision, and speech recognition. So are you also interested in building those cool AI project but still have no idea of what artificial neural network is? There are already hundreds of articles explaining the concept of the artificial neural network with the name of "a beginner's guide on back propagation in ANN" or "A gentle introduction of the artificial neural network." They are really great already, but I found It could be still hard for someone who is not comfortable with mathematical expressions.



Machine Learning - Predictions with ordinal logistic regression - Michael Fuchs

#artificialintelligence

Now, let's look at the fit on the training data and the corresponding confusion matrix. Our model performs only marginally better on the training data than our baseline model. We can see why this is the case: it predicts the average class (1) very often. Now we want to try this again with the test set. As you can see, we get a pretty much identical situation.


Nonparametric Inverse Dynamic Models for Multimodal Interactive Robots

arXiv.org Artificial Intelligence

Direct design of a robot's rendered dynamics, such as in impedance control, is now a well-established control mode in uncertain environments. When the physical interaction port variables are not measured directly, dynamic and kinematic models are required to relate the measured variables to the interaction port variables. A typical example is serial manipulators with joint torque sensors, where the interaction occurs at the end-effector. As interactive robots perform increasingly complex tasks, they will be intermittently coupled with additional dynamic elements such as tools, grippers, or workpieces, some of which should be compensated and brought to the robot side of the interaction port, making the inverse dynamics multimodal. Furthermore, there may also be unavoidable and unmeasured external input when the desired system cannot be totally isolated. Towards semi-autonomous robots, capable of handling such applications, a multimodal Gaussian process regression approach to manipulator dynamic modelling is developed. A sampling-based approach clusters different dynamic modes from unlabelled data, also allowing the seperation of perturbed data with significant, irregular external input. The passivity of the overall approach is shown analytically, and experiments examine the performance and safety of this approach on a test actuator.


ORIGAMI: A Heterogeneous Split Architecture for In-Memory Acceleration of Learning

arXiv.org Machine Learning

Memory bandwidth bottleneck is a major challenges in processing machine learning (ML) algorithms. In-memory acceleration has potential to address this problem; however, it needs to address two challenges. First, in-memory accelerator should be general enough to support a large set of different ML algorithms. Second, it should be efficient enough to utilize bandwidth while meeting limited power and area budgets of logic layer of a 3D-stacked memory. We observe that previous work fails to simultaneously address both challenges. We propose ORIGAMI, a heterogeneous set of in-memory accelerators, to support compute demands of different ML algorithms, and also uses an off-the-shelf compute platform (e.g.,FPGA,GPU,TPU,etc.) to utilize bandwidth without violating strict area and power budgets. ORIGAMI offers a pattern-matching technique to identify similar computation patterns of ML algorithms and extracts a compute engine for each pattern. These compute engines constitute heterogeneous accelerators integrated on logic layer of a 3D-stacked memory. Combination of these compute engines can execute any type of ML algorithms. To utilize available bandwidth without violating area and power budgets of logic layer, ORIGAMI comes with a computation-splitting compiler that divides an ML algorithm between in-memory accelerators and an out-of-the-memory platform in a balanced way and with minimum inter-communications. Combination of pattern matching and split execution offers a new design point for acceleration of ML algorithms. Evaluation results across 12 popular ML algorithms show that ORIGAMI outperforms state-of-the-art accelerator with 3D-stacked memory in terms of performance and energy-delay product (EDP) by 1.5x and 29x (up to 1.6x and 31x), respectively. Furthermore, results are within a 1% margin of an ideal system that has unlimited compute resources on logic layer of a 3D-stacked memory.


Development of Mobile-Interfaced Machine Learning-Based Predictive Models for Improving Students Performance in Programming Courses

arXiv.org Machine Learning

Student performance modelling (SPM) is a critical step to assessing and improving students performances in their learning discourse. However, most existing SPM are based on statistical approaches, which on one hand are based on probability, depicting that results are based on estimation; and on the other hand, actual influences of hidden factors that are peculiar to students, lecturers, learning environment and the family, together with their overall effect on student performance have not been exhaustively investigated. In this paper, Student Performance Models (SPM) for improving students performance in programming courses were developed using M5P Decision Tree (MDT) and Linear Regression Classifier (LRC). The data used was gathered using a structured questionnaire from 295 students in 200 and 300 levels of study who offered Web programming, C or JAVA at Federal University, Oye-Ekiti, Nigeria between 2012 and 2016. Hidden factors that are significant to students performance in programming were identified. The relevant data gathered, normalized, coded and prepared as variable and factor datasets, and fed into the MDT algorithm and LRC to develop the predictive models. The evaluation results obtained indicate that the variable-based LRC produced the best model in terms of MAE, RMSE, RAE and the RRSE having yielded the least values in all the evaluations conducted. Further results obtained established the strong significance of attitude of students and lecturers, fearful perception of students, erratic power supply, university facilities, student health and students attendance to the performance of students in programming courses. The variable-based LRC model presented in this paper could provide baseline information about students performance thereby offering better decision making towards improving teaching/learning outcomes in programming courses.


Optimizing Software Effort Estimation Models Using Firefly Algorithm

arXiv.org Artificial Intelligence

Software development effort estimation is considered a fundamental task for software development life cycle as well as for managing project cost, time and quality. Therefore, accurate estimation is a substantial factor in projects success and reducing the risks. In recent years, software effort estimation has received a considerable amount of attention from researchers and became a challenge for software industry. In the last two decades, many researchers and practitioners proposed statistical and machine learning-based models for software effort estimation. In this work, Firefly Algorithm is proposed as a metaheuristic optimization method for optimizing the parameters of three COCOMO-based models. These models include the basic COCOMO model and other two models proposed in the literature as extensions of the basic COCOMO model. The developed estimation models are evaluated using different evaluation metrics. Experimental results show high accuracy and significant error minimization of Firefly Algorithm over other metaheuristic optimization algorithms including Genetic Algorithms and Particle Swarm Optimization.


Data Masking with Privacy Guarantees

arXiv.org Machine Learning

We study the problem of data release with privacy, where data is made available with privacy guarantees while keeping the usability of the data as high as possible --- this is important in health-care and other domains with sensitive data. In particular, we propose a method of masking the private data with privacy guarantee while ensuring that a classifier trained on the masked data is similar to the classifier trained on the original data, to maintain usability. We analyze the theoretical risks of the proposed method and the traditional input perturbation method. Results show that the proposed method achieves lower risk compared to the input perturbation, especially when the number of training samples gets large. We illustrate the effectiveness of the proposed method of data masking for privacy-sensitive learning on $12$ benchmark datasets.


Location-Centered House Price Prediction: A Multi-Task Learning Approach

arXiv.org Machine Learning

Accurate house prediction is of great significance to various real estate stakeholders such as house owners, buyers, investors, and agents. We propose a location-centered prediction framework that differs from existing work in terms of data profiling and prediction model. Regarding data profiling, we define and capture a fine-grained location profile powered by a diverse range of location data sources, such as transportation profile (e.g., distance to nearest train station), education profile (e.g., school zones and ranking), suburb profile based on census data, facility profile (e.g., nearby hospitals, supermarkets). Regarding the choice of prediction model, we observe that a variety of approaches either consider the entire house data for modeling, or split the entire data and model each partition independently. However, such modeling ignores the relatedness between partitions, and for all prediction scenarios, there may not be sufficient training samples per partition for the latter approach. We address this problem by conducting a careful study of exploiting the Multi-Task Learning (MTL) model. Specifically, we map the strategies for splitting the entire house data to the ways the tasks are defined in MTL, and each partition obtained is aligned with a task. Furthermore, we select specific MTL-based methods with different regularization terms to capture and exploit the relatedness between tasks. Based on real-world house transaction data collected in Melbourne, Australia. We design extensive experimental evaluations, and the results indicate a significant superiority of MTL-based methods over state-of-the-art approaches. Meanwhile, we conduct an in-depth analysis on the impact of task definitions and method selections in MTL on the prediction performance, and demonstrate that the impact of task definitions on prediction performance far exceeds that of method selections.


45 Best Data Science Certification for Data Scientists JA Directives

#artificialintelligence

Are you looking for Best Data Science Degree Online? This Online Data Science Course list will help you to become a top Data Scientist. Data science or data-driven science is one of today's fastest-growing fields. Do you want to become a Data Scientist in 2019? The list of the Data Science Degree will give you a clear idea from data science definition to expert's levels. If you don't know how to get data scientist certification then this data science certificate programs online will help you to get an online data science certificate. You will be able to get Microsoft data science certification or even Harvard data science certificate with this excellent collection of online courses. Also, this Data Science training will give you an idea about data science, python, data scientist, big data, analytics, machine learning, deep learning and Artificial Intelligence (AI) which are the most booming topics now. You can be a data science master in a short period of time. All big companies, publishers, advertisers, and other industries are now highly depended on data science or machine learning. So, it is high time to learn some skills in data science, for example, get the high demanded Data Science online certifications. How does it work at the present time, why data scientist's career and data science jobs are in top position? If you like a trendy career, you have that opportunity right now and get hired by the big industries. At the same time, online entrepreneurs and business personals also need to update themselves with the fundamental machine learning skills to compete with the fast-moving industry. Below are few best Data Science online courses that might assist you to jump-start the knowledge of data science sector. Best Data Science online tutorial and programs listing displays the'Best Course,' 'Product Description,' 'Rating,' 'Students Enrolled' 'Product's Image' and as well as an Enroll button to purchase the Courses from respective learning platforms for your convenience. Description: If you want to become a successful data scientist then you should take this course. Just learning statistics, data visualization and data wrangling is not enough. You also need to know how to ask the right questions and tell the right story from your data. Description: If you want to learn machine learning then this is the perfect course for you. Two professional data scientists designed this course so that you can learn the theory and algorithms behind the machine learning. If you just learn the coding libraries then you will not know what is actually going on in the back end. In fact, you will not be able to perform well in the industries. Which is why this is a very good course to get started into the machine learning world. The course also includes study materials about coding libraries. The two data scientist professionals walk you through the course step by step.