Collaborating Authors


Insights Discovery in Data Science Through Novel Machine Learning Approaches


I have always appreciated the unusual, unexpected, and surprising in science and in data. As famous science author Arthur C. Clarke once said, "The most exciting phrase to hear in science, the one that heralds new discoveries, is not'Eureka!' (I found it) but'That's funny!'" This is the primary reason that I motivated most of the doctoral students that I mentored at GMU to work on some variation of Novelty Discovery (or Surprise Discovery) for their Ph.D. dissertations. "Surprise discovery" for me is a much more positive, exciting phrase than "outlier detection" or "anomaly detection", and it is much richer in meaning, in algorithms, and in new opportunities. Finding the surprising unexpected thing in your data is what inspires our exclamation "That's funny!" that may be signaling a great discovery (either about your data's quality, or about your data pipeline's deficiencies, or about some wholly new scientific concept). As famous astronomer, Vera Rubin said, "Science progresses best when observations force us to alter our preconceptions."

Udacity Machine Learning vs. Simplilearn Machine Learning - for your ML Career


You will receive 58 hours of applied instructor-led training. To earn the certification, you should attend a full batch of online training and submit a completed project for the flexi-pass learners or complete at least 85% of the course and submit one completed project for the self-paced learners. The machine learning certification course by Simplilearn is designed for learners with intermediate-level machine learning knowledge and skills in various roles, including business analysis, data analysis, information architecture, data science, machine learning, and others. To take this course, you need a college-level understanding of statistics and mathematics as well as Python programming knowledge. Simplilearn offers a blended learning approach that gives learners access to both live instructor-led training and recorded-videos.

Linear Regression and Logistic Regression using R Studio


In this section we will learn - What does Machine Learning mean. What are the meanings or different terms associated with machine learning? You will see some examples so that you understand what machine learning actually is. It also contains steps involved in building a machine learning model, not just linear models, any machine learning model.

Slow-Reading is The New Deep Learning


I was just a youth when Evelyn Wood debuted her speed-reading course back in 1959. For years, I was fascinated with the prospect of getting my reading assignments over with as quickly as possible so that I could get on to the fun part of life. Fortunately, I massively turned that around. The Evelyn Wood Reading Dynamics course became a huge sensation. So much so that the Kennedy White House sent staff members to take the course.

Classification with Localization: Convert any Keras Classifier to a Detector


Image classification is used to solve several Computer Vision problems; right from medical diagnoses, to surveillance systems, on to monitoring agricultural farms. There are innumerable possibilities to explore using Image Classification. If you have completed the basic courses on Computer Vision, you are familiar with the tasks and routines involved in Image Classification tasks. Image Classification tasks follow a standard flow – where you pass an image to a deep learning model and it outcomes the class or the label of the object present. While learning Computer Vision, most often a project that would be equivalent to your first hello world project, will most likely be an image classifier. You attempt to solve something like the digit recognition on MNIST Digits dataset or maybe the Cats and Dog Classification problem.

The Use of AI for Accessible Education


Many times AI has been put on a pedestal as the future of x y & z, however, many seem to agree that education is a sector in particular which will see stark changes in both admin, teaching styles, personalisation and more. I had the pleasure of speaking to three individuals working in the field, including, Vinod Bakthavachalam, Senior Data Scientist at Coursera, Kian Katanforoosh, Lecturer at Stanford University & Sergey Karayev, Co-Founder and CTO of Gradescope. We began by having Sergey of Gradescope walk us through his product, which has been recently acquired by turnitin. The concept, it seemed was formed from the simple and widespread issue of both lack of consistency, lack of insight through time constraint and delayed feedback on academic work. Sergey found that scanning the papers onto an online interface when paired with a rubric can allow for accurate marking in seconds across several papers.

One robot on Mars is robotics, ten robots are automation


The difference between robotics and automation is almost nonexistent and yet has a huge difference in everything from trade shows, marketing, publications to academic conferences and journals. This week, the difference was expressed as an opportunity in the Dear Colleague Letter below from Professor Ken Goldberg, CITRIS CPAR and UC Berkeley, who suggested that students whose papers were rejected from ICRA, revise them for CASE, the Conference on Automation Science and Engineering. This opportunity was expressed beautifully in the title quote from Professor Raja Chatila, ex President of IEEE Robotics and Automation Society and current President of IEEE Global Society on Ethics of Autonomous and Intelligent Systems. "One robot on Mars is robotics, ten robots on Mars is automation." Over 2000 papers were declined by ICRA today, including many that can be effectively revised for another conference such as IEEE CASE (deadline 15 March).

Kaskada data science automation platform aims to speed machine learning models into production - SiliconANGLE


More than a year after announcing plans to automate the feature engineering phase of artificial intelligence projects, Seattle-based startup Kaskada Inc. is bringing its first product to market. Kaskada says it aims to democratize feature engineering, an often laborious process that requires data scientists to select, clean and validate the data to be fed into machine learning training models prior to moving them into production. A model intended to predict housing prices, for example, would be feature engineered with predictor data such as the square footage of properties, number of bedrooms and location. The larger and more complete the training data set, the better the results. The resources required to collect data and move machine learning models into production can be so significant that the capabilities are out of reach of all but the largest companies.

Why universities will need to digitalise to survive


Why universities will need to digitalise to survive Dave Sherwood 27 February 2021 Universities, and the role they play in society, are under threat from the impact of the ongoing pandemic. While rarely a sector in financial crisis, university leaders in seven of the higher education systems in Europe now predict a fall in core national funding as a result of COVID-19, compounding the huge hits universities have taken on rental and commercial services and contractual research. Fourteen national university sectors in Europe have also predicted a fall in income from international students, with travel restrictions limiting student mobility. Estimates of losses to the United Kingdom university sector range from £3 billion (US4.2 billion) to £19 billion (US$26.7 billion) per year as a result of the coronavirus, while the picture is no less bleak across the pond. The University of Michigan alone anticipates losses of up to US$1 billion this year across its three campuses.

A Journey through XGBoost: Milestone 1


Welcome to another article series! This time, we are discussing XGBoost (Extreme Gradient Boosting) -- The leading and the most preferred machine learning algorithm among data scientists in the 21st century. Most people say XGBoost is a money-making algorithm because it easily outperforms any other algorithms, gives the best possible scores and helps its users to claim luxury cash prizes from data science competitions. The topic we are discussing is broad and important so that we discuss it through a series of articles. It is like a journey, maybe a long journey for newcomers.