More and more organizations are taking on the challenge of analyzing big data. This course teaches you how to use the Hadoop technologies in Microsoft Azure HDInsight to build batch processing solutions that cleanse and reshape data for analysis. In this five-week course, you'll learn how to use technologies like Hive, Pig, Oozie, and Sqoop with Hadoop in HDInsight; and how to work with HDInsight clusters from Windows, Linux, and Mac OSX client computers. Course Syllabus Module 1: Getting Started with HDInsight The course begins with an introduction to big data concepts and Hadoop, before examining Microsoft Azure HDInsight and the Hadoop distribution it provides. You'll learn how to provision an HDInsight cluster, upload data to the cluster, and perform Map/Reduce jobs that process the data.
Hadoop might be dead, but that's not stopping public cloud providers from using it. The latest to make a move is Microsoft Azure, which in July announced that it would begin developing its own distribution under its HDInsight brand. Microsoft, of course, has been providing Hadoop software on its Azure cloud for many years. It was an early partner of Hortonworks, and basically had an OEM version of the Hortonworks Data Platform (HDP) for the cloud that it called HDInsight. But Hortonworks merged with Cloudera in early 2019, and the HDP product line is no longer being developed, although it is still being supported by Cloudera, along with its legacy Hadoop distribution, Cloudera Distribution including Hadoop (CDH), until at least 2022.
Apache Spark enhancements are on the way, as are a whole host of new products. Microsoft's cloud-based big data offering HDInsight is being handed some new capabilities, but that's not the only thing that users will be looking forward to. The Azure hosted service, which is based off Apache Hadoop, HBase, Spark, Microsoft R Server and more, is having its pricing restructured across a number of product areas. First up is an up to 52% price reduction in Azure HDInsight, which will include the replacement of the Premium cluster tier with the Enterprise Security Package. Microsoft's bumper Christmas savings bonanza also includes an additional 80% price reduction for R Server for Azure HDInsight.
Household names such as Adobe, Jet, ASOS, Schneider Electric, and Milliman are amongst hundreds of enterprises that are powering their Big Data Analytics using Azure HDInsight. Azure HDInsight launched nearly six years ago and has since become the best place to run Apache Hadoop and Spark analytics on Azure. We will monitor the cluster and all the services, detect and repair common issues and respond to issues 24/7. Your big data applications can run more reliably as your HDInsight service monitors the health and automatically recovers from failures. Isolate your HDInsight cluster within VNETs and take advantage of transparent data encryption.