AITopics | hdinsight

Collaborating Authors

hdinsight

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Sentiment analysis and face recognition - Azure Example Scenarios

#artificialintelligenceFeb-10-2023, 21:30:45 GMT

This article presents a solution for gauging public opinion in tweets. The goal is to create a transformation pipeline that outputs clusters of comments and trending subjects. Apache, Apache NiFi, Apache Hadoop, Apache Hive, and Apache Airflow are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. No endorsement by The Apache Software Foundation is implied by the use of these marks. The Twitter ingestion pipeline consists of four stages.

hdinsight, information, sentiment analysis and face recognition, (11 more...)

#artificialintelligence

Country: North America > United States (0.25)

Industry:

Law > Intellectual Property & Technology Law (0.76)
Information Technology > Security & Privacy (0.50)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.92)
Information Technology > Data Science > Data Mining > Big Data (0.92)

Add feedback

Machine Learning and BIG Data Analytics on Microsoft AZURE

#artificialintelligenceFeb-28-2020, 18:42:26 GMT

This course is all about learning various cloud Analytics and Machine Learning options available on Microsoft AZURE cloud platform. We would be creating resources for Stream Analytics, Spark, HDInsight exploring options. We would be learning all the Analytics services with some use cases. Machine learning and cloud computing are trending domains and also have lot of job opportunities, if you have interest in machine learning as well as cloud computing then this course for you. This course will let you use your machine learning skills deploy in cloud.

analytic, learning and big data analytic, machine learning, (4 more...)

#artificialintelligence

Industry: Information Technology > Services (1.00)

Technology:

Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.85)

Add feedback

What is Azure Databricks?

@machinelearnbotNov-21-2017, 08:45:17 GMT

Azure Databricks (documentation and user guide) was announced at Microsoft Connect, and with this post I'll try to explain its use case. At a high level, think of it as a tool for curating and processing massive amounts of data and developing, training and deploying models on that data, and managing the whole workflow process throughout the project. It is for those who are comfortable with Apache Spark as it is 100% based on Spark and is extensible with support for Scala, Java, R, and Python alongside Spark SQL, GraphX, Streaming and Machine Learning Library (Mllib). It has built-in integration with Azure Blog Storage, Azure Data Lake Storage (ADLS), Azure SQL Data Warehouse (SQL DW), Cosmos DB, Azure Event Hub, Apache Kafka for HDInsight, and Power BI (see Spark Data Sources). Think of it as an alternative to HDInsight (HDI) and Azure Data Lake Analytics (ADLA).

artificial intelligence, azure databrick, machine learning, (8 more...)

@machinelearnbot

Technology:

Information Technology > Data Science (0.82)
Information Technology > Information Management (0.59)
Information Technology > Artificial Intelligence > Machine Learning (0.59)

Add feedback

Analyze Twitter data with Apache Hive - Azure HDInsight

@machinelearnbotNov-20-2017, 21:00:09 GMT

Learn how to use Apache Hive to process Twitter data. The result is a list of Twitter users who sent the most tweets that contain a certain word. The steps in this document were tested on HDInsight 3.6. Linux is the only operating system used on HDInsight version 3.4 or greater. For more information, see HDInsight retirement on Windows.

data mining, natural language, tweet, (14 more...)

@machinelearnbot

Genre: Press Release (0.39)

Industry: Information Technology > Services (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.75)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.62)

Add feedback

r-server-data-factory.html?utm_content=bufferd52a1&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer

@machinelearnbotSep-10-2017, 01:00:07 GMT

Beginning in 2016, Microsoft rolled out a preview of Microsoft R Server (MRS) for Azure HDInsight clusters. Recent blog posts (by Max Kaznady and David Smith) have highlighted how to use and tune this service for large scale machine learning tasks. In this post, we push the envelope and show how to build an end-to-end fully operationalized analytics pipeline using Azure Data Factory (ADF) and MRS with HDInsight (specifically Apache Spark). By integrating Azure Data Factory with Microsoft R Server and Spark, we show how to configure a scalable training and testing pipeline that operates on large volumes of data.

artificial intelligence, big data, microsoft, (16 more...)

@machinelearnbot

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.63)
Information Technology > Data Science > Data Mining > Big Data (0.40)

Add feedback

Spark with HDInsight - Enterprise Ready Machine Learning and Interactive Data Analysis at Scale - Silicon Valley, CA

#artificialintelligenceApr-6-2017, 08:08:17 GMT

In particular, it is particularly amenable to machine learning and interactive data workloads, and can provide an order of magnitude greater performance than traditional Hadoop data processing tools. In this course, we will provide a deep-dive into Spark as a framework, understand it's design, how to optimally utilize it's design, and how to develop effective machine learning applications with Spark on HDInsight. The course covers the fundamentals of Spark, it's core APIs and design, relational data processing with Spark SQL, the fundamentals of Spark job execution, performance tuning, tracking and debugging. Users will get hands-on experience with processing streaming data with Spark streaming, training machine learning algorithms with Spark ML and R Server on Spark, as well as HDInsight configuration and platform specific considerations such as remote developing and access with Livy and IntelliJ, secure Spark, multi-user notebooks with Zeppelin, and virtual networking with other HDInsight clusters.

artificial intelligence, hdinsight, machine learning, (14 more...)

#artificialintelligence

Country: North America > United States > California (0.40)

Genre: Instructional Material > Course Syllabus & Notes (0.52)

Industry: Information Technology (0.96)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Microsoft moves ahead on cloud, data, AI fronts ZDNet

#artificialintelligenceMar-23-2017, 18:05:38 GMT

Microsoft has a tricky job in the data world. On the one hand, it has a 25-year legacy in the on-premises relational database business with SQL Server and needs to keep that lucrative business relevant and stable. On the other hand, as the company pivots toward the cloud, it needs to proffer relational OLTP, data warehouse, NoSQL, Big Data and machine learning technologies. And it need to make them credible and competitive against offerings from so many startups in the data and analytics world. And then there was Strata... Microsoft also needs to make all of this technology accessible to developers, including its core constituency of .NET developers, but also those working with Java, Node/JavaScript, Python and a slew of other programming platforms.

hdinsight, integration, microsoft, (13 more...)

#artificialintelligence

Country: North America > United States > California > Santa Clara County > San Jose (0.05)

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.56)
Information Technology > Artificial Intelligence > Machine Learning (0.55)
Information Technology > Software > Programming Languages (0.35)

Add feedback

R Server for HDInsight now generally available Blog Microsoft Azure

#artificialintelligenceNov-18-2016, 08:50:24 GMT

Today, we announced the general availability of R Server for Azure HDInsight. This gives Azure HDInsight the most comprehensive set of ML algorithms and statistical functions in the cloud that also leverages Hadoop and Spark. R is one of the most popular programming language that helps millions of data scientists solve their most challenging problems in fields ranging from computational biology to quantitative marketing. R Server for Azure HDInsight is a scale-out implementation of R integrated with Spark clusters created from HDInsight. This gives you the familiarity of the R language for machine learning while leveraging the scalability and reliability built into Spark.

data mining, machine learning, programming language, (10 more...)

#artificialintelligence

Industry: Information Technology > Services (0.40)

Technology:

Information Technology > Software > Programming Languages (0.69)
Information Technology > Artificial Intelligence > Machine Learning (0.55)
Information Technology > Data Science > Data Mining > Big Data (0.44)

Add feedback

Exploring NYC Taxi Data with Microsoft R Server and HDInsight

#artificialintelligenceApr-21-2016, 11:36:26 GMT

As I mentioned yesterday, Microsoft R Server now available for HDInsight, which means that you can now run R code (including the big-data algorithms of Microsoft R Server) on a managed, cloud-based Hadoop instance. Debraj GuhaThakurta, Senior Data Scientist, and Shauheen Zahirazami, Senior Machine Learning Engineer at Microsoft, demonstrate some of these capabilities in their analysis of 170M taxi trips in New York City in 2013 (about 40 Gb). Their goal was to show the use of Microsoft R Server on an HDInsight Hadoop cluster, and to that end, they created machine learning models using distributed R functions to predict (1) whether a tip was given for a taxi ride (binary classification problem), and (2) the amount of tip given (regression problem). The analyses involved building and testing different kinds of predictive models. Debraj and Shauheen uploaded the NYC Taxi data to HDFS on Azure blob storage, provisioned an HDInsight Hadoop Cluster with 2 head nodes (D12), 4 worker nodes (D12), and 1 R-server node (D4), and installed R Studio Server on the HDInsight cluster to conveniently communicate with the cluster and drive the computations from R. To predict the tip amount, Debraj and Shauheen used linear regression on the training set (75% of the full dataset, about 127M rows).

artificial intelligence, data mining, machine learning, (8 more...)

#artificialintelligence

Country: North America > United States > New York (0.29)

Industry:

Transportation > Passenger (0.75)
Transportation > Ground > Road (0.75)

Technology:

Information Technology > Data Science > Data Mining > Big Data (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.41)

Add feedback