AITopics

Country: North America > United States > California > San Francisco County > San Francisco (0.16)

Industry: Education > Educational Setting > Higher Education (0.36)

Technology:

Information Technology > Cloud Computing (0.96)
Information Technology > Data Science > Data Mining > Big Data (0.78)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

#artificialintelligenceFeb-22-2019, 03:02:17 GMT

Apache Spark Machine Learning Tutorial

Editor's Note: Download this Free eBook: Getting Started with Apache Spark 2.x – from Inception to Production In this blog post, we will give an introduction to machine learning and deep learning, and we will go over the main Spark machine learning algorithms and techniques with some real-world use cases. The goal is to give you a better understanding of what you can do with machine learning. Machine learning is becoming more accessible to developers, and data scientists work with domain experts, architects, developers, and data engineers, so it is important for everyone to have a better understanding of the possibilities. Every piece of information that your business generates has potential to add value. This overview is meant to provoke a review of your own data to identify new opportunities.

algorithm, learning, spark 2, (13 more...)

Genre: Overview (0.89)

Industry: Banking & Finance (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.38)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.33)

#artificialintelligenceAug-10-2018, 05:07:58 GMT

Scala and Spark for Big Data and Machine Learning

Learn how to utilize some of the most valuable tech skills on the market today, Scala and Spark! In this course we will show you how to use Scala and Spark to analyze Big Data. Scala and Spark are two of the most in demand skills right now, and with this course you can learn them quickly and easily! This course comes with full projects for you including topics such as analyzing financial data or using machine learning to classify Ecommerce customer behavior! We teach the latest methodologies of Spark 2.0 so you can learn how to use SparkSQL, Spark DataFrames, and Spark's MLlib!

data mining, machine learning, scala and spark, (5 more...)

Genre: Instructional Material (0.93)

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.89)
Information Technology > Artificial Intelligence > Machine Learning (0.67)

@machinelearnbotApr-18-2018, 14:10:13 GMT

Deep Learning With Apache Spark: Part 1

My journey into Deep Learning In this post I'll share how I've been studying Deep Learning and using it to solve data science problems.

apache spark, deep learning, learning, (12 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

@machinelearnbotApr-8-2018, 20:49:20 GMT

Advanced Machine Learning with Spark 2.x Udemy

The aim of this course is to provide a practical understanding of advanced Machine Learning algorithms in Apache Spark to make predictions and recommendation and derive insights from large distributed datasets. This course starts with an introduction to the key concepts and data types that are fundamental to understanding distributed data processing and Machine Learning with Spark. Further to this, we provide practical recipes that demonstrate some of the most popular algorithms in Spark, leading to the creation of sophisticated Machine Learning pipelines and applications. The final sections are dedicated to more advanced use cases for Machine Learning: streaming, Natural Language Processing, and Deep Learning. In each section, we briefly establish the theoretical basis of the topic under discussion and then cement our understanding with practical use cases.

advanced machine learning, machine learning, spark 2, (4 more...)

Country:

Europe > United Kingdom > England > Leicestershire > Leicester (0.08)
Europe > Poland > Lesser Poland Province > Kraków (0.08)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (0.40)
Education > Educational Setting > Online (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.40)
Information Technology > Data Science > Data Mining > Big Data (0.39)

#artificialintelligenceMar-8-2018, 20:57:35 GMT

What's so 'unified' about universal data analytics? - CW Developer Network

Ground level definitions out of the way, what has Databricks been doing to add to unified utopia? The company has this month announced Apache Spark open-source cluster-computing framework. This means that the company is the vendor to support Apache Spark 2.3 within a compute engine, Databricks Runtime 4.0, which is now generally available. In addition to support for Spark 2.3, Databricks Runtime 4.0 introduces new features including Machine Learning Model Export to simplify production deployments and performance optimizations. "The community continues to expand on Apache Spark's role as a unified analytics engine for big data and AI. This is a major milestone to introduce the continuous processing mode of Structured Streaming with millisecond low-latency, as well as other features across the project," said Matei Zaharia, creator of Apache Spark and chief technologist and co-founder of Databricks.

artificial intelligence, data mining, machine learning, (9 more...)

Technology:

Information Technology > Data Science > Data Mining (0.87)
Information Technology > Artificial Intelligence > Machine Learning (0.61)

@machinelearnbotJan-11-2018, 21:12:24 GMT

Processing a Trillion Rows Per Second on a Single Machine: How Can Nested Loop Joins be this Fast?

This blog post describes our experience debugging a failing test case caused by a cross join query running "too fast." Because the root cause of fail test case spans across multiple layers--from Apache Spark to the JVM JIT compiler-- we wanted to share our analysis in this post. The vast majority of big data SQL or MPP engines follow the Volcano iterator architecture that is inefficient for analytical workloads. Since Spark 2.0 release, the new Tungsten execution engine in Apache Spark implements whole-stage code generation, a technique inspired by modern compilers to collapse the entire query into a single function. This JIT compiler approach is a far superior architecture than the row-at-a-time processing or code generation model employed by other engines, making Spark one of the most efficient in the market.

artificial intelligence, natural language, optimization, (16 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval > Query Processing (0.91)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.76)
Information Technology > Communications > Social Media (0.72)

@machinelearnbotDec-16-2017, 17:00:32 GMT

Top Hortonworks Blogs from 2017 - Hortonworks

Try Apache Spark 2.1 & Zeppelin in Hortonworks Data Cloud by Vinay Shukla Wanna try Spark 2.1 Now? Well, you are in luck… Hortonworks Data Cloud ("HDCloud") for AWS gives you a quick way to launch a Spark cluster in the cloud. Read More Machine Learning & its Impact on the Future for Insurance by Cindy Maike First and foremost, machine learning WILL change the way insurers do business. The insurance industry is founded on forecasting future events and estimating the value/impact of those events and has used established predictive modeling practices – especially in claims loss prediction and pricing – for some time now. Read More A Reference Architecture for the Open Banking Standard… by Vamsi Chemitiganti Financial services firms specifically deal with manifold data types ranging from Customer Account data, Transaction Data, Wire Data, Trade Data, Customer Relationship Management (CRM), General Ledger and other systems supporting core banking functions. When one factors in social media feeds, mobile clients & other non traditional data types, the challenge is not just one of data volumes but also variety and the need to draw conclusions from fast moving data streams by commingling them with years of historical data.

cloud computing, data mining, machine learning, (18 more...)

Industry:

Banking & Finance > Financial Services (0.78)
Banking & Finance > Insurance (0.56)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.78)
Information Technology > Cloud Computing (0.59)
Information Technology > Modeling & Simulation (0.56)
Information Technology > Data Science > Data Mining > Big Data (0.53)

@machinelearnbotNov-27-2017, 14:00:15 GMT

Spark tutorial: Get started with Apache Spark

Apache Spark has become the de facto standard for processing data at scale, whether for querying large datasets, training machine learning models to predict future trends, or processing streaming data. In this article, we'll show you how to use Apache Spark to analyze data in both Python and Spark SQL. And we'll extend our code to support Structured Streaming, the new current state of the art for handling streaming data within the platform. We'll be using Apache Spark 2.2.0 here, but the code in this tutorial should also work on Spark 2.1.0 Before we begin, we'll need an Apache Spark installation.

apache spark, artificial intelligence, social media, (4 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.60)
Information Technology > Communications > Social Media (0.40)

#artificialintelligenceAug-28-2017, 01:05:26 GMT

Machine Learning using Spark and R - Dataconomy

R is ubiquitous in the machine learning community. Its ecosystem of more than 8,000 packages makes it the Swiss Army knife of modeling applications. Similarly, Apache Spark has rapidly become the big data platform of choice for data scientists. Its ability to perform calculations relatively quickly (due to features like in-memory caching) makes it ideal for interactive tasks--such as exploratory data analysis. R (SparkR) is the latest addition and support for it certainly lags the other three languages. In Spark 1.x there was no support for accessing the Spark ML (machine learning) libraries from R. The performance of R code on Spark was also considerably worse than could be achieved using, say, Scala.

artificial intelligence, data mining, machine learning, (11 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.71)