AITopics | sparkr

Collaborating Authors

sparkr

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Applied Machine Learning and Deep Learning with R

@machinelearnbotDec-24-2017, 19:15:21 GMT

In this course, we will examine in detail the R software, which is the most popular statistical programming language of recent years. You will start with exploring different learning methods, clustering, classification, model evaluation methods and performance metrics. From there, you will dive into the general structure of the clustering algorithms and develop applications in the R environment by using clustering and classification algorithms for real-life problems Next, you will learn to use general definitions about artificial neural networks, and the concept of deep learning will be introduced. The elements of deep learning neural networks, types of deep learning networks, frameworks used for deep learning applications will be addressed and applications will be done with R TensorFlow package. Finally, you will dive into developing machine learning applications with SparkR, and learn to make distributed jobs on SparkR.

application, artificial intelligence, machine learning, (7 more...)

@machinelearnbot

Genre: Instructional Material > Course Syllabus & Notes (0.74)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (0.40)
Education > Educational Setting > Online (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Machine Learning using Spark and R - Dataconomy

#artificialintelligenceAug-28-2017, 01:05:26 GMT

R is ubiquitous in the machine learning community. Its ecosystem of more than 8,000 packages makes it the Swiss Army knife of modeling applications. Similarly, Apache Spark has rapidly become the big data platform of choice for data scientists. Its ability to perform calculations relatively quickly (due to features like in-memory caching) makes it ideal for interactive tasks--such as exploratory data analysis. R (SparkR) is the latest addition and support for it certainly lags the other three languages. In Spark 1.x there was no support for accessing the Spark ML (machine learning) libraries from R. The performance of R code on Spark was also considerably worse than could be achieved using, say, Scala.

artificial intelligence, data mining, machine learning, (11 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.71)

Add feedback

Crunching Statistics at Scale with SparkR on Amazon EMR

#artificialintelligenceJul-9-2016, 16:25:45 GMT

Christopher Crosbie is a Healthcare and Life Science Solutions Architect with Amazon Web Services. This post is co-authored by Gopal Wunnava, a Senior Consultant with AWS Professional Services. SparkR is an R package that allows you to integrate complex statistical analysis with large datasets. In this blog post, we introduce you running R with the Apache SparkR project on Amazon EMR. The diagram of SparkR below is provided as a reference, but this video provides an overview of what is depicted.

artificial intelligence, machine learning, sparkr, (13 more...)

#artificialintelligence

Genre: Press Release (0.32)

Industry:

Professional Services (0.55)
Information Technology (0.35)

Technology:

Information Technology > Communications > Web (0.50)
Information Technology > Artificial Intelligence > Machine Learning (0.49)

Add feedback

Announcing SparkR: R on Apache Spark

#artificialintelligenceJun-28-2016, 02:31:10 GMT

I am excited to announce that the upcoming Apache Spark 1.4 release will include SparkR, an R package that allows data scientists to analyze large datasets and interactively run jobs on them from the R shell. R is a popular statistical programming language with a number of extensions that support data processing and machine learning tasks. However, interactive data analysis in R is usually limited as the runtime is single-threaded and can only process data sets that fit in a single machine's memory. SparkR, an R package initially developed at the AMPLab, provides an R frontend to Apache Spark and using Spark's distributed computation engine allows us to run large scale data analysis from the R shell. The SparkR project was initially started in the AMPLab as an effort to explore different techniques to integrate the usability of R with the scalability of Spark.

artificial intelligence, machine learning, sparkr, (10 more...)

#artificialintelligence

Technology:

Information Technology > Data Science (0.94)
Information Technology > Artificial Intelligence > Machine Learning (0.93)

Add feedback

How-to: Train Models in R and Python using Apache Spark MLlib and H2O - Cloudera Engineering Blog

#artificialintelligenceJun-28-2016, 02:31:09 GMT

Creating and training machine-learning models is more complex on distributed systems, but there are lots of frameworks for abstracting that complexity. There are more options now than ever from proven open source projects for doing distributed analytics, with Python and R become increasingly popular. In this post, you'll learn the options for setting up a simple read-eval-print (REPL) environment with Python and R within the Cloudera QuickStart VM using APIs for two of the most popular cluster computing frameworks: Apache Spark (with MLlib) and H2O (from the company with the same name). To compare these approaches, you'll train a linear regression against a data set with known coefficients. Spark includes PySpark (supported by Cloudera), the Python API for Spark.

application, artificial intelligence, machine learning, (15 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

SparkR (R on Spark) - Spark 1.6.0 Documentation

#artificialintelligenceMar-23-2016, 23:20:35 GMT

SparkR is an R package that provides a light-weight frontend to use Apache Spark from R. In Spark 1.6.0, SparkR provides a distributed data frame implementation that supports operations like selection, filtering, aggregation etc. (similar to R data frames, dplyr) but on large datasets. SparkR also supports distributed machine learning using MLlib. A DataFrame is a distributed collection of data organized into named columns. It is conceptually equivalent to a table in a relational database or a data frame in R, but with richer optimizations under the hood.

artificial intelligence, dataframe, machine learning, (16 more...)

#artificialintelligence

Technology:

Information Technology > Databases (0.36)
Information Technology > Artificial Intelligence > Machine Learning (0.35)

Add feedback