Machine Learning with Spark - Second Edition

Machine Learning with Spark - Second Edition
Author :
Publisher :
Total Pages : 572
Release :
ISBN-10 : 1785889931
ISBN-13 : 9781785889936
Rating : 4/5 (936 Downloads)

Book Synopsis Machine Learning with Spark - Second Edition by : Rajdeep Dua

Download or read book Machine Learning with Spark - Second Edition written by Rajdeep Dua and published by . This book was released on 2016-10-31 with total page 572 pages. Available in PDF, EPUB and Kindle. Book excerpt: Develop intelligent machine learning systems with SparkAbout This Book*Get to the grips with the latest version of Apache Spark*Utilize Spark's machine learning library to implement predictive analytics*Leverage Spark's powerful tools to load, analyze, clean, and transform your dataWho This Book Is ForIf you have a basic knowledge of machine learning and want to implement various machine-learning concepts in the context of Spark ML, this book is for you. You should be well versed with the Scala and Python languages.What You Will Learn*Get hands-on with the latest version of Spark ML*Create your first Spark program with Scala and Python*Set up and configure a development environment for Spark on your own computer, as well as on Amazon EC2*Access public machine learning datasets and use Spark to load, process, clean, and transform data*Use Spark's machine learning library to implement programs by utilizing well-known machine learning models*Deal with large-scale text data, including feature extraction and using text data as input to your machine learning models*Write Spark functions to evaluate the performance of your machine learning modelsIn DetailSpark ML is the machine learning module of Spark. It uses in-memory RDDs to process machine learning models faster for clustering, classification, and regression.This book will teach you about popular machine learning algorithms and their implementation. You will learn how various machine learning concepts are implemented in the context of Spark ML. You will start by installing Spark in a single and multinode cluster. Next you'll see how to execute Scala and Python based programs for Spark ML. Then we will take a few datasets and go deeper into clustering, classification, and regression. Toward the end, we will also cover text processing using Spark ML.Once you have learned the concepts, they can be applied to implement algorithms in either green-field implementations or to migrate existing systems to this new platform. You can migrate from Mahout or Scikit to use Spark ML.


Machine Learning with Spark - Second Edition Related Books

Machine Learning with Spark - Second Edition
Language: en
Pages: 572
Authors: Rajdeep Dua
Categories:
Type: BOOK - Published: 2016-10-31 - Publisher:

GET EBOOK

Develop intelligent machine learning systems with SparkAbout This Book*Get to the grips with the latest version of Apache Spark*Utilize Spark's machine learning
Learning Spark
Language: en
Pages: 400
Authors: Jules S. Damji
Categories: Computers
Type: BOOK - Published: 2020-07-16 - Publisher: O'Reilly Media

GET EBOOK

Data is bigger, arrives faster, and comes in a variety of formats—and it all needs to be processed at scale for analytics or machine learning. But how can you
Machine Learning in Python
Language: en
Pages: 361
Authors: Michael Bowles
Categories: Computers
Type: BOOK - Published: 2015-04-27 - Publisher: John Wiley & Sons

GET EBOOK

Learn a simpler and more effective way to analyze data and predict outcomes with Python Machine Learning in Python shows you how to successfully analyze data us
Advanced Analytics with Spark
Language: en
Pages: 276
Authors: Sandy Ryza
Categories: Computers
Type: BOOK - Published: 2015-04-02 - Publisher: "O'Reilly Media, Inc."

GET EBOOK

In this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors
Learning Spark, 2nd Edition
Language: en
Pages: 300
Authors: Jules Damji
Categories:
Type: BOOK - Published: 2020 - Publisher:

GET EBOOK

Data is getting bigger, arriving faster, and coming in varied formats-and it all needs to be processed at scale for analytics or machine learning. How can you p