Mastering Apache Spark 2.x - Second Edition

This is the code repository for Mastering Apache Spark 2.x - Second Edition, published by Packt. It contains all the supporting project files necessary to work through the book from start to finish.

About the Book

Apache Spark is an in-memory cluster based parallel processing system that provides a wide range of functionality like graph processing, machine learning, stream processing and SQL. This book aims to take your limited knowledge of Spark to the next level by teaching you how to expand Spark functionality and implement your data flows and machine/deep learning programs on top of the platform.

Instructions and Navigation

All of the code is organized into folders. Each folder starts with a number followed by the application name. For example, Chapter02.

The code will look like the following:

import org.apache.spark.SparkContext
import org.apache.spark.SparkContext._
import org.apache.spark.SparkConf

You will need the following to work with the examples in this book:

A laptop or PC with at least 6 GB main memory running Windows, macOS, or Linux
VirtualBox 5.1.22 or above
Hortonworks HDP Sandbox V2.6 or above
Eclipse Neon or above
Maven
Eclipse Maven Plugin
Eclipse Scala Plugin
Eclipse Git Plugin

Related Products

Download a free PDF

If you have already purchased a print or Kindle version of this book, you can get a DRM-free PDF version at no cost.
Simply click on the link to claim your free PDF.

https://packt.link/free-ebook/9781786462749

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
Chapter 10		Chapter 10
Chapter 11		Chapter 11
Chapter 12		Chapter 12
Chapter 14		Chapter 14
Chapter 2		Chapter 2
Chapter 3		Chapter 3
Chapter 4		Chapter 4
Chapter 6		Chapter 6
Chapter 8		Chapter 8
Chapter 9		Chapter 9
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mastering Apache Spark 2.x - Second Edition

About the Book

Instructions and Navigation

Related Products

Download a free PDF

About

Releases

Packages

Contributors 4

Languages

License

PacktPublishing/Mastering-Apache-Spark-2x

Folders and files

Latest commit

History

Repository files navigation

Mastering Apache Spark 2.x - Second Edition

About the Book

Instructions and Navigation

Related Products

Download a free PDF

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages