更新时间:2021-06-18 19:07:31
封面
版权信息
Packt Upsell
Why subscribe?
PacktPub.com
Contributors
About the authors
About the reviewer
Packt is searching for authors like you
Preface
Who this book is for
What this book covers
To get the most out of this book
Download the example code files
Download the color images
Conventions used
Sections
Getting ready
How to do it...
How it works...
There's more...
See also
Get in touch
Reviews
Installing and Configuring Spark
Introduction
Installing Spark requirements
Installing Java
Installing Python
Installing R
Installing Scala
Installing Maven
Updating PATH
Installing Spark from sources
Installing Spark from binaries
Configuring a local instance of Spark
Configuring a multi-node instance of Spark
Installing Jupyter
Configuring a session in Jupyter
Working with Cloudera Spark images
Abstracting Data with RDDs
Creating RDDs
Spark context parallelize method
.take(...) method
Reading data from files
.textFile(...) method
.map(...) method
Partitions and performance
Overview of RDD transformations
.map(...) transformation
.filter(...) transformation
.flatMap(...) transformation
.distinct() transformation
.sample(...) transformation
.join(...) transformation
.repartition(...) transformation
.zipWithIndex() transformation