#spark
#scheduling
#data-locality
#streaming
This is the third post in my notes on Spark in Action by Petar Zečević and Marko Bonaći. In this post, I will summarize the runtime components and ...
read more →
#spark
#partitioning
#shuffle
#rdd
This is the second post in my notes on Spark in Action by Petar Zečević and Marko Bonaći. In the first post, I looked at Spark’s basic execution fl...
read more →
#spark
#hadoop
#mapreduce
#rdd
I am going to record my notes from reading Spark in Action by Petar Zečević and Marko Bonaći in three parts. In this first post, I will start with ...
read more →