In our earlier post, we built a pretty light 2-nodes Apache Spark cluster without using any Hadoop HDFS and YARN underneath. We didn’t point the spark installation to any Hadoop distribution or set up any “HADOOP_HOME” as a PATH environment variable and we have deliberately set the “master” parameter to a spark master node. Today,