Follow the installation procedure to set up:
- Hadoop 2.7.2 installed from Apache Hadoop download archive
- Spark 2.4.3 installed from Apache Spark project
- For Windows platforms needed is also specific integration:
Set up HADOOP_GREMLIN_LIBS OS environment variable as of TinkerPop Documentation.
NOTE: HADOOP_GREMLIN_LIBS=/share/hadoop/common/lib seems good enough
The installation procedure should have set up the environment variables:
- HADOOP_HOME
- SPARK_HOME
- HADOOP_GREMLIN_LIBS
- PATH including the HADOOP_HOME and SPARK_HOME
On the command line run:
hadoop version
spark-shell --version