- Create an environment with your favorite venv manager, ala:
conda create --name etl_spark_standalone python=3.8 && activate etl_spark_standalone pip install -r requirements.txt. Note: This installs pyspark.
- Do you want standalone spark/hadoop (any version you want) or
- Built-in Spark that comes with pip installs ala
pip install pyspark==3.0.1