To set up Pyspark 2.3.1 on ubuntu , you need to have java, 1.8+,scala,python 3.5.x and py4j package need to be installed.And as an IDE , we will be using jupyter-notebook here. Setting up pyspark in ubuntu is very simple if you follow the below steps in order Open your Terminal on ubuntu OS ubuntu 18.04 will have python 3.5.x by default or else you need install it as per the instructions in the below link(its a very good discussion forum for installation): https://askubuntu.com/questions/865554/how-do-i-install-python-3-6-using-apt-get To keep the latest updates in sync with your os, run the below command: sudo apt-get update once update is done, install java 1.8 using below command: sudo apt install openjdk-8-jre-headless Once java is installed validate it by using java --version: it should display jdk version as 1.8.x once validated, install scala as below: sudo apt-get install scala once installation is done,validate it using scala --version: it should display the late...