Spark hands-on with Databricks
Date: 10th Feb, 2021
I started spark hands-on once I finished with basic architecture and working of spark core I needed to install spark on my system but somehow couldn't manage hence thought of starting on databricks, which I find really amazing , your spark setup is just few clicks away and we can try spark with many resources:
1. RDS
2. S3
3. Databricks file system
4. Upload file from local system
Set up working environment:
Step1: Create Cluster
Select Cluster from LHS menu and fill up the required settings as per your version availability. Spark 3.0 was latest when I was trying hence I selected spark 3.0 as shown in below picture:
Once filled up click on Create Cluster button on top and wait for few minutes...Once cluster has been created add Data
Step2: Add Data source
We have so many options as a data source, I chose to upload file from my local to get my hands dirty with Movie dataset (http://media.sundog-soft.com/es/ml-100k.zip)
Comments
Post a Comment