Spark template using Scala

Spark application template for running on Cloud

Quick Tips

Example Use Case

  1. Running Spark application on Google Cloud Dataproc. Tutorial can be found here
  2. Save the output to a Parquet to Google Cloud Storage
  3. Import to Google BigQuery and further process it