I started by looking at this 5 year old but quite useful file on this repo. https://github.com/alexmilowski/emr/tree/master/spark
Last active
September 21, 2019 02:31
-
-
Save ravsau/1129794bfa56655a4d03e079190718b5 to your computer and use it in GitHub Desktop.
Spark-word-count-on-aws-emr
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Problem can be the snappy compressed files ☝️