- copy files to a directory:
git clone https://gist.github.com/cc7c8cec1188fd387cc2e3ec0f4fed7a.git wordcountand thencd wordcount. - see the input files:
cat *.txt - make sure mapper and reducer are executable
chmod +x *.scala - see how mapper works:
cat baa.txt | ./mapper.scala - see how reducer works:
cat baa.txt | ./mapper.scala | ./reducer.scala
- copy files to a directory:
git clone https://gist.github.com/cc7c8cec1188fd387cc2e3ec0f4fed7a.git wordcountand thencd wordcount. - create a directory on HDFS:
hadoop fs -mkdir -p /wc/in