Skip to content

Instantly share code, notes, and snippets.

@cj-praveen
Created October 30, 2025 05:13
Show Gist options
  • Select an option

  • Save cj-praveen/34a7b027e307ea2d53a8b4b48d5c19b5 to your computer and use it in GitHub Desktop.

Select an option

Save cj-praveen/34a7b027e307ea2d53a8b4b48d5c19b5 to your computer and use it in GitHub Desktop.

Hadoop commands

Hadoop version
Hadoop fs -mkdir path
Hadoop fs -ls path
Hadoop fs -put source path
Hadoop fs -cat file.txt
Hadoop fs -touchz file.txt
Hadoop fsck path
Hadoop fs -df path
Hadoop fs 
Echo “text” > file.txt
Hadoop fs -moveFromLocal src dest
Hadoop fs -rm path
Hadoop fs -cp src dest
Hadoop fs -mv src dest 
Hadoop fs -setrep -w 2 file.txt
Hadoop-expunge 

Word Counter

Docker run -d —name namenode hadoop-namenode
Docker run -d —name datanode hadoop-datanode
Docker exec -it namenode bash
Cd /root
Docker cp input.txt namenode:/root
Hadoop fs -put input.txt /input
Hadoop jar wordcounter.jar /input /output
Hadoop fs -ls /output
Hadoop fs -cat /output/part-00000

Max Temperature

  • Dockerfile
FROM bitnami/spark:latest
WORKDIR /app
COPY max_temperature.py .
COPY temperature_data.csv .
CMD ["spark-submit", "max_temperature.py"]
  • Build and run
docker build -t spark-max-temp .
docker run -it spark-max-temp
  • max_temperature.py
from pyspark.sql import SparkSession, Row

spark = SparkSession.builder \
    .appName("Max Temperature with Year") \
     .getOrCreate()


data = spark.read.csv("temperature_data.csv", header = True, inferSchema = True)

max_temp_row = data.orderBy(data["temperature"].desc()).first()

if max_temp_row:
    max_temp_df = spark.createDataFrame([
        Row(year = max_temp_row["year"], max_temperature = max_temp_row["temperature"])
    ])

    max_temp_df.show()

else:
    print("No temperature data available")

spark.stop()

Hive server setup

docker exec -it hive-hive-server hive
@guider23
Copy link

C:\Users\sidhe\Desktop\BDS>docker-compose up -d

C:\Users\sidhe\Desktop\BDS>docker exec -it namenode bash

File name is - docker-compose.yml

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment