Skip to content

Instantly share code, notes, and snippets.

View viseshagarwal's full-sized avatar
🎯
Focusing

Visesh Agarwal viseshagarwal

🎯
Focusing
View GitHub Profile
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
@viseshagarwal
viseshagarwal / Reddit-Data-Analysis.md
Last active September 2, 2025 08:22
Reddit Data Analysis: Insights from Machine Learning Models

Introduction

In the age of social media, Reddit stands out as a unique platform where users engage in discussions across a wide range of topics. This article presents an in-depth analysis of Reddit comments from various subreddits related to data science, programming, and technology. We'll explore the sentiment, emotions, and content of these comments using several machine learning techniques, including sentiment analysis, topic modeling, and text classification.

Data Collection and Preprocessing

Our analysis begins with data collection from eight subreddits: Python, DataScience, MachineLearning, DataAnalysis, DataMining, Data, DataSets, and DataCenter. We used the PRAW (Python Reddit API Wrapper) library to scrape comments from these subreddits.

Here's a snippet of the code used for data collection: