Skip to content

Instantly share code, notes, and snippets.

View russellb23's full-sized avatar

russellb23

View GitHub Profile
@vinovator
vinovator / pdfTextMiner.py
Last active November 24, 2025 14:30
A sample code which uses pdfminer module to extract text from pdf files
# pdfTextMiner.py
# Python 2.7.6
# For Python 3.x use pdfminer3k module
# This link has useful information on components of the program
# https://euske.github.io/pdfminer/programming.html
# http://denis.papathanasiou.org/posts/2010.08.04.post.html
''' Important classes to remember
PDFParser - fetches data from pdf file