Skip to content

Instantly share code, notes, and snippets.

@hugovk
Last active January 12, 2026 15:52
Show Gist options
  • Select an option

  • Save hugovk/efdbee0620cc64df7b405b52cf0b6e42 to your computer and use it in GitHub Desktop.

Select an option

Save hugovk/efdbee0620cc64df7b405b52cf0b6e42 to your computer and use it in GitHub Desktop.
PyPI names, specs and versions

Names, specs and versions from PyPI as text files, extracted from https://github.com/pypi-data/pypi-json-data/releases/download/latest/pypi-data.sqlite.gz

For use with benchmark scripts from https://iscinumpy.dev/post/packaging-faster/

wc -l *.txt
 8344947 names.txt
 4907383 specs.txt
 8339540 versions.txtsort --unique names.txt | wc -l
  754567sort --unique specs.txt | wc -l
    6167sort --unique versions.txt | wc -l
  725451du -h *.{sqlite,txt,zip}
 10G	pypi-data.sqlite
120M	names.txt
 38M	specs.txt
 61M	versions.txt
4.7M	names.txt.zip
1.3M	specs.txt.zip
 10M	versions.txt.zip
This file has been truncated, but you can view the full file.
View raw

(Sorry about that, but we can’t show files that are this big right now.)

View raw

(Sorry about that, but we can’t show files that are this big right now.)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment