Skip to content

Instantly share code, notes, and snippets.

@abdellatifLabr
Last active February 28, 2026 11:32
Show Gist options
  • Select an option

  • Save abdellatifLabr/7011304f74816b20b694fd3d07450c1c to your computer and use it in GitHub Desktop.

Select an option

Save abdellatifLabr/7011304f74816b20b694fd3d07450c1c to your computer and use it in GitHub Desktop.
parquet-cli skill for coding agents
name description license compatibility metadata
parquet-cli
Work with Parquet files - view schema, metadata, contents, and convert between formats
MIT
opencode
audience use-case
developers
data-files

What I do

This skill enables you to inspect, convert, and manipulate Parquet files using the parquet CLI tool.

Available commands

Command Description
meta Print Parquet file metadata (row groups, columns, compression)
schema Print the Avro schema for a file
cat Print the first N records from a file
head Print the first N records from a file
scan Scan all records from a file
pages Print page summaries for a Parquet file
dictionary Print dictionaries for a Parquet column
footer Print Parquet file footer in JSON format
column-index Print column and offset indexes
column-size Print column sizes
bloom-filter Check bloom filters for a Parquet column
check-stats Check for corrupt page and column stats
convert Create a Parquet file from a data file (CSV, JSON, etc.)
to-avro Create an Avro file from a Parquet file
convert-csv Create a Parquet file from CSV data
csv-schema Build a schema from a CSV data sample
rewrite Rewrite one or more Parquet files with new settings

Common examples

# View file metadata
parquet meta data.parquet

# View schema
parquet schema data.parquet

# View first 10 records
parquet head -n 10 data.parquet

# View all records (as JSON)
parquet cat data.parquet

# View footer in JSON format
parquet footer data.parquet

# Check column statistics
parquet check-stats data.parquet

# Convert CSV to Parquet
parquet convert-csv -s schema.avsc data.csv data.parquet

# Convert Parquet to Avro
parquet to-avro data.parquet output.avro

# Rewrite with different compression
parquet rewrite -c SNAPPY data.parquet output.parquet

# Print column dictionary values
parquet dictionary data.parquet column_name

When to use me

  • Inspect Parquet file structure and schemas
  • Debug Parquet file issues
  • Convert between Parquet, CSV, Avro formats
  • Extract sample data from large Parquet files
  • Analyze column statistics and bloom filters
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment