| name | description | license | compatibility | metadata | ||||
|---|---|---|---|---|---|---|---|---|
parquet-cli |
Work with Parquet files - view schema, metadata, contents, and convert between formats |
MIT |
opencode |
|
This skill enables you to inspect, convert, and manipulate Parquet files using the parquet CLI tool.
| Command | Description |
|---|---|
meta |
Print Parquet file metadata (row groups, columns, compression) |
schema |
Print the Avro schema for a file |
cat |
Print the first N records from a file |
head |
Print the first N records from a file |
scan |
Scan all records from a file |
pages |
Print page summaries for a Parquet file |
dictionary |
Print dictionaries for a Parquet column |
footer |
Print Parquet file footer in JSON format |
column-index |
Print column and offset indexes |
column-size |
Print column sizes |
bloom-filter |
Check bloom filters for a Parquet column |
check-stats |
Check for corrupt page and column stats |
convert |
Create a Parquet file from a data file (CSV, JSON, etc.) |
to-avro |
Create an Avro file from a Parquet file |
convert-csv |
Create a Parquet file from CSV data |
csv-schema |
Build a schema from a CSV data sample |
rewrite |
Rewrite one or more Parquet files with new settings |
# View file metadata
parquet meta data.parquet
# View schema
parquet schema data.parquet
# View first 10 records
parquet head -n 10 data.parquet
# View all records (as JSON)
parquet cat data.parquet
# View footer in JSON format
parquet footer data.parquet
# Check column statistics
parquet check-stats data.parquet
# Convert CSV to Parquet
parquet convert-csv -s schema.avsc data.csv data.parquet
# Convert Parquet to Avro
parquet to-avro data.parquet output.avro
# Rewrite with different compression
parquet rewrite -c SNAPPY data.parquet output.parquet
# Print column dictionary values
parquet dictionary data.parquet column_name- Inspect Parquet file structure and schemas
- Debug Parquet file issues
- Convert between Parquet, CSV, Avro formats
- Extract sample data from large Parquet files
- Analyze column statistics and bloom filters