Skip to content

Instantly share code, notes, and snippets.

@apcamargo
Created July 13, 2025 21:11
Show Gist options
  • Save apcamargo/7511a4a89b224ccaa6e4e349de0f768a to your computer and use it in GitHub Desktop.
Save apcamargo/7511a4a89b224ccaa6e4e349de0f768a to your computer and use it in GitHub Desktop.
Query SRA metadata stored as Parquet files in S3 using DuckDB
duckdb -c "
INSTALL httpfs;
LOAD httpfs;
INSTALL parquet;
LOAD parquet;
COPY (
SELECT *
FROM read_parquet('s3://sra-pub-metadata-us-east-1/sra/metadata/*')
) TO STDOUT WITH (FORMAT CSV, DELIMITER E'\t', HEADER);"
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment