A fast, ergonomic CLI tool for data engineers. Query across file trees using SQL, profile data, add lineage, and more.
Built in Rust on top of DuckDB.
cargo build --release./target/release/dtoo query testdata/trips.parquet --output out/trips.csv./target/release/dtoo query \
--glob "testdata/**/*" \
--on-error skip \
--post-sql "SELECT passenger_count, COUNT(*) AS trips FROM _ GROUP BY 1 ORDER BY 2 DESC" \
--output out/agg.csv./target/release/dtoo inspect testdata/trips.parquet --rows 10
./target/release/dtoo profile testdata/trips.parquet --format html --output out/profile.html./target/release/dtoo --help
./target/release/dtoo query --helpFor full usage, options, and recipes, see USER_GUIDE.md.
See DESIGN.md for the full specification.
