Skip to content

Add data-eng sandbox image.#78

Open
apurvvkumaria wants to merge 1 commit into
NVIDIA:mainfrom
apurvvkumaria:add-data-eng-sandbox
Open

Add data-eng sandbox image.#78
apurvvkumaria wants to merge 1 commit into
NVIDIA:mainfrom
apurvvkumaria:add-data-eng-sandbox

Conversation

@apurvvkumaria
Copy link
Copy Markdown

What this adds

A data engineering sandbox pre-configured for agents doing local data
processing, transformation, and analysis workflows.

Pre-installed tools

  • Python 3.12 — pinned runtime for stable data engineering workflows
  • DuckDB — fast in-process analytical SQL engine
  • pandas — DataFrame library for data manipulation and analysis
  • pyarrow — Apache Arrow columnar format, Parquet read/write
  • httpx — async-capable HTTP client for fetching remote datasets

Usage

openshell sandbox create --from data-eng

Testing

Built and verified locally. All packages install cleanly from the
base image. Entrypoint prints installed versions on sandbox start.

…gured with Python 3.12,

DuckDB, pandas, pyarrow, and httpx for local data processing
workflows inside OpenShell.

Signed-off-by: Apurv Kumaria <kumaria.apurv@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant