Skip to content
#

google-dataproc

Here are 10 public repositories matching this topic...

This project orchestrates a data processing workflow using Apache Airflow, Spark, Google Cloud Storage (GCS), and Snowflake. The workflow is designed to handle daily data updates, filter completed orders, and update a Snowflake target table with the latest information. The project leverages Apache Airflow for workflow scheduling and management.

  • Updated Jan 4, 2024
  • Python

Welcome to the MiniProjects Playground—an interactive space where learning meets doing! This repository is a collection of hands-on mini-projects that I've crafted after delving into various tech stacks and frameworks. From theory to application, each project is a testament to the practical side of coding.

  • Updated Dec 31, 2023
  • Python

Welcome to the Learning and Experiments Hub—a dynamic repository capturing my journey of exploration and experimentation in the vast world of technology. This space serves as a digital canvas where I document my learning process, experiments, and discoveries.

  • Updated Mar 26, 2025
  • Jupyter Notebook

🏗️⌾Terraform demo project provisioning a Google Cloud Dataproc cluster on GCP with configurable master and worker node types, autoscaling policies, initialization actions, custom metadata, Kerberos security, IAM service account bindings, staging bucket settings, and optional Spark and Hadoop component configuration for managed data processing work

  • Updated Apr 3, 2026
  • HCL

Improve this page

Add a description, image, and links to the google-dataproc topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the google-dataproc topic, visit your repo's landing page and select "manage topics."

Learn more