Skip to content
View amansarohadev's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report amansarohadev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
amansarohadev/README.md

🎯 The Origin Story

"I spent 14 months keeping Microsoft's Azure SQL databases alive in production at HCL Technologies. Now I'm building the data pipelines and warehouses behind them."

I'm a B.Tech graduate (2022) with a background that runs through enterprise Microsoft support β†’ deliberate cloud engineering pivot β†’ live 2026 portfolio sprint.

While most engineers learn in sandboxes, I was diagnosing critical production SQL deadlocks, performance bottlenecks, and high-priority escalations as an Azure SQL DB Technical Support Engineer at HCL Technologies β€” with Microsoft as the direct client.

In January 2024, I made a calculated decision: go deep into Azure Data Engineering. I call the 26-month period that followed my "Advanced Cloud Residency".

Today I'm applying for Data Analyst and Azure Data Specialist roles simultaneously β€” and actively committing to GitHub every week.


πŸ”΄ What I'm Building Right Now

Status Project Stack Repo
● LIVE SQL Data Warehouse β€” Medallion Architecture Azure Synapse Β· PySpark Β· Bronze/Silver/Gold β†’ View
● LIVE ADF Pipeline System β€” Incremental Loads Azure Data Factory Β· Parent-Child Pipelines β†’ View
⚑ Next Power BI Dashboard β€” Advanced DAX Power BI Β· DAX Β· Data Modeling Coming soon

πŸ›€οΈ The Journey

Oct 2022 ─────── Dec 2023        Jan 2024 ────────────── Feb 2026        March 2026 β†’ Now
      β”‚                                   β”‚                                      β”‚
 HCL Technologies               Advanced Cloud                           Active Sprint
 Microsoft Client               Residency                                DA + ADE Roles
 Azure SQL DB                   ADF Β· Databricks                         Applying Now
 Tech Support Eng.              Synapse Β· PySpark                        GitHub: Active
      β”‚                         Medallion Arch.                               β”‚
      β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
                    Every step intentional. Every commit deliberate.

⚑ Data Engineering Track

Tool Proficiency
Azure Data Factory β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘ 85%
Azure Databricks β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘ 80%
PySpark β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘ 78%
Azure Synapse Analytics β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘β–‘β–‘ 75%
Medallion Architecture β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘ 85%
Incremental Load + Parent-Child Patterns β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–‘β–‘ 83%

Cloud & Compute Azure ADF Databricks Synapse PySpark Delta Lake


πŸ“Š Data Analytics Track

BI & Visualization Power BI Tableau Excel

Languages & Databases Python SQL PostgreSQL MySQL Pandas NumPy


πŸ—‚οΈ Featured Projects

⚑ ADF Pipeline System

Enterprise-grade incremental load pipelines with Parent-Child orchestration. Schema drift handling, retry logic, parameterized ingestion.

Stack: Azure Data Factory PySpark Azure

View Live

πŸ—οΈ SQL Data Warehouse β€” Medallion

End-to-end DW from scratch. Bronze (raw) β†’ Silver (clean) β†’ Gold (analytics-ready). Full transformation layer.

Stack: Azure Synapse SQL DW Medallion Architecture

View Live

πŸ“¦ Vendor Performance Analysis

End-to-end retail pipeline. 2GB+ dataset, Python + SQLAlchemy, statistical hypothesis testing, Power BI dashboard.

Stack: Python SQLAlchemy Power BI Statistics

View

🏑 Seattle Airbnb Market Insights

Full EDA revealing pricing patterns, seasonality, and neighborhood intelligence. 15+ view Tableau dashboard.

Stack: Python Pandas Tableau EDA

View


πŸ“Š GitHub Stats

stats langs
GitHub contribution snake

πŸ’‘ The Gap Isn't the Weakness β€” It's the Differentiator.

From production SQL support at Microsoft to engineering the data pipelines behind enterprise decisions. Every month of the "residency" was a deposit. March 2026 is the withdrawal.

β†’ Portfolio Β· β†’ LinkedIn Β· β†’ Email

Pinned Loading

  1. adf adf Public

    Azure Data Factory pipeline implementations | Incremental loads, parameterized pipelines, ForEach + Get Metadata patterns | Production-ready ADF

  2. Vendor-Performance-Analysis Vendor-Performance-Analysis Public

    End-to-end retail analytics pipeline | Python, SQLAlchemy, Power BI | 2GB+ dataset | Statistical testing on vendor KPIs | Business-ready insights

    Jupyter Notebook

  3. azure-de-learning-journal azure-de-learning-journal Public

    Azure Data Engineering build log | ADF, Databricks, PySpark, Synapse | Medallion Architecture implementation | Documented end-to-end

    Jupyter Notebook 1

  4. hr-analytics-mysql hr-analytics-mysql Public

    HR analytics using advanced SQL | Window functions, CTEs, views, indexing | Optimized for dashboard consumption | MySQL

    1