Skip to content

duc-dn/duc-dn

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 

Repository files navigation

πŸ‘‹ Hi, I'm Duc

πŸš€ Data Engineer @ VNPT AI
πŸ”Ή Building scalable Lakehouse & Data Platforms
πŸ”Ή Experienced with Big Data, Streaming, and Cloud Infrastructure
πŸ”Ή Passionate about Data Infrastructure, APIs, and Workflow Orchestration


πŸ”§ Tech Stack

πŸ’» Programming Languages

  • Python (ETL, APIs, data pipelines, orchestration)
  • Java (Big Data, Kafka, Flink, Spark ecosystem)

πŸ“Š Data & Lakehouse

  • Apache Iceberg, Delta Lake
  • Apache Spark, Apache Flink
  • Kafka, Kafka Connect, Debezium (CDC from Postgres/MySQL/MongoDB)

☁️ Cloud & Storage

  • Google BigQuery, Cloud Scheduler
  • AWS S3, MinIO
  • GCS (Google Cloud Storage)

πŸ—„οΈ Databases & Vector Search

  • PostgreSQL, MySQL, MongoDB
  • Qdrant (Vector Database)

πŸ“ˆ BI & Visualization

  • Apache Superset

πŸ•’ Workflow Orchestration & Scheduling

  • Apache Airflow, Cronjob, Cloud Scheduler

βš™οΈ DevOps & Infra

  • Docker, Docker Compose
  • Kubernetes, Helm
  • Terraform, GitHub Actions

🌐 API & Software

  • FastAPI
  • Git, GitHub (version control & collaboration)

πŸ“Œ Featured Projects


🌱 What I’m Learning

  • Data mesh & federated query engines (Trino/Presto, Dremio)
  • Advanced Iceberg optimizations (partitioning, compaction, metadata scaling)
  • Hybrid pipelines (batch + streaming with Flink + Spark)
  • AI/LLM integration with vector databases (Qdrant)

πŸ“« Connect with Me


⭐️ From ducdn

πŸ“ŠGitHub Stats :




About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors