☕ Café Analytics Platform

An End-to-End Data Engineering & Analytics System (Docker · PostgreSQL · Python · Apache Superset)

📌 Overview

This project implements a production-style analytics platform for a fictional Seoul-based café selling coffees and cakes.

The system simulates transactional café orders, ingests them into a relational database with proper constraints, and exposes governed analytics through Apache Superset — all running in a fully containerized environment.

🎯 Objective Build, debug, and operate a realistic data engineering pipeline rather than a demo-only dashboard.

🏗️ System Architecture

┌──────────────────┐
│  Python Ingestion│
│  (Batch Job)     │
└───────┬──────────┘
        ↓
┌──────────────────┐
│  PostgreSQL 15   │
│  Orders & Items  │
└───────┬──────────┘
        ↓
┌──────────────────┐
│ Apache Superset  │
│ BI & Dashboards  │
└──────────────────┘

(All services orchestrated via Docker Compose)

🧠 Data Modeling

Schema Design Philosophy

Designed around real POS systems
Supports multi-item orders
Optimized for analytics & aggregations
Enforces data integrity via constraints

Tables

`items` (Dimension)

item_no       INT PRIMARY KEY
item_name     TEXT
category      coffee | cake
description   TEXT
price         NUMERIC(8,2)

`order_items` (Fact)

(order_no, item_no)  PRIMARY KEY
quantity             INT
order_ts             TIMESTAMP

Grain: One row per item per order This enables accurate revenue, quantity, and time-series analysis.

🔄 Data Ingestion

The ingestion service:

Inserts café menu items (idempotent)
Generates realistic order volumes
Creates multi-item orders
Spreads orders across operating hours
Enforces foreign-key relationships

📁 Location:

ingestion/ingest.py

The ingestion container runs as a batch job and exits after completion.

📊 Analytics & BI (Apache Superset)

Superset is used as the analytics layer, enabling:

Revenue analysis by time
Coffee vs cake sales comparison
Average order size
Item-level performance
Peak café hours

Database Connection

postgresql+psycopg2://cafe_user:cafe_pass@postgres:5432/cafe

🔐 Role-Based Access Control (RBAC)

This project intentionally implements governed analytics, not open access.

Roles Used

Role	Capabilities
Admin	Full access (DB, SQL Lab, users, dashboards)
Analyst	Query datasets, build charts & dashboards
Viewer	Read-only dashboard access

Why This Matters

Mirrors enterprise BI environments
Prevents unrestricted SQL access
Enables controlled data exposure
Validates multi-user workflows

🌍 Multi-Device Validation

The platform was accessed and validated from:

Multiple laptops
Browser-only sessions
Different user roles

This confirmed:

Stateless UI behavior
Dockerized service reliability
Role-specific access enforcement
Production-like usability

🐳 Dockerized Infrastructure

Services

Service	Purpose
`postgres`	Relational data store
`ingestion`	Batch data generator
`superset`	BI & analytics UI

All services communicate via a private Docker network.

🧩 Superset Driver Configuration (Important)

Superset requires database drivers to be installed inside its runtime environment.

A custom Dockerfile is used to safely enable PostgreSQL support.

📁 superset/Dockerfile

FROM apache/superset:latest

USER root

RUN apt-get update && apt-get install -y \
    build-essential \
    libpq-dev \
    gcc \
    && rm -rf /var/lib/apt/lists/*

RUN pip install --no-cache-dir \
    psycopg2-binary \
    sqlalchemy \
    sqlalchemy-utils \
    redis \
    pymysql \
    clickhouse-connect

USER superset

This resolves common PostgresEngineSpec errors in Superset.

🚀 Getting Started

1️⃣ Clone the Repository

git clone <repo-url>
cd seoul-cafe-analytics

2️⃣ Build & Start Services

docker compose down -v
docker compose build --no-cache
docker compose up

3️⃣ Access Superset

URL: http://localhost:8088
Username: admin
Password: admin

4️⃣ Verify Postgres Driver

docker exec -it cafe_superset superset shell

import psycopg2
psycopg2.__version__

🛠️ Key Engineering Challenges Solved

Superset driver loading issues
Docker build vs runtime Python environments
Foreign-key constraint handling
Duplicate primary key errors
Secure, role-based analytics access

🧑‍💻 Author

Harsh Indoria

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
image		image
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
ingest.py		ingest.py
postgres_init.sql		postgres_init.sql
superset_config.py		superset_config.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

☕ Café Analytics Platform

An End-to-End Data Engineering & Analytics System (Docker · PostgreSQL · Python · Apache Superset)

📌 Overview

🏗️ System Architecture

🧠 Data Modeling

Schema Design Philosophy

Tables

`items` (Dimension)

`order_items` (Fact)

🔄 Data Ingestion

📊 Analytics & BI (Apache Superset)

Database Connection

🔐 Role-Based Access Control (RBAC)

Roles Used

Why This Matters

🌍 Multi-Device Validation

🐳 Dockerized Infrastructure

Services

🧩 Superset Driver Configuration (Important)

🚀 Getting Started

1️⃣ Clone the Repository

2️⃣ Build & Start Services

3️⃣ Access Superset

4️⃣ Verify Postgres Driver

🛠️ Key Engineering Challenges Solved

🧑‍💻 Author

About

Uh oh!

Releases

Packages

Languages

License

harshindcoder/cafe-analytics-infrastructure

Folders and files

Latest commit

History

Repository files navigation

☕ Café Analytics Platform

An End-to-End Data Engineering & Analytics System (Docker · PostgreSQL · Python · Apache Superset)

📌 Overview

🏗️ System Architecture

🧠 Data Modeling

Schema Design Philosophy

Tables

items (Dimension)

order_items (Fact)

🔄 Data Ingestion

📊 Analytics & BI (Apache Superset)

Database Connection

🔐 Role-Based Access Control (RBAC)

Roles Used

Why This Matters

🌍 Multi-Device Validation

🐳 Dockerized Infrastructure

Services

🧩 Superset Driver Configuration (Important)

🚀 Getting Started

1️⃣ Clone the Repository

2️⃣ Build & Start Services

3️⃣ Access Superset

4️⃣ Verify Postgres Driver

🛠️ Key Engineering Challenges Solved

🧑‍💻 Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`items` (Dimension)

`order_items` (Fact)

Packages