Quantitative engineer working on AI SRE eval and benchmarks. Independent Researcher (Columbia University, alumnus), UK-based.
- Independent writing on AI evaluation methodology, observability, and the structural overlap between quant trading and LLM eval.
- Asymmetric-information solutions in ML evaluation, surfacing what labs know internally about benchmark noise and drift.
- Calibration tooling for benchmark drift, distinguishing genuine model improvement from eval movement.
- A foundational essay on production observability for LLM agents.
- A weekly paper digest series on alignment, evaluation methodology, and AI safety research.
- "Benchmark crowding": mapping factor decay in quant finance to benchmark saturation in LLM evaluation.
- Substack: substack.com/@sandeeprai1
- Hugging Face: huggingface.co/SandeepRai1
- Google Scholar: scholar.google.co.uk/citations?user=GF8g3_QAAAAJ
- Email: sandeeprai_dsp@hotmail.com
