WatchTree-19

Follow

WatchTree-19

Follow

1 follower · 0 following

Achievements

Achievements

WatchTree-19/README.md

Hi, I'm Sandeep

Quantitative engineer working on AI SRE eval and benchmarks. Independent Researcher (Columbia University, alumnus), UK-based.

What I'm building

Independent writing on AI evaluation methodology, observability, and the structural overlap between quant trading and LLM eval.
Asymmetric-information solutions in ML evaluation, surfacing what labs know internally about benchmark noise and drift.
Calibration tooling for benchmark drift, distinguishing genuine model improvement from eval movement.

Currently working on

A foundational essay on production observability for LLM agents.
A weekly paper digest series on alignment, evaluation methodology, and AI safety research.
"Benchmark crowding": mapping factor decay in quant finance to benchmark saturation in LLM evaluation.

Around the web

Substack: substack.com/@sandeeprai1
Hugging Face: huggingface.co/SandeepRai1
Google Scholar: scholar.google.co.uk/citations?user=GF8g3_QAAAAJ
Email: sandeeprai_dsp@hotmail.com

Popular repositories Loading

About-me About-me Public

Mostly interested in optimization and web3.
QR_Backtest_Sentiment-analysis_Stock-market QR_Backtest_Sentiment-analysis_Stock-market Public

Jupyter Notebook
Crypto_NFT-100-commission Crypto_NFT-100-commission Public

Basic script demonstrating full commission given

Solidity
Crypto_NFT-pre-sale Crypto_NFT-pre-sale Public

Solidity
Crypto_Decentralized-voting-attempt Crypto_Decentralized-voting-attempt Public

May not be fully functional anymore
General_Python_Anonymous-google-extraction- General_Python_Anonymous-google-extraction- Public

Anonymous google extraction via Tor

Python