Biraj

i sometimes build things from scratch to understand how they work, write blogs, play chess (1900 rapid on lichess), badminton, and read books.

biraj.pub@gmail.com

GitHub github.com/biraj21 Twitter x.com/biraj21_ Lichess lichess.org/@/biraj21 LinkedIn linkedin.com/in/biraj21

work experience

(you can click on the role summary to see the details)

projects

LLM serving engine from scratch Python

a toy LLM server built from scratch with batched inference, auto-regressive generation loop for token streaming, request queueing, and independent sequence completion. supports both streaming and non-streaming responses via FastAPI. also includes Whisper for STT.

DotDB: in-browser vector database TypeScript

a lightweight vector database that runs entirely in the browser. supports cosine similarity search, persistent storage via IndexedDB, and batch operations. uses transformers.js with bge-small-en-v1.5 for generating embeddings. implements brute-force k-NN search with Float32Array for memory efficiency.
| try it

Focker: Linux containers Go

Focker is a toy container runtime written in Go to understand Docker internals. implements Linux namespace isolation (PID, mount, UTS), runs processes inside an extracted Ubuntu 22.04 root filesystem, and supports bind mounts for sharing host directories with containers.

Ask My AI: Chrome extension TypeScript

a Chrome extension that brings AI assistants into a sidebar. highlight any text and press shortcut to get instant explanations without tab-switching. hit 100+ users and 16+ five-star ratings within days of launch. built it because existing extensions charged for wrapping AI APIs in a new UI, which seemed redundant when users already pay for ChatGPT/Claude/Gemini subscriptions.
| chrome web store

NumPy from scratch in Go Go

i was implementing Neural Networks from scratch in Go and as a byproduct of it, i created my own version of NumPy for tensor operations like +, -, *, /, matrix multiplication, transpose etc. it also supports broadcasting.

TCP server C

a single-threaded TCP server written in C with an event loop using the poll() system call, without any third-party library. the server listens on a port and echoes back received data. i've also written a simple client to test the server.

Obsy: AI observability platform Next.js Node.js TypeScript

Obsy is an AI observability tool that provides insights into AI operations. it has a Node.js SDK that automatically instruments OpenAI, Pinecone and Vercel's AI SDK. the dashboard shows LLM trace, with timing and usage metrics for each stage. built this in a hackathon.

Gomon: nodemon for Go files Go

a CLI tool to run Go programs in watch mode. to watch for changes, it uses kqueue() system call on macOS and inotify on Linux. no 3rd-party dependencies.


blogs