Who is Kaushik Saravanan?

Kaushik Saravanan is an AI/ML engineer and MS in Artificial Intelligence Engineering candidate at Carnegie Mellon University (ECE, expected December 2027), based in Pittsburgh, PA. He was previously an Associate Application Engineer at SAP Labs India (2024–2026), where he shipped production GDPR-compliant RAG and LLM systems to 400+ users. IEEE-published researcher and Smart India Hackathon 2022 winner.

Is Kaushik Saravanan open to new AI/ML roles?

Yes. Kaushik is open to Summer 2027 AI/ML and RAG internships in the US, and full-time AI engineering roles starting January 2028 after his CMU MS-AIE graduation. Reach out via LinkedIn (linkedin.com/in/kaushiksss) or X (@Kaushiks0).

Does Kaushik need visa sponsorship?

Kaushik is an F-1 international student at Carnegie Mellon University. He has 3-year STEM OPT eligibility after his December 2027 graduation, and is open to employers who sponsor H-1B afterward.

What did Kaushik build at SAP Labs India?

At SAP Labs India (2024–2026) he engineered a GDPR-compliant, privacy-first RAG platform for SAP's internal chatbot. He scaled it to 2M+ documents and 400+ users with <2s p95 end-to-end latency, fine-tuned DeBERTa for Germany-specific PII detection (94% recall@10, MRR@10=0.82), and rewrote a credential-fetch client in dependency-free Go for 9,000+ Linux servers.

What are Kaushik's IEEE publications?

Two IEEE papers: 'Swarm Intelligence-Based Cooperative Intelligent Transportation System' (ICCIES 2025) and 'Cognitive Intrusion Detection System in Autonomous Vehicles Using Machine Learning' (ICPECTS 2024).

What is Kaushik's tech stack?

Python, Go, FastAPI, PyTorch, TensorFlow, Hugging Face Transformers, LangChain, PostgreSQL, Docker, Kubernetes, NVIDIA CUDA, Google Cloud Platform, and Microsoft Azure. Specializes in RAG pipelines, LLM fine-tuning (DeBERTa, QLoRA), and cloud observability.

Blog

Q: What are Kaushik's IEEE publications?

Two IEEE papers: 'Swarm Intelligence-Based Cooperative Intelligent Transportation System' (ICCIES 2025) and 'Cognitive Intrusion Detection System in Autonomous Vehicles Using Machine Learning' (ICPECTS 2024).

Q: What is Kaushik's tech stack?

Python, Go, FastAPI, PyTorch, TensorFlow, Hugging Face Transformers, LangChain, PostgreSQL, Docker, Kubernetes, NVIDIA CUDA, Google Cloud Platform, and Microsoft Azure. Specializes in RAG pipelines, LLM fine-tuning (DeBERTa, QLoRA), and cloud observability.

Technical insights on software engineering, AI, cloud computing, and more.

Local

An LRU Key-Rotation State Machine for a Personal Credential Vault

July 4, 2026

9 min read

Why I stopped hardcoding API keys in .env files and built CipherStack instead. The four-state machine behind LRU vending, the PostgreSQL lock that saved it under concurrency, and what an evening's worth of race conditions taught me about treating provider keys as a fleet.

SecurityPostgreSQLRate LimitingCipherStack

Local

A Dependency-Free Go Binary Is the Right Answer for a 9,000-Server Fleet

July 4, 2026

9 min read

Why I stopped shipping a Python client to 9,000 Linux servers for a security-critical credential fetch, what static linking actually buys you at fleet scale, and the surprising moment a stripped Go binary weighed less than the Python container image it replaced.

GoInfrastructureSecuritySAP

Local

The <700ms Latency Budget for a Personal AI Voicemail Line

July 4, 2026

8 min read

I gave my phone number to an AI. The naive path was Gemini Live for ~200ms speech-to-speech. That didn't survive contact with reality. Here's the three-stage pipeline I fell back to, the per-stage latency budget it forced, and the UX trick that makes 700ms feel like 300.

VoiceLiveKitLLMLatency

Local

20x GPU Speedup on Multimedia Indexing: Cache Locality, Batch Shape, and Where I Stopped

July 4, 2026

9 min read

How a TF-IDF/NLP indexer for 1,000+ multimedia files went from 30 seconds to 1.5 on a single GPU. Batch shape mattered more than batch size, torch.compile earned its keep for a reason I didn't expect, and I burned three engineer-days chasing the last 10% before I quit.

GPUCUDAPyTorchPerformance

Local

Guard-Rails Every Personal AI Should Have (Lessons from Shipping Dyx)

July 4, 2026

11 min read

My phone number is on the internet, and an LLM answers it. Six months in, here are the guard-rails that actually mattered — and the one 'never do this' rule I had to soften because it contradicted the site that advertised the bot.

AI SafetyLLMGuard RailsPrompt Engineering

Local

What HyperFrames Taught Me About Deterministic Video Rendering

July 4, 2026

8 min read

Same input, same pixels, every render. The non-negotiable behind HyperFrames — and the places non-determinism kept sneaking back in. Font loaders, rAF cadence, GC pauses, and the one adapter that almost broke the contract.

VideoDeterminismHyperFramesBrowser Rendering

Local

Redact at Retrieval, Not at Ingest: A GDPR-Compliant RAG Architecture

July 4, 2026

9 min read

The naive PII strategy is to scrub the corpus at index time. It's also the strategy that quietly destroys recall on every query that legitimately mentions a public entity. Here's why I moved the redaction pass downstream of retrieval — and how a DeBERTa PII model, an HNSW index, and a cross-encoder reranker fit inside a sub-2s p95 budget without stepping on each other.

RAGGDPRArchitectureSAP

Local

IEEE ICCIES 2025: Swarm Intelligence for Cooperative ITS — and the Parts We Cut

June 30, 2026

7 min read

The paper that landed at ICCIES 2025 argued for swarm heuristics over MARL for cooperative intersection control. This is the honest version — what we shipped, the RL baseline that never converged, the traffic-sim adapter that didn't make review, and what I'd take further at CMU.

ResearchIEEESwarm IntelligenceAutonomous Vehicles

Local

Why We Fine-Tuned DeBERTa-base and Not XLM-R for German PII

June 25, 2026

10 min read

The multilingual model was the obvious pick and it lost by six F1 points. Why tokenizer coverage beat parameter breadth on Bundesdatenschutzgesetz entities, what disentangled attention did to German compound nouns, and the cost of specializing a model inside a regulated pipeline.

NLPDeBERTaFine-tuningGDPR

Local

HNSW or IVF-PQ? What I Actually Chose at 2M Documents

June 18, 2026

7 min read

The recall-vs-memory decision behind a GDPR-compliant RAG platform. Why I stopped reaching for IVF-PQ, what the graph index cost me in RAM, and the one thing about HNSW's insert path I didn't see coming until we crossed a million vectors.

RAGVector SearchHNSWSAP

Medium

Synced

Claude Code Wouldn’t Start on Windows — The Real Reason Took Me Hours to Find

May 5, 2026

3 min read

I Couldn’t Get Claude Code to Start on Windows. It Took Me Way Too Long to Figure Out Why.So there I was, trying to start Claude Code on my Windows machine.I typed:claudeHit Enter… and nothing.The trust prompt just sat...

Medium

Synced

Installing Burp Suite’s CA Certificate in Chrome (2026 Updated Guide)

February 3, 2026

3 min read

If you’ve tried following PortSwigger’s official documentation for installing Burp Suite’s CA certificate in Chrome, you probably noticed the screenshots and instructions don’t match what you see on your screen. That’s...

Local

🎧 I Reverse-Engineered ChatGPT’s Voice Data Flow and Found My Own Voice Hidden in a ZIP File

August 11, 2025

3 min read

How I went from missing transcript frustration to finding my actual ChatGPT voice recordings through reverse engineering, DevTools, Burp Suite, and… a simple ZIP file.

Reverse EngineeringChatGPTJavaScript

Substack

Synced

Love the Hunt, Not the Prize

August 9, 2025

4 min read

Subscribe nowYou do something you were confident about, but suddenly your momentum falters. Now, it’s up to you to identify the issue and restore yourself to your previous stable state.I Reverse-Engineered ChatGPT’s Voi...

Substack

Synced

ampersnow: where thoughts take shape

August 9, 2025

1 min read

This is ampersnow. a space for ideas, thoughts, and questions. Here, every post is a spark, every thought gets a second glance, because curiosity drives us. Follow me for the journeyI am going to nerd out here, no more...