Who is Kaushik Saravanan?

Kaushik Saravanan is an AI/ML engineer and MS in Artificial Intelligence Engineering candidate at Carnegie Mellon University (ECE, expected December 2027), based in Pittsburgh, PA. He was previously an Associate Application Engineer at SAP Labs India (2024–2026), where he shipped production GDPR-compliant RAG and LLM systems to 400+ users. IEEE-published researcher and Smart India Hackathon 2022 winner.

Is Kaushik Saravanan open to new AI/ML roles?

Yes. Kaushik is open to Summer 2027 AI/ML and RAG internships in the US, and full-time AI engineering roles starting January 2028 after his CMU MS-AIE graduation. Reach out via LinkedIn (linkedin.com/in/kaushiksss) or X (@Kaushiks0).

Does Kaushik need visa sponsorship?

Kaushik is an F-1 international student at Carnegie Mellon University. He has 3-year STEM OPT eligibility after his December 2027 graduation, and is open to employers who sponsor H-1B afterward.

What did Kaushik build at SAP Labs India?

At SAP Labs India (2024–2026) he engineered a GDPR-compliant, privacy-first RAG platform for SAP's internal chatbot. He scaled it to 2M+ documents and 400+ users with <2s p95 end-to-end latency, fine-tuned DeBERTa for Germany-specific PII detection (94% recall@10, MRR@10=0.82), and rewrote a credential-fetch client in dependency-free Go for 9,000+ Linux servers.

What are Kaushik's IEEE publications?

Two IEEE papers: 'Swarm Intelligence-Based Cooperative Intelligent Transportation System' (ICCIES 2025) and 'Cognitive Intrusion Detection System in Autonomous Vehicles Using Machine Learning' (ICPECTS 2024).

What is Kaushik's tech stack?

Python, Go, FastAPI, PyTorch, TensorFlow, Hugging Face Transformers, LangChain, PostgreSQL, Docker, Kubernetes, NVIDIA CUDA, Google Cloud Platform, and Microsoft Azure. Specializes in RAG pipelines, LLM fine-tuning (DeBERTa, QLoRA), and cloud observability.

IEEE ICCIES 2025: Swarm Intelligence for Cooperative ITS — and the Parts We Cut

Q: What are Kaushik's IEEE publications?

Two IEEE papers: 'Swarm Intelligence-Based Cooperative Intelligent Transportation System' (ICCIES 2025) and 'Cognitive Intrusion Detection System in Autonomous Vehicles Using Machine Learning' (ICPECTS 2024).

Q: What is Kaushik's tech stack?

Python, Go, FastAPI, PyTorch, TensorFlow, Hugging Face Transformers, LangChain, PostgreSQL, Docker, Kubernetes, NVIDIA CUDA, Google Cloud Platform, and Microsoft Azure. Specializes in RAG pipelines, LLM fine-tuning (DeBERTa, QLoRA), and cloud observability.

The problem

Cooperative Intelligent Transportation Systems (C-ITS) have a specific coordination problem that classical traffic-light optimization does not: the vehicles themselves are the decision agents. There is no central signal head at the intersection deciding who goes. There is a fleet of connected vehicles approaching a shared conflict zone, each with its own local view, each latency-bound to sub-100ms decisions, and none of them are allowed to assume a working uplink to a cloud coordinator.

The paper we submitted to IEEE ICCIES 2025 — "Swarm Intelligence-Based Cooperative Intelligent Transportation System" — was about the decision layer that sits underneath that. Given a four-way intersection, a set of approaching CAVs (connected autonomous vehicles), and no central authority, how do the agents negotiate ordering and speed profiles fast enough that the intersection clears without a stop?

The constraint we actually cared about was not throughput. It was behavior under partial connectivity. Every C-ITS paper I read at the time reported gorgeous throughput curves under the assumption that every agent could talk to every other agent, every tick. In our simulation, that assumption held for exactly zero of the real-world V2X traces we could get our hands on.

Why swarm heuristics over MARL

The reflex, in 2024–2025, was to reach for multi-agent reinforcement learning. QMIX, MADDPG, MAPPO — the shelf was full. And on the paper benchmarks, MARL wins.

We didn't pick MARL. Three reasons:

Convergence under non-stationarity. Every vehicle's policy is another vehicle's environment. MARL papers handle this with centralized training and decentralized execution, which needs a training-time oracle we did not have and could not fake.
Explainability at review time. A swarm heuristic answers "why did the vehicle yield?" with a pheromone value and a local rule. A neural policy answers with an activation vector. Guess which one gets through peer review faster.
Failure mode when connectivity drops. A swarm agent that loses its neighbors falls back to a conservative local rule and stops. A MARL agent runs a policy trained on a joint observation it no longer has. In our early runs, the MARL fallback was worse than "just stop."

Swarm intelligence — specifically an ACO-flavored (ant colony optimization) heuristic with a PSO-flavored velocity update for the speed profile — was the boring choice that composed cleanly with the constraint. Each vehicle deposits a virtual pheromone on the intersection lanes it plans to cross, decays over time, and reads its neighbors' pheromones through V2X broadcasts. The intersection clears in the order that emerges from the pheromone gradient, not the order a central authority picks.

The tradeoff

Axis	MARL (QMIX / MAPPO family)	Our swarm heuristic
Peak throughput in fully-connected simulation	higher	comparable
Behavior under 30–50% packet loss	degrades sharply	graceful degradation
Training data required	large — millions of joint episodes	none — heuristic parameters only
Explainability to a traffic engineer	opaque activations	pheromone value + local rule
Compute at the vehicle	GPU-class for inference on some architectures	fits on the ECU we targeted
Time to a working baseline	weeks	days
Failure mode on comms drop	policy runs on stale joint obs	falls back to local yield rule
Formal safety-argument story	hard	tractable

The trade was explicit. We traded ceiling throughput for floor safety, and we traded end-to-end learned behavior for something a domain reviewer could actually read.

The decision loop, roughly

# Per-vehicle decision loop, called every planning tick (~50ms in sim).
# The two things that mattered were the pheromone decay rate and the
# yield-rule threshold — everything else was second-order.
 
def swarm_decide(self, neighbors, intersection):
    # 1. Read pheromones from neighbors' V2X broadcasts (may be partial).
    field = pheromone_field(neighbors, decay=self.rho)
 
    # 2. Score each candidate maneuver: {go, yield, slow}.
    scored = {}
    for m in candidate_maneuvers(self.state, intersection):
        conflict = field.conflict_score(m.path, m.arrival_window)
        urgency  = self.urgency(m)              # local: fuel, delay, priority
        safety   = self.safety_margin(m, neighbors)
        scored[m] = (safety, -conflict, urgency)  # lex order
 
    # 3. Pick best; if conflict above threshold, fall back to yield rule.
    best = max(scored, key=scored.get)
    if field.conflict_score(best.path, best.arrival_window) > self.yield_tau:
        best = local_yield_rule(self.state, intersection)   # comms-independent
 
    # 4. Deposit pheromone on chosen path for downstream agents.
    self.broadcast_pheromone(best.path, mass=self.tau_dep)
 
    return best

The local_yield_rule at step 3 is the entire reason the paper cleared review. It is a boring right-of-way rule — the same one a human driver would use at an unsignalized intersection with no other information. It is what runs when V2X is dead. Everything above it is optimization; that line is the safety floor.

A four-way intersection viewed from above, with four approaching vehicles and their V2X reception arcs drawn as teal shaded disks. The intersection lanes carry a pheromone-decay heat-map overlay — warmer teal patches under recently-passed vehicles, cooler patches where the field has decayed. The northbound vehicle's arc is clipped on the east side (dashed, simulating 50% packet loss) yet the local_yield_rule still fires because the pheromone in front of it is fresh. A right-side inset shows the MARL baseline: a joint-observation vector π(a₁..a₄ | o₁..o₄) with o₂ struck through and a 'policy undefined' box beneath it, contrasted with the swarm form a_i = f(o_i, φ(local)) that stays defined when a neighbor drops. A bottom callout reads 'swarm: slope · MARL: cliff' at 50% packet loss. — Swarm coordination degrades along a slope; the MARL baseline degrades along a cliff. The local yield rule is the safety floor — it is what runs when V2X is dead.

The result table and one honest ablation

The paper reports the throughput and average intersection-clearing time under three connectivity regimes: full V2X, 30% packet loss, and 50% packet loss. Full-connectivity numbers are competitive with the MARL baselines we could get to converge; the interesting result is the shape of the degradation curve. Ours slopes; theirs cliff.

The honest ablation is the one on pheromone decay rate rho. There is a sweet spot around a decay half-life that matches the typical intersection-crossing time — decay too fast and neighbors don't have time to read your intent, decay too slow and stale intent pollutes the field long after the vehicle has passed. The paper reports the sweep. What the paper does not fully advertise is that this parameter is the load-bearing knob of the entire system. If a downstream implementer misses this, the whole thing degrades to random.

I mention it here because it's the thing I'd flag first to anyone building on the work.

The parts we cut

Two things did not make the submitted version.

The RL baseline that didn't converge in time. We ran a MAPPO baseline against the same intersection scenario, and it never got to a policy we were willing to compare on. The training was under-budgeted — a few days of GPU time we did not really have — and the reward shaping was doing more work than it should have. In our simulation, the swarm heuristic outperformed the MAPPO agent, but I do not believe that comparison. A properly-trained MARL agent could plausibly meet or beat the swarm on peak throughput in the fully-connected regime. The paper claims a different thing — behavior under degraded connectivity — and we cut the half-cooked MARL numbers rather than defend a comparison we knew was thin.

The microscopic-traffic-sim adapter. Most of our simulation ran in a custom lightweight harness — enough to model vehicle kinematics, V2X packet drops, and intersection geometry, but not mixed traffic with human-driven vehicles. I had a partial adapter to SUMO that would have let us run the swarm agents with human-driven traffic as background. It ran, it produced numbers, but the numbers were sensitive to SUMO configuration in ways I could not fully explain in the review window. We cut it. That cut is the one I regret — the follow-up work has to build that adapter from scratch.

What I'd take further at CMU

The ICCIES paper is the ceiling of what the swarm-only formulation can do. The natural next questions:

Learned pheromone deposition. The decay rate rho is a hand-tuned scalar. In reality it should be a policy — a small model that decides how much pheromone to deposit given local state. That is a MARL problem again, but a much smaller one, and the safety floor is still the local yield rule.
Formal guarantees on the yield rule. We argued informally that the fallback is safe. A responsibility-sensitive-safety or barrier-function certificate would let the whole system inherit that guarantee.
A real SUMO integration, done properly. The adapter that got cut is the piece the community will actually want to reproduce — with human-driven background traffic, calibrated geometries, and reproducible seeds.
Heterogeneous fleets. Every simulation ran with identical agents. The real question is what happens when a subset runs the swarm policy and the rest run something else — MARL, legacy ADAS, or a human driver.

Cooperative ITS is a problem area where the ceiling is set by the modelling assumptions, not the algorithms. The paper bet on a specific set of assumptions — partial connectivity is the default, explainability is not optional, and the safety floor has to hold when the optimization ceiling doesn't. That bet held for review. What comes after is a different set of bets.

Full paper: IEEE ICCIES 2025 (document 11033077).