← Back

Things I've built

01

Network Validation Platform

Pre- and Post-Change Automation
  • Built pre/post-change validation for Cisco NX-OS, IOS and IOS XE — reduced validation time from weeks to 10 minutes while preventing "quiet failures" missed by manual checks.
  • Automated canonical device snapshots: routing tables, ARP/ND, STP, interface states, error counters, and subnet reachability with fan-out pings.
  • Aggregated CPU, memory, and interface utilization pre- and post-change, enabling regression detection at scale.
  • AI-driven diff summarization highlights missing VLANs, neighbor changes, and error growth while preserving full evidence-backed reports.
  • Expanded coverage to switch RMAs and refreshes, normalising OS/version discrepancies for reliable device-to-device comparisons.
02

Automated Device Certification Platform

PDF Ingestion → pytest → Allure
  • Replaced a weeks-long contractor-driven process with a repeatable automated workflow executed in minutes.
  • PDF ingestion pipeline using prompt chaining + RAG to normalise multi-page documents into structured markdown test specs.
  • Auto-generates pytest scripts with PyShark and testbed YAML files for scalable, repeatable validation.
  • Allure-pytest reporting delivers visual, versioned certification reports with full traceability for board-level compliance.
  • Eliminated contractor costs, standardised certification quality, and accelerated onboarding from weeks to minutes.
03

AI Multi-Agent Workflow Platform

LangGraph · FastAPI · Kafka
  • LangGraph supervisor/sub-agent architecture where a supervisor orchestrates domain-specific agents for monitoring, troubleshooting, and automation.
  • Dynamic tool invocation and intelligent decision-making — reduced manual troubleshooting by 40%.
  • Full-stack chatbot (FastAPI, Kafka, MongoDB, React, TypeScript) streamlining cross-team collaboration.
  • Agentic workflows handle SSH multi-command tasks, upgrade validation, and device searches across ExtraHop, NetBrain, and Cisco ACI.
04

Network Topology & Visibility Platform

Neo4j · Airflow · Real-time Impact Analysis
  • Real-time topology visibility of device interconnections — enables teams to predict impact zones during change management and incident response.
  • Airflow DAGs schedule daily multi-source data collection, ensuring up-to-date device state visibility.
  • Progressive ingestion (device-by-device) avoids heavy loads and improves pipeline efficiency.
  • Neo4j graph database for device relationships — operations teams quickly identify dependency chains and failure points.
05

Enterprise Data Platform MVP

Kafka · Airflow · AWS S3 · Grafana
  • ETL pipelines (Kafka + Airflow) extracting and transforming metrics from Grafana, PostgreSQL, Elasticsearch, and Dynatrace into an AWS S3 data lake.
  • Real-time monitoring dashboards in Grafana — cut data-access latency by 35%.
  • AWS Glue Data Catalog + Lake Formation for governance; microservices on OpenShift/Kubernetes with 99.9% uptime.
  • Enabled enterprise-wide analytics: anomaly detection, proactive infrastructure optimisation, and stakeholder reporting.
06

Real-Time Chatbot with LLM RAG

LangChain · Kafka · Kubernetes
  • Decomposed monolithic chatbot into 3 scalable microservices with Kafka event-driven communication.
  • LangChain + Hugging Face RAG pipeline merging LLM knowledge with internal data sources for domain-specific responses.
  • MongoDB caching eliminated redundant processing; WebSocket notifications via Webex and Slack for live change updates.
  • Docker/Kubernetes autoscaling deployment — reduced manual troubleshooting and accelerated change validation for network ops.