01
Network Validation Platform
Pre- and Post-Change Automation
- Built pre/post-change validation for Cisco NX-OS, IOS and IOS XE — reduced validation time from weeks to 10 minutes while preventing "quiet failures" missed by manual checks.
- Automated canonical device snapshots: routing tables, ARP/ND, STP, interface states, error counters, and subnet reachability with fan-out pings.
- Aggregated CPU, memory, and interface utilization pre- and post-change, enabling regression detection at scale.
- AI-driven diff summarization highlights missing VLANs, neighbor changes, and error growth while preserving full evidence-backed reports.
- Expanded coverage to switch RMAs and refreshes, normalising OS/version discrepancies for reliable device-to-device comparisons.
02
Automated Device Certification Platform
PDF Ingestion → pytest → Allure
- Replaced a weeks-long contractor-driven process with a repeatable automated workflow executed in minutes.
- PDF ingestion pipeline using prompt chaining + RAG to normalise multi-page documents into structured markdown test specs.
- Auto-generates pytest scripts with PyShark and testbed YAML files for scalable, repeatable validation.
- Allure-pytest reporting delivers visual, versioned certification reports with full traceability for board-level compliance.
- Eliminated contractor costs, standardised certification quality, and accelerated onboarding from weeks to minutes.
03
AI Multi-Agent Workflow Platform
LangGraph · FastAPI · Kafka
- LangGraph supervisor/sub-agent architecture where a supervisor orchestrates domain-specific agents for monitoring, troubleshooting, and automation.
- Dynamic tool invocation and intelligent decision-making — reduced manual troubleshooting by 40%.
- Full-stack chatbot (FastAPI, Kafka, MongoDB, React, TypeScript) streamlining cross-team collaboration.
- Agentic workflows handle SSH multi-command tasks, upgrade validation, and device searches across ExtraHop, NetBrain, and Cisco ACI.
04
Network Topology & Visibility Platform
Neo4j · Airflow · Real-time Impact Analysis
- Real-time topology visibility of device interconnections — enables teams to predict impact zones during change management and incident response.
- Airflow DAGs schedule daily multi-source data collection, ensuring up-to-date device state visibility.
- Progressive ingestion (device-by-device) avoids heavy loads and improves pipeline efficiency.
- Neo4j graph database for device relationships — operations teams quickly identify dependency chains and failure points.
05
Enterprise Data Platform MVP
Kafka · Airflow · AWS S3 · Grafana
- ETL pipelines (Kafka + Airflow) extracting and transforming metrics from Grafana, PostgreSQL, Elasticsearch, and Dynatrace into an AWS S3 data lake.
- Real-time monitoring dashboards in Grafana — cut data-access latency by 35%.
- AWS Glue Data Catalog + Lake Formation for governance; microservices on OpenShift/Kubernetes with 99.9% uptime.
- Enabled enterprise-wide analytics: anomaly detection, proactive infrastructure optimisation, and stakeholder reporting.
06
Real-Time Chatbot with LLM RAG
LangChain · Kafka · Kubernetes
- Decomposed monolithic chatbot into 3 scalable microservices with Kafka event-driven communication.
- LangChain + Hugging Face RAG pipeline merging LLM knowledge with internal data sources for domain-specific responses.
- MongoDB caching eliminated redundant processing; WebSocket notifications via Webex and Slack for live change updates.
- Docker/Kubernetes autoscaling deployment — reduced manual troubleshooting and accelerated change validation for network ops.