Core Services
API Gateway
Primary request routing and load balancing
12ms
Healthy
BGP Route Processor
Route table computation and advertisement
8ms
Healthy
LLM Agent Cluster
AI-assisted network operations (13/14 nodes active)
284ms
Degraded
Configuration Store
Distributed config management and sync
5ms
Healthy
Infrastructure
PostgreSQL Primary
Main database cluster (CN-BJ-01)
3ms
Healthy
Redis Cache
Session store and rate limiter
1ms
Healthy
Vector Store (Milvus)
Knowledge base embeddings for LLM agent
18ms
Healthy
Uptime (last 30 days) 99.94%
30 days ago Today
Recent Incidents
2026-03-16 09:14 CST
LLM Agent Node #11 Unresponsive
Agent node 11 became unresponsive due to GPU memory exhaustion. Traffic automatically rerouted to remaining nodes. Node is being reprovisioned.
2026-03-08 02:31 CST
Elevated API Latency
API response times increased to ~450ms for approximately 12 minutes during a BGP route table full recomputation triggered by upstream peer changes.
Resolved automatically
2026-02-22 18:05 CST
Scheduled Maintenance
Planned maintenance for PostgreSQL version upgrade and schema migration. Total downtime: 4 minutes.
Completed as planned