Hi all,
I’m building an AI/ML model to predict Kubernetes failures (pod crashes, resource exhaustion, network issues, etc.) using historical and real-time cluster metrics.
🔍 Looking for a dataset that includes:
✅ CPU & Memory usage
✅ Pod & Node status
✅ Network I/O & latency
✅ Failure logs & events
submitted by /u/Gold_Educator_6655
[link] [comments]