Hi everyone,
I’m working on an anomaly detection project using logs from an all-in-one OpenStack deployment (Ansible-based). The logs come from multiple sources , and are collected via Fluentd and sent to OpenSearch.
My main problem is that I don’t have a dataset, and I don’t have enough time to build one manually.
I’m considering running OpenStack for a full day to generate a large amount of logs, then using a tool to generate more data to have a huge and good dataset for anomaly detection.
Are there any tools or approaches that can help generate a good dataset from my own logs in this kind of setup? (Logs are json lines!)
Thanks in advance!
submitted by /u/Substantial_Elk_2999
[link] [comments]