This is a cross-post from r/dataengineering via recommendation from the comments there.
I am trying to curate and crowdsource a list of real-time datasets and sources into an “Awesome List” in a GitHub repo – https://github.com/bytewax/awesome-public-real-time-datasets. It is something I found difficult when building hobby projects or trying to learn about streaming data.
If you have any recommendations please share a link in the comments or open a PR in the repo :).
submitted by /u/math-bw
[link] [comments]