Dear everyone,
I humbly seek your assistance in my current endeavor. I am tasked with conducting a data analysis as part of my school project. The initial (and, for me, the most challenging) step is to identify two datasets that are interrelated and can be merged. Subsequently, I will proceed with the analytical work, which does not intimidate me. The datasets need not provide instant, magical solutions to one another, but there should be a logical basis for their integration.
The primary dataset should encompass approximately 20 categories, with a predominant emphasis on categorical data. It should be in a format that can be reasonably connected or merged with the second dataset, which should originate from a different data structure or source.
Honestly, after hours of diligent searching, I find myself somewhat disoriented. I would greatly appreciate any insights or suggestions. Initially, we contemplated working with a dataset pertaining to train delays in Poland, aiming to correlate it with weather data based on the date. Unfortunately, the dataset concerning Polish trains contains only 8 columns.
I will be immensely thankful for any guidance or counsel. Thank you!
submitted by /u/M4tel0te
[link] [comments]