I’m looking for a dataset that contains a large amount of features (100+) for clustering purposes. I’m applying sparse clustering, which means that my method removes features that are not important for the clusters. Either a data set that only contains categorical variables, or a mix of numerical and categorical variables is fine.
Does anyone have any ideas?
submitted by /u/y_zh
[link] [comments]