Looking For A Dataset For Clustering And PCA Project

Hi guys, I’m new in this data science world. I’m looking for a real-world dataset for a data science portfolio project focused on clustering and PCA (no classification labels required)

  • At least 4–10 numerical features
  • Preferably 500+ rows
  • Suitable for customer/user segmentation or behavioral clustering
  • Clean or moderately clean data
  • Must be publicly available

The goal is to apply dimensionality reduction (PCA) and clustering algorithms and interpret meaningful segments.

Any suggestions for datasets that fit this use case would be highly appreciated

-> Any suggestions regarding suitable datasets for this use case would be also very helpful. Instead of direct dataset recommendations, I would be very grateful if you could give me some ideas on where I can look.

submitted by /u/persephone_y
[link] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *