I started on this idea of finding a comprehensive book dataset which for sure has a description and more than one genre (makes things more realistic), since I wanted to cluster them based on similarity to find some good ones to read for myself 😉 The only ones I could find on Kaggle were ones with a single genre label, so collected it on my own.
So sharing it here in case it helps someone else too:
[Dataset](https://www.kaggle.com/datasets/ishikajohari/best-books-10k-multi-genre-data)
The data was collected from Goodreads from their list – Books That Everyone Should Read At Least Once and contains Description, Ratings and Multiple Genre classifiers.
submitted by /u/ishika_jo
[link] [comments]