I need dataset for this
Using the concept of topic modeling, implement it using:
(i) Rule-based method
(ii) Latent Dirichlet Allocation LDA method
For your convenience, take any unlabeled dataset
Perform data cleaning
Use TF-IDF vectorizer and any clustering method in case of Rule-based method
Fit LatentDirichletAllocation estimator in case of LDA method
submitted by /u/Particular-Pie-1640
[link] [comments]