Will Using Synthetic Data Affect My ML Model Accuracy Or My Resume?

Hey everyone ๐Ÿ‘‹ Iโ€™m currently working on my final year engineering project based on disease prediction using Machine Learning.

Since real medical datasets are hard to find, I decided to generate synthetic data for training and testing my model. Some people told me itโ€™s not a good idea โ€” that it might affect my model accuracy or even look bad on my resume.

But my main goal is to learn the entire ML workflow โ€” from preprocessing to model building and evaluation.

So I wanted to ask: ๐Ÿ‘‰ Will using synthetic data affect my modelโ€™s performance or generalization? ๐Ÿ‘‰ Does it look bad on a resume or during interviews if I mention that I used synthetic data? ๐Ÿ‘‰ Any suggestions to make my project more authentic or practical despite using synthetic data?

Would really appreciate honest opinions or experiences from others whoโ€™ve been in the same situation ๐Ÿ™Œ

submitted by /u/shrinivas-2003
[link] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *