{"id":31018,"date":"2024-10-21T06:28:17","date_gmt":"2024-10-21T04:28:17","guid":{"rendered":"https:\/\/www.graviton.at\/letterswaplibrary\/merging-datasets-for-one-single-project\/"},"modified":"2024-10-21T06:28:17","modified_gmt":"2024-10-21T04:28:17","slug":"merging-datasets-for-one-single-project","status":"publish","type":"post","link":"https:\/\/www.graviton.at\/letterswaplibrary\/merging-datasets-for-one-single-project\/","title":{"rendered":"Merging Datasets For One Single Project?"},"content":{"rendered":"<p><!-- SC_OFF --><\/p>\n<div class=\"md\">\n<p>There\u2019s more of like two parts with this question, so yeah. <\/p>\n<p>First question: Let\u2019s say I want to train a ML model to detect a basic disease based off an image, say a brain. I can find a large dataset on regular. Then, I find multiple smaller datasets with not as many brain with disease images. Thus, I take all these smaller datasets of brains with diseases, combine them into one, then use this new dataset (brain with diseases) and the other dataset (large dataset with regular brain), and use them for classification. Is this possible?<\/p>\n<p>Second question: can we extend this to multiple classes? Say we have a disease that requires many conditions\/symptoms to detect. Can I find these conditions from multiple data sets (One dataset contains characteristics, one dataset contains duration, one dataset includes images, etc) and essentially merge them all into one as long as they classify the same disease??<\/p>\n<\/div>\n<p><!-- SC_ON -->   submitted by   <a href=\"https:\/\/www.reddit.com\/user\/ResearchingTinBot\"> \/u\/ResearchingTinBot <\/a> <br \/> <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/1g8h87n\/merging_datasets_for_one_single_project\/\">[link]<\/a><\/span>   <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/1g8h87n\/merging_datasets_for_one_single_project\/\">[comments]<\/a><\/span><\/p><div class='watch-action'><div class='watch-position align-right'><div class='action-like'><a class='lbg-style1 like-31018 jlk' href='javascript:void(0)' data-task='like' data-post_id='31018' data-nonce='614a020375' rel='nofollow'><img class='wti-pixel' src='https:\/\/www.graviton.at\/letterswaplibrary\/wp-content\/plugins\/wti-like-post\/images\/pixel.gif' title='Like' \/><span class='lc-31018 lc'>0<\/span><\/a><\/div><\/div> <div class='status-31018 status align-right'><\/div><\/div><div class='wti-clear'><\/div>","protected":false},"excerpt":{"rendered":"<p>There\u2019s more of like two parts with this question, so yeah. First question: Let\u2019s say I want&#8230;<\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[85],"tags":[],"class_list":["post-31018","post","type-post","status-publish","format-standard","hentry","category-datatards","wpcat-85-id"],"_links":{"self":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/31018","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/comments?post=31018"}],"version-history":[{"count":0,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/31018\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/media?parent=31018"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/categories?post=31018"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/tags?post=31018"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}