{"id":14693,"date":"2023-01-01T08:23:40","date_gmt":"2023-01-01T07:23:40","guid":{"rendered":"https:\/\/www.graviton.at\/letterswaplibrary\/cleanvision-audit-your-image-datasets-for-better-computer-vision\/"},"modified":"2023-01-01T08:23:40","modified_gmt":"2023-01-01T07:23:40","slug":"cleanvision-audit-your-image-datasets-for-better-computer-vision","status":"publish","type":"post","link":"https:\/\/www.graviton.at\/letterswaplibrary\/cleanvision-audit-your-image-datasets-for-better-computer-vision\/","title":{"rendered":"CleanVision: Audit Your Image Datasets For Better Computer Vision"},"content":{"rendered":"<p><!-- SC_OFF --><\/p>\n<div class=\"md\">\n<p>To all my computer vision friends working on real-world applications with messy image data, I just open-sourced a Python library you may find useful!<\/p>\n<p>CleanVision audits any image dataset to automatically detect common issues such as images that are blurry, under\/over-exposed, oddly sized, or near duplicates of others. It\u2019s just 3 lines of code to discover what issues lurk in your data before you dive into modeling, and CleanVision can be used for <strong>any<\/strong> image dataset \u2014 regardless of whether your task is image generation, classification, segmentation, object detection, etc.<\/p>\n<p> from cleanvision.imagelab import Imagelab imagelab = Imagelab(data_path=&#8221;path_to_dataset&#8221;) imagelab.find_issues() imagelab.report()  <\/p>\n<p>As leaders like Andrew Ng and OpenAI have lately repeated: models can only be as good as the data they are trained on. Before diving into modeling, quickly run your images through CleanVision to make sure they are ok \u2014 it\u2019s super easy!<\/p>\n<p>Github: <a href=\"https:\/\/github.com\/cleanlab\/cleanvision\">https:\/\/github.com\/cleanlab\/cleanvision<\/a><\/p>\n<p>Disclaimer: I am affiliated with Cleanlab.<\/p>\n<\/div>\n<p><!-- SC_ON -->   submitted by   <a href=\"https:\/\/www.reddit.com\/user\/jonas__m\"> \/u\/jonas__m <\/a> <br \/> <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/11ym95l\/cleanvision_audit_your_image_datasets_for_better\/\">[link]<\/a><\/span>   <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/11ym95l\/cleanvision_audit_your_image_datasets_for_better\/\">[comments]<\/a><\/span><\/p><div class='watch-action'><div class='watch-position align-right'><div class='action-like'><a class='lbg-style1 like-14693 jlk' href='javascript:void(0)' data-task='like' data-post_id='14693' data-nonce='65e0e39b87' rel='nofollow'><img class='wti-pixel' src='https:\/\/www.graviton.at\/letterswaplibrary\/wp-content\/plugins\/wti-like-post\/images\/pixel.gif' title='Like' \/><span class='lc-14693 lc'>0<\/span><\/a><\/div><\/div> <div class='status-14693 status align-right'><\/div><\/div><div class='wti-clear'><\/div>","protected":false},"excerpt":{"rendered":"<p>To all my computer vision friends working on real-world applications with messy image data, I just open-sourced&#8230;<\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[7],"tags":[],"class_list":["post-14693","post","type-post","status-publish","format-standard","hentry","category-othernonsense","wpcat-7-id"],"_links":{"self":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/14693","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/comments?post=14693"}],"version-history":[{"count":0,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/14693\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/media?parent=14693"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/categories?post=14693"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/tags?post=14693"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}