{"id":31519,"date":"2024-11-15T21:28:23","date_gmt":"2024-11-15T20:28:23","guid":{"rendered":"https:\/\/www.graviton.at\/letterswaplibrary\/selling-preprocesed-and-cleaned-job-description-dataset-latest-linkedin-and-indeed-stem-postings-from-us-the-dataset-contains-both-uncleaned-and-preprocessed-data-for-ai-training-please-let-me-kno\/"},"modified":"2024-11-15T21:28:23","modified_gmt":"2024-11-15T20:28:23","slug":"selling-preprocesed-and-cleaned-job-description-dataset-latest-linkedin-and-indeed-stem-postings-from-us-the-dataset-contains-both-uncleaned-and-preprocessed-data-for-ai-training-please-let-me-kno","status":"publish","type":"post","link":"https:\/\/www.graviton.at\/letterswaplibrary\/selling-preprocesed-and-cleaned-job-description-dataset-latest-linkedin-and-indeed-stem-postings-from-us-the-dataset-contains-both-uncleaned-and-preprocessed-data-for-ai-training-please-let-me-kno\/","title":{"rendered":"Selling Preprocesed And Cleaned Job Description Dataset (Latest LinkedIn And Indeed STEM Postings From US). The Dataset Contains Both Uncleaned And Preprocessed Data For AI Training. Please Let Me Know If Anyone Would Like It, I&#8217;m Trying To Raise Some Money For My Startup. Thanks!!!"},"content":{"rendered":"<p><!-- SC_OFF --><\/p>\n<div class=\"md\">\n<p>Hey!<\/p>\n<p>I have around 700K lines of job description processed for AI and ML training. This extracting just the requirements and responsibilities, splitting them into individual lines, correcting all grammatical mistakes, extracting keywords into software skills and experience, classifying the job description, and adding an H1B filter to it. <\/p>\n<p>The dataset is from LinkedIn and Indeed, I scrape and process around 15K everyday. I also have uncleaned and purely scraped data that is 60K everyday. They are all STEM jobs in the US.<\/p>\n<p>I have attached an example of both datasets with this. You can find them <a href=\"https:\/\/drive.google.com\/drive\/folders\/1m2Tbutacuq5QGBuo6ulhQ7n_sTxJtmyi?usp=sharing\">here<\/a>.<\/p>\n<p>I&#8217;m trying to raise around $2000 for my startup and this would help me a lot. However, its no pressure I&#8217;m not trying to solicitate, just trying to sell some good dataset.<\/p>\n<p>Let me know if anyone has any questions, and please no hate. <\/p>\n<p>Thanks!<\/p>\n<\/div>\n<p><!-- SC_ON -->   submitted by   <a href=\"https:\/\/www.reddit.com\/user\/assassinator444\"> \/u\/assassinator444 <\/a> <br \/> <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/1gs5rk7\/selling_preprocesed_and_cleaned_job_description\/\">[link]<\/a><\/span>   <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/1gs5rk7\/selling_preprocesed_and_cleaned_job_description\/\">[comments]<\/a><\/span><\/p><div class='watch-action'><div class='watch-position align-right'><div class='action-like'><a class='lbg-style1 like-31519 jlk' href='javascript:void(0)' data-task='like' data-post_id='31519' data-nonce='65e0e39b87' rel='nofollow'><img class='wti-pixel' src='https:\/\/www.graviton.at\/letterswaplibrary\/wp-content\/plugins\/wti-like-post\/images\/pixel.gif' title='Like' \/><span class='lc-31519 lc'>0<\/span><\/a><\/div><\/div> <div class='status-31519 status align-right'><\/div><\/div><div class='wti-clear'><\/div>","protected":false},"excerpt":{"rendered":"<p>Hey! I have around 700K lines of job description processed for AI and ML training. This extracting&#8230;<\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[85],"tags":[],"class_list":["post-31519","post","type-post","status-publish","format-standard","hentry","category-datatards","wpcat-85-id"],"_links":{"self":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/31519","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/comments?post=31519"}],"version-history":[{"count":0,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/31519\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/media?parent=31519"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/categories?post=31519"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/tags?post=31519"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}