{"id":30103,"date":"2024-08-20T10:28:02","date_gmt":"2024-08-20T08:28:02","guid":{"rendered":"https:\/\/www.graviton.at\/letterswaplibrary\/recommendations-for-extensive-datasets-in-process-engineering-and-optimization-for-end-to-end-ds-de-projects\/"},"modified":"2024-08-20T10:28:02","modified_gmt":"2024-08-20T08:28:02","slug":"recommendations-for-extensive-datasets-in-process-engineering-and-optimization-for-end-to-end-ds-de-projects","status":"publish","type":"post","link":"https:\/\/www.graviton.at\/letterswaplibrary\/recommendations-for-extensive-datasets-in-process-engineering-and-optimization-for-end-to-end-ds-de-projects\/","title":{"rendered":"Recommendations For Extensive Datasets In Process Engineering And Optimization For End-to-End DS\/DE Projects"},"content":{"rendered":"<p><!-- SC_OFF --><\/p>\n<div class=\"md\">\n<p>Hi everyone,<\/p>\n<p>I\u2019m a data science researcher focusing on process engineering and optimization, and I\u2019m looking to further strengthen my knowledge through different use cases. I\u2019m reaching out for recommendations on extensively large datasets that can be processed using cloud platforms.<\/p>\n<p>My goal is to create an end-to-end Data Science\/Data Engineering project that involves ingesting these large datasets and applying domain knowledge to derive insights. I\u2019m particularly interested in **time series** modeling, which is crucial for capturing temporal trends.<\/p>\n<p>Some areas I\u2019m considering include:<\/p>\n<p>  Oil and gas unit operations datasets Carbon Capture, Utilization, and Storage (CCUS) datasets FMCG manufacturing datasets, such as edible oil or biomass production Water treatment units, especially where time-sensitive data is key  <\/p>\n<p>To give you an idea of my background, I\u2019ve worked on modeling and optimization in amine treating, sulfur recovery, and carbon capture datasets. I\u2019ve also successfully developed an anomaly detection model for the Tennessee Eastman process. However, I\u2019m eager to dive deeper into time series modeling for my next project.<\/p>\n<p><strong>Major requirements:<\/strong><\/p>\n<p>  Focus on time series data Can involve classification or regression tasks Comparatively large datasets with many columns (variables) and datapoints  <\/p>\n<p>I would greatly appreciate any suggestions or pointers to datasets that align with what I mentioned.<\/p>\n<p>Thanks in Advance!<\/p>\n<\/div>\n<p><!-- SC_ON -->   submitted by   <a href=\"https:\/\/www.reddit.com\/user\/ryanroy0698\"> \/u\/ryanroy0698 <\/a> <br \/> <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/1ewpqup\/recommendations_for_extensive_datasets_in_process\/\">[link]<\/a><\/span>   <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/1ewpqup\/recommendations_for_extensive_datasets_in_process\/\">[comments]<\/a><\/span><\/p><div class='watch-action'><div class='watch-position align-right'><div class='action-like'><a class='lbg-style1 like-30103 jlk' href='javascript:void(0)' data-task='like' data-post_id='30103' data-nonce='65e0e39b87' rel='nofollow'><img class='wti-pixel' src='https:\/\/www.graviton.at\/letterswaplibrary\/wp-content\/plugins\/wti-like-post\/images\/pixel.gif' title='Like' \/><span class='lc-30103 lc'>0<\/span><\/a><\/div><\/div> <div class='status-30103 status align-right'><\/div><\/div><div class='wti-clear'><\/div>","protected":false},"excerpt":{"rendered":"<p>Hi everyone, I\u2019m a data science researcher focusing on process engineering and optimization, and I\u2019m looking to&#8230;<\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[85],"tags":[],"class_list":["post-30103","post","type-post","status-publish","format-standard","hentry","category-datatards","wpcat-85-id"],"_links":{"self":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/30103","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/comments?post=30103"}],"version-history":[{"count":0,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/30103\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/media?parent=30103"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/categories?post=30103"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/tags?post=30103"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}