{"id":32517,"date":"2025-02-05T16:27:22","date_gmt":"2025-02-05T15:27:22","guid":{"rendered":"https:\/\/www.graviton.at\/letterswaplibrary\/high-school-ap-research-project-need-help-replacing-pushshift-api-for-reddit-data-collection\/"},"modified":"2025-02-05T16:27:22","modified_gmt":"2025-02-05T15:27:22","slug":"high-school-ap-research-project-need-help-replacing-pushshift-api-for-reddit-data-collection","status":"publish","type":"post","link":"https:\/\/www.graviton.at\/letterswaplibrary\/high-school-ap-research-project-need-help-replacing-pushshift-api-for-reddit-data-collection\/","title":{"rendered":"High School AP Research Project: Need Help Replacing Pushshift API For Reddit Data Collection"},"content":{"rendered":"<p><!-- SC_OFF --><\/p>\n<div class=\"md\">\n<p>Hi everyone,<\/p>\n<p>I\u2019m a high school student working on my AP Research project, and I\u2019m running into some issues with data collection that I could really use help with. My study focuses on <strong>analyzing how Reddit-driven stock recommendations impact long-term investment decisions.<\/strong> I\u2019m specifically looking at subreddits like <a href=\"https:\/\/www.reddit.com\/r\/wallstreetbets\">r\/wallstreetbets<\/a>, <a href=\"https:\/\/www.reddit.com\/r\/stock\">r\/stock<\/a>, <a href=\"https:\/\/www.reddit.com\/r\/investing\">r\/investing<\/a>, and <a href=\"https:\/\/www.reddit.com\/r\/SecurityAnalysis\">r\/SecurityAnalysis<\/a> to track sentiment around different stocks and see if that sentiment can predict stock performance over time.<\/p>\n<p>I had originally planned to use the Pushshift API to collect historical Reddit data, but with Reddit\u2019s recent API changes, Pushshift no longer works. Since I\u2019m pretty new to programming and APIs, I\u2019m not sure what the best alternative is. I\u2019ve tried looking into PRAW, but I\u2019m concerned about its limitations when it comes to accessing older posts.<\/p>\n<p><strong>Here\u2019s what I need:<\/strong><\/p>\n<p>  A reliable way to collect historical Reddit posts (from 2022 to 2025 if possible). Advice on whether PRAW can handle this, or if there\u2019s another tool or method I should use. Suggestions for workarounds or public datasets that might help with historical Reddit data.  <\/p>\n<p>Since this is part of a project I hope to eventually publish, I\u2019m really eager to find a solution. I\u2019d love any advice, resources, or guidance you can offer, especially considering I\u2019m new to this and learning as I go.<\/p>\n<p>Here&#8217;s a link to my original methodology plan if it helps clear up some questions. Feel free to add coments to the document!<\/p>\n<p><a href=\"https:\/\/docs.google.com\/document\/d\/1p_pI7Cq9EJgc1MMsYSHRiobdf5_Oirkc61p5_lCF3Ck\/edit?usp=sharing\">Methodology Plan<\/a><\/p>\n<\/div>\n<p><!-- SC_ON -->   submitted by   <a href=\"https:\/\/www.reddit.com\/user\/Immediate-Today-8157\"> \/u\/Immediate-Today-8157 <\/a> <br \/> <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/1iibwna\/high_school_ap_research_project_need_help\/\">[link]<\/a><\/span>   <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/1iibwna\/high_school_ap_research_project_need_help\/\">[comments]<\/a><\/span><\/p><div class='watch-action'><div class='watch-position align-right'><div class='action-like'><a class='lbg-style1 like-32517 jlk' href='javascript:void(0)' data-task='like' data-post_id='32517' data-nonce='614a020375' rel='nofollow'><img class='wti-pixel' src='https:\/\/www.graviton.at\/letterswaplibrary\/wp-content\/plugins\/wti-like-post\/images\/pixel.gif' title='Like' \/><span class='lc-32517 lc'>0<\/span><\/a><\/div><\/div> <div class='status-32517 status align-right'><\/div><\/div><div class='wti-clear'><\/div>","protected":false},"excerpt":{"rendered":"<p>Hi everyone, I\u2019m a high school student working on my AP Research project, and I\u2019m running into&#8230;<\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[85],"tags":[],"class_list":["post-32517","post","type-post","status-publish","format-standard","hentry","category-datatards","wpcat-85-id"],"_links":{"self":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/32517","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/comments?post=32517"}],"version-history":[{"count":0,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/32517\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/media?parent=32517"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/categories?post=32517"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/tags?post=32517"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}