{"id":36097,"date":"2025-10-19T22:27:37","date_gmt":"2025-10-19T20:27:37","guid":{"rendered":"https:\/\/www.graviton.at\/letterswaplibrary\/dataset-massive-free-airbnb-dataset-1000-largest-markets-with-revenue-occupancy-calendar-rates-and-more\/"},"modified":"2025-10-19T22:27:37","modified_gmt":"2025-10-19T20:27:37","slug":"dataset-massive-free-airbnb-dataset-1000-largest-markets-with-revenue-occupancy-calendar-rates-and-more","status":"publish","type":"post","link":"https:\/\/www.graviton.at\/letterswaplibrary\/dataset-massive-free-airbnb-dataset-1000-largest-markets-with-revenue-occupancy-calendar-rates-and-more\/","title":{"rendered":"[Dataset] Massive Free Airbnb Dataset: 1,000 Largest Markets With Revenue, Occupancy, Calendar Rates And More"},"content":{"rendered":"<p><!-- SC_OFF --><\/p>\n<div class=\"md\">\n<p>Hi folks,<\/p>\n<p>I work on the data science team at <a href=\"https:\/\/www.airroi.com\/\">AirROI<\/a>, we are one of the largest Airbnb data analytics platform.<\/p>\n<p>FYI, we&#8217;ve released free Airbnb datasets on nearly <strong>1,000 largest markets<\/strong>, and we&#8217;re releasing it for free to the community. This is one of the most granular free datasets available, containing not just listing details but critical performance metrics like trailing-twelve-month revenue, occupancy rates, and future calendar rates. We also <strong>refresh<\/strong> this free datasets on <strong>monthly<\/strong> basis.<\/p>\n<p><strong>Direct Download Link (No sign-up required):<\/strong><br \/> <a href=\"http:\/\/www.airroi.com\/data-portal\">www.airroi.com\/data-portal<\/a> -&gt; then download from each market<\/p>\n<h1>Dataset Overview &amp; Schemas<\/h1>\n<p>The data is structured into several interconnected tables, provided as CSV files per market.<\/p>\n<p><strong>1. Listings Data (65 Fields)<\/strong><br \/> This is the core table with detailed property information and\u2014most importantly\u2014<strong>performance metrics<\/strong>.<\/p>\n<ul>\n<li><strong>Core Attributes:<\/strong> <code>listing_id<\/code>, <code>listing_name<\/code>, <code>property_type<\/code>, <code>room_type<\/code>, <code>neighborhood<\/code>, <code>latitude<\/code>, <code>longitude<\/code>, <code>amenities<\/code> (list), <code>bedrooms<\/code>, <code>baths<\/code>.<\/li>\n<li><strong>Host Info:<\/strong> <code>host_id<\/code>, <code>host_name<\/code>, <code>superhost<\/code> status, <code>professional_management<\/code> flag.<\/li>\n<li><strong>Performance &amp; Revenue Metrics (The Gold):<\/strong>\n<ul>\n<li><code>ttm_revenue<\/code> \/ <code>ttm_revenue_native<\/code> (Total revenue last 12 months)<\/li>\n<li><code>ttm_avg_rate<\/code> \/ <code>ttm_avg_rate_native<\/code> (Average daily rate)<\/li>\n<li><code>ttm_occupancy<\/code> \/ <code>ttm_adjusted_occupancy<\/code><\/li>\n<li><code>ttm_revpar<\/code> \/ <code>ttm_adjusted_revpar<\/code> (Revenue Per Available Room)<\/li>\n<li><code>l90d_revenue<\/code>, <code>l90d_occupancy<\/code>, etc. (Last 90-day snapshot)<\/li>\n<li><code>ttm_reserved_days<\/code>, <code>ttm_blocked_days<\/code>, <code>ttm_available_days<\/code><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p><strong>2. Calendar Rates Data (14 Fields)<\/strong><br \/> Monthly aggregated future pricing and availability data for forecasting.<\/p>\n<ul>\n<li><strong>Key Fields:<\/strong> <code>listing_id<\/code>, <code>date<\/code> (monthly), <code>vacant_days<\/code>, <code>reserved_days<\/code>, <code>occupancy<\/code>, <code>revenue<\/code>, <code>rate_avg<\/code>, <code>booked_rate_avg<\/code>, <code>booking_lead_time_avg<\/code>.<\/li>\n<\/ul>\n<p><strong>3. Reviews Data (4 Fields)<\/strong><br \/> Temporal review data for sentiment and volume analysis.<\/p>\n<ul>\n<li><strong>Key Fields:<\/strong> <code>listing_id<\/code>, <code>date<\/code> (monthly), <code>num_reviews<\/code>, <code>reviewers<\/code> (list of IDs).<\/li>\n<\/ul>\n<p><strong>4. Host Data (11 Fields)<\/strong> <strong><em>Coming Soon<\/em><\/strong><br \/> Profile and portfolio information for hosts.<\/p>\n<ul>\n<li><strong>Key Fields:<\/strong> <code>host_id<\/code>, <code>is_superhost<\/code>, <code>listing_count<\/code>, <code>member_since<\/code>, <code>ratings<\/code>.<\/li>\n<\/ul>\n<h1>Why This Dataset is Unique<\/h1>\n<p>Most free datasets stop at basic listing info. This one includes the <strong>performance data<\/strong> needed for serious analysis:<\/p>\n<ul>\n<li><strong>Investment Analysis:<\/strong> Model ROI using actual <code>ttm_revenue<\/code> and <code>occupancy<\/code> data.<\/li>\n<li><strong>Pricing Strategy:<\/strong> Analyze how <code>rate_avg<\/code> fluctuates with seasonality and <code>booking_lead_time<\/code>.<\/li>\n<li><strong>Market Sizing:<\/strong> Use <code>professional_management<\/code> and <code>superhost<\/code> flags to understand market maturity.<\/li>\n<li><strong>Geospatial Studies:<\/strong> Plot revenue heatmaps using <code>latitude<\/code>\/<code>longitude<\/code> and <code>ttm_revpar<\/code>.<\/li>\n<\/ul>\n<h1>Potential Use Cases<\/h1>\n<ul>\n<li><strong>Academic Research:<\/strong> Economics, urban studies, and platform economy research.<\/li>\n<li><strong>Competitive Analysis:<\/strong> Benchmark property performance against market averages.<\/li>\n<li><strong>Machine Learning:<\/strong> Build models to predict <code>occupancy<\/code> or <code>revenue<\/code> based on amenities, location, and host data.<\/li>\n<li><strong>Data Visualization:<\/strong> Create dashboards showing revenue density, occupancy calendars, and amenity correlations.<\/li>\n<li><strong>Portfolio Projects:<\/strong> A fantastic dataset for a standout data science portfolio piece.<\/li>\n<\/ul>\n<h1>License &amp; Usage<\/h1>\n<p>The data is provided under a permissive license for academic and personal use. We request attribution to <a href=\"http:\/\/www.airroi.com\/\">AirROI<\/a> in public work.<\/p>\n<h1>For Custom Needs<\/h1>\n<p>This free dataset is updated monthly. If you need <strong>real-time, hyper-specific data, or larger historical dumps<\/strong>, we offer a low-cost API for developers and researchers:<br \/> <a href=\"http:\/\/www.airroi.com\/api\">www.airroi.com\/api<\/a><\/p>\n<p>Alternatively, we also provide bespoke data services if your needs go beyond the scope of the free datasets.<\/p>\n<p>We hope this data is useful. Happy analyzing!<\/p>\n<\/div>\n<p><!-- SC_ON -->   submitted by   <a href=\"https:\/\/www.reddit.com\/user\/jason-airroi\"> \/u\/jason-airroi <\/a> <br \/> <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/1oazsov\/dataset_massive_free_airbnb_dataset_1000_largest\/\">[link]<\/a><\/span>   <span><a href=\"https:\/\/www.reddit.com\/r\/datasets\/comments\/1oazsov\/dataset_massive_free_airbnb_dataset_1000_largest\/\">[comments]<\/a><\/span><\/p><div class='watch-action'><div class='watch-position align-right'><div class='action-like'><a class='lbg-style1 like-36097 jlk' href='javascript:void(0)' data-task='like' data-post_id='36097' data-nonce='65e0e39b87' rel='nofollow'><img class='wti-pixel' src='https:\/\/www.graviton.at\/letterswaplibrary\/wp-content\/plugins\/wti-like-post\/images\/pixel.gif' title='Like' \/><span class='lc-36097 lc'>0<\/span><\/a><\/div><\/div> <div class='status-36097 status align-right'><\/div><\/div><div class='wti-clear'><\/div>","protected":false},"excerpt":{"rendered":"<p>Hi folks, I work on the data science team at AirROI, we are one of the largest&#8230;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[85],"tags":[],"class_list":["post-36097","post","type-post","status-publish","format-standard","hentry","category-datatards","wpcat-85-id"],"_links":{"self":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/36097","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/comments?post=36097"}],"version-history":[{"count":0,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/posts\/36097\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/media?parent=36097"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/categories?post=36097"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.graviton.at\/letterswaplibrary\/wp-json\/wp\/v2\/tags?post=36097"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}