Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

NFL Ticket Price Database 2023 Regular Season

Hi! I’m looking for a database that includes ticket prices (highest, lowest, average, etc) for every NFL game in the regular season. I found data (the seat geek API) for future games but I’m missing the first 100 games of the season, as the API only includes games still posted on their website. Help???

submitted by /u/theasummerall
[link] [comments]

Muscle Distribution DataSet Available?

I am on the search for a data set that has workout information, the muscle groups that are hit in the workout, and the percentage hit by that muscle.

Most datasets I have found have the workout and the muscles that are hit, but not how much a muscle is hit.

I am looking for a list that will say “Squats (40% Quad, 40% Hamstrings, 20% Glute)”

Is there a dataset out there that would have the distribution data I am looking for?

I’ve done some research on this subreddit and through the web and haven’t been able to find anything, any help will be appreciated.

submitted by /u/DontTouchMyNut9000
[link] [comments]

Trending Recipes / Food In Real Time

I am trying to find a way to get name of food which is trending on social media right now .

On Google I found articles but I am not sure they are updated or not .

One of my ideas is to scrape r/foodporn but it’s not only about trending food .

What are the subs or websites which provide this type of data and update it frequently.

Or how can I generate my own dataset

submitted by /u/Universe-89
[link] [comments]

Anyone Looking/requesting For Some Datasets? Trying To See If I Can Help! [SELF-PROMOTION]

There are tons of dataset requests in this subreddit that just go unfulfilled – I built a tool, as part of my data marketplace project, that connects your data requests with people, organization or companies that will be able to fulfill your request. No need for you to do the searching. I realized there really isn’t a single place where you can just drop your request and people come to you so hopefully this helps some people out there. It’s called sellagen.com, so please let me know if you have any questions or feedback so I can improve on it!

Disclaimer: I built and own this platform

submitted by /u/nobilis_rex_
[link] [comments]

Economic Activity Data At The Census Tract Level?

I am looking for a data set that can show the economic activity of census tracts. I’ve found a good one by zip codes but because the rest of the datasets are gonna be by census tracts I need a census tract data for this as well and haven’t been able to find it. Literally any suggestions are welcome cause I cannot find anything. Thank you so much!

submitted by /u/E6E6FA_FFB6C1
[link] [comments]

How To Extract The Inc 5000 List (2023) Into Excel?

Hi there, I have seen a few questions on past year’s lists and Excel sheets but I couldn’t get the R code to work for the 2023 set. I’m not sure if its because I do not have the correct link format or what..
Here is the website I am taking the data from: https://www.inc.com/inc5000/2023

This is the Reddit post I tried to follow on R: https://www.reddit.com/r/datasets/comments/wr3vyz/trying_to_extract_inc_5000_2022_list_to_excel/
More specifically I followed this code: https://gist.github.com/MattSandy/14242b5af9dce69102647e2000848bcc

When I tried to follow the above code I just substituted 2022 for 2023 and crossed my fingers which did not work. I can post my R error codes or the exact code I wrote if that is helpful.

submitted by /u/Character-Forever382
[link] [comments]

Radiation Spread During An Oil Tanker Explosion

Hey y’all! Got a uni project to determine zones of high risk depending on the scenario of an oil rig or tanker exploding in a specific area , I would like to know if there’s any dataset available that gives some radiation value (for eg in Sievert) corresponding to some distance/intensity from an explosion

(doesn’t have to specific to the problem, just need one that has set of radiation values)

submitted by /u/qvuuh
[link] [comments]

[self-promotion] Git Version Controlled Datasets In S3

Ever wanted to use Git to version control datasets or large files but Github LFS turned out to be too expensive and now you have a bunch of hacky scripts put together to use S3 for storage but there’s no version control?

We’re here to help you with that. You can use your own S3 buckets or our Free LFS Storage with Github.

Try out: https://underhive.in (please use on Desktop, the mobile version is broken right now)

Dashboard Screenshot: https://i.imgur.com/eYwGGjw.png

submitted by /u/kaisoma
[link] [comments]

Open Source Data Set To Reprocess Data

I am trying to find a data set that takes data from a database and reproccesses it by removing noise and other irrelevant information from it. I have been looking at REDD (A public data set for energy disaggregation), but the hardware for it seems outside my budget. I am not sure if this is the right place to put this but if anyone knows of something, please let me know!

submitted by /u/labib0910
[link] [comments]

Looking For Data Sets Covering California Buildings And HVAC Systems

Hello, so I am working on a research project that is looking to understand the relationships between buildings in California and the unique locations and environments they are located in, specifically pertaining to which types of HVAC (heating, ventilation, and air conditioning) systems (e.g. air conditioner, heat pump, etc.) are in use. I am essentially trying to map a building footprint of California buildings with an attribute denoting the type of HVAC system attributed to each building. I would be mapping this data against a map of California using GIS software, on top of other data layers showing various demographic, economic, and environmental factors. Would anyone here perhaps know if such a data set exists and is available? This could be a shapefile with building geometry, or honestly just a file showing a specific HVAC system for a given building address. Or would anyone know how I could assemble such a data set? Any recommendations, guidance, or help would be much appreciated. Thank you!

submitted by /u/teledude_22
[link] [comments]

Turn Your Google Sheet Of Data Into A Sellable API

Hello, I’m a professional web developer planning to build a service that does exactly what the title says. I’m looking for feedback from you, what features are must haves. I’m thinking the flow would essentially be like this:

1) You create and fill the google sheet that has the dataset in it you wish to sell API access to. Probably you’ve already done this

2) You login to my site, give the API a name and define pricing. I think what makes the most sense here are rate limits (i.e. x number of GET requests per minute/hour/day), so you define 3 rate limits, for your 3 pricing tiers. You provide credentials that provide access to only that google sheet, my service automatically generates a REST API for your sheet.

3) you can either manually generate API keys for your customers through my website, or you can use an internal API with my site to automatically provision them for your customers.

4) Your customers pay you directly for API usage, if they stop paying you revoke their keys (again either through the UI or through one of my internal API’s).

I’m thinking of charging $30, $50 and $100 per month for 3 tiers that will allow you to provision your API based on requests/hour and number of API keys.

What concerns do you have about that? The beauty of integrating with google sheets is that I suspect many of you already have many datasets within google sheets. I’m just providing a way to give programatic access. You can also update the google sheet and the API will automatically provide that updated data. You could also programatically write updates to your google sheet as well, your customers would not have write access.

submitted by /u/AnyPrize
[link] [comments]

Project Help Regarding Power Bi Dashboard

Hello everyone, this is my first post here. I am currently in second year pursuing ai and data science as an undergraduate degree. I am currently working on a student complaint dashboard using Power bi. I was thinking of collecting the data through google form using various questionnaire to provide to my classmates and then performing ETL on it in power bi. But i am having trouble regarding the questions that would provide me with valuable insights. So far i have categorised the complaints in category for example:- 1) hostel/mess 2) infrastructure 3) fee etc And then i would further ask questions regarding hygiene of the mess for ex and then provide a rating for their answer.

What measures/ questions should i incorporate to draw some good insights from this?

submitted by /u/Lazy_Telephone6759
[link] [comments]