submitted by /u/cavedave
[link] [comments]
Category: Datatards
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
Hello guys I’m looking for a dataset that contains information either on the tourism industry or generally anything that has to do with tourism, hotel prices, bookings and cancellations or anything that comes to mind. If there is a resource online i still haven’t found something good I would appreciate any kind of help. Thanks.
submitted by /u/ComputersAndPunches
[link] [comments]
Hi Guys,
Im currently building a specialized language model, and trying to get access to some unique data sources (think healthcare). What is your go to data marketplace (and why)?
Any suggestions are welcomed
submitted by /u/XhoniShollaj
[link] [comments]
anyone knows where i can find a dataset for videos/images of exercises for shoulder, neck, fingers,etc.
where the patient does these exercises at home to get better
submitted by /u/dark_magician420
[link] [comments]
I am a student looking for a Diabetes Dataset( preferably Type-II ) for a research project. I have been trying my best but just found PIMA Indian Diabetes Dataset, which has been overly exploited.Any help is appreciated.
submitted by /u/Strange2118333
[link] [comments]
I need to find raw data and datasets to build my data analytics portfolio about that industry, I was considering to pay statista but I’m not 100% sure.
Not kaggle please.
submitted by /u/zardiax
[link] [comments]
For a while now I’ve trying to prove a perception of mine (and other folks too, I’m sure): scientific papers are getting much longer. I have the (strong) impression that papers now tend to have much more pages than years ago. If anyone knows of such a dataset with, say, titles of papers published by a journal during some years and then, attached to every paper, information like the number of pages.
I’d love to find data about STEM journals, but I’ll take any data that’s available.
Thanks.
submitted by /u/MasonBo_90
[link] [comments]
I’m looking for as many or a singular large dataset of human history. I did check back through this subreddit but don’t see anything. Any help would be appreciated
submitted by /u/ironic833
[link] [comments]
Are there any free intraday (e.g., 1-minute, tick) cryptocurrency data sources?
submitted by /u/BOBOLIU
[link] [comments]
Im trying to train an algorithm to determine if the shoes are a counterfeit or not but i can’t seem to find a decent dataset, can you please help me guys im doing this for my thesis project
submitted by /u/Rinzler_Uchiha
[link] [comments]
Sharing the dataset containing 27 hours of Sanskrit Audio from the News Services Division India. Find more information on the dataset page.
Link to the dataset: https://www.kaggle.com/datasets/warcoder/sanskrit-speech-recognition
submitted by /u/Chirag_Chauhan4579
[link] [comments]
Hey everyone! I am doing a project for a Hackathon related to the environment. I’m planning on making a dashboard using Tableau and analyzing a dataset related to the environment using SQL. Any resources?
submitted by /u/TwistLow1558
[link] [comments]
Under EU/UK legislation, consumers are eligible for compensation if their flights are delayed or cancelled due to reasons within a carrier’s control. This would rule out natural disasters, for example, but include reasons such as ‘an air steward was ill’.
Passengers are able to claim compensation based on the length of the delay and distance being travelled, and there’s some excellent documentation on the subject here:
The process for claiming compensation is convoluted and has spawned a mini industry of copycat legal firms who’ll do the heavy lifting on behalf of customers (for a fee).
Many of these firms provide free online tools (e.g. this one) for checking the validity of a claim. Whilst it’s trivial to check the status of any given flight (e.g. delayed by x minutes, distance, destinations, etc.), determining the airline’s provided reason for a delay is less obvious.
Is anyone familiar with an API or dataset that might provide this data? I’ve found a provider for US domestic flights (https://www.bts.gov/explore-topics-and-geography/topics/airline-time-performance-and-causes-flight-delays) but nothing for those operating within Europe.
Any pointers would be greatly appreciated.
submitted by /u/trilson
[link] [comments]
So here are some geographical location based disaster datasets:
https://www.kaggle.com/datasets/warcoder/earthquake-dataset
https://www.kaggle.com/datasets/warcoder/oil-spillage-data
https://www.kaggle.com/datasets/warcoder/civil-aviation-accidents
Here is a basic audio classification datasets that classifies either the word is bird, cat or dog
https://www.kaggle.com/datasets/warcoder/cats-vs-dogs-vs-birds-audio-classification
submitted by /u/Chirag_Chauhan4579
[link] [comments]
Hey everyone. I am learning myself R studio and would apppreciate any data sets to practice on. Perfferably I would like easy to understand data, as I am only a beginner and I am trying to get comforable with statistical analysis and data visualization. Thanks.
submitted by /u/Cyberredpanda1
[link] [comments]
Does anyone know if there are any public datasets available with information on cardiac arrests that follow Utstein or EuReCa guidelines?
Or are there any datasets that I could obtain if I am a student and need them for my master’s thesis?
Thank you!
submitted by /u/nivaznu
[link] [comments]
Hi, i am working on a research related with citation index. I am unable to find any dataset made public. It would be a great help if i could find a verified dataset rather than scraping data from google schoolar or scopus.
TIA
submitted by /u/pickle_rickstar
[link] [comments]
I understand that these are commercially valuable and personal, so rather hard to obtain. My research interest is non-commercial. I just want to establish base rates of specific terms.
Any suggestions?
submitted by /u/Forensicista
[link] [comments]
Title pretty much covers it. I’m looking for datasets on antibiotic resistant bacteria in UK waterways for a personal/portfolio project (not affiliated with any company, I am a Data Analytics student with some background in biology)
I’m especially interested in looking at the river Thames and the impact of antibiotics filtering into the environment through wastewater treatment plant “effluent”. Alternatively, hospital effluent would be really interesting to look at too!
Most of the data I’ve found has been a (thin) patchwork of time periods and areas covered and it’s been hard to find anything I can use to tell a story. Any help would be hugely appreciated. Thank you, r/datasets!
submitted by /u/Medium-Tea-
[link] [comments]
For context, I’m looking for a large food recipe datset (>5000) with nutritional information for my second personal project as a data analyst.
The goal is to identify recipes and the list of ingredients for it with the following input parameters: The amount of nutrients Dietary requirements Type of cuisine Etc.
In terms of the data source, any excel public dataset or getting it using Post API request is fine.
Thanks in advance.
submitted by /u/xu3n12
[link] [comments]
Title is pretty self-explanatory. I plan on making an interactive shot chart on RAWGraphs. Wondering if anybody has a dataset for the 2022-2023 NBA playoffs. (dataset needs X and Y coordinates)
submitted by /u/TwistLow1558
[link] [comments]
Looking for very specific use cases…
Moneyball is my best example but I’m hoping for more of something along the lines of the business of entertainment ticket sales. Any help is appreciated 🙂
submitted by /u/poiseandnerve
[link] [comments]
I looking for something like this, on the county or city level if possible. I’d need something deeper than state. Any recs?
submitted by /u/throwawayrandomvowel
[link] [comments]
I’m looking for all NCAA conferences and the colleges within them! (I think there are around ~140 conferences?)
submitted by /u/CheesyPanther
[link] [comments]
i am looking at the NASA earthdata datasets and ngl it is overwhelming haha. Can anyone suggest any dataset from there to analyse and create simple regression models (or even classification or clustering) from there?
In the meantime I will continue looking at them myself too – but it would great to hear some experienced opinions too!
submitted by /u/Icy-Bid-5585
[link] [comments]
I’m looking for an api that would give me an average price for food. Can’t find any
submitted by /u/Jerem_d
[link] [comments]
Does anyone still have access to the CT part of this dataset (Left_Atrial_Segmentation_Challenge_2013)?
Thanks!
submitted by /u/BABY_B00MERS
[link] [comments]