Hello, I am looking for a dataset that has number of tourists by city, msa, or county for the last 10 years. It is okay if its a paid dataset.
submitted by /u/Timmmaaayyy93
[link] [comments]
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
Hello, I am looking for a dataset that has number of tourists by city, msa, or county for the last 10 years. It is okay if its a paid dataset.
submitted by /u/Timmmaaayyy93
[link] [comments]
I’m looking for datasets containing questions that people ask to “opponents” along with questions that they ask to other people in similar situations. Examples of what I’m looking for include lawyers asking questions to their own witnesses and cross-examining other witnesses, politicians in hearings asking questions to supporters of different political parties, and detectives asking for information from suspects and from each other. I’d like to analyze any changes people make in asking questions to their “opponents” vs other people as a baseline.
submitted by /u/geartrains
[link] [comments]
Hey all! I was doing some research on companies offering AI solutions for recruiting. I remember seeing a company mentioning that they were benchmarking their algorithm’s results to make sure there was no bias (as it relates to diversity) using some public dataset.
Unfortunately, I forgot to save the link and have been having trouble remembering what that dataset was. I would greatly appreciate it if you could tell me what the dataset could have been.
Thanks!
submitted by /u/opposity
[link] [comments]
I apologize in advance for the vague request, but I need to build a Tableau dashboard and present it for an interview. Unfortunately I wasn’t given any firm requirements or data when I asked, except that it needs to support funnel analysis. My Google searches for data haven’t been successful either. The data would ideally deal with maximizing capacity at a boarding school or stay over camp, but it doesn’t have to as long as the data support funnel analysis. I’m still pretty new in BI, so I’m not sure which data would best facilitate this. Thanks in advance for any help!
submitted by /u/skittles_grabber
[link] [comments]
All I would need is a data set lol. I might just be dumb but ive looked on Harvard dataverse, world data bank and a few others. Any help would be appreciated.
EDIT: For stata
submitted by /u/DoubleBarrelEnjoyer
[link] [comments]
This dataset from the Blood Transfusion Service Center in Hsin-Chu City, Taiwan, explores blood donation behavior as a classification problem. Collected every three months from 748 randomly selected donors, it includes attributes like recency, frequency, monetary value, and time. The dataset is ideal for studying and predicting blood donation behavior, pretty cool for classification tasks focused on understanding influencing factors.
You can find it here: https://sellagen.com/item/650207244d7ce7e8220cbec5
submitted by /u/nobilis_rex_
[link] [comments]
Has anyone come across any datasets dealing with autism rates? I want to work on a personal project since I am close to the subject of autism but I have not come across any large data sets
Specifically it would be nice if the information is broken down by year, country, etc and shows how it is progressing
submitted by /u/aerost0rm
[link] [comments]
Hello,
I’m looking for a queuing system, e.g traffic junction, service in cafe/ fast food, anything. I would need to have arrival, queue and service times. Does anything like this exist readily available?
submitted by /u/big123ben321
[link] [comments]
Trying to just view the CDC datasets, and the only format it seems to open in is text document. Why!?!? I can’t tell a single thing that’s going on, not even the variables being measured, because it just looks like blocks of text arranged haphazardly in the notepad app
Some other datasets from GitHub contains EDF files and text files again, which are also super inconvenient
Like where is the option for csv or spreadsheet, or basically anything that’s readily viewable and understandable? Why isn’t that the default? I was expecting that viewing the data files would be the easier part of trying to write a research paper, but no
Also if anyone knows how to get this CDC dataset into a viewable format, please let me know! Thanks
submitted by /u/Classic-Asparagus
[link] [comments]
BigPicture.io, the company I work for, has just released the latest version of their open-source company dataset, and it’s now available for download. I’ve been in Reddit for a while now, and think that this community might find it useful.
Check it out here: https://docs.bigpicture.io/docs/free-datasets/companies/
You need to sign up first, as we’ve had problems with bots and an AWS bill one month that nearly killed us.
Please feel free to provide your feedback/suggestions as we’re always aiming to improve our services.
submitted by /u/master_in_something
[link] [comments]
Looking for just a list that contains two kinds of information about the FBI’s uniform crime reports (UCR) or the newer NIBRS (the incident-based reporting system, can’t remember what it stands for):
Which agencies (e.g., police departments, etc.) contributed data to the UCR and/or NIBRS Which agencies did NOT do that (e.g., last year)
I’m hunting around the FBI’s UCR website looking for this and haven’t found it, yet. Anyone have this info?
submitted by /u/bobbyfiend
[link] [comments]
Looking for datasets that have kids’ images and after they grow up
submitted by /u/Dizzy-Banana-94
[link] [comments]
I think the current iteration of the data marketplace sucks. You have to know a specific place, where you want to get your data from. The variety of data sets available in a specific platform also varies so much. Also, it is incredibly difficult for a non-technical person to get their hands on the data. If a business user wants to access data they have to jump through a lot of hoops to download the data. Is it a good idea to start a marketplace that solves all these problems? Did anyone try to do this before?
submitted by /u/Responsible_Bell_772
[link] [comments]
I’m looking for a dataset of sports, games or video games events with two teams of multiple players (ideally 5 to 10) facing each other with the individual composition of each team being a different combination of a limited pool of players. And of course the final score/outcome of the event.
Like if 23 players had played 100 games of counter strike together : who is playing, what is each team’s composition (not always the same 5 dudes facing the other same 5 dudes) and what is the result + maybe how long did it last ?
All I can find are datasets with teams with fixed or little variying composition like the european football dataset or broad results without individual differenciation of the team members like league of legends ranked games datasets.
Doesn’t have to be highly skilled players. It could be the dataset of one’s kid’s football games at recess.
Any idea if such à dataset exists ? I’m currently trying to make my own by recording my own practice games but at the rate of once a week this will take forever.
submitted by /u/Heliantine
[link] [comments]
I haven’t been able to find any sort of data about how much Reddit gold has been given over the years. Has any information on this been tracked or published? Would be interested in seeing the results!
submitted by /u/naxypoo
[link] [comments]
This is what I found, but I suspect they are not updated, I have looked up a few of them up and they do not match what is shown on the link, but the way they are listed and the whole structure is just perfect. thats what am I looking for, Any alternative?
https://en.wikipedia.org/wiki/List_of_cities_by_average_temperature
submitted by /u/4everonlyninja
[link] [comments]
Hello, Reddit community,
I’m working on a project that focuses on query-oriented data cleaning with human expert involvement, and I’m in search of a suitable dataset to support this research. The dataset should ideally contain messy or incomplete data.
If you know of any relevant datasets or sources where I can find such data, I would greatly appreciate your assistance. Additionally, if you have any suggestions or insights on where to look for datasets with data quality issues, please feel free to share them.
Thank you in advance for your help and suggestions!
submitted by /u/thelifeofZ080
[link] [comments]
I’m looking for a dataset that has geolocation coordinates (e.g., latitude & longitude) for bombs dropped on Gaza, especially in the past few weeks, but older, as well. Ideally, I’d like a column with location and a matched column with date/time, and any other information is gravy.
Any ideas? I’ve been searching online, trying to follow sources back for reports in WaPo, Reuters, Axios, AP, etc., but they all seem to lead to dead ends (e.g., proprietary data not shared online).
submitted by /u/bobbyfiend
[link] [comments]
I need a dataset for a university proyect about cultural distance and cultural exports. For that I need cultural exports from S.Korea (or any country at this point) to other countries.
submitted by /u/Suku_Lete
[link] [comments]
I would ideally like to have state code/name, year, month, and average temperature. I looked at NOAA, Kaggle, Weatherspark and Extreme Weather Watch but couldn’t find it in a tabular format.
submitted by /u/byrak97
[link] [comments]
Hello, so I am trying to do some real estate-related research, and am particularly trying to understand types of buildings and locations that are most likely to have houses that have certain “green” and sustainability-related features, such as certain energy efficient appliances. I do not intend for this to be a discussion about the overall sustainability and performance of heat pumps, but I am trying to find a way to obtain a database of as many houses as I can across that US that have a heat pump, or just within California. The whole US would be great, but I am most interested in California for the moment. This is real estate-related, because heat pumps are just a hot topic in general in the eco-friendly home space. I know there are certain data sources like RECS data sets that have stats on heat pump adoption, but these values are only at the census division level. I want to see how heat pump homes are distributed much more locally and granularly so that I can understand which cities, regions, districts, neighborhoods, climate zones, etc. have higher clusters of heat pumps installed than others. Additionally, I want to understand the types of homes that have heat pumps, so that I can understand if there are any trends to take note of. I at first thought this idea was absurd and this data was just unobtainable, but then it was just suggested that I take a look at Zillow’s API ,which can be used to pull real estate home data that includes (sometimes) the HVAC system of a home. So I am wondering if maybe I could actually leverage this to get a read on where heat pump households are located within California. But also, I am wondering if there are other data sources I could use for this, I am thinking like construction permit databases or tax assessor databases, where I could filter results for houses where a permit was taken out for a heat pump installation. The idea would be to match all these data points to an address, so that I can map out heat pump homes across the state with GIS. Does this sound reasonable? Would anyone here perhaps have any suggestions on how I could approach this research challenge? Thank you!
submitted by /u/teledude_22
[link] [comments]
Hello all! I’m needing to find some cyber security related large or small datasets. In particular, datasets with relational datasets. Can anyone provide me with some?
Thank you!
submitted by /u/APB7148
[link] [comments]
Hi, so straight to the point me and my team chose a project idea for the machine learning course “social media mental health monitoring” basically Mental Health Monitoring: Collect data on social media posts, online forums, or surveys. Develop a sentiment analysis model to monitor and identify signs of mental health issues, and i think it’s gonna be fun and all but the first issue to face us is the lack of usable dataset, we looked into it alot but most of what we found was papers and sources and even the datasets we found (barely a handful) were not exactly aligned with how our project should go or unavailable, our professor told us she’d prefer a dataset that’s not from Kaggle for some reason, but I’d really appreciate it if someone could help me link a similar dataset that can be used for this project be it on kaggle or not and if there was a project implementation that’s close to what we’re trying to achieve here.
Thank you.
submitted by /u/_-_VIK_-_
[link] [comments]
I’m looking for statistics of high school graduation rates based on state, ethnicity, and gender. I am also looking for statistics based on college enrollment rates based on state, ethnicity, gender, etc…
submitted by /u/iiinnnoooxxx
[link] [comments]
Looking for a csv dataset of the enslaved population in the United States for some research on throughlines between past racial oppression and current systemic racism. I found a lot of census pdf documents but haven’t been able to find any excel data with county by county or state level numbers.
submitted by /u/NPRnoose
[link] [comments]
Hi everyone, I am looking for an image dataset, consisting of only images of electric vehicles. The aim is to detect the number of electric vehicles going down a road. Preferably with vehicles that can be seen on European roads, however, not a dealbreaker if otherwise.
Any help would be appreciated! Thanks!
submitted by /u/mrsavov
[link] [comments]
And I’m not sure if people will get this things can you help me?
submitted by /u/Mr_BMO5888
[link] [comments]
Looking for data on medications – with FDA pregnancy categories specifically. Anyone know of a comprehensive list/table out there somewhere? If it needs to be scraped, any recommendations for an easily scraped source?
submitted by /u/tbosk
[link] [comments]