Pls share your inputs
submitted by /u/whateveritis2020
[link] [comments]
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
Hi everyone! I am fine tuning an LLM so that it is capable of decision making and function calling in robotics and AIoT applications. So for this I need a dataset that contains inputs from various different types of sensors and outputs which can be maybe moving motor by a certain angle or using an API to send notifications, etc. So are there any such datasets?
submitted by /u/Shoddy_Vegetable_115
[link] [comments]
Hello everyone, I am new here and was sent here from the r/statistics subreddit.
Like the title says, I am currently in a university class and am having trouble finding datasets for my project. What I am doing for my project is this: I am comparing the average revenue of the movie industry (meaning worldwide box office) and comparing it to the revenue of the gaming industry (meaning mobile gaming, PC gaming, and console gaming all together).
For the datasets, I don’t need any specifics other than how much the industry made per year. Something so simple seems to be impossible to find lol. Something like boxofficemojo.com but in a downloadable CSV file and one that takes every movie of the year, not just the top 100 like that site does.
If anyone can help me out, this would be greatly appreciated, thanks!
submitted by /u/THE_Best_Major
[link] [comments]
I need to get future flights on each report worldwide.
submitted by /u/alejandrobrega
[link] [comments]
I need to get live and future flights on each Airport worldwide.
submitted by /u/alejandrobrega
[link] [comments]
i need a data set that can be easily used in rapid miner project.
submitted by /u/yourmomis-gay
[link] [comments]
I’m currently working on a project that requires a comprehensive dataset of grocery items in India. I’ve searched on Kaggle and found a few datasets, but one doesn’t have images and the other doesn’t have descriptions.
I’m ideally looking for a dataset that includes the following attributes:
product_id
name
description
price
image_url
category
I’m also interested in datasets from Indian grocery delivery platforms like Instamart, Zepto, and Blinkit.
If anyone has any leads on such datasets, please share them in the comments below. I would greatly appreciate it!
submitted by /u/bazzinga2002
[link] [comments]
Hello everyone, I am building a sports bets project and I need access to historical sports data for analysis. Could you please recommend which is the best API that fits this purpose?
I understand most of these are paid, so I would like to make the correct decision before I make any type of commitment.
Thanks,
submitted by /u/xywa
[link] [comments]
I am interested in studying anti-Semitism in public administration and require a database of Jewish surnames to identify the ethnicity of the plaintiffs. Most databases I’ve found online only offer search engines without a scrappable list or downloadable CSV file. Am I overlooking any available databases?
Thank you.
(No nazi, plz)
submitted by /u/the_iriki
[link] [comments]
So far I’ve only found the percentages and 95% CIs for each category, but it would be nice if I could get the raw data
https://www-doh.state.nj.us/doh-shad/query/builder/prams/PRAMS/PostpDepress12.html
submitted by /u/Classic-Asparagus
[link] [comments]
Hello, so for an analysis project I am trying to get the number of households in the US by state as an Excel file, and then further organize these data by the type of household (e.g. single-family, multi-family, apartment, etc.), and then then further organize these state-level counts by the type of heating-cooling equipment used in the household. I am having a tough time trying to get this accomplished though, as I don’t know how to actually just retrieve the data columns I want.
I am looking at the US Census Bureau website, and specifically the American Housing Survey (AHS) section:
https://www.census.gov/programs-surveys/ahs.html
I am specifically trying to use the AHS Table Creator tool to get the columns I want:
https://www.census.gov/programs-surveys/ahs/data/interactive/ahstablecreator.html?s_areas=00000&s_year=2021&s_tablename=TABLE1&s_bygroup1=1&s_bygroup2=1&s_filtergroup1=1&s_filtergroup2=1
However, I only see limited States actually offered, and am not sure how to get the other columns I want. Organizing this data to get the data I need is really confusing me, and I would appreciate any help on how I can accomplish this if anyone knows! Thanks!
submitted by /u/teledude_22
[link] [comments]
Hi all, I’m conducting a literature review on photocatalysts and want to make a graph showing over what range of intensities different materials can absorb light (due to their bandgap) and need some data on the intensity of light at sea level vs its wavelength and for the life of me can’t find it anywhere. If anyone can point me to a website where I can find it I would be very appreciative.
submitted by /u/Willie_the_k1d
[link] [comments]
Hey friends, Simple as title. I went through many places but couldn’t find many datasets that have these three modalities. Really any dataset will do, of course the more ‘well built’ the better.
even if you don’t know any specific, I’d appreciate that if you have a suggestion for where to look Really many thanks 🙏🏻
submitted by /u/TheBamba
[link] [comments]
I’ve been trying to find for weeks now and so far I’m having trouble. I need a student performance dataset which includes GPA attained and text data which includes their comments/feedback regarding their academic performance. This would be a huge help for my model.
submitted by /u/zeyus042
[link] [comments]
Hello!
I’m looking for a dataset for my thesis on environmental protection and climate change.
I need a lot of data, a large dataset.
I’m interested in environmental protection, climate change, renewable energy sources, climate variations (air pollution, waste generation, weather data) or the same, but related to electric vehicles.
Thanks!
submitted by /u/itsnotmn
[link] [comments]
If apartment community XYZ has 5 units left of floor plan A1, and 3 are leased, there are only 2 A1’s left, and the price goes up 20% (for example).
Who determined this beforehand? And if the algorithm/whatever decision making exists for all these apartment communities, how is that not price fixing? There are only 3-4 data companies who handle the entirety of this data from what I can find online.
I’m curious because Realpage has a big lawsuit against them (link here) (another link here). But if there are only 2-3 other companies who handle the same data, it can’t be much different from how these prices are determined.
submitted by /u/fl135790135790
[link] [comments]
I’ve been given this large dataset in this format that I don’t know anything about. Is there a free software that can convert the data into excel or a reader that will be able to perform search functions?
submitted by /u/AdBene
[link] [comments]
Need help finding a data set
I’m working on a multiple regression model and I’m having so many problems with finding a data set that has interval variables.
Currently, I’m looking at (broadly) addiction/drug usage and mental health. I was using the NSDUH data set, but it is massive and a tutor told me that the variables aren’t useful for what I’m doing (bc it’s lack of scalable ones which results in too many dummy variables).
This is just for a class and I need to create a straight forward model for my final paper, but it seems so difficult. Does anyone have any recommendations? I just want to be done with it.
submitted by /u/ExperienceOk1476
[link] [comments]
Hi, there! Can anyone assist or guide me to where I can obtain correlated datasets on Cardiovascular health and how physician care/relationships impacts patients’ hospitalizations and readmissions? Seeking data by demographics (region, age, gender, etc).
submitted by /u/Curious-Mind-SG71
[link] [comments]
I understand the dataset is no longer available, but copies of it must still exist somewhere, I hope.
submitted by /u/rfurlan
[link] [comments]
I’ve been trying to look around and see if I can find any historical datasets on the Michelin guide, ideally all of it, or starting from 1931 when the star system was first implemented, but have come up with nothing.
I have this: https://www.kaggle.com/datasets/ngshiheng/michelin-guide-restaurants-2021?select=michelin_my_maps.csv which is fantastic, but the fact that its so well maintained poses a problem for my own purposes. If anyone knows of any place this sort of data can be found or place it might be digitized and accessible already, I would be endlessly grateful.
submitted by /u/martinjeden
[link] [comments]
Hello,
I’m looking for a data set of all companies both listed and de-listed. I’d like the historical Market Cap for each day for each company going back to at least 2017. Any ideas for data vendors who may provide this data? I know this wouldn’t be free and am ok paying. Just looking for a good data provider. Any info would be helpful. Thank you!
submitted by /u/valorallure01
[link] [comments]
Hey Reddit!
I just launched DataDepot, which simplifies how you find, share, and use diverse datasets and research products. Inspired by my own challenges in accessing and monetizing data, we’ve created an easy way for you to turn your datasets into a profitable asset in just a few steps.
Key Features for Providers:
💰 Secure Payments: Simple, secure transaction handling for subscribers.
🔒 Authentication: We safeguard your data while ensuring subscriber access.
🎯 Delivery: Data presented in a clear, table format for easy analysis to your subscribers.
🕥 Real-Time Updates: Your listing instantly mirrors changes made to your datasets.
🎨 Provider Dashboard: Easily centralize your listing management process.
We’re starting with a private release and would love some early feedback from providers. Comment below if you’re interested in trying it out & I will send you an early access code 🙂
submitted by /u/BobbyAxelrod9
[link] [comments]
Hello,
I’m in search of a dataset that includes the number of listeners for various artists on Spotify in the year 2015. I’m specifically interested in non-synthetic data. Does anyone happen to have something like that?
submitted by /u/highonlemonsips
[link] [comments]
Where I can get dataset on the best TV – Email – SMS – Video Ads with opening/conversion metrics ?
submitted by /u/fractai__
[link] [comments]
Hi,
I am looking for a dataset with air quality information. It would be best if it was information for a country for the year. Any help is appreciated
submitted by /u/QuackyPenguin
[link] [comments]
Hi, I am looking for a dataset which has several mental disorder symptoms and the target field must be the disorder that has to be predicted. I tried looking in several website. If anyone is working on it or has the similar kind do share. Thank You.
submitted by /u/Rahul-1078
[link] [comments]
James Hoffmann did a coffee taste test with 5k participants and 4k survey results.
James summarizes the data here: https://www.youtube.com/watch?v=bMOOQfeloH0.
Then he links the spreadsheet in the description here: https://bit.ly/gacttCSV
submitted by /u/Qinistral
[link] [comments]
I’m gathering financial data on firms and was wondering what the best way to is label/organize the data? I’m gathering data on their accounts from annual reports.
submitted by /u/jdumpz
[link] [comments]