submitted by /u/yhl3052
[link] [comments]
Category: Datatards
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
Hello, I am not a statistician so I’m unfamiliar with searching for data so help on where to find free data would be appreciated. I’m specifically looking for data on the quantity sold instead of sales for the Sporting Goods Retailers industry in the United States for the past 5 years (monthly). Thank you in advance!
submitted by /u/QDibie
[link] [comments]
I’m working on a master’s assignment focused on these islands. I’m aiming to integrate various kinds of geographic data, including traffic, geography, networks, services, building infrastructure, and socio-economic data. I’m looking for free sources that can provide up-to-date or relatively recent data feeds.
Any suggestions?
Thanks!
submitted by /u/TrapsterJo
[link] [comments]
I am looking at ways of extracting titles from sentences. these could be reddit Posts, comments or article titles. Can you help with where I can find a good dataset to train my model for such usecase.
Thanks for the help.
submitted by /u/romatom
[link] [comments]
Hi r/datasets
I’m learning about how to manage upwards of 500,000 customer profiles that would include things like:
Name Profession Email Phone Address Business Address Instagram Tik Tok Twitter YouTube
I’ll need to be able to search this database based on certain criteria (IE: search all Influencers, search all male customers, search all Texas customers) as well as export and share lists.
This can obviously be done in Excel or Sheets but I was looking for something with more modern UX and an inherent focus on contact management.
Any direction appreciated.
submitted by /u/Old-Act3456
[link] [comments]
Hello all,
I am curious if there is an open database of all FDA approved drugs by indication (disease state) that is regularly updated and easy to access.
Appreciate any help.
submitted by /u/Dnncir
[link] [comments]
Hi, I’m planning to build an application to help people with dyslexia get better access to documents using ML. I am looking for datasets of doc and docx files with textual content that might not be very friendly to dyslexic people for training my ML model. Can someone help me in finding such datasets?
submitted by /u/Bitter-Name-6594
[link] [comments]
Hi there, I’m currently working on a group project where we were assigned to make a dashboard for a specific mobile app developer. We were assigned SuperCell, but are finding it difficult to acquire reliable and free datasets. Kaggle has some options, but we need plenty more. Specifically we are looking for data that could be used on a dashboard such as revenues, amount of downloads, review data, active users, etc… PS: More generic datasets about the mobile gaming industry in general are also useful. THX in advance.
submitted by /u/A_Succulent_Eggplant
[link] [comments]
Hi guys i’m looking for a glaucoma detection dataset please! I know theres some on kaggle but its just good.
submitted by /u/PlatypusParticular34
[link] [comments]
I’m looking for datasets on Kuala Lumpur and Jakarta consumer tastes and spending.
Any idea where I can look for them?
Cheers
submitted by /u/saintisstat
[link] [comments]
Hello everyone!
I’m looking for a dataset of images of facial skin annotated with their type like oily, sensitive, dry.. My goal is to detect the type of the skin based on a input image
I cannot find any, so your help is appreciated!
Thank you!
submitted by /u/Expert-Damage8482
[link] [comments]
Hey there, I know it has been asked a couple of times before, but I could not get a good source from them, and besides my request is perhaps simpler.
I am looking for a dataset of chemical reactions, the simplest possible, to construct an interaction graph, e.g. from the reaction H + 2O -> H2O, I would construct two edges between (H, H2O) and (O, H2O). Is there a database with a bunch of reactions of any kind which I could use?
Alternatively, if you know a website whose HTML could be scraped, I could also work with that.
Thanks
submitted by /u/qotsalo
[link] [comments]
Could someone please recommend me a good source of free data on cryptocurrency fundamentals?
submitted by /u/BOBOLIU
[link] [comments]
My boss asked me to make the Target Group for our new products. One is an entry-level sedan, the second one is a 5-seat SUV and the last one is a full-sized 7-seater SUV. I’ve to make three TG for these 3 models. I’ve collected data on the population of Bangladesh by age group, how many people live in one urban area, and trying to relate it to the income level of population. But it is very hard to quantify how many people can buy our products. Can someone help me with this problem with suggestions or solutions?
submitted by /u/drdoctor98
[link] [comments]
I have a small doubt hoping you can clarify. I’ve been trying to collect F&O daily bhav copies from NSE from 2011 to 2022. I was successful with doing so from years 2016 onwards using some libraries on python.
However, a lot of people on the internet including myself have been facing the issue of downloading bhav copies prior to 2016 because the new NSE website is pretty shitty that way (it’s storing the csv file in a zip so the API can’t access the csv directly).
If you have some time you spare, will you be able to help me out? It’s for a research project I’m working on!
Thank you in advance 🙂
submitted by /u/jingbolosodabama
[link] [comments]
I’m working on a data mining school assignment, with a primary focus on quality education/decent work and economic growth. However, I’m open to exploring datasets related to any other SDGs as well.
I’m looking for two datasets with the following criteria:
At least 10,000 observations and six variables per dataset They must be mergeable Must be related to one of the SDGs
I’ve already searched on Kaggle but I haven’t found suitable datasets. If you have any suggestions or if you know of an easier way to filter search results effectively it would be much appreciated.
submitted by /u/Lyn03
[link] [comments]
Hello, I would need to find the number of new cases, deaths and recoveries per day in a given country for a project im making. Preferably the data should be listed as a spreadsheet from around the day of the breakout till the 100th day. Any idea where can I find this type of information? I’ve been looking for the past hour, but all I’ve found is just the total number or the newest data (I need it from the 2019-2020)
submitted by /u/Several-Ad-3048
[link] [comments]
Hi Friends,
For my Master’s i try to model pedestrian traffic – however, getting datasets with pedestrian traffic data is bit a challenge – sinds most companies that collect such data sell it for huge amounts of money 🙂 Can anyone recommend some freely available sources?
Thanks in advance!
submitted by /u/salamanta
[link] [comments]
My team mates and I wish to focus on mental health resources and stigma for our Big Data course project, and we could use some help locating data sources. Here’s the rundown:
Project Objective: Our research aims to collect and analyze data related to mental health resources and the evolving stigma around mental health, particularly on social media. We plan to compare trends over the past two decades, both globally and within North America if the global data is limited.
Data Needed: To make this project a reality, we require data on the following:
Availability of mental health facilities across countries.
Information on mental health programs in various nations.
Data regarding non-governmental organizations (NGOs) dedicated to mental health awareness globally.
The percentage of utilization of mental health facilities and programs in each region.
We initially tried to access data from the World Health Organization (WHO) through their Project Atlas Report (https://www.who.int/publications/i/item/9789240036703 ), but our efforts hit a roadblock. We’ve reached out to WHO, although we’re uncertain if they’ll share this information.
If anyone knows of alternative data sources or has any tips on where to find similar datasets, your input would be incredibly valuable. We’re committed to advancing our understanding of mental health resources and stigma, and your assistance can make a real difference in our research. Thanks in advance! 📈🧠
submitted by /u/hinberry
[link] [comments]
Hi!
I’m writing my dissertation and looking for a dataset of global Technology Adoption rates in Corporate Management. For the years 2012 through 2022.
I’m especially interested in Technology adoption rate in performance appraisal processes over the years.
submitted by /u/Joan_Hawk
[link] [comments]
Hi everyone👋,
Maybe someone has researched mapping COIOP product classification with EXIOBASE3 industry classification? I have the CPI of Germany with COICOP products, want to map with EXIOBASE industries to see how much energy sectors consume and if price increase of certain products are justifiable or not. Could someone guide me?
submitted by /u/Grindelwaldt
[link] [comments]
I’m curious if there’s any data on noise pollution over time in cities, such as NYC, London, Paris, or perhaps even in entire countries (if that even makes sense).
I’m thinking electric cars are becoming more common, and they’re more quiet, so perhaps noise pollution has gone down in recent years?
(To be honest I’d be more interested in a graph than the raw data, but this was the stats/data request sub I found (via askstatistics…) so here I am.)
Thanks!
submitted by /u/cpwnage
[link] [comments]
Hello, I’m currently working on a project and we’re trying to offer information to clients about average maintenance costs of ownership of a vehicle. So, basically, if you are planning to buy a vehicle, you would know how much and when you would have certain expenses related to maintaining your vehicle (scheduled services, oil changes, windshield wipers, tire changes, etc).
Ideally, the dataset would show information based on average mileage driven per year, different makes, years, and models, and separate information for each state.
Thank you!
submitted by /u/dumdumbadum
[link] [comments]
Hi all,
I wanted to create an image classification model for dinosaur recognition but had no success in finding any usable images. Does anyone have an idea where to find a dinosaur images dataset(s)?
Thanks
submitted by /u/Milennium-Falcon
[link] [comments]
Hello people,
I am looking for a multiclass classification dataset(more than 3 classes) for my data mining project. If you have any leads please let me know. I have been searching for sometime on kaggle, UCI repository but I am not able to find it. Thanks in advance.
Note: It shouldn’t contain any Text or Image analysis.
submitted by /u/swinging_mood7260
[link] [comments]
These are supposed to be birth dates, how to read them?
2440511
2439348
2443536
2429575
2439815
2441089
2439139
2435595
2439094
2442944
2446164
2437132
2441420
2442178
2436247
2433557
2440835
2430173
2436205
2447681
2436951
2435644
2436441
2441879
submitted by /u/Entity303BR
[link] [comments]
Dear everyone,
I humbly seek your assistance in my current endeavor. I am tasked with conducting a data analysis as part of my school project. The initial (and, for me, the most challenging) step is to identify two datasets that are interrelated and can be merged. Subsequently, I will proceed with the analytical work, which does not intimidate me. The datasets need not provide instant, magical solutions to one another, but there should be a logical basis for their integration.
The primary dataset should encompass approximately 20 categories, with a predominant emphasis on categorical data. It should be in a format that can be reasonably connected or merged with the second dataset, which should originate from a different data structure or source.
Honestly, after hours of diligent searching, I find myself somewhat disoriented. I would greatly appreciate any insights or suggestions. Initially, we contemplated working with a dataset pertaining to train delays in Poland, aiming to correlate it with weather data based on the date. Unfortunately, the dataset concerning Polish trains contains only 8 columns.
I will be immensely thankful for any guidance or counsel. Thank you!
submitted by /u/M4tel0te
[link] [comments]