Hi guys! I am new to the data world and I was wondering if there are websites that share good datasets or data analysis publicly. Thanks!
submitted by /u/Adlpg
[link] [comments]
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
Hi guys! I am new to the data world and I was wondering if there are websites that share good datasets or data analysis publicly. Thanks!
submitted by /u/Adlpg
[link] [comments]
Hi all, I’m looking for datasets with posts, their comments and reactions (likes, dislikes, etc.). Ideally for a platform like Twitter/X or LinkedIn. Are there any datasets? If not, is it feasible to try and scrape Twitter/X or LinkedIn to collect the data? Cheers
submitted by /u/Thick-Ad3346
[link] [comments]
Hello everyone,
I’m looking for bond data on municipalities, specifically the ratings of all municipal bonds in the United States. It would be particularly useful if this data is available as panel data, covering ratings over time. I have found this data at the state level and have seen data that includes only municipalities with AAA ratings, but I am looking for data that includes all municipalities in the United States.
Thank you!
submitted by /u/EconGesus
[link] [comments]
I’m looking for a set of data about sensory impairment amongst internet users.
Sadly google gives me nothing and I don’t really know where to look.
Do you know any datasets like this? Preferably in percent but I appreciate everything. Thanks in advance
submitted by /u/wizarddos
[link] [comments]
README file reads:
This is a list of topic-centric public data sources in high quality. They are collected and tidied from blogs, answers, and user responses. Most of the data sets listed below are free, however, some are not. This project was incubated at OMNILab, Shanghai Jiao Tong University during Xiaming Chen’s Ph.D. studies. OMNILab is now part of the BaiYuLan Open AI community.
GitHub repo: https://github.com/awesomedata/awesome-public-datasets
submitted by /u/alamiin
[link] [comments]
submitted by /u/AdDifferent9401
[link] [comments]
I am doing some ML model classification experiments and really want to operate on realistic employee calendar data, basically like a dump of a company’s outlook calendar with the meeting times and titles, attendees, and the employee’s role. I don’t care if its old or synthetic, just need something with realistic patterns and distributions. Ideally a couple months worth and at least 100 employees. Anyone know where I might find something like this?
submitted by /u/madmax_br5
[link] [comments]
CSV, with links and dates. Please feel free to upload or share.
submitted by /u/AdDifferent9401
[link] [comments]
Hi
I hav been searching for dataset has Eye color by country. I have seen one but only have two measures, Blue and Brown by country.
I’m looking for one at lease have 6 measures like Brown, Blue, Green, …etc
Can anyone help me find one in CSV/EXCEL file.
TIA
submitted by /u/Hadi_3812
[link] [comments]
Hey, I’m looking for data on investment/capital expenditure in critical minerals/energy transition minerals. I would appreciate any help, thank you!
submitted by /u/tanmaythomas
[link] [comments]
Is there any public datasets that contain individual products with things like their title and description and their daily sales data over the course of the year
submitted by /u/OneZone1923
[link] [comments]
Good morning. I have two data sets that I’d like to relate. One set has US state and county FIPS codes and the other set has US state FIPS and school district codes. The data sets are from 2023. I’d like to find some way to connect the school district codes and county FIPS codes. Would anyone happen to know where I could find this information? Thanks.
submitted by /u/bphillips1105
[link] [comments]
I have a project/assignment coming up about time series analysis and forecasting at my school. Could you please suggest me some time series data sources with large, complex and many attributes/variables datasets.
Many thanks
submitted by /u/Sensitive_Web6152
[link] [comments]
Hi, I am looking for UK datasets which are related to grocery shopping or plastic waste generated through grocery shopping or even fuel consumption per household for grocery shopping. I want to analyze the of environmental impact from grocery shopping to provide inputs so as to reduce it.
submitted by /u/naive_byes
[link] [comments]
Hello! I’m planning a project concerning substance abuse and a variety of factors around it like treatment and its effects on people’s lives [currently in the frameworks of it as I’m basing my approach off of the data available so not much more information available unfortunately] and was wondering if anyone had any dataset/database recommendations for it? I’ve been searching far and wide and haven’t found anything yet, so I’m pretty desperate. Thanks!
submitted by /u/InfiniteQuestions101
[link] [comments]
I am building a Grocery type app, and I am looking for a dataset that contains as close to all the grocery items that you might find at Walmart or some other supermarket. I simply need would need the item name and an image of the item. Does anyone know where I could find this kind of dataset?
I have tried sites like Kaggle, but I can’t seem to find any that include images.
submitted by /u/MovesLikeJagr28
[link] [comments]
Can someone assist me in finding out the unit of this water requirement column. I have made a model that predicts the Water requirement but now that i have to map that to hardware. I don’t know what is its unit so I can’t determine the duration of water. HELP
submitted by /u/Sanguinestan
[link] [comments]
I am not sure this is kosher but it seems really interesting
submitted by /u/cavedave
[link] [comments]
I needed untidy dataset.
One of the selected data sets must not follow at least of the tidy data principles. In tidy data where each variable must have its own column or Each observation must have its own row.
submitted by /u/Front-Benefit8232
[link] [comments]
Hello everyone,
I am a college student currently working on a thesis about machine learning, specifically focused on identifying Indian and Carabao mango leaves with and without anthracnose disease using a CNN model.
At this stage, I need a large number of datasets, likely 1000 and more images, from the mentioned varieties of mango. I am looking for datasets of leaves affected by anthracnose disease as well as healthy leaves from both Carabao mango and Indian mango varieties.
I am reaching out in the hope that you can help us find these datasets, as they will serve as the primary data for our thesis.
Thank you very much for considering my request.
submitted by /u/chadmomentgiga
[link] [comments]
Hi guys,
I’m currently working on a project to enhance the detection and prevention of cryptocurrency scams and phishing attempts. A crucial part of this project is identifying and analyzing scam crypto wallets that have been reported by users and security experts.
I am looking for a reliable and up-to-date dataset that contains information about cryptocurrency wallets reported as being involved in phishing or scam activities. Ideally, this dataset should include details such as:
Wallet addresses Type of scam or phishing attempt
If anyone knows where I can find such a dataset or has resources that could help, I would greatly appreciate your assistance. Open-source datasets or any repositories maintained by security communities or organizations would be extremely helpful.
Thank you in advance for your help!
submitted by /u/Funny-Accident-5612
[link] [comments]
Hello everyone,
I am currently working on a machine learning, specifically focused on identifying Philippine Indian and Carabao mango leaves with and without anthracnose disease using a CNN model.
At this stage, I need a large number of datasets, likely 1000 and more images, from the mentioned varieties of mango. I am looking for datasets of leaves affected by anthracnose disease as well as healthy leaves from both Carabao mango and Indian mango varieties.
Thank you very much for considering my request.
submitted by /u/chadmomentgiga
[link] [comments]
Hi, I am learning my companies data management system from scratch, and am trying to figure out if I copy things FROM excel INTO access in the Query section or the Table section? I am pretty sure table but want to be sure. Thanks!
submitted by /u/suzimakesthings
[link] [comments]
Hi everyone,
Maybe someone knows some open access datasets on suicides committed in the U.S. (or number of death if there is variable for the cause of death) per year (from about 2015 to at least 2020) and per state. The more addition variables there are (such as gender, age, employment status, etc.), the better.
Hope that maybe some of you have seen something of this sort🙏
submitted by /u/dollala
[link] [comments]
We are a UK FinTech company and have launched a new product that automatically extracts data (including handwritten) from 25 million filings for millions of UK companies. In addition, there are insights and easy-to-consume charts and tables. The automatically extracted data includes/ provides the following data for 2m+ private companies:
An industry-first price-per-share and last-round-valuation (market capitalisation) chart Capital structure, shareholding, and the change in shareholding Equity fundraising trends in the UK Top fundraisers and investors in the UK
I would like to hear your feedback on our UK company insights data 🙂
submitted by /u/olive_er
[link] [comments]
Hi guys, I’ve been working on a fine tuned llama3 for quite some time now and want to expand the dataset. Are there any good automated solutions to generate these datasets from pdf or html and can these be augmented automatically?
Thanks so much in advance
submitted by /u/OkVegetable2512
[link] [comments]
I’m selling a high quality dataset that includes(Email address, Full Name, Phone number, Age, Location(country), Gaming Platforms Owned (e.g., PC, PlayStation, Xbox, Android, etc.), etc.)
Price: $1.20 per individual ($120 total)
Format: CSV, Excel and PDF
Delivery: Secure download link or Direct file
DM If you are interested
submitted by /u/Money_Ad3408
[link] [comments]
I would like to invite all of you kindly visit, open and upvote this dataset.
If you found it valuable then download it and leave a comment.
Your support and appreciation means a lot.
Link: https://www.kaggle.com/…/uk-gender-pay-gap-data-2018-2023
submitted by /u/Umer_Haddii
[link] [comments]