I was wondering if there were any job datasets with statistics about employment rates and types of jobs recent graduates get. The more variables for a data point, the better.
submitted by /u/DetachedOptimist
[link] [comments]
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
I was wondering if there were any job datasets with statistics about employment rates and types of jobs recent graduates get. The more variables for a data point, the better.
submitted by /u/DetachedOptimist
[link] [comments]
There are legitimate gripes about Java syntax, to be sure—the same is true of JavaScript and every other language. As Bjarne Stroustrup once said, “There are only two kinds of languages: the ones people complain about and the ones nobody uses.”
The JVM
submitted by /u/Upbeat-Ad-2183
[link] [comments]
I’m taking a programming module in my Economics course and I need an easy csv dataset to analyse as my first one for some coursework.
I have absolutely zero experience with this so a relatively simple one would be very nice thanks!
submitted by /u/As14nn
[link] [comments]
Working on a side project, but cant seem to find this data. Its weird that this should be obviously out there but is behind pay walls. Is there a free source I can get this data from?
submitted by /u/Ill_Fisherman8352
[link] [comments]
On the hunt for dollar store data from the past 5 years, including opening year. Preferable including Dollar General and Dollar Tree, but either/or is fine. I’m able to find some data through SNAP and BluePages, but they don’t have any information on when the store was opened. Any ideas?
submitted by /u/Rough-Fail-3211
[link] [comments]
Doing a project where we are finding data about airlines. I need a dataset with complex demography of passengers from the years 2019-2022. This primarily focuses on age, gender, and possibly nationality. It has been a pain in the ass to find anything that specific, and I’m guessing it is hard to find because most datasets have limited information, and others may have restrictions on how data can be used. If you do find anything, please comment.
submitted by /u/ShrimpChipCEO
[link] [comments]
Preferably west pomeranian university one but could be anything. I’m in need of a way to download and access it.
submitted by /u/Prudent_Country4074
[link] [comments]
The subject for my project is predicting forest fires and I am looking for a dataset similar to the one shared on Kaggle but I can’t find one. I looked on Earth engine and found some datasets but they don’t provide dates and they are Imagecollections, not csv. I am familiar with machine learning and cleaning datasets in csv format after turning it into dataframes but not at all familiar with Imagecollections. So basically my question comes down to two paths:
I use the datasets from Earth Engine but I don’t know how to work with them. So perhaps someone could give me some tips on how to predict Can someone guide me towards a suitable dataset to predict forest fires?
I appreciate all input!
submitted by /u/Ripplekipple
[link] [comments]
Are there any good tools/techniques for capturing workflow data, specifically to help train an LLM? Use case is accurate question answering around processes/best practices inside an organization.
Is this where something like a UiPath would be necessary?
submitted by /u/Constant-Potato-4712
[link] [comments]
I started on this idea of finding a comprehensive book dataset which for sure has a description and more than one genre (makes things more realistic), since I wanted to cluster them based on similarity to find some good ones to read for myself 😉 The only ones I could find on Kaggle were ones with a single genre label, so collected it on my own.
So sharing it here in case it helps someone else too:
[Dataset](https://www.kaggle.com/datasets/ishikajohari/best-books-10k-multi-genre-data)
The data was collected from Goodreads from their list – Books That Everyone Should Read At Least Once and contains Description, Ratings and Multiple Genre classifiers.
submitted by /u/ishika_jo
[link] [comments]
I’m wondering if there is a free aviation API to track arrivals and departures to a set airport. It would collect: Callsign, Aircraft Type, Gate, and Arrival/Departure airport, then plug that into a Google Sheet.
Currently I run this process manually by looking at FlightAware data, but if I can automate this for free that would be great!
submitted by /u/ModeratorOfNothing
[link] [comments]
From the assignment ” Source a data set with regard on equipment failure on the internet that can help you to illustrate the difference between causation and correlation. “
submitted by /u/Strong_Papaya94
[link] [comments]
I need each country’s population, area (preferably in square miles), GDP, and year of founding. Just raw data.
submitted by /u/PotatoSacGamingYT
[link] [comments]
i need earnings estimates preferably multiyears into the future. yahoo finance only gives 2 years. zacks too
submitted by /u/MrZwink
[link] [comments]
I’m working on a project that will involve identifying maintenance issues based on a worm’s eye view of the underside of vehicles. Any suggestions on where to look? Thanks in advance.
submitted by /u/mikebattaglia_com
[link] [comments]
In March 2023, a Python scraper was used to collect a dataset that comprises of over 8,000 beauty products available on the Sephora online store. The dataset includes comprehensive details about the products, such as their names, brands, prices, ingredients, ratings, and all other relevant features.
Source of the dataset: https://www.kaggle.com/datasets/nadyinky/sephora-products-and-skincare-reviews
To view the dataset: https://app.gigasheet.com/spreadsheet/Sephora-Cosmetic-Reviews/e74caa44_2abf_4f49_bc0d_3477fcb1663e
submitted by /u/sheetheadd
[link] [comments]
Hi all,
Wondering if anyone has any ideas for accessing or gathering data on Nextdoor and Ring user demographics – specifically in four target cities. This is a part of a larger project that is examining the effects of technology on neighborhoods. Been beating my head against a wall trying to figure this out myself and decided I would ask y’all.
Thanks!
submitted by /u/FrightFeats
[link] [comments]
All the publicly available datasets I could find like the TCGA, GEO, NCBI don’t have the entire DNA sequence of a patient that is positive for cancer.
submitted by /u/AdagioJump
[link] [comments]
Huggingface links: 90M tweets, 150K users
Is is introduced in [this blog]((https://medium.com/@enryu9000/fun-with-large-scale-tweet-analysis-783c96b45df4).
Compared to some of the existing twitter datasets, this one has many tweets per user, which can be useful for some types of analysis.
submitted by /u/enryu42
[link] [comments]
Hi all,
I’m currently trying to merge 3 lists of together over 30.000 rows with companies, to make sure I can delete the duplicates I want to merge the companies sin one list with one similar cell for all, the exact address or coordinates.
I’ve tried using bing maps API, but after checking it doesn’t show up correctly. What I can do is go into google maps an manually put in the company, state & city and then copy the address but doing this for 30.000 rows will take me years.
What would my best option be to do this? I’m advanced with Zapier & Power Automate.
Many thanks!
submitted by /u/Florent-Lesage
[link] [comments]
Additionally a histogram of the most common individual colors and pairings (and trios, etc. if some schools have 3+ colors)
Edit: The intention of the post is for anyone to submit to r/dataisbeautiful, but the chain of related subs leads me here if I don’t have that data to begin with.
submitted by /u/Exaskryz
[link] [comments]
Hi,
I’m looking for both easily scrapable data, or a publicly available database for the real-world gaming performance of computer components. I don’t want synthetic benchmark performance, but the actual frames per second that the computer achieves with what components.
I’ve looked quite extensively online and have mainly came up empty handed.
For context, I only really need the CPU and GPU the computer used when getting the performance it achieved.
Here’s what I’ve managed to find and their respective issues.
The “UserBenchMark” FPS page. This has data on around 300 games and many samples, however it is quite difficult to crawl due to its page layout seemingly only showing the average FPS a paired GPU and CPU have once you’ve selected that pair.
An OPENML database that seemingly has managed to compile a lot of data from UserBenchmark, however apparently for only around 15 games.
Tom’s Hardware benchmarks. Great for benchmarks on the newest tech, but the results are typically presented as part of an image which I’m not super well versed in getting data from.
If anyone has any good datasets for me to use to gather this information, I’d greatly appreciate the info. Thanks!
submitted by /u/Rejesto
[link] [comments]
I work extensively with addresses for mailings, and often have to look them up on websites for corrections if they don’t pass USPS CASS certification.
The existing sites are awful, and usually want to sell you background checks (and will try to trap you into it–“wait 3 minutes while we display a fake progress bar.”). Where can I find US address data (including individuals names and prior address history, if possible)? I envision building a clean, easy to use site with minimal ads and zero gimmicks.
submitted by /u/onebiscuit
[link] [comments]
Hello,
For school I need to perform a T-test.
I just need to use real-world data. I know how to perform a t-test, I just cant find any suitable data to perform a t-test!
Can someone please link me to some suitable data? Thanks!
I’m sorry if this seems low-effort, but I assure you i’ve looked! Most of the data sets ive seen (for example on the WHO website) only provide ratios, and not means!
Please help!
submitted by /u/Even-Substance
[link] [comments]
Need fundus photographs dataset for developing DL model to detect diabetic retinopathy
submitted by /u/CreativeGold5336
[link] [comments]
Data analysis project!
I’m looking for dataset about learning systems in the mother tongue language, and how does that effect the students. Compared to the other kids who are studying in other language “English”. If you have dataset please send share it with me!!Its for my data analysis project, however I couldn’t fined any. I’m not sure if the way I’m search is wrong or I’m not specifying when looking for the data. Please help!!
I need it today!!!
submitted by /u/uu3333
[link] [comments]
Data analysis project!
I’m looking for dataset about learning systems in the mother tongue language, and how does that effect the students. Compared to the other kids who are studying in other language “English”.
If you have dataset please send share it with me!!
Its for my data analysis project, however I couldn’t fined any. I’m not sure if the way I’m search is wrong or I’m not specifying when looking for the data. Please help!!
I need it today!!!
📷1
submitted by /u/uu3333
[link] [comments]