Working on a project for www.BuriedInWork.com. Thanks!
submitted by /u/apzuckerman
[link] [comments]
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
Working on a project for www.BuriedInWork.com. Thanks!
submitted by /u/apzuckerman
[link] [comments]
as mentioned in the title I am looking for a pixel arts data set consisting of artwork from old 2D games. if you know any sources which can help create such a data set please comment
submitted by /u/FlowerJaded4071
[link] [comments]
I’m searching for an API, preferably free, or a dataset available for commercial use that provides streaming service information for a particular movie. I’ve come across the ReelGood API, which is priced at $95 per month, and the JustWatch API, but it’s only available for businesses, and you need to reach out to them. Are there any other alternatives you’re aware of? While a free option would be ideal, I’m open to checking out paid options as well.
submitted by /u/-Oake
[link] [comments]
Does anyone know of any data on UK pet owners broken down by demographics? Age/locations/type of pet etc?
submitted by /u/Vox_1610
[link] [comments]
Hey everyone! 👋 Exciting news – we just launched our latest product on ProductHunt:
🚀 Job Postings API: Unlock millions of fresh job opportunities every month!
Check it out here: Job Postings API on ProductHunt
Job postings provide detailed insights into jobs, companies, and technologies. Perfect for powering new job boards, uncovering sales leads, generating market reports, tracking tech trends, and more.
If you need larger datasets for in-depth data analysis or machine learning, we’ve got you covered with job postings from 140+ countries available as datasets or data feeds.
We’d love to hear your thoughts! Feel free to share your feedback. Thanks for checking us out! 🚀
submitted by /u/Techmap_io
[link] [comments]
Looking for pharma data. I literally searched the dark web. Help would be appreciated, thanks.
submitted by /u/Tabasco4realtho
[link] [comments]
Hello, everybody.
I’m interested in datasets from CCTV cameras that contain several kinds of distortions such as underexposure, overexposure, defocus and occlusion. Can someone please advise me non-synthetic datasets with such kind distortions?
submitted by /u/Anxious-Scratch4748
[link] [comments]
Hello all, I am doing a project where I want to find the avg snowfall for each event over the last 15 years for over 400 locations. Any ideas would be appreciated
submitted by /u/johndoe266
[link] [comments]
Hello everyone! I am looking for cat owners keen to help out with a research project, I am studying cats to see whether we can estimate their age just by their voices. If we manage to, this promises significant benefits in veterinary care, can aid rescue centres in creating accurate adoption profiles, and has potential implications for understanding the age demographics of feral cats.
It’s quick and simple – if you’re interested in helping out please send me a message and I will share more info & instructions 🙂
Any contribution is invaluable and will help gain insight into the development of age-related vocalisation patterns in cats!
Thank you!
submitted by /u/Asseflas
[link] [comments]
Hello, I am a PhD student working on a research project on labor economics. I am looking for job posting data (including job descriptions and requirements), especially historical data from the past 5 to 10 years (preferably). Are there any places I can find data like that? I currently know some job listing APIs, but they only have active postings, and some data consulting firms have historical data, but it costs more than 20k 🙁
submitted by /u/nycameraguy
[link] [comments]
Hey r/datasets! I wrote a bit about how we use GitHub to scrape air quality data from openAQ and store the resulting data in the same GitHub repo itself:
https://about.xethub.com/blog/simple-etl-pipelines-git-xet-github-actions
I really enjoyed writing this and it’s quite fun to set up new scrapers in just an hour or so thanks to GitHub Actions.
submitted by /u/semicausal
[link] [comments]
Hi, i need help finding the REDD dataset for a work project on nilm disaggregation but the original link to the dataset here seems to not be available anymore. Can someone help me find it anywhere or send it to me?
submitted by /u/Dandaran
[link] [comments]
Hello!
I have currently finding these datasets to perform machine learning on. I have looked through the government websites and could not find these datasets according to states in Malaysia.
Would appreciate if someone could provide me some idea on where to look for these datasets
submitted by /u/LYJ9339
[link] [comments]
Hi!
I have searched around the web and I can’t find any good dataset for Kaplan–Meier method which I need for school work. I’m looking for datasets where each entry is about an individual and has info about the start and end of some event measurement. In principle, I don’t care what the data should be about, but prefer that it isn’t about the survival rate of people.
So far I have searched for:
Tried to find a dataset about marriages (but usually no label about the end of marriages)hod. In principle, I don’t care what the data should be about, but prefer that it isn’t about the the survival rate of people. Tried to find a dataset about marriages (but usually no label about end of marriages) Unemployment duration
submitted by /u/HBlackwooder
[link] [comments]
Hello, Please I want your help with an issue in a data science project… In the step of handling missing values, I handle continuous data by replacing it with the mean, but for time data, I don’t think it’s the right approach. I found out that there are two ways to do it: Forward Fill (ffill()) or Backward Fill (bfill()) and Linear Interpolation. However, I’m still wondering which one to use because it’s the first time I’m dealing with null values for time data.
submitted by /u/t_abdessamad
[link] [comments]
Hey all,
I’ve been looking for a good source of pre-sanitized, collated social platform data organized by topic to run my LLM on. Wondering how people find such datasets (Google, Reddit, scholarly articles, etc) / if anyone has had luck with any specific providers recently. Thanks!
submitted by /u/mstahl23
[link] [comments]
So i have been trying to train model (thin plate spline motion model and try to fine tune it), but i am not been able to download voxceleb dataset too. Any tips? Or links?
submitted by /u/IntelligentUse5990
[link] [comments]
I’m looking for dataset similar to google trends (Time series behavioral data) but not on a relative scale.
I am not looking for any specific data, anything that is temporal and which involves human behaviour should do, any suggestions?
submitted by /u/No_Albatross8524
[link] [comments]
Hello, do you guyz know where I can find data on the number of console units sold from each year from the inception of the first console until now?? I need this for my college project. I wanted to make a bar chart race animation
submitted by /u/wilqqqq
[link] [comments]
Ive started playing around with custom AI models because I was bored and it looked fun from things I’ve seen in YouTube. I’ve created characters, tested different models and had loads of fun learning and playing. But now I want to “fine tune” the local model I’m using on specific data for it to pull from.
The overall goal is to have this chatbot assist me in writing wiki articles and events for an online roleplay thing, I want it to have access to all 7,567 already created articles that the community has made so it can pull information and make enhance my writing and suggestions with cannon responses.
How…. how would I do that? As in get the data and put it in a format that could be used for fine tuning. The YouTube tutorials I’ve seen generally focus on “reverse engineering” midjouney prompts or medical questions.
submitted by /u/Jakob4800
[link] [comments]
I usually study on data that is ready in the server so I have no idea how to get it from StatCan. I read their website, but it might be I’m not a dev so … still have no clue at all.
For instance, I want the report of persistence and graduation of doctoral degree students, within Canada, by student characteristics ( including sex, age, marital, father/ mother occupation, scholarship, funding, location, household income…. ) for a period.
Where I can get all the tables I need? I would prefer the flat files CSV.
I downloaded files from website, but it’s not data same as what I got from Kaggle.
TIA!
submitted by /u/Whatswrongwithman
[link] [comments]
I’m building a film recommendation system, I have a large csv file with film data scraped from the IMDB dataset which I plan to use to build the machine learning model, at the same time I’m using theMovieDB api to get some extra film details like plot summary.
I’m using around 300,000 films from IMDB, and some records are missing certain data, like editor, cinematographer etc., and I’m not sure how much more data each dataset has on a film compared to the other.
Would it be better to consistently use TMDB api to display film data on the frontend, and only use IMDB to build the ML model, or consistently use the IMDB csv throughout my system for the model and for displaying film details. Alternatively I could cross-reference both sources but I’m wary of contrasting data in both datasets.
Any advice is appreciated
submitted by /u/wobowizard
[link] [comments]
Hey, are there any datasets available of emotion recognition in populations with impairment in identifying facial emotions correctly? Whether autism, schizophrenia, etc., doesn’t matter.
Thanks!
submitted by /u/tofoyer
[link] [comments]
Hello
I want to train an AI Model with Geotagged Images. Where can I find such Dataset?
submitted by /u/Pierruno
[link] [comments]
I’m currently working on ISIC2020 dataset and I’m trying to find a dataset that contain the information about the ROI of each image. Any ideas ?
submitted by /u/Impossible_Lack3452
[link] [comments]