For quite sometime i have been looking for facial video dataset which is labeled by the mental health disorder.
i want to build a deep learning model using this data.
submitted by /u/Intrepid-Walk1227
[link] [comments]
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
For quite sometime i have been looking for facial video dataset which is labeled by the mental health disorder.
i want to build a deep learning model using this data.
submitted by /u/Intrepid-Walk1227
[link] [comments]
Hi, I’m currently working on a capstone project focusing on the integration of AI in Human Resources, particularly its impact on recruitment, workforce management, and employee retention. I am looking for relevant datasets that would help analyze the role of AI in HR processes. Could you suggest any sources or repositories where I can find such datasets?
Thank you!
submitted by /u/Used_Confection1949
[link] [comments]
Anyone knows if there’s an API to call ocean data?
Currently I have multiple shipments which I have to manually check status frequently. It takes so much time and energy. I was thinking if I have the Vessel# and the ocean dataset, I can make a dashboard overview. Anyone have done this before?
submitted by /u/Pineapple_Lina
[link] [comments]
Does anyone know of any particular studies or data sources for student outcomes by housing instability? Particularly in GA.
Thank you so much!!
submitted by /u/aFeelingProcess
[link] [comments]
My mom is in the ICU with a severe anoxic brain injury. She is currently sustained by a ventilator and feeding tube. She is no longer on any sedative and has not shown signs of waking.
My family is considering further care options and I would like all the data I can find on those options. As of now those options are transferring to another hospital, transitioning to Long-Term Acute Care (LTAC), and pulling the plug.
I have serious concerns about quality of life for her and my family. My family responds best to data supported arguments, so I am looking for relevant data sources to validate or assuage my concerns.
I know this is heavy so thank you for reading this far and please send anything you think might be relevant.
submitted by /u/rover_G
[link] [comments]
Hi all,
Do you know if there are free datasets about bees and honey?
Thank you
submitted by /u/annleemar
[link] [comments]
Hello,
As a learning project I’m gonna build a small mobile app to track calories intake through the day, i’ll need a database with nutritional values to do so.
I found USDA and Open Food Facts db dumps but it’s more about products or meal informations and not generic food like plain chicken or white rice.
In my case I want to track calories of unprocessed food, as the vast majority of processed food already have nutritional facts printed on.
I plan to do this in MongoDb or Postgres, I can even take a CSV file if it has the type of data i’m looking for.
submitted by /u/JoeTheOutlawer
[link] [comments]
I’m working on a machine learning project and I’m using MRI images from the ADNI dataset for Alzheimer’s. Unfortunately I downloaded the files and I’m very confused about the structure and the meanings of the folder names. If anyone has any experience working with this dataset or something similar I would be very grateful for their help.
submitted by /u/Xcuse_Me_Sir-
[link] [comments]
Iam working on a project to find the meals you are looking for and iam struggling to find good datasets.
The datasets i want need to contain detailed ingredients also maybe calories if possible.
submitted by /u/Raditzer
[link] [comments]
Wondering if this is attainable. Simplified example:
State A is 80% white and 20% black.
White: 20% no HS, 20% HS, 40% bachelors
Black: 5% no HS, 5% HS, 10% bachelors
Thank you!
submitted by /u/marketarian
[link] [comments]
Creating a cool project to track migration patterns to assess what’s happening with some housing markets.
submitted by /u/Bubbly-Sentence-4931
[link] [comments]
I’m a B2B marketer trying to figure out what type of content resonates most with data-driven professionals. Do they prefer videos, blogs, infographics, case studies, eBooks, whitepapers, webinars, or something else? Would love your insights!
submitted by /u/EntertainerKey7709
[link] [comments]
hello everyone I’m thinking to develop an plant app but I couldn’t find well rounded plant datasets mainly for plants inside house I searched on Kaggle but most of datasets are vegetables that’s fine too but I’m looking for more to plants that have small and home plants type if you have any link to something like that I really appreciate it
submitted by /u/mibappeferto
[link] [comments]
hi I’m planning to do a side project about relationship advice for women I’m looking for examples for any research or datasets about advice or behaviors in relationships I didn’t find in Kaggle or internet but maybe that’s related to I dont know what to looking for so if you have any dataset or know what to type for this I really appreciate it
submitted by /u/mibappeferto
[link] [comments]
Guys, I want a dataset for AI mock interview website. Using it , I want to measure the confidence level and fluency of the users. The only one I have found so far is the MIT dataset. Is there any other dataset available?
submitted by /u/laiba61
[link] [comments]
Is data containing per part component servicing/replacement of automobiles and motorcycles available? If yes, where can I access them?
Example: date serviced= 01/01/2020, part replaced = front driver’s side shock absorber, odometer during service = 20000kms.
submitted by /u/officialisma
[link] [comments]
There’s more of like two parts with this question, so yeah.
First question: Let’s say I want to train a ML model to detect a basic disease based off an image, say a brain. I can find a large dataset on regular. Then, I find multiple smaller datasets with not as many brain with disease images. Thus, I take all these smaller datasets of brains with diseases, combine them into one, then use this new dataset (brain with diseases) and the other dataset (large dataset with regular brain), and use them for classification. Is this possible?
Second question: can we extend this to multiple classes? Say we have a disease that requires many conditions/symptoms to detect. Can I find these conditions from multiple data sets (One dataset contains characteristics, one dataset contains duration, one dataset includes images, etc) and essentially merge them all into one as long as they classify the same disease??
submitted by /u/ResearchingTinBot
[link] [comments]
Does anyone have a working link to the million songs dataset? The original one that was hosted on aws (https://aws.amazon.com/datasets/million-song-dataset/) does not exist anymore. Even if you have a copy somewhere please do share. This is for a class project amd I’d be grateful for any help.
submitted by /u/Aspiring_DE
[link] [comments]
For my ML project I need the scan files or pdf of banking statements to train model. Maybe synthetic data will do, the main thing is that I need them in diversity.
Business banking statement are needed too.
submitted by /u/i_kramer
[link] [comments]
My question is regarding this Formula 1 dataset
https://www.kaggle.com/datasets/rohanrao/formula-1-world-championship-1950-2020
It contains multiple csv files- circuit data, driver IDs, lap times, results etc. Im currently trying to merge these into a single usable csv. I’m very new to data analysis/coding so is this something that is possible? If it is, how would I go about doing that? Appreciate the help!
submitted by /u/FalconStone95
[link] [comments]
Hi everyone,
I’m currently working on an assignment for a Junior Data Engineer role, and I could use some guidance. The task involves merging three datasets from different sources (Facebook, Google, and Company Website) into one comprehensive dataset. The columns I’m focusing on are:
Domain (most reliable) Phone Number (second most reliable) Name Category Address
I’ve mostly cleaned the datasets, but I need to merge them accurately. My main goals are to:
Merge the datasets using one or two columns (Domain and Phone Number). Ensure no overlap in information and that each row complements itself to create the most accurate and reliable data.
Could anyone suggest the best steps to take for this process? Should I use tools like Power Query or MySQL? Any recommendations for tutorials or YouTube videos would also be greatly appreciated.
Thanks in advance for your help!
submitted by /u/FortaDeMunca
[link] [comments]
Can anyone please tell me where can I find data set of US across all 50 years of this century. Particularly I am looking for Farenheit, avg per month or day for all states, doesn’t have to be for each city. I couldn’t really find a good one online
submitted by /u/Boring-Baker-3716
[link] [comments]
Hello everyone, I would like to work on my Data analysis skills and am in the hunt for a few datasets that I could work on. I want to work on my Excel, SQL and Tableau skills. I would love to get hold of some datasets that start from extremely easy to an intermediate level so that I can improve my skills gradually. Any reccomendations on a data viz tool to use and anything else is highly appreciated too. Thank you!
submitted by /u/Shoddy-Scallion4712
[link] [comments]
It would be really helpful if someone can share some sources for fetching real-time and historic data for blockchain metrics, the following parameters to be specific:
Average block size
Number of user addresses
Number of transactions
Miners’ revenue
The data should preferably begin from the year of 2017.
submitted by /u/Mustaksi
[link] [comments]
I am trying to find a way to find all bills that were in congress (senate and house) with their information (such as title of the bill, what the bill is about, etc.) and find the distribution of votes on each bill by the rep and their state
I looked into
1) https://api.congress.gov/#/bill/bill_list_all – seems like you can find a specific bill, but there is no way to search and download all say the 118 2023-2024 about 2000 bills at once. I was also unable to find vote information
2) https://projects.propublica.org/represent/ – no longer working
3) https://www.govtrack.us/congress/votes – for example https://www.govtrack.us/congress/votes/118-2024/h328#details . This option seems to have the information I am looking for but they are no longer allowing bulk data.
for 3 I guess I can brute-force it with getting all the urls from the html, then write a script to visit all urls for each page and try to parse the html data into a json/xml of sort, but that seems not great
would love to know if anyone has any suggestions
submitted by /u/psychic_shadow_lugia
[link] [comments]
I am trying to further my excel skills, eventually also python, power bi and sql. I just find it fun and i think its good skills to have.
My question is. What are some of the first things to examine after getting a dataset and cleaning it?
Im working with some datasets from kraggle.
Are there some things the experienced people always do? Like make a top 5 of valuables, or of top sellers etc, or is it something completely different that i am skipping?
submitted by /u/FuegoFlamingo
[link] [comments]