Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

Looking For Datasets Related To AI And HR Integration

Hi, I’m currently working on a capstone project focusing on the integration of AI in Human Resources, particularly its impact on recruitment, workforce management, and employee retention. I am looking for relevant datasets that would help analyze the role of AI in HR processes. Could you suggest any sources or repositories where I can find such datasets?

Thank you!

submitted by /u/Used_Confection1949
[link] [comments]

Looking For Datasets On ICU And LTAC Survival, Relapse, Infection, Etc. Rates (any Format)

My mom is in the ICU with a severe anoxic brain injury. She is currently sustained by a ventilator and feeding tube. She is no longer on any sedative and has not shown signs of waking.

My family is considering further care options and I would like all the data I can find on those options. As of now those options are transferring to another hospital, transitioning to Long-Term Acute Care (LTAC), and pulling the plug.

I have serious concerns about quality of life for her and my family. My family responds best to data supported arguments, so I am looking for relevant data sources to validate or assuage my concerns.

I know this is heavy so thank you for reading this far and please send anything you think might be relevant.

submitted by /u/rover_G
[link] [comments]

Free SQL/noSQL Database/CSV About Generic Food Nutritional Values

Hello,

As a learning project I’m gonna build a small mobile app to track calories intake through the day, i’ll need a database with nutritional values to do so.

I found USDA and Open Food Facts db dumps but it’s more about products or meal informations and not generic food like plain chicken or white rice.

In my case I want to track calories of unprocessed food, as the vast majority of processed food already have nutritional facts printed on.

I plan to do this in MongoDb or Postgres, I can even take a CSV file if it has the type of data i’m looking for.

submitted by /u/JoeTheOutlawer
[link] [comments]

Structure Of ADNI Alzheimer’s Dataset

I’m working on a machine learning project and I’m using MRI images from the ADNI dataset for Alzheimer’s. Unfortunately I downloaded the files and I’m very confused about the structure and the meanings of the folder names. If anyone has any experience working with this dataset or something similar I would be very grateful for their help.

submitted by /u/Xcuse_Me_Sir-
[link] [comments]

Dating/relationship Advice Or Info Dataset

hi I’m planning to do a side project about relationship advice for women I’m looking for examples for any research or datasets about advice or behaviors in relationships I didn’t find in Kaggle or internet but maybe that’s related to I dont know what to looking for so if you have any dataset or know what to type for this I really appreciate it

submitted by /u/mibappeferto
[link] [comments]

Merging Datasets For One Single Project?

There’s more of like two parts with this question, so yeah.

First question: Let’s say I want to train a ML model to detect a basic disease based off an image, say a brain. I can find a large dataset on regular. Then, I find multiple smaller datasets with not as many brain with disease images. Thus, I take all these smaller datasets of brains with diseases, combine them into one, then use this new dataset (brain with diseases) and the other dataset (large dataset with regular brain), and use them for classification. Is this possible?

Second question: can we extend this to multiple classes? Say we have a disease that requires many conditions/symptoms to detect. Can I find these conditions from multiple data sets (One dataset contains characteristics, one dataset contains duration, one dataset includes images, etc) and essentially merge them all into one as long as they classify the same disease??

submitted by /u/ResearchingTinBot
[link] [comments]

Combining Multiple Files Into A Single Csv

My question is regarding this Formula 1 dataset

https://www.kaggle.com/datasets/rohanrao/formula-1-world-championship-1950-2020

It contains multiple csv files- circuit data, driver IDs, lap times, results etc. Im currently trying to merge these into a single usable csv. I’m very new to data analysis/coding so is this something that is possible? If it is, how would I go about doing that? Appreciate the help!

submitted by /u/FalconStone95
[link] [comments]

Help Needed: Merging 3 Datasets For Junior Data Engineer Assignment

Hi everyone,

I’m currently working on an assignment for a Junior Data Engineer role, and I could use some guidance. The task involves merging three datasets from different sources (Facebook, Google, and Company Website) into one comprehensive dataset. The columns I’m focusing on are:

Domain (most reliable) Phone Number (second most reliable) Name Category Address

I’ve mostly cleaned the datasets, but I need to merge them accurately. My main goals are to:

Merge the datasets using one or two columns (Domain and Phone Number). Ensure no overlap in information and that each row complements itself to create the most accurate and reliable data.

Could anyone suggest the best steps to take for this process? Should I use tools like Power Query or MySQL? Any recommendations for tutorials or YouTube videos would also be greatly appreciated.

Thanks in advance for your help!

submitted by /u/FortaDeMunca
[link] [comments]

Improving My Data Analytics Skills By Practicing On Datasets

Hello everyone, I would like to work on my Data analysis skills and am in the hunt for a few datasets that I could work on. I want to work on my Excel, SQL and Tableau skills. I would love to get hold of some datasets that start from extremely easy to an intermediate level so that I can improve my skills gradually. Any reccomendations on a data viz tool to use and anything else is highly appreciated too. Thank you!

submitted by /u/Shoddy-Scallion4712
[link] [comments]

Finding All Bills In Congress For A Specific Year/congress Session And The Votes On Each One Of Those And Downloading It

I am trying to find a way to find all bills that were in congress (senate and house) with their information (such as title of the bill, what the bill is about, etc.) and find the distribution of votes on each bill by the rep and their state

I looked into

1) https://api.congress.gov/#/bill/bill_list_all – seems like you can find a specific bill, but there is no way to search and download all say the 118 2023-2024 about 2000 bills at once. I was also unable to find vote information

2) https://projects.propublica.org/represent/ – no longer working

3) https://www.govtrack.us/congress/votes – for example https://www.govtrack.us/congress/votes/118-2024/h328#details . This option seems to have the information I am looking for but they are no longer allowing bulk data.

for 3 I guess I can brute-force it with getting all the urls from the html, then write a script to visit all urls for each page and try to parse the html data into a json/xml of sort, but that seems not great

would love to know if anyone has any suggestions

submitted by /u/psychic_shadow_lugia
[link] [comments]

My First Dataset, How Do I Proceed??

I am trying to further my excel skills, eventually also python, power bi and sql. I just find it fun and i think its good skills to have.

My question is. What are some of the first things to examine after getting a dataset and cleaning it?

Im working with some datasets from kraggle.

Are there some things the experienced people always do? Like make a top 5 of valuables, or of top sellers etc, or is it something completely different that i am skipping?

submitted by /u/FuegoFlamingo
[link] [comments]

Consent Regarding Dataset Publication

Hello, suppose I have built a “user review on products” dataset by scraping from a website.

Now I want to publish the dataset, 1. Do I need to get their consent for publishing it? 2. What if I cant reach out to them to get consent?

If yall could kindly give me solutions to this. Thanks.

submitted by /u/Second_Naf
[link] [comments]