Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

Help With Finding A Dataset With Parents And Children’s Faces

Hi everyone!

I am doing an academic project, and I am trying to find a dataset or a source to scrape where I could acquire the faces of parents and children of the parents.

This would need to be on a pretty large scale, preferably thousands or tens of thousands of faces (I would use AMT to sort through the images to take out incompatible ones).

Do any of you have an idea of where I might look to find this?

P.S for other projects I am also looking for facial datasets from various regions of countries in the world, especially in Europe and faces of individuals with differing Jewish admixture. This is a bit more complicated, and I would likely need to gather data by survey.

submitted by /u/SyrupAccomplished853
[link] [comments]

National Readmission Database Comorbidities Help

I am working with the national readmission database in SPSS. HCUP gives out an Elixhauser Comorbidity Software Refined for ICD-10-CM diagnosis codes to identify comorbidities for the patient population, however this software is only usable in SAS (which I don’t have). In order to identify comorbidity frequencies, according to HCUP, there are 18 comorbidities (within the elixhauser comorbidity index) that can only be identified using present on admission (POA) indicators: basically specifies whether the diagnosis was prior medical history or if it occurred during the hospital stay (POA indicator is binary yes or no). However, these indicators are not present in the SPSS file.

Anyone know a solution? Is the use of POA indicators necessary in NRD (this software isn’t specific to NRD and can also be used in NIS)?

submitted by /u/mygpaistrash
[link] [comments]

Observational Study: Different Type Of Backpack Usage At Columbia Vs At Barnard College.

Hi! My psych lab group and I did an observational study where we stood outside a library at Columbia university and tallied the different type of backpacks used (totes, purses, double strap back pack. Then we stood outside a library at Barnard college and did the same.

I’m wondering what could this information be useful for? Like what further research? Or what questions did the data we collect possibly answer if that makes sense

submitted by /u/bombasticsideeie
[link] [comments]

How To Build A Realistic Health Related Dataset

Hi, guys. I need to create a realistic health data set to showcase how a data analytics platform can help to draw useful insights, such as identifying seasonal trends, local hotspots, supply chain issue, etc.

The data needs to be recorded daily/weekly and have dimensions as facility name, age group, gender and indicators such as suspected and confirmed cases, vaccine stock, people immunized and missed immunizations.

I tried GPT but it cannot handle this task well. Does anyone know how to do this? Thanks!

submitted by /u/Technical-Blood9031
[link] [comments]

Looking For A Gym Exercise Dataset From A Peer Reviewed Journal

Hey guys, basically I’m working on a system that would use machine learning to recommend workout plans (exercise selection etc) based on the muscles that users want to prioritise, whether their goal is strength or hypertrophy etc, and I need some datasets that I could potentially use to train my model.

My professor said to look into emg studies and whatnot but I was wondering if anyone could help me out and even potentially link to some datasets I could use. He said to try to use datasets that have been used in peer reviewed journals, and to avoid places like kaggle if I can. I really want to be able to use a dataset like this one https://www.kaggle.com/datasets/niharika41298/gym-exercise-data/data

How would I go about finding a similar dataset but it’s from a peer reviewed journal?

Any help would really be appreciated, thanks. If this isn’t the right place to ask then any pointers on where to ask would be appreciate too, thanks

submitted by /u/ILikeFish69
[link] [comments]

Looking For A Paraquat Applicator/Farmers Database

Hey 👋🏻,

I’m currently working on a project and I’m trying to get my hands on a database that tracks farmers or applicators who have used Paraquat. I’m particularly interested in any datasets that could provide info on usage patterns, application history, or anything related to this herbicide.

I’ve done some basic searches but haven’t had much luck finding something concrete. Does anyone here know where I might be able to find such a dataset? Whether it’s publicly available, or even something I’d need to purchase or request through an organization, any lead would be super helpful.

Thanks in advance for any tips or suggestions! 👨‍🌾

submitted by /u/alb53
[link] [comments]

Looking For Data Set To Detect Anxiety Or Panic Attacks Or Phobia Or Stress

I’m working on a project about detecting physiological symptoms of anxiety in general using physiological sensors: Gyroscope, Thermometer, Heartbeat.

And using machine learning.

I need data set to put in the system so he can tell if that person is stressed or not and I don’t have much time to submit the project to actually train the system

Thank you all in advance

submitted by /u/Revolutionary_Bat94
[link] [comments]

MIT Technology Review Data In JSON Format [1997-2024]

MIT technology review magazine data from January 1997 to October 2024. I started scrapping from 1890 but looks like posts from years < 1997 aren’t posted so I’ve excluded them from the dataset (I’ve metadata about these issues though, which includes the cover image, title and link to the pdf file for that issue).

Format:

{ title: “Issue Title”, date: “2024 January”, hero: “cover image url”, pdfLink: “link to pdf file”, posts: [{ title: “Post Title”, date: “Article publishing date”, topic: “Policy”, headerImg: “image url for article hero img”, authors: [{ name: “Author name”, link: “Link to author profile”, }], body: “<p>Article content goes here</p>”, }] }

All files are stored in folders named by year.

Useage: I actually scrapped this data for myself to generate epub and pdf files with less clutter and better readability on mobile/kindle devices. I’m currently scrapping all the popular magazines like economist, newyorker, atlantic, vanity fair etc without a solid usecase other then generating epubs/pdfs. You can generate epubs/html or combine it with other data to use in some LLM projects.

Download link: Google Drive

submitted by /u/waqarHocain
[link] [comments]