Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

Looking For A Dataset Of Handwritten Answered Exam Papers

Hello!

I’m doing a project on auto-grading handwritten exam papers and so am looking for a dataset to help me with that. I want to specifically do this project for auto-marking GCSE/A level exam papers but it seems that no dataset with answered papers exist, so I am looking for alternatives. I am new to ML projects so any advice would be very much appreciated. Thanks!

submitted by /u/cakeandflowers2202
[link] [comments]

Dataset For Training LLMs To Translate English Into Statements Of Pure Zero-order Logic (ZOL)

All my searching so far leads me to suspect that this is a dataset that does not exist. There are a bunch of datasets that primarily focus on examples of English-to-ZOL, but the creators always insist on throwing some first-order logic in there as well. I can explain why that’s a problem if anyone is genuinely curious (as opposed to simply wanting to have an argument.)

TL;DR: I need a dataset that makes a point out of including examples of English (when a sentence actually allows for it) being translated only into ZOL, no higher-order logic whatsoever.

submitted by /u/evangelos520
[link] [comments]

ISO: Simple Format Global Elevation Data

I want to play with global elevation data, but I’m not good at parsing special files. Is there a simple text format dataset of global elevation? Something like a CSV of

LONGITUDE, LATITUDE, ELEVATION 0,0,0

It doesn’t have to be super-high resolution. I’ve found a few sources, but I don’t know how to parse an hgt or kml file.

submitted by /u/stable_maple
[link] [comments]

Wanted: Linux Kernel GitHub Contributors

Hello,
I am looking for a way to get all contributors of the Linux kernel GitHub repo, and then also get all followers from each contributor, preferably in python.
Unfortunately i have never done anything in this direction, i need this for a course at uni.
Is there any way to do this? if so, which programs, library or tutorials can recommend?
Cheers!

submitted by /u/ro-oope
[link] [comments]

Data Set For Plants With Sensor Readings

Hello everyone, I am currently working on a project that involves using R-based statistical analysis to improve precision plant growth and farming in greenhouses. I have generated a data set for a few plants, but it is not very efficient as it is randomly generated. Therefore, I am wondering if there is a real-life data set available for a few plants that includes sensor readings for temperature, humidity, and light intensity. If anybody has accomplished anything similar to this, I would very appreciate hearing about it.

submitted by /u/Biocandy93
[link] [comments]

South Africa’s Court Case Against Israel In The International Court Of Justice

This is a dataset including text from South Africa’s 84-page case submitted to the International Court of Justice accusing Israel of committing genocide against the people of Gaza.

Link to Dataset: https://www.kaggle.com/datasets/samerhijjazi/south-africa-genocide-case-against-israel-2324

Original source: https://www.pbs.org/newshour/world/read-the-full-application-bringing-genocide-charges-against-israel-at-un-top-court

submitted by /u/Embarrassed-Big-5823
[link] [comments]

Why Don’t More Companies Try To Sell Their Data? What Are The Challenges For DaaS (data As A Service) Or Companies Trying To Make Data Products?

Most people can agree that data is the new gold. There is a lot of valuable data that companies own that their customers, partners, or other companies could use and make money for both sides, so I am surprised there isn’t more data products out there especially for small-medium businesses.

Curious for the community’s thoughts on the biggest barriers of selling data (I guess both for data companies but also for other companies who just want to make extra revenue?)

submitted by /u/kitkat_126
[link] [comments]

Commercial Pools And Commercial Elevators Dataset Needed

I am looking for a data set that includes state-by-state data on the number of commercial pools and commercial elevators in the United States.

I have tried looking at government data state by state but there are a lot of inconsistencies and some states have no information available. I am looking to complete a project that requires me to look at all of the locations for pools and elevators.

Does anyone know where this data would exist? Any pointers or tips that anyone may have to lead me in the right direction would be greatly appreciated. TYIA!!

submitted by /u/ilovemarketresearch
[link] [comments]

I Need A Data Set For Medical Image Denoising. Short On Time, Please Help!

As the title says, I need a data set containing noisy medical images so that I can apply Denoising algorithms on em and maybe try new things. I have to convey the data set I would be using to my project guide by this Saturday and I am unable to find one. All the medical image data sets I find online are pure images. I want medical image data sets containing noisy images as well as the ground truth. Please help me someone.

submitted by /u/No1_unpredictablenin
[link] [comments]

Looking For A Streaming Services For A Particular Movie API/dataset

I’m searching for an API, preferably free, or a dataset available for commercial use that provides streaming service information for a particular movie. I’ve come across the ReelGood API, which is priced at $95 per month, and the JustWatch API, but it’s only available for businesses, and you need to reach out to them. Are there any other alternatives you’re aware of? While a free option would be ideal, I’m open to checking out paid options as well.

submitted by /u/-Oake
[link] [comments]

šŸš€ Launched Job Posting API On ProductHunt [self-promotion]

Hey everyone! šŸ‘‹ Exciting news – we just launched our latest product on ProductHunt:
šŸš€ Job Postings API: Unlock millions of fresh job opportunities every month!
Check it out here: Job Postings API on ProductHunt
Job postings provide detailed insights into jobs, companies, and technologies. Perfect for powering new job boards, uncovering sales leads, generating market reports, tracking tech trends, and more.
If you need larger datasets for in-depth data analysis or machine learning, we’ve got you covered with job postings from 140+ countries available as datasets or data feeds.
We’d love to hear your thoughts! Feel free to share your feedback. Thanks for checking us out! šŸš€

submitted by /u/Techmap_io
[link] [comments]

Researcher Collecting Cat Meows To Aid Novel Study On Cat Vocalisations (it’s Quick And Simple!)

Hello everyone! I am looking for cat owners keen to help out with a research project, I am studying cats to see whether we can estimate their age just by their voices. If we manage to, this promises significant benefits in veterinary care, can aid rescue centres in creating accurate adoption profiles, and has potential implications for understanding the age demographics of feral cats.

It’s quick and simple – if you’re interested in helping out please send me a message and I will share more info & instructions šŸ™‚

Any contribution is invaluable and will help gain insight into the development of age-related vocalisation patterns in cats!

Thank you!

submitted by /u/Asseflas
[link] [comments]

Looking For Historical Job Posting Data

Hello, I am a PhD student working on a research project on labor economics. I am looking for job posting data (including job descriptions and requirements), especially historical data from the past 5 to 10 years (preferably). Are there any places I can find data like that? I currently know some job listing APIs, but they only have active postings, and some data consulting firms have historical data, but it costs more than 20k šŸ™

submitted by /u/nycameraguy
[link] [comments]