Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

Datasets On US Government Cheese + TEFAP Food Distribution Help

Hi all,
I’m trying to find data on government cheese, mainly how much cheese was bought per year by the US Gov in line with dairy subsidies/where it was distributed to in the US, and when it was supplied to Americans, how much went to each operation e.g. the Temporary Emergency Food Assistance Program (TEFAP) and how that was distributed across the country (programmes/quantity/method). I’ve never worked with US gov data before so am finding it a bit tricky to navigate through the different departments and how it’s laid out and will continue to try and find it but was just reaching out if anyone here somehow had any background with this. I’ve started out with USDA data but can only find distribution and consumption under cheddar, but not necessarily the government variety. I’ll probably try a FOIA request soon if I get stuck. If you have any information or guidance I would really appreciate it, thank you.

submitted by /u/clearwatertaffy
[link] [comments]

Looking For An Uniform Gdp/employment By Country And Economic Sector Dataset That Goes Back To At Least 2006

I am looking for a high quality data source for growth rates and employees of different economic sectors (economic activity) of different countries by year. The data set should go back to 2006. At least Germany and the USA should be included. Ideally also China, Nigeria, Japan and Brazil. I could look at the respective national statistical offices, but the sector classification in particular is sometimes very different, which leads to methodological problems.

So far I have looked at the World Bank, OECD and the International Monetary Fund. Unfortunately without success. The OECD does have good statistics on “employment by activities and status“, but these only go back to 2008. However, 2006 must be included because of the global economic crisis that occurred in the following years. Does anyone here have any ideas?

submitted by /u/Aequitas49
[link] [comments]

Making Experimental Variograms Correctly?

I am having a bit of difficulty understanding experimental variograms and when making one not too sure what I’m looking for. Am I just adjusting the number of lags and lag distance until it looks good? What should one that looks good look like? And how do you justify your choices?

submitted by /u/alex123711
[link] [comments]

What Is The Term For A Wiki-like Dataset

a wiki “is a website that allows any user to change or add to the information it contains” accord to oxford’s dictionary.

What is it called when there is a dataset that is the same way? A lot of datasets have static and/or outdated info – like an NBA dataset might need to be updated every season with the new roster and people would be willing to submit changes to it just like they do to wikipedia.

Is there a name for this type of database/dataset and are there good examples of it? One I found is https://openlibrary.org/about but the features of that go pretty far beyond just a dataset. It doesn’t need a full api for instance.

submitted by /u/third_dude
[link] [comments]

Scraped Top Active Football Players Data

Hello everyone,

the other day I was bored so I scraped and cleaned the data of the top 380 active football players. Each player is also linked to their images with IDs.
Feel free to check it out and play around with it. I was gonna use it for a guess-who game with football players, but I don’t have time to tackle that solo. If interested, we can make a web app game together for that.

PS: If you’re interested in the scraping script I wrote, DM me!

Cheers,
Atilla
https://www.kaggle.com/datasets/atillacolak/top-active-football-players-data

submitted by /u/AttilaTheHappyHun
[link] [comments]

Personal Project For My GitHub Profile

I’m graduating in 3 weeks, I am thinking of this random thing to showcase on my GitHub. My idea is to implement remote gas stations (Like a fuel truck). The plan is to get the traffic dataset of an area and analyze the data for all days of the week. Create a heatmap and then plot the existing gas stations on the map. Now the goal is to select top 5 places where there is traffic and less gas stations. (Assuming gas stations are required at high traffic flow areas). I’m not sure where to start, I mean where can I get the datasets other than kaggle. And also can someone help me to brainstorm the things I need to focus on. Thanks

submitted by /u/happyplantt
[link] [comments]

Need Assignment Help With Finding A Dataset To Work On (Data Science)

Hi everyone, I need a dataset I can work on for this project, since I have to make a business question out of it, I need something that is relevant, I am doing my masters in france, can you recommend an easy dataset to work on. It is kind of urgent, so would appreciate a response by today.

* Already looked through Kaggle and other resources, can’t find something business related, so I have come here

you will write a project proposal that will capture the “who, what, why and how” of your work, plus any challenge that you foresee along the way. Your proposal will include:
Project specification (Word document) *

a specific business case (Business questions) or personal objective to reach,
any intended outcomes (Business values),
a description of the needs of the intended audience,
a description of the dataset to be used, and any foreseeable challenges.
Tableau Software specification
import and prepare the data (Extract data!) (Tableau document)
Analyze the data, (Tableau document)
Create dashboard and storyboard, (Tableau document)

Due date: April 28, 2024 before midnight.Format: “Tableau” TWBX file with data and other workbooks. DOCX document for your specification*
File repository: Assignments folder

submitted by /u/Thelostmind912
[link] [comments]

Finding Or Creating The Dataset You Could Not Find Or Want To Find For Free

Hello everyone,

I am here to help you and myself with this post. So here is a brief explanation of what I want to do. I want to create a directory of extreme and absurd datasets as a side project and would love to help you in return for ideas. I also appreciate it if you had challenging ideas. For all datasets I could find or create, I will share them here.

I am a junior ML engineer and want to do something different for my portfolio. People are already doing and I did segmentation, classification, stable diffusion, NLP or LLM projects, or open source project contributions. I think they are pretty useful and joy to learn and develop but I want to do something different and helpful to draw some extra attention. I think it would look pretty good on a portfolio to have a unique public dataset directory that people are using and also it is something that can be advanced continuously.

I mostly worked on computer vision so far but I am open to anything. So far what comes to my mind are

Different Types of Beards Dataset Feces in Cat Litter Dataset Dog Poop Dataset: but i found it easily here though not sure fake poop provides the best results Emoji – Emotion Dataset: found it too link. Firearm – Manufacturer Dataset

My ideas are mostly visual because of my work ig but I hope i could give some context on what is the limit for absurdity you can think of. Waiting for your ideas.

Will try my best to find or create(ofc that might take a while) one for you.

submitted by /u/Minimum_Medium_3914
[link] [comments]

Seeking Data For Correlation Study: Obesity And GPA Among University Graduates

Hello everyone,
I’m just curious about exploring the correlation between obesity and academic performance among university graduates (GPA). However, I need data regarding the sex, weight, height, and GPA of graduated students from various universities.
If anyone has access to or knows where I can find such data, please do share your insights or point me in the right direction.

submitted by /u/Sea-Dimension2515
[link] [comments]