Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

IMF Loan And Transaction Data Is Very Hard To Find

Hey there,

I’m pretty new to this sub and am having a not so easy time looking for a nice overview of loans (Stand-by Arrangements, Credit Tranche, Extended Fund Facility, Poverty Reduction and Growth Fund) from the IMF from 2000-2020. The website of the IMF is completely unhelpful and for the years 2000-2006, I’ve been gathering the data from the appendixes of the annual reports. However, from 2007 onwards, the design and format is changed resulting in less information about loan extension, cancellation, augmentation, specific dates, etc. Does anyone happen to be aware of any database/dataset where this information can be found. Help would be greatly appreciated! Many thanks in advance 🙂

submitted by /u/Ok_Lettuce2987
[link] [comments]

[Dataset Request] Bizarre Datasets For Final Project Data Analysis

For my final project this semester I have to clean, summarize, and visualize a dataset. The professor provided datasets but since I’m graduating I kinda want to go out with a bang. So, any ideas for a very bizarre dataset that will cause my professor to question my sanity/thought process? Or at least things to look up on the interweb. Searching “bizarre datasets” has me questioning why the author thought said dataset is bizarre.

submitted by /u/zora833
[link] [comments]

Dataset Wanted: Country-Level Well-being & Wealth As For Understanding The Role Of Job Quality/opportunity As Development

Hey folks! 👋 I’m on a mission to find a dataset/merged datasets that covers all the possible details about a country’s wealth at work landscape (not only money). I’m talking productivity, workspace wealth (including happiness at work, quality of life), entrepreneurship opportunities (like successful starting companies and investment levels), and sustainability practices within each country companies.

Know of any datasets that cover these angles comprehensively? Your expertise would be invaluable!

Particularly the focus is comparing Germany, Colombia, US and South Africa

submitted by /u/jucajagu
[link] [comments]

Scenarios/walkthroughs Of Utilizing SQL On Datasets And Then Inputting Into Tableau?

Howdy folks,

I’m a data analyst with two years of experience and I’ve been job searching the last few weeks. Im trying to find any possible walkthroughs/scenarios of data sets that utilize a set of data where SQL is then used to make joins on different tables (or whatever way SQL is used to transform the data), and then that data then gets input into Tableau and visualized accordingly.

Im aware there’s different data sets that this could be done with but Im trying to find possibly anywhere where theres possible walk throughs of this being done. Although SQL isn’t all that complex I haven’t used it for a bit and I have much more experience in Tableau.

Im trying to run through some scenarios/walkthroughs so I can get a hang of making all the queries/transformation in SQL/the database and then outputting that into Tableau accordingly. I’ve already been using the search function, so please dont ask me to just google it.

Im just wondering if anyone here has maybe seen a good dataset previously to do this on or has practiced a scenario they’ve worked through so I could get the hang of things (like a video explainer/walk through) and then just start to use whatever dataset i want to choose from afterwards once I get the hang of things. Id prefer this with Postgre if possible, but it absolutely doesn’t need to be.

Any direction would vastly help.

submitted by /u/WhatsTheAnswerDude
[link] [comments]

Does Anyone Know A Dataset Of European Railways Connections?

For a project at Uni about community finding in a graph, I wish to experiment with the railways connections graph, see if stations are classified in communities by country or something.

Do you know any dataset with european train stations with the other stations they’re connected to? I found datasets of stations but not connections.

Thank you in advance !

submitted by /u/Gogani
[link] [comments]

Most Publicly Available Datasets Are Already Finalized In A Single Table. How Important Are Showing ‘joins’ In An Entry Level Portfolio?

Hi guys,

I’m currently working on a data analysis portfolio for entry level jobs and everyone always says that knowing SQL and more specifically, joins, are very important skills to know and to demonstrate.

When obtaining datasets whether it would be from kaggle, data publicly available from an official website, extracting data through API’s, or wherever you get your data from, the one thing i’ve noticed is that all the data is usually already put together in a single table. You can take that data and ‘clean’ it (making rows, columns, values consistent prior to analysis, etc.) and so forth.

Few questions:

How can you demonstrate joins however when most public datasets are already put together and finalized? How important are showing joins in a entry level portfolio? Is finding a ready dataset on kaggle for example and writing SQL queries to just answer business related issues (ex: what features are causing retention rates to decrease?) and then visualzing it on tableau for example good enough for entry level roles? Again no joins used since datasets are usually already completed.

Thanks for any help I can get, greatly appreciated!!

submitted by /u/believeinriven
[link] [comments]

Hi, Looking For Dataset For Crime Incident Reports With Geographic Information (New York), Arrest Records Dataset In New York And Crime Victimisation Survey Data

Hi I urgently need 3 dataset where one is crime incident reports with geographic information, arrest records Dataset in New York and crime victimisation survey data. The later 2 should be a JSON and the first should be a CSV file. Can you please provide the resources where to find these dataset

submitted by /u/MalayaleeKL06
[link] [comments]

Looking For California Solar Panel Incentive/Rebate Table

Looking for a historical archive of California’s Solar Power Incentive programs (with date enacted specifically). This type of data is available for EV incentive programs in a nice format and Im looking to find the same thing specifically for solar power incentives in CA. The column names include: Title, Text (not important), enacted date (important), expired date if applicable (important)

submitted by /u/UrAvgCollegeStudent
[link] [comments]

Secondary Dataset- Occupational Stress

I need to find a secondary dataset for analysis. I am most interested in evaluating burnout (or other occupational stressors) in American social workers. A different population of healthcare workers would be fine too! I’m having a hard time finding raw data, and when I do, it’s almost always too old to be relevant. Please help!!

submitted by /u/Deep_Instance2597
[link] [comments]

Looking For Large Animal Sound Dataset

I am looking for a dataset contatining a large(!) amount of audio files that I can use to train a generative model. I doesn’t matter which animal it is, as long as it makes a distinct sound (some birds make very short sounds that are hard to learn from). Any help would be appreciated!

submitted by /u/lubbby
[link] [comments]

Looking For Interesting County Level Data Sets To Analyze

Hey All! During a project I created a script which collected all the neighbors of a given county which I now am looking to leverage to do some analysis. There should be a cool experiment possible comparing some features of counties who border one another but are in different states as compared to other counties within the state for example. Does anyone know of any interesting county level data which is available at that level of granularity which you could point me in the direction of. Im avoiding typical “Census” stuff since thats beaten to death by political scientists. I know surveys are hard to get at this level (most people just use MRP if they even bother projecting down that much), but what other sources can I draw from.

Is doesn’t need to be particularly clean as I can manage, merge and claw through but I am hoping for it to be detailed!

Thanks in advance!

submitted by /u/UrAvgCollegeStudent
[link] [comments]

Looking For Plant Care & Analysis Datasets

I am interested in building an LLM that can understand from a photo of a plant what species it is, what is possibly wrong with it and describe a solution to me. Similar to plant parent.

To build this I would need a dataset of basic house plants with identification labels, a data set for disease identification and a dataset that would have symptoms/solutions for the identified disease.

I think this would make for a great learning project!

submitted by /u/horasandchorus
[link] [comments]