Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

Dataset On Global Plants And Native Area

I’m looking for a dataset connecting global native plants with their natural locations (countries, regions, cities, etc). I’ve found a few datasets that don’t have locations, but cover tons of plants!

GlobalUsefulNativeTrees – https://zenodo.org/records/7994433 World Checklist of Useful Plant Species – https://kew.iro.bl.uk/concern/datasets/7243d727-e28d-419d-a8f7-9ebef5b9e03e?locale=en all global flora – https://www.worldfloraonline.org/ Trees and location, but no plantshttps://www.bgci.org/resources/bgci-databases/globaltreesearch/

Any other datasets you all have used? Thanks!

submitted by /u/teenwent11
[link] [comments]

HELP FOR MY STATA PROJECT (FINDING DATASETS)

Hi guys i would like to ask some information about Datasets in Stata, Does someone know where i can download a dta file or an excel in order to do a project It would be better to be official datas i was searching in particular for health datas such as Drug abuse and the use of drugs in Medicine as drugs Otherwise im looking for anything that is interesting as long as makes the professor evaluate the project well! Thanks in advance

submitted by /u/Academic-Muffin-5119
[link] [comments]

Seeking Data On Historical University Protests In The US

I am interested in conducting a statistical analysis comparing current protests to historical ones at universities in the US. Specifically, I would like to examine the timeline and organization of these protests using a statistical approach.

Does anyone know of an open source dataset that can be used for this analysis? Alternatively, has anyone already conducted a similar analysis that I can reference?

Thank you for any assistance!

submitted by /u/Tolure
[link] [comments]

Looking For Purchase Orders Dataset Of PDFs Provided By Procurement Managers.

I couldn’t find dataset online, be it fictive or real (obviously because of privacy reasons).

If there are fictive PO dataset filled with PDFs and corresponding table of data against a PO number, it’ll be helpful.

Otherwise, I’m looking to create my own dataset with fictional items generated by GPT and populated to a PDF Purchase Order template, any GitHub code similar to something like this?

submitted by /u/adhadse
[link] [comments]

Seeking Data Sets On Power Grids For Machine Learning Projects

Hi everyone,

I’m currently exploring machine learning applications related to power grids and am in search of relevant data sets. Specifically, I’m looking for any of the following:

Labeled Image Data: Images of power grid components such as distribution poles, power lines, substations, etc., that are labeled for machine learning models. Failure Data: Information on failures or malfunctions within power grid elements, which could be used for predictive maintenance models. Operational Data: Any data that captures the operational aspects of power grids, including load, demand, flow, etc (not so much for generation).

For any dataset, the higher spatial/temporal resolution, the better, but I’m not too picky about that. I have already found some resources but I want to learn about any other datasets that might be out there, especially ones that might not be widely known. If you have or know of datasets that could fit these needs, could you please share them?

If you think that me sharing the datasets I found so far could make the post more informative, I would be happy to do that. Thanks in advance for your help!

submitted by /u/Zarashi00
[link] [comments]

Seeking Datasets For Cancer Research Project In The UK

I’m currently working on a cancer research project focusing on analyzing factors influencing cancer outcomes in the UK. As part of my project, I’m in need of datasets containing information related to cancer incidence, demographics, healthcare utilization, socioeconomic factors, environmental variables, and other relevant factors specific to the UK.

I was wondering if anyone in the community is aware of any websites or resources where I can find such datasets? Any leads or suggestions would be greatly appreciated.

submitted by /u/Blue-Croissant
[link] [comments]

Help Required In Opening Files Of A Dataset (.phys, .thermal, .pts, .ass Extensions)

We have received a dataset that consists of audio, visual, thermal, and physiological modalities. Upon exploring the dataset, we encountered some challenges in opening the following file types:

.phys with the Physiological information .thermal, .hist and .stat with the thermal information .pts with the visual information .ass with the auditory information

We have attempted various approaches to open these files, but unfortunately, none have proven successful thus far. We are not aware of the extensions used, and despite our persistent and thorough efforts, we have been unable to open these files. Please help us by guiding us on how to open files with these extensions.

submitted by /u/AnupKumarGupta_
[link] [comments]

Seeking Datasets: Construction Companies In India And UAE

Hello everyone, I’m currently working on a project focusing on the scope of construction companies in India and the UAE, and I’m in need of datasets containing information about top construction companies in this regions. Specifically, I’m looking for datasets that includes details such as the names of construction companies, their projects, number of employees, project duration, and any other relevant information. The dataset should cover the last 10 years to provide a comprehensive view of the industry’s scope and trends. I’ve searched various online platforms but haven’t been able to find suitable datasets. If anyone has access to or knows where i can find such datasets, I would greatly appreciate your help. Additionally, if you have any suggestions of advice on where to look, please feel free to share them. Thank you in advance for your assistance.

submitted by /u/Muhzin07
[link] [comments]

IMF Loan And Transaction Data Is Very Hard To Find

Hey there,

I’m pretty new to this sub and am having a not so easy time looking for a nice overview of loans (Stand-by Arrangements, Credit Tranche, Extended Fund Facility, Poverty Reduction and Growth Fund) from the IMF from 2000-2020. The website of the IMF is completely unhelpful and for the years 2000-2006, I’ve been gathering the data from the appendixes of the annual reports. However, from 2007 onwards, the design and format is changed resulting in less information about loan extension, cancellation, augmentation, specific dates, etc. Does anyone happen to be aware of any database/dataset where this information can be found. Help would be greatly appreciated! Many thanks in advance 🙂

submitted by /u/Ok_Lettuce2987
[link] [comments]

[Dataset Request] Bizarre Datasets For Final Project Data Analysis

For my final project this semester I have to clean, summarize, and visualize a dataset. The professor provided datasets but since I’m graduating I kinda want to go out with a bang. So, any ideas for a very bizarre dataset that will cause my professor to question my sanity/thought process? Or at least things to look up on the interweb. Searching “bizarre datasets” has me questioning why the author thought said dataset is bizarre.

submitted by /u/zora833
[link] [comments]

Dataset Wanted: Country-Level Well-being & Wealth As For Understanding The Role Of Job Quality/opportunity As Development

Hey folks! 👋 I’m on a mission to find a dataset/merged datasets that covers all the possible details about a country’s wealth at work landscape (not only money). I’m talking productivity, workspace wealth (including happiness at work, quality of life), entrepreneurship opportunities (like successful starting companies and investment levels), and sustainability practices within each country companies.

Know of any datasets that cover these angles comprehensively? Your expertise would be invaluable!

Particularly the focus is comparing Germany, Colombia, US and South Africa

submitted by /u/jucajagu
[link] [comments]

Scenarios/walkthroughs Of Utilizing SQL On Datasets And Then Inputting Into Tableau?

Howdy folks,

I’m a data analyst with two years of experience and I’ve been job searching the last few weeks. Im trying to find any possible walkthroughs/scenarios of data sets that utilize a set of data where SQL is then used to make joins on different tables (or whatever way SQL is used to transform the data), and then that data then gets input into Tableau and visualized accordingly.

Im aware there’s different data sets that this could be done with but Im trying to find possibly anywhere where theres possible walk throughs of this being done. Although SQL isn’t all that complex I haven’t used it for a bit and I have much more experience in Tableau.

Im trying to run through some scenarios/walkthroughs so I can get a hang of making all the queries/transformation in SQL/the database and then outputting that into Tableau accordingly. I’ve already been using the search function, so please dont ask me to just google it.

Im just wondering if anyone here has maybe seen a good dataset previously to do this on or has practiced a scenario they’ve worked through so I could get the hang of things (like a video explainer/walk through) and then just start to use whatever dataset i want to choose from afterwards once I get the hang of things. Id prefer this with Postgre if possible, but it absolutely doesn’t need to be.

Any direction would vastly help.

submitted by /u/WhatsTheAnswerDude
[link] [comments]

Does Anyone Know A Dataset Of European Railways Connections?

For a project at Uni about community finding in a graph, I wish to experiment with the railways connections graph, see if stations are classified in communities by country or something.

Do you know any dataset with european train stations with the other stations they’re connected to? I found datasets of stations but not connections.

Thank you in advance !

submitted by /u/Gogani
[link] [comments]

Most Publicly Available Datasets Are Already Finalized In A Single Table. How Important Are Showing ‘joins’ In An Entry Level Portfolio?

Hi guys,

I’m currently working on a data analysis portfolio for entry level jobs and everyone always says that knowing SQL and more specifically, joins, are very important skills to know and to demonstrate.

When obtaining datasets whether it would be from kaggle, data publicly available from an official website, extracting data through API’s, or wherever you get your data from, the one thing i’ve noticed is that all the data is usually already put together in a single table. You can take that data and ‘clean’ it (making rows, columns, values consistent prior to analysis, etc.) and so forth.

Few questions:

How can you demonstrate joins however when most public datasets are already put together and finalized? How important are showing joins in a entry level portfolio? Is finding a ready dataset on kaggle for example and writing SQL queries to just answer business related issues (ex: what features are causing retention rates to decrease?) and then visualzing it on tableau for example good enough for entry level roles? Again no joins used since datasets are usually already completed.

Thanks for any help I can get, greatly appreciated!!

submitted by /u/believeinriven
[link] [comments]

Hi, Looking For Dataset For Crime Incident Reports With Geographic Information (New York), Arrest Records Dataset In New York And Crime Victimisation Survey Data

Hi I urgently need 3 dataset where one is crime incident reports with geographic information, arrest records Dataset in New York and crime victimisation survey data. The later 2 should be a JSON and the first should be a CSV file. Can you please provide the resources where to find these dataset

submitted by /u/MalayaleeKL06
[link] [comments]