submitted by /u/cavedave
[link] [comments]
Category: Datatards
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
I’m currently working on a cancer research project focusing on analyzing factors influencing cancer outcomes in the UK. As part of my project, I’m in need of datasets containing information related to cancer incidence, demographics, healthcare utilization, socioeconomic factors, environmental variables, and other relevant factors specific to the UK.
I was wondering if anyone in the community is aware of any websites or resources where I can find such datasets? Any leads or suggestions would be greatly appreciated.
submitted by /u/Blue-Croissant
[link] [comments]
I need datasets that has audio files(.wav preferably) of English alphabets pronounced for a speech processing project. Fill me in if you know any free available datasets. Thank you!
submitted by /u/karthic2811
[link] [comments]
We have received a dataset that consists of audio, visual, thermal, and physiological modalities. Upon exploring the dataset, we encountered some challenges in opening the following file types:
.phys with the Physiological information .thermal, .hist and .stat with the thermal information .pts with the visual information .ass with the auditory information
We have attempted various approaches to open these files, but unfortunately, none have proven successful thus far. We are not aware of the extensions used, and despite our persistent and thorough efforts, we have been unable to open these files. Please help us by guiding us on how to open files with these extensions.
submitted by /u/AnupKumarGupta_
[link] [comments]
Hello everyone, I’m currently working on a project focusing on the scope of construction companies in India and the UAE, and I’m in need of datasets containing information about top construction companies in this regions. Specifically, I’m looking for datasets that includes details such as the names of construction companies, their projects, number of employees, project duration, and any other relevant information. The dataset should cover the last 10 years to provide a comprehensive view of the industry’s scope and trends. I’ve searched various online platforms but haven’t been able to find suitable datasets. If anyone has access to or knows where i can find such datasets, I would greatly appreciate your help. Additionally, if you have any suggestions of advice on where to look, please feel free to share them. Thank you in advance for your assistance.
submitted by /u/Muhzin07
[link] [comments]
Does anyone know where can I find datasets for current and past seasons of English Premier League?
submitted by /u/Beautiful-Area-5356
[link] [comments]
I am working on a research project in college which required me to have access to chest x-ray datasets. I am working to optimize pre-trained AI models through private mixed with public datasets. I would need only a few thousand units max. Anyone have any leads or suggestions for private datasets? TIA
submitted by /u/GingerKillerr
[link] [comments]
Hey there,
I’m pretty new to this sub and am having a not so easy time looking for a nice overview of loans (Stand-by Arrangements, Credit Tranche, Extended Fund Facility, Poverty Reduction and Growth Fund) from the IMF from 2000-2020. The website of the IMF is completely unhelpful and for the years 2000-2006, I’ve been gathering the data from the appendixes of the annual reports. However, from 2007 onwards, the design and format is changed resulting in less information about loan extension, cancellation, augmentation, specific dates, etc. Does anyone happen to be aware of any database/dataset where this information can be found. Help would be greatly appreciated! Many thanks in advance 🙂
submitted by /u/Ok_Lettuce2987
[link] [comments]
For my final project this semester I have to clean, summarize, and visualize a dataset. The professor provided datasets but since I’m graduating I kinda want to go out with a bang. So, any ideas for a very bizarre dataset that will cause my professor to question my sanity/thought process? Or at least things to look up on the interweb. Searching “bizarre datasets” has me questioning why the author thought said dataset is bizarre.
submitted by /u/zora833
[link] [comments]
Hey folks! 👋 I’m on a mission to find a dataset/merged datasets that covers all the possible details about a country’s wealth at work landscape (not only money). I’m talking productivity, workspace wealth (including happiness at work, quality of life), entrepreneurship opportunities (like successful starting companies and investment levels), and sustainability practices within each country companies.
Know of any datasets that cover these angles comprehensively? Your expertise would be invaluable!
Particularly the focus is comparing Germany, Colombia, US and South Africa
submitted by /u/jucajagu
[link] [comments]
Are there any datasets which contain the audio (.wav preferably) files of utterances of chess moves? Need it for a speech processing project. Thank you!
submitted by /u/karthic2811
[link] [comments]
Howdy folks,
I’m a data analyst with two years of experience and I’ve been job searching the last few weeks. Im trying to find any possible walkthroughs/scenarios of data sets that utilize a set of data where SQL is then used to make joins on different tables (or whatever way SQL is used to transform the data), and then that data then gets input into Tableau and visualized accordingly.
Im aware there’s different data sets that this could be done with but Im trying to find possibly anywhere where theres possible walk throughs of this being done. Although SQL isn’t all that complex I haven’t used it for a bit and I have much more experience in Tableau.
Im trying to run through some scenarios/walkthroughs so I can get a hang of making all the queries/transformation in SQL/the database and then outputting that into Tableau accordingly. I’ve already been using the search function, so please dont ask me to just google it.
Im just wondering if anyone here has maybe seen a good dataset previously to do this on or has practiced a scenario they’ve worked through so I could get the hang of things (like a video explainer/walk through) and then just start to use whatever dataset i want to choose from afterwards once I get the hang of things. Id prefer this with Postgre if possible, but it absolutely doesn’t need to be.
Any direction would vastly help.
submitted by /u/WhatsTheAnswerDude
[link] [comments]
For a project at Uni about community finding in a graph, I wish to experiment with the railways connections graph, see if stations are classified in communities by country or something.
Do you know any dataset with european train stations with the other stations they’re connected to? I found datasets of stations but not connections.
Thank you in advance !
submitted by /u/Gogani
[link] [comments]
Looking for a large dataset that has to do with gaming usage or gaming spending. Anything will do, asking very broadly.
submitted by /u/yesvoid
[link] [comments]
Hi guys,
I’m currently working on a data analysis portfolio for entry level jobs and everyone always says that knowing SQL and more specifically, joins, are very important skills to know and to demonstrate.
When obtaining datasets whether it would be from kaggle, data publicly available from an official website, extracting data through API’s, or wherever you get your data from, the one thing i’ve noticed is that all the data is usually already put together in a single table. You can take that data and ‘clean’ it (making rows, columns, values consistent prior to analysis, etc.) and so forth.
Few questions:
How can you demonstrate joins however when most public datasets are already put together and finalized? How important are showing joins in a entry level portfolio? Is finding a ready dataset on kaggle for example and writing SQL queries to just answer business related issues (ex: what features are causing retention rates to decrease?) and then visualzing it on tableau for example good enough for entry level roles? Again no joins used since datasets are usually already completed.
Thanks for any help I can get, greatly appreciated!!
submitted by /u/believeinriven
[link] [comments]
Hi I urgently need 3 dataset where one is crime incident reports with geographic information, arrest records Dataset in New York and crime victimisation survey data. The later 2 should be a JSON and the first should be a CSV file. Can you please provide the resources where to find these dataset
submitted by /u/MalayaleeKL06
[link] [comments]
Hello, I’m doing a Data Science bootcamp and for a student project I would like to pull data from Resident Advisor the event platform.
Any idea how I could scrape the website https://ra.co/events/?
Thank you!
submitted by /u/SnooMacarons7531
[link] [comments]
does anyone have csv or exel files atlantic keno lottery from last 5 years?
submitted by /u/No_Adhesiveness7023
[link] [comments]
I wanted to do some personal research using current real estate data, but I’m surprised how difficult it is to find datasets to work with.
Does anyone know a good source where I can get real estate sales listing data in the U.S.?
submitted by /u/leapintoblue
[link] [comments]
Looking for a historical archive of California’s Solar Power Incentive programs (with date enacted specifically). This type of data is available for EV incentive programs in a nice format and Im looking to find the same thing specifically for solar power incentives in CA. The column names include: Title, Text (not important), enacted date (important), expired date if applicable (important)
submitted by /u/UrAvgCollegeStudent
[link] [comments]
Book summaries data from below sites available: – blinkist – shortform – instaread – getabstract
Data format: text + audio
Text is in epub & pdf format for each book. Audio is in mp3 format.
Last Updated: march, 2024
Update frequency: approximately ~2-3 months.
Dm me for access.
submitted by /u/waqarHocain
[link] [comments]
I am in urgent need for electric vehicles dataset for my project to develop Tableau visualisation dashboards. Though i searched on kaggle and various other sources it’s not much useful. Please do suggest some resources I should look into.
submitted by /u/Kingkong99999
[link] [comments]
I need to find a secondary dataset for analysis. I am most interested in evaluating burnout (or other occupational stressors) in American social workers. A different population of healthcare workers would be fine too! I’m having a hard time finding raw data, and when I do, it’s almost always too old to be relevant. Please help!!
submitted by /u/Deep_Instance2597
[link] [comments]
This seems like such a simple dataset to have yet i can’t seem to find it. Id like a dataset that would give me the “top trending searches” for a given date, google seems to have one but it seems that it is limited to the last 30 days. Id like one exactly like that but spanning for longer (as long as possible).
submitted by /u/EmilianoyBeatriz
[link] [comments]
I am planning on using NIS dataset (large separate files) and load and combine the various files in R. I have rudimentary experience with R. Any help?
submitted by /u/cautionhope
[link] [comments]
I am looking for a dataset contatining a large(!) amount of audio files that I can use to train a generative model. I doesn’t matter which animal it is, as long as it makes a distinct sound (some birds make very short sounds that are hard to learn from). Any help would be appreciated!
submitted by /u/lubbby
[link] [comments]