Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

Healthcare Mergers And Acquisitions

I’m trying to understand acquisitions and mergers in healthcare and ownership data; so far the resources I’m looking for which have some leads include a NYS DoH and CMS database. I also found a ‘Hospitalogy’ toolkit, but it seems to be more about data visualization and it’s only accessible by paying $250.

Does anyone know any open-source data that tracks this kind of info?

submitted by /u/wasacarpenter
[link] [comments]

Where Is The Spotify Sequential Skip Prediction Dataset?

Hi everyone,

I’m on the hunt for the Spotify Sequential Skip Prediction Challenge dataset. This dataset was part of a competition organized by Spotify, WSDM, and CrowdAI and focused on predicting whether users would skip or listen to the tracks they’re streamed. Unfortunately, it seems the dataset is no longer available on the official link.

Here’s a bit of background about the challenge and dataset:

Organizer: Spotify, WSDM, CrowdAI Dataset Size: Public part – ~130 million listening sessions; Challenge leaderboard – ~30 million listening sessions Features: User interactions, track metadata, acoustic features, etc. Task: Predict if users will skip tracks based on their session history Challenge Details: Challenge Overview

The dataset is crucial for my work on developing a recommender system for my start up.

If anyone has access to this dataset or knows where I can obtain it, I would greatly appreciate your help. This dataset would be incredibly beneficial for my research and development in the field of music recommender systems.

For more details on the challenge and dataset, here’s an overview page.

Thank you in advance!

submitted by /u/Elpiramidonus
[link] [comments]

Directory / Dataset Of Landscape Webcams?

I am looking for datasets / directories of webcams, mainly focused on landscape, cities, etc., not private (streaming/gaming) cams. Ideally this dataset would contain both the page where the image is embedded as well as the image url itself. Does anyone know where I can find this?

submitted by /u/j0nes2k
[link] [comments]

Help Finding Certain Data By Address

No idea where to post this, sorry!

I have a list of about 5,000 addresses. For each one, I want to know the census tract, the voting districts, the region (as defined by my city), and maybe more later on.

How can I set something up where I can match my list of addresses with a list of all addresses in my state (Ohio), cross-reference all of that other data, and have all of that information spit out for each address for me?

Really any way to make this process faster would be appreciated. I’ve found some files online from various government agencies but I’m not sure if they are all relevant or useful. What kind of file types am I looking for? I have some maps overlayed in Google Earth so I can look up addresses and find the information that way, but I’m not doing it one by one. I chatted with my IT guy but he’s part time and didn’t have any standout ideas at the time.

Thank you!

submitted by /u/countesscranberry
[link] [comments]

Does Anyone Know, Where To Find Lactate Test Datasets?

Hi everyone,

I’m seeking for some datasets about lactate tests. They should ideally include the following information: Lactate levels (preferably continuous or regular measurements), Heart rate, Respiratory rate, Other relevant physiological parameters (blood pressure, temperature, etc.), Contextual data (e.g., type of physical activity, duration, intensity)

I’m seeking for a I feel like I went through the whole www, but I just can’t find anything useful.

Does anyone have experience with this topic and can provide tips on where I might find such datasets? Or perhaps someone has access to relevant data and would be willing to share?

I would greatly appreciate any help and guidance Thank you in advance for your support!

Regards, algyier

submitted by /u/Electrical_Present73
[link] [comments]

Does Anybody Know A Place To Download The Rom Graphs Dataset?

Hey, I wanted to ask if someone knows a place the rom graphs dataset can be downloaded from.
I tried to search for it but I only found a german document which cites them.
Using the document I found the paper of the rom graphs “An Experimental Comparison of Four Graph Drawing Algorithms.” ( https://doi.org/10.1016/S0925-7721(96)00005-3 ).
In the paper they mention that the graph dataset can be downloaded from their ftp server ftp://infokit.dis.uniroma1.it/public/ but the domain does not resolve anymore.
So I wondered if anybody knows a place it can still be downloaded from.

submitted by /u/finanzbruch
[link] [comments]

Looking To Share Or Sell A Large Collection Of Stock Prices Stored In MySQL

I have gathered a large set of data that includes the prices of 10,286 different stocks, updated every minute since November 17, 2021. This data is organized and stored using MySQL.

I’m looking for advice on where I might be able to share or sell this data, especially to people who use such information for studying the stock market, building trading software, or conducting research.

Does anyone know of any places or communities where I could do this? Also, if you are interested in talking more about this data and possibly using it together, please let me know!

I’m excited to hear your ideas and talk more about this!

submitted by /u/ScienceNerd2023
[link] [comments]

Cycling Dataset For Different Positions On The Bike

Hey guys, I’m working on a project about cycling and need a dataset to help me out. It should have just two columns: the speed and the average power output of the cyclist maintaining that speed. I want data for different postures on the bike, like the drops or the hoods. Any help would be greatly appreciated! Thanks!

submitted by /u/Anass_Lpro
[link] [comments]

Where Can I ‘sell’ A Potent Dataset?

Hi guys ! Have quite a potent dataset that can be used to further research in the renewable energy sector. The data is from a facility where I’m a stakeholder, so I really don’t wanna put it up for free.

Any leads as to what would be a good website where I can put it up to be used for a small fee?

(Uni student here, so I need the extra income this may generate lol)

submitted by /u/Illustrious_Grass199
[link] [comments]

Need Help Scraping Text From Benefits Websites For AI Project (Python, BeautifulSoup, Selenium)

Hi everyone,

I’m currently taking a course on Python, and I’ve been learning web scraping with BeautifulSoup and Selenium. My situation is a bit unique and time-sensitive, so I’m reaching out to this amazing community for some assistance.

My wife and son are both disabled, and navigating through benefits websites to find the best solutions and information has become quite overwhelming. My goal is to scrape the text from a few key benefits websites and input this data into an AI system to help manage and sift through the information more effectively.

Despite my efforts, I’m still struggling to get the code right. I’m really keen to learn and understand how to do this properly, but given my circumstances, I could really use a bit of a jump start with some working code examples.

If anyone could provide a working script or point me in the right direction, especially using Python with BeautifulSoup or Selenium, I would be incredibly grateful. Here are a couple of specific websites I need to scrape:

https://www.service-public.fr/ However, the main body of content is under the ‘Practical sheets by theme’ drop down if you translate it to English. https://www.aide-sociale.fr/

If it’s easier to share a working code snippet for just one website, that’s perfectly fine too.

Thank you so much for taking the time to read this and for any help you can offer. I really appreciate it!

submitted by /u/myway_thehardway
[link] [comments]

I’m Looking For Koi Fish Dataset At Least 10000 Images

I have this thesis about koi fish counting and classification, the document was accepted, however, I find trouble finding the number of datasets required by our professor for the implementation part. Let’s say around 10,000 images of koi fish would suffice.

I appreciate any help that I can get, since my current dataset only ranges around 1200 which are already classified and annotated which I’ve sourced from Kaggle and Roboflow. Thanks. (I’ll be using YOLOv9 for the model to be trained)

P. S. Don’t mind the link.

submitted by /u/shhty
[link] [comments]

For Anyone Wanting US Weather Observation Station Data

You can find a list of observation station IDs accessible by US NWS API at https://demos.synopticdata.com/meta-lists/#networks

Idk if it’s just me and maybe it is but I had a bit of a hard time trying to find a master list of observation stations and their IDs accessible by the NWS API. I think the link above has most of them.

I only accidentally came across the one from Synoptic.

Not surprisingly I came across a lot of paid services and products but they all get their data from taxpayer funded sources anyway.

If anyone has other sources of free weather APIs or list of observation stations accessible by the NWS API, feel free to comment below. I know MADIS is another source but haven’t checked it out yet.

submitted by /u/Live-Machine4746
[link] [comments]

Data Set Request: CPU Specifications

Hi everyone,

I’m looking for a good CPU dataset – I’m mostly interested in base clock, turbo clock (broken down to single and multi core if possible) & TDP but the more info the better

I read that the Intel ARK database has a CSV that can be dumped from the Android app, but not managed to find a good source for AMD CPUs yet

submitted by /u/aw1cks
[link] [comments]

Data Set Request: Renewable Energy Projects In India

Hey everyone,

I’m looking for detailed datasets on several aspects of renewable energy projects in India:

Investment data in renewable energy projects, especially focusing on foreign investments. Broader economic impacts of these foreign investments on India’s economy. The effects of subsidies on renewable energy revenue.

Any pointers or links to these datasets would be greatly appreciated!

submitted by /u/Interesting_Cause826
[link] [comments]

Where To Find US Trademark Data (have Lookup Database, But Not Sure How To Get Aggregate Data Out)

Hi, I’m looking for granular US trademark data that includes the name of the company that filed the trademark (I’m trying to view summary statistics on trademark filed by company in the US).

I’ve tried: https://tmsearch.uspto.gov/search/search-resultsbut but can’t find out how to get the aggregate data out of this.

I’ve been told that this data should be publicly available, but am stumped on where to find it.

Does anyone have a data set that would have this data? Alternatively, does anyone know how to scrape data from this lookup above?

submitted by /u/Acrobatic_Stay_9221
[link] [comments]

Help Understanding A Dataset About Pancreatic Ductal Adenocarcinoma!

Hi everyone! I’m trying to understand this dataset: (https://www.cancerimagingarchive.net/collection/cptac-pda/) –> it says that the patients have pancreatic ductal adenocarcinoma, but once I downloaded the dataset, it is a complete MESS. It is not organized at all, there are not annotations, the DICOM files don’t make sense, and all the files say NA (which I’m assuming means negative assessment). I don’t have enough time to sit and try to reconstruct/annotate all these DICOM files and it’s honestly just not making sense to me. If anyone has any experience or understand what is going in this dataset it would be greatly appreciated! Thank you so much!

submitted by /u/DiyaRamakrishnan
[link] [comments]

I’m Seeking A Heart Disease Dataset For Training A Model

I’ve been trying to find a dataset of Cornary Artery disease patients, or any Cardiovascular disease that contains a few biomarkers info in the columns. I tried searching quite a lot of sites like kaggle,physionet etc and Its either unavailable or is locked behind a paywall (Im a research student). Is there any free medical datasets around that I can dig in? I’d be so grateful for your time help

submitted by /u/Linus_sex_tipz
[link] [comments]