Deaths Caused By Tsunami In Each Decade Since 1900s

I have been working on project related to tsunami and I wanted a graph on this but I am not able to find anything please help me

submitted by /u/Howl_Beast
[link] [comments]

0

Spotify Comprehensive Dataset On Revenue And Users

Hi Everyone,

Spotify Dataset contains the number of premium users, number of Ad-supported users and total monthly active users (MAUs), Total revenues and Cost of Revenue, Gross Profit.

Use Dataset : https://www.kaggle.com/datasets/mauryansshivam/spotify-revenue-expenses-and-its-premium-users

Listed Spotify Revenue, Expenses and Its Premium Users since 2017

submitted by /u/AsgardiansLoki
[link] [comments]

0

Building A Behavior Classifier For Flies: How To Approach The Problem?

Hey everyone,

I have a dataset that contains the positions of 9 flies for each frame. I want to build a behavior classifier based on this data, but I’m not sure how to approach the problem.

Sample: https://drive.google.com/file/d/1W960Z92f1im80o1l6FveWXBQI5883iRx/view?usp=sharing

My goal is to create an input that takes 9 rows at once, where each row represents the position of one fly, and then learn from it by finding distances between each body part of the flies with each other to determine whether they are touching, grooming, or avoiding.

Additionally, I would like to consider past frames while predicting current frame outputs. Does anyone have suggestions on how to approach this problem? Are there any similar models or approaches that already exist for this?

I’m open to using various machine learning models such as decision trees, support vector machines, or even deep learning models.

If you have any insights or resources that could help me get started, please let me know! Thanks in advance.

submitted by /u/SahilSingh2402
[link] [comments]

0

Creating A News Article Dataset Ready For Latent Semantic Analysis

Can someone please create a dataset of news articles about ‘USA-China relations’. The dataset must have 5-10 years worth of articles and contain the news article name, date and short description.

submitted by /u/raks1811
[link] [comments]

0

Data On Human Beings With Birth And Death Dates?

Hello data people!

I’m interested in any datasets of specific humans (identifying details not needed) but with their specific date of birth AND date of death, i.e. to be able to perform inference procedures on exact ages at death (but with specific dates).

Thank you in advance! I searched a bit but didn’t find exactly those specifics.

submitted by /u/ghabibi
[link] [comments]

0

Datasets For Mushroom Yield Prediction

Hello, everyone! May I know if anyone here has data set for mushroom yield production that includes temperature and humidity data? We need at least 1,500 data for our simulation as part of our capstone project. Thank you.

submitted by /u/Ill-Moose4794
[link] [comments]

0

117th U.S. Congress Bill Full Text Data Scrap?

I’m looking to scrap the full text for all the proposed bills from the 117th Congress. I want to run the data through NVIVO for content analysis. I tried just downloading all the texts individually from Congress.gov, but I am looking to have all 15,224 documents available for analysis so the one-by-one approach is really unrealistic. I haven’t been able to find this data in a pre-existing dataset, but any assistance would be greatly appreciated!

Of note, I have tried utilizing the Congress.gov API but I can’t figure out how to get all proposed texts. I then tried to run a python script in Google Collab, but I kept getting a “gaierror” error that I couldn’t resolve. I’ve also tried ProPublica and govtrack.us but I couldn’t find a bulk data download option — only a bulk data query for view. I would still have to individually download each bill.

Reference Python Script:

#I removed my API key for privacy purposes, but I assure you it was in the script when I ran it

import requests

import json

def get_bill_data(congress_number):

base_url = “https://api.govinfo.gov“

endpoint = “https://api.congress.gov/v3/bill/117/hr/1/text?api_key=DEMO_KEY”.format(congress_number)

api_key = “[SQUATTINGFOX_API_KEY]”

url = base_url + endpoint

headers = {

“X-API-KEY”: api_key,

“Content-Type”: “application/json”

}

response = requests.get(url, headers=headers)

if response.status_code == 200:

data = response.json()

return data

else:

print(“Error retrieving bill data. Status Code:”, response.status_code)

return None

def save_bill_data(data, output_file):

with open(output_file, ‘w’) as file:

json.dump(data, file)

congress_number = “117”

output_file = “bills_data.json”

bill_data = get_bill_data(congress_number)

if bill_data:

save_bill_data(bill_data, output_file)

print(“Bill data saved to”, output_file)

submitted by /u/squattingfox
[link] [comments]

0

OpenSpending.org Is Back Online Bringing More Transparency To The World 🌍 Rebuilt With PortalJS, The Open Data Portal Has Been Updated With New Features – Check It Out! [self-promotion]

submitted by /u/anuveya
[link] [comments]

0

Dataset With Tuition Of Private Education In Brazil (elementary Through High School) Over Time

I am trying to find a dataset with the cost of education in Brazil to look at the demand side effects of the fundef and fundeb program. I have already found the link above but I have not been able to extract the data. Has anyone some experience with that ?

submitted by /u/Cultural-Ad-2470
[link] [comments]

0

Anyone Here Work For One Of The Big Brokers?

Hey all, I am hoping to learn more about how the big data industry works. Like buying datasets, where you go to find them, how much they cost etc

I’d appreciate any advice or even just a direction to head in. I’ve spoken to Snowflake and Datarade already but they don’t have much insight on what kind of data is actually being purchased or why (apparently anyway)

submitted by /u/Crumbedsausage
[link] [comments]

0

How Bad Are STDs In Your City? Do You Part!

Interesting idea. Id give it a clap.

The original site is down. It says the data is from the CDC https://www.cdc.gov/std/statistics/default.htm?CDC_AA_refVal=https%3A%2F%2Fwww.cdc.gov%2Fstd%2Fstats%2Fdefault.htm if you are not on a mobile and can find the right source for the actual data please comment

submitted by /u/cavedave
[link] [comments]

0

Anyone Know If There’s A Way To Get NOAA Data With City Included?

I’ve got a couple NOAA datasets where lat/long we’re provided as well as the weather station names. But I cannot for the life of me get the lat/long converted to city (tried geopy, geopandas, and a slew of other things).

submitted by /u/sureshakerdood
[link] [comments]

0

Looking For: Dataset (Gender) Bias Detection

Hello everyone!

I am planning to test an AI language model for bias – using SHAP and a lexical analyzer – and thus need a dataset I can feed into it. My preferred bias would be gender bias, e.g., a set of statements that are classified as either biased or not biased. However, if such a dataset does not exist I am open for other suggestions or ways how to create such a dataset manually/with AI support.

I am really grateful for any hints/help!

Cheers

submitted by /u/obeseelk
[link] [comments]

0

Need Help Sourcing A Dataset For Psychometrics Class!

I have been on the struggle bus trying to find a dataset on the Morse Fall Risk Assessment tool. I would love some assistance! Thank you all in advance!

submitted by /u/Trabes023
[link] [comments]

0

Looking For Historical Billboard Chart Data

Hi! I’m looking for as much billboard chart data as i can get my hands on – the historical weekly charts are accessible on their website but I was hoping someone in here knows or has access to the information in some sort of pre-organized fashion – going page by page and copy/pasting the each week is not ideal.

Thank you so much!

submitted by /u/wrentrivia
[link] [comments]

0

I Am A Newbie…can Anyone Help With Getting Me Datasets

Can anyone suggest datasets, or link them, i want to try training models, want them to practice for the first time to get to know how it works. i want datasets which can be a little fun to enjoy the output when it works

submitted by /u/Vansh1703
[link] [comments]

0

Any Way To Search For Similar Datasets?

I don’t know if anything exists like this but I have a data set that shows percent change year over year since 2016 in a specific industry. I took the USA GDP percent change over the same time period (just for curiosity) and there was somewhat of a correlation between the two datasets. My question is, are there any tools that search public data sets for similar percentage changes? I understand there is a high percentage of the correlation being “coincidence”, but does anything like this exist?

submitted by /u/lil_cheeks
[link] [comments]

0

Found Some More Simple Csv & Excel Datasets For Business Use Cases

I have been looking for simple public datasets in csv & excel format which does not require me to be a data analyst to draw some charts and understand my data. Usually many of the public ones are super complicated plus usually about public topics on health, economy etc. Found some interesting ones on Kaggle already, but just discovered Maven openly available datasets. Here it is https://mavenanalytics.io/data-playground (I have not affiliation to them btw)
Also I find interesting data on Statista, but all is behind an expensive paywall. If you know of any similar but free or cheaper alternative, pls share. thank you 🙂

submitted by /u/andidia82
[link] [comments]

0

An Open-Source Replica Of FiveThirtyEight Data Portal With The New JavaScript Framework PortalJS | More Upgrades Coming Soon… [self-promotion]

submitted by /u/anuveya
[link] [comments]

0

Mental Health Misinformation Dataset

I’ve been searching for datasets regarding misinformation in mental health but I can’t find any. I understand that for ethical reasons they might not be public but even ones that I can fill out a form/ask an author for access to aren’t showing in my search results. I tried asking Chat GPT and Bard which both listed three datasets but when I simply search for those names nothing about them appears and LLMs don’t provide links to their “knowledge”.

Would be very happy if I got any leads.

submitted by /u/homebutnothome
[link] [comments]

0

Requesting An Images Dataset With Annotated Human Actions To Train Visual Description Model For Accessibility App

Hi everyone, I need help finding a dataset of images annotated with human actions [such as sitting+in-chair, working+on-laptop, etc.]. I found a model capable of generating such tags on Huggingface here, however I was unable to locate its source dataset.

Just for context, I am trying to create a fine-tuned ViT model, that incorporates as broad a set of visual tags as possible. My plan is to optimize this model for edge devices [using Quantization aware training + TFLite model conversion] and open-source the weights. Eventually, I am hoping this can be used for a broad range of visual search/tagging/QnA tasks. Currently, I am training the model on top 2500 Danbooru tags + MIT SUN indoor location tags.

An online demo of the model can be found here. If anyone has any suggestions regarding what other dataset/tags to add, or would like to help with the training efforts, please drop a line. I would really appreciate it.

[Disclosures: I am not affiliated in any way with any of the HuggingFace /Arxiv/Mit.edu links I posted here. The link to the online-demo is maintained by me, but there are no ads or anything else that procures me financial gain on it.]

submitted by /u/DisintegratingBo
[link] [comments]

0

Community-built Hospital Price Database Hits 400 Hospitals

submitted by /u/alecs-dolt
[link] [comments]

0

Where Can I Find Data About Scientific Papers?

Hi,
I am searching for a database of scientific papers, the bigger the better. Like the web of science papers, but I d like to have all the information of the paper; like the abstract text and the introduction (all the paper would be perfect). Information about the authors, academic affiliation, and sector where it has been published

submitted by /u/riegel_d
[link] [comments]

0

Which Dataset Is Good For Food Related Projects. I Want To Work On Inverse Cooking, But Am Confused If I Can Jump Into It Or Not

First of all, i wanted to work on Recipe 1M+ dataset. But i couldnt find it since all links seemed broken. So, now i need some alternatives to that dataset

submitted by /u/IntelligentUse5990
[link] [comments]

0

Dataset With Marking Of Sunflower Seeds

Hello! I am writing a graduation thesis on the topic of calculating the yield of a sunflower. Does anyone help and know where I can get a dataset with sunflower seed markings, or maybe someone has it?

submitted by /u/CardiologistFew3283
[link] [comments]

0

Trying To Find An ‘official’ Premier League Dataset, Covering Disciplinary Events (fouls + Yellow/red Cards) By Matchday

I’m looking for a dataset covering disciplinary events (fouls, yellow cards and red cards) by fixture, stadium and matchday (date) in the Premier League, ideally from season 1992/93 (though I expect this to be super unrealistic) up to 2022/23 (though any dataset covering around ten years would be great). this data is out there online in bits and pieces, but is fan-collected and not from official sources (which would be required for semi-formal research). has anyone had any luck with getting this kind of dataset before, or have any suggestions as to who I could contact? I emailed the premier league a few months ago but haven’t received a replythanks!!

submitted by /u/oof-oofs
[link] [comments]

0

3000 Microwave Ovens From Popular E-commerce Sites

I received about 3000 listings when I played around with the NPM package [ecommerce-scraper-js](https://www.npmjs.com/package/ecommerce-scraper-js). Here’s the resulting dataset, if you’re interested. I tried to get 1000 microwave oven listings from each website. But there are not always so many products in practice. There are:

– 480 listings from Amazon;

– 1000 listings from eBay;

– 180 listings from Google Shopping;

– 299 listings from The Home Depot;

– 1000 listings from Walmart.

In total, I received 2959 microwave oven listings.

With this parser, you can get any listings (or selected listing info). Check the docs for more detail, it’s elementary, like:

“`javascript

import { config, amazon, walmart, ebay, homeDepot, googleShopping } from “ecommerce-scraper-js”;

config.API_KEY = “your_api_key_from_serpApi”;

amazon.getListings().then(console.log);

walmart.getListings().then(console.log);

ebay.getListings().then(console.log);

homeDepot.getListings().then(console.log);

googleShopping.getListings().then(console.log);

“`

You can load the dataset from [Kaggle](https://www.kaggle.com/datasets/mykhailozub/3000-microwave-ovens-from-popular-e-commerce-sites)

submitted by /u/Character_Equal_2732
[link] [comments]

0

[self-promotion] 7500 Hotels From Airbnb, Booking, And Hotels.com

I made a hotel parser on JS (hotels-scraper-js) and checked for usefulness. Here’s the resulting dataset, if you’re interested. For tests, I chose 5 European capitals: Berlin, London, Madrid, Paris, and Rome — 500 hotels from each site for each city. (In theory, there should be 500, but there are not always so many free rooms on the selected dates in practice so the results may be slightly less). You can get the hotel data you need with this parser. Check the docs for more detail, it’s very simple, like:

“`javascript import { airbnb, booking, hotelsCom } from “hotels-scraper-js”;

airbnb.getHotels(“YOUR_SEARCH_PARAMS”).then(console.log); booking.getHotels(“YOUR_SEARCH_PARAMS”).then(console.log); hotelsCom.getHotels(“YOUR_SEARCH_PARAMS”).then(console.log); “`

You can load the dataset from Kaggle

submitted by /u/Character_Equal_2732
[link] [comments]

0

Public Dataset For People Climbing Over Fences/Walls

Hello kind strangers of reddit! I have been trying to find some public dataset for images of people climbing over fences/walls online but so far in vain. I couldn’t even find enough data that I could try to annotate it myself to create a dataset. Does any one have any suggestions or recommendations for me?

Edit: *spelling

submitted by /u/NaumanKhan93
[link] [comments]

0

Individual Anonymized Demographic Features From US Census?

Does the census give out anonymized individual data? Like a dataset where each row is an anonymous person and the columns are income / education / parcel / occupation / religion / age / race / is_homeowner / marital status / gender?

If not, where could I obtain a data set like this?

submitted by /u/BoofThatNug
[link] [comments]

0

Category: Datatards

Deaths Caused By Tsunami In Each Decade Since 1900s

Spotify Comprehensive Dataset On Revenue And Users

Building A Behavior Classifier For Flies: How To Approach The Problem?

Creating A News Article Dataset Ready For Latent Semantic Analysis

Data On Human Beings With Birth And Death Dates?

Datasets For Mushroom Yield Prediction

117th U.S. Congress Bill Full Text Data Scrap?

OpenSpending.org Is Back Online Bringing More Transparency To The World 🌍 Rebuilt With PortalJS, The Open Data Portal Has Been Updated With New Features – Check It Out! [self-promotion]

Dataset With Tuition Of Private Education In Brazil (elementary Through High School) Over Time

Anyone Here Work For One Of The Big Brokers?

How Bad Are STDs In Your City? Do You Part!

Anyone Know If There’s A Way To Get NOAA Data With City Included?

Looking For: Dataset (Gender) Bias Detection

Need Help Sourcing A Dataset For Psychometrics Class!

Looking For Historical Billboard Chart Data

I Am A Newbie…can Anyone Help With Getting Me Datasets

Any Way To Search For Similar Datasets?

Found Some More Simple Csv & Excel Datasets For Business Use Cases

An Open-Source Replica Of FiveThirtyEight Data Portal With The New JavaScript Framework PortalJS | More Upgrades Coming Soon… [self-promotion]

Mental Health Misinformation Dataset

Requesting An Images Dataset With Annotated Human Actions To Train Visual Description Model For Accessibility App

Community-built Hospital Price Database Hits 400 Hospitals

Where Can I Find Data About Scientific Papers?

Which Dataset Is Good For Food Related Projects. I Want To Work On Inverse Cooking, But Am Confused If I Can Jump Into It Or Not

Dataset With Marking Of Sunflower Seeds

Trying To Find An ‘official’ Premier League Dataset, Covering Disciplinary Events (fouls + Yellow/red Cards) By Matchday

3000 Microwave Ovens From Popular E-commerce Sites

[self-promotion] 7500 Hotels From Airbnb, Booking, And Hotels.com

Public Dataset For People Climbing Over Fences/Walls

Individual Anonymized Demographic Features From US Census?

Recent Posts

Recent Comments

18+ Content

Recent Posts

Recent Comments