Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

Percentage Of Land Covered In Flood Water And Flood Water Volume Estimates By States (USA)

I’m trying to find some data on flooding frequency and averages for land covered by flood water annually by state. I can’t seem to find any information other than specific address searches. Is there a data set available that would give me any of the information below? – % of land affected by floods annually by state OR – amount of land covered in flood water annually by state – Water volume estimates for floods I searched everywhere and I can’t seem to find anything that fits the criteria.

submitted by /u/Possibl3DumbQuestion
[link] [comments]

Seeking Dataset To Train A Mental Health Treatment Chatbot

Hey fellow Redditors,
I hope you’re all doing well. I’m reaching out to this amazing community today with a request for assistance. I am currently working on developing a mental health treatment chatbot like Woebot, and I am in need of a suitable dataset to train it effectively.

To create an effective mental health treatment chatbot, it is essential to have a diverse and comprehensive dataset. This dataset should ideally include a wide range of mental health conditions, symptoms, treatment approaches, and relevant conversations between mental health professionals and patients. By training the chatbot on such a dataset, we can ensure that it is equipped with the knowledge and empathy necessary to provide meaningful support to users.
Therefore, I kindly request the assistance of this community in locating or providing a suitable dataset for training my mental health treatment chatbot. If you have access to any relevant resources or know of any existing datasets that could be utilized for this purpose, I would greatly appreciate your input.
Additionally, if you have any suggestions, advice, or experiences related to developing a mental health treatment chatbot, I would love to hear from you. Your insights could prove invaluable in shaping the direction of this project.

submitted by /u/Amans-r
[link] [comments]

Map Instances From Wikidata And DBPedia

Is there any way to map entities from Wikidata and DBPedia.There is a method to map property type using sparql queries (eg date of birth).But is there way to map instances of classes.Lets say Michael Jackson. So given url/id of Michael Jackson from WikiData I need to find the corresponding instance in DBPedia.Can someone help me with this?Please let me know if there anything ambiguous in the question.

submitted by /u/Designer_Ad_6525
[link] [comments]

Manufacturing Dataset For Time Series Classification

Hi,

i am looking for a dataset with specific traits:

Industrial manufacturing domain Sensor data (multivariate time series) The machine is performing different operations Ideally the data is labelled according to those operation so it can be used for time series classification Open source (for purpose of a thesis)

I know there are several repositories with Industrial Datasets, but I havent found one that fits these requirements. Maybe somebody has an idea.

Thank you.

submitted by /u/GetThere2023
[link] [comments]

Lokking For Datasets For Grocery Item Detection

Hi all, I’m trying to build a classifcation model for grocery items and was wondering if anyone would know where I could get labelled grocery item data? I’ve seen a couple on kaggle but they are usually labelled under classes (i.e fruit, vegetable, animal meat) rather than a specific item like broccoli, chicken breast etc.

submitted by /u/Black_God_Ho
[link] [comments]

Median Income By Zip Code And Year In US

Hey all! Anyone know where I can get median household income data by zipcode for like 20 years ago? Trying to calculate median household income based on where people lived when they were born (sample is 18-25 years old). Seems like the US census website only has current information, but I may not be looking in the right spot. Thanks!

submitted by /u/Neurotic-raccoon
[link] [comments]

Looking For Tracking Data For Rugby Union.

Hello everyone,

I’m hoping you all can help. I am looking for a Rugby tracking data set that shows the XY position of players on the field. I know some more things exist for football, both American and European but I am really struggling to find that information for Rugby.

Anything helps if you have an idea or no somewhere I should start my search. Please let me know.

submitted by /u/abrax55
[link] [comments]

[self-promotion] 13F, 10-Q, 10-K, And 8-K Reports + OpenFigi Ids Direct To Your Snowflake Instance

Last night Cybersyn added 13Fs and OpenFigi IDs to Snowflake Marketplace.
You can leverage 13Fs to track institutional investors’ securities holdings and OpenFigi IDs (financial instrument global identifiers) to facilitate easier mapping of securities across data sources.
This release builds on the 8-K, 10-K, & 10-Q reports and attached exhibits originally available in Cybersyn SEC Filings.

submitted by /u/aiatco2
[link] [comments]

Congressional Data, Preferably With Bills Introduced

I’d like a dataset with columns for the name of the bill introduced, date introduced, title, subject, number of co-sponsors, etc.

I want to analyze (in R) congressional action related to Taiwan, so I hope to get a dataset of bills from, say, the last 5-10 congresses and evaluate how many were passed, what share had bipartisan support, and temporal trends.

I’ve researched a couple options but have tun into problems with both:

ProPublicaR Congress API — I have the API working in R, but its functions return lists, the function it suggests to turn the output into a data frame returns an error: “no method for [function] applied to an object of class list”. I’m also unsure how comprehensive the data is from this source.

GovInfo bulk data — this site has data on congressional bills, but the bills come in individual XML files and I don’t know how to get those into R (and then into a format in which I can analyze the bills as I described above)

Thanks!

submitted by /u/Rude_Inside_4089
[link] [comments]

Is There An API Or Daily Dataset For Large, In-person Event Information?

I’m looking for a way to get up-to-date information about large, in-person events happening today or in the near future (hundreds to thousands of attendees), e.g. concerts, festivals/fairs, conferences, sports, etc.

Ideally, the dataset provides simple information, like the time the event starts & ends, and the location of the event. Events could be global, but would be best if it focused on US and/or English-speaking countries.

submitted by /u/coinclink
[link] [comments]

How Can I Find All Companies Of A Specific Category Residing Within My State?

Just a disclaimer, I have zero experience dealing with data and stuff, so please bear with me.

Let’s say I want a list of all plumbing companies in my state. I want the name of their company, e-mail address, phone number, and general location. If this is too much, just their e-mail address is fine. Currently, I’ve been going to each and every business’s website and copying and pasting their contact information and general location. The problem is that doing it this way is that it takes forever. I wonder if there is a better approach or tool I can use to save time and achieve the same goal. Please let me know, thank you.

submitted by /u/Jolivsant
[link] [comments]

Need Help Finding Datasets About European Funds

Hello!

I am writing my master thesis in finance and need to find datasets. Preferably i need information for the last ~ 30 years about mutual fund performance, size and age. Any other information about them is also valued. I am hoping to find a large dataset containing funds from different countries, hopefully withouth having to gather each fund individually. Mainly i am interested in EU/Nordic funds. Can anyone help/ point me in the right direction?

My school gives me access to:

Compustat

CRSP

Bloomberg terminals

(possibly others, so please suggest and i can check)

But I have not been trained in using these at all. Any guides to using these databases or direct help is extremely appriciated!

submitted by /u/J-Stonks
[link] [comments]

Seeking Comprehensive Rugby Datasets Ahead Of The Rugby World Cup

With the Rugby World Cup just around the corner, I’m diving deep into the world of rugby analytics and data. I’m on the hunt for extensive datasets that encompass:

Team Information: Detailed profiles, players, historical matches, and any notable events. Player Information: Career statistics, past games played, performance metrics, and other relevant statistics. Unique Insights: Unconventional data or any other cool tidbits related to rugby.

While I’ve stumbled upon a dataset on Kaggle detailing the International Rugby Union results from 1871-2023, I’m eager to explore more comprehensive and in-depth datasets.

If anyone has come across any resource or can point me in the right direction, I’d be immensely grateful. Let’s gear up for an informed Rugby World Cup experience!

submitted by /u/Snorkel_26
[link] [comments]