Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

Median Income By Zip Code And Year In US

Hey all! Anyone know where I can get median household income data by zipcode for like 20 years ago? Trying to calculate median household income based on where people lived when they were born (sample is 18-25 years old). Seems like the US census website only has current information, but I may not be looking in the right spot. Thanks!

submitted by /u/Neurotic-raccoon
[link] [comments]

Looking For Tracking Data For Rugby Union.

Hello everyone,

I’m hoping you all can help. I am looking for a Rugby tracking data set that shows the XY position of players on the field. I know some more things exist for football, both American and European but I am really struggling to find that information for Rugby.

Anything helps if you have an idea or no somewhere I should start my search. Please let me know.

submitted by /u/abrax55
[link] [comments]

[self-promotion] 13F, 10-Q, 10-K, And 8-K Reports + OpenFigi Ids Direct To Your Snowflake Instance

Last night Cybersyn added 13Fs and OpenFigi IDs to Snowflake Marketplace.
You can leverage 13Fs to track institutional investors’ securities holdings and OpenFigi IDs (financial instrument global identifiers) to facilitate easier mapping of securities across data sources.
This release builds on the 8-K, 10-K, & 10-Q reports and attached exhibits originally available in Cybersyn SEC Filings.

submitted by /u/aiatco2
[link] [comments]

Congressional Data, Preferably With Bills Introduced

I’d like a dataset with columns for the name of the bill introduced, date introduced, title, subject, number of co-sponsors, etc.

I want to analyze (in R) congressional action related to Taiwan, so I hope to get a dataset of bills from, say, the last 5-10 congresses and evaluate how many were passed, what share had bipartisan support, and temporal trends.

I’ve researched a couple options but have tun into problems with both:

ProPublicaR Congress API — I have the API working in R, but its functions return lists, the function it suggests to turn the output into a data frame returns an error: “no method for [function] applied to an object of class list”. I’m also unsure how comprehensive the data is from this source.

GovInfo bulk data — this site has data on congressional bills, but the bills come in individual XML files and I don’t know how to get those into R (and then into a format in which I can analyze the bills as I described above)

Thanks!

submitted by /u/Rude_Inside_4089
[link] [comments]

Is There An API Or Daily Dataset For Large, In-person Event Information?

I’m looking for a way to get up-to-date information about large, in-person events happening today or in the near future (hundreds to thousands of attendees), e.g. concerts, festivals/fairs, conferences, sports, etc.

Ideally, the dataset provides simple information, like the time the event starts & ends, and the location of the event. Events could be global, but would be best if it focused on US and/or English-speaking countries.

submitted by /u/coinclink
[link] [comments]

How Can I Find All Companies Of A Specific Category Residing Within My State?

Just a disclaimer, I have zero experience dealing with data and stuff, so please bear with me.

Let’s say I want a list of all plumbing companies in my state. I want the name of their company, e-mail address, phone number, and general location. If this is too much, just their e-mail address is fine. Currently, I’ve been going to each and every business’s website and copying and pasting their contact information and general location. The problem is that doing it this way is that it takes forever. I wonder if there is a better approach or tool I can use to save time and achieve the same goal. Please let me know, thank you.

submitted by /u/Jolivsant
[link] [comments]

Need Help Finding Datasets About European Funds

Hello!

I am writing my master thesis in finance and need to find datasets. Preferably i need information for the last ~ 30 years about mutual fund performance, size and age. Any other information about them is also valued. I am hoping to find a large dataset containing funds from different countries, hopefully withouth having to gather each fund individually. Mainly i am interested in EU/Nordic funds. Can anyone help/ point me in the right direction?

My school gives me access to:

Compustat

CRSP

Bloomberg terminals

(possibly others, so please suggest and i can check)

But I have not been trained in using these at all. Any guides to using these databases or direct help is extremely appriciated!

submitted by /u/J-Stonks
[link] [comments]

Seeking Comprehensive Rugby Datasets Ahead Of The Rugby World Cup

With the Rugby World Cup just around the corner, I’m diving deep into the world of rugby analytics and data. I’m on the hunt for extensive datasets that encompass:

Team Information: Detailed profiles, players, historical matches, and any notable events. Player Information: Career statistics, past games played, performance metrics, and other relevant statistics. Unique Insights: Unconventional data or any other cool tidbits related to rugby.

While I’ve stumbled upon a dataset on Kaggle detailing the International Rugby Union results from 1871-2023, I’m eager to explore more comprehensive and in-depth datasets.

If anyone has come across any resource or can point me in the right direction, I’d be immensely grateful. Let’s gear up for an informed Rugby World Cup experience!

submitted by /u/Snorkel_26
[link] [comments]

Help Finding A Dataset With WWII Deaths Over Time

I am trying to find data on the number of casualties over time during World War 2: how are deaths distributed over the course of the war? The closest data I could find is for Italy only, but I am interested in the combined, world-wide deaths over time.

Ideally, I am looking for the number of deaths per month over the course of the war. It would be less ideal, but still ok, to have data at lower frequency.

Does anyone know if there is such data somewhere? If not, I could estimate these numbers by calculating the excess deaths over that time period. Any thoughts on that? Thanks!

submitted by /u/matmerda
[link] [comments]

Best Place To Find Data On Real Estate Transactions In Arizona?

Hi r/datasets! I’m looking for an AZ real estate dataset from recent years that contains any or all of the following attributes:

Price: The selling or listing price of the property. Size: Total square footage or square meters of the property. Bedrooms: Number of bedrooms. Bathrooms: Number of bathrooms. Property Type: e.g., Single-family, condo, townhouse. Year Built: The year the property was constructed. City: City where the property is located. ZIP Code: ZIP or postal code of the property. Days on Market: Number of days the property has been listed on the market.

Is scraping Zillow the best option? Would appreciate any advice, thanks!

submitted by /u/abc1203218
[link] [comments]