Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

How Can I Find All Companies Of A Specific Category Residing Within My State?

Just a disclaimer, I have zero experience dealing with data and stuff, so please bear with me.

Let’s say I want a list of all plumbing companies in my state. I want the name of their company, e-mail address, phone number, and general location. If this is too much, just their e-mail address is fine. Currently, I’ve been going to each and every business’s website and copying and pasting their contact information and general location. The problem is that doing it this way is that it takes forever. I wonder if there is a better approach or tool I can use to save time and achieve the same goal. Please let me know, thank you.

submitted by /u/Jolivsant
[link] [comments]

Need Help Finding Datasets About European Funds

Hello!

I am writing my master thesis in finance and need to find datasets. Preferably i need information for the last ~ 30 years about mutual fund performance, size and age. Any other information about them is also valued. I am hoping to find a large dataset containing funds from different countries, hopefully withouth having to gather each fund individually. Mainly i am interested in EU/Nordic funds. Can anyone help/ point me in the right direction?

My school gives me access to:

Compustat

CRSP

Bloomberg terminals

(possibly others, so please suggest and i can check)

But I have not been trained in using these at all. Any guides to using these databases or direct help is extremely appriciated!

submitted by /u/J-Stonks
[link] [comments]

Seeking Comprehensive Rugby Datasets Ahead Of The Rugby World Cup

With the Rugby World Cup just around the corner, I’m diving deep into the world of rugby analytics and data. I’m on the hunt for extensive datasets that encompass:

Team Information: Detailed profiles, players, historical matches, and any notable events. Player Information: Career statistics, past games played, performance metrics, and other relevant statistics. Unique Insights: Unconventional data or any other cool tidbits related to rugby.

While I’ve stumbled upon a dataset on Kaggle detailing the International Rugby Union results from 1871-2023, I’m eager to explore more comprehensive and in-depth datasets.

If anyone has come across any resource or can point me in the right direction, I’d be immensely grateful. Let’s gear up for an informed Rugby World Cup experience!

submitted by /u/Snorkel_26
[link] [comments]

Help Finding A Dataset With WWII Deaths Over Time

I am trying to find data on the number of casualties over time during World War 2: how are deaths distributed over the course of the war? The closest data I could find is for Italy only, but I am interested in the combined, world-wide deaths over time.

Ideally, I am looking for the number of deaths per month over the course of the war. It would be less ideal, but still ok, to have data at lower frequency.

Does anyone know if there is such data somewhere? If not, I could estimate these numbers by calculating the excess deaths over that time period. Any thoughts on that? Thanks!

submitted by /u/matmerda
[link] [comments]

Best Place To Find Data On Real Estate Transactions In Arizona?

Hi r/datasets! I’m looking for an AZ real estate dataset from recent years that contains any or all of the following attributes:

Price: The selling or listing price of the property. Size: Total square footage or square meters of the property. Bedrooms: Number of bedrooms. Bathrooms: Number of bathrooms. Property Type: e.g., Single-family, condo, townhouse. Year Built: The year the property was constructed. City: City where the property is located. ZIP Code: ZIP or postal code of the property. Days on Market: Number of days the property has been listed on the market.

Is scraping Zillow the best option? Would appreciate any advice, thanks!

submitted by /u/abc1203218
[link] [comments]

Looking For Dyslexia Reading Speed And Letter Mix Up Dataset

I’m currently searching for a dataset on the reading speed of persons with dyslexia. I try to find out what letters or letter combinations cause the most problems during reading.

Ideal would be a dataset of a text that has been copied by dyslexic people. (so source text and the same text written by multiple dyslexic people) or a dataset with sample sentences and time required to read them.

I know this is very specific, so suggestions on alternative data sources that I might infer this information from are also very welcome!

submitted by /u/X99p
[link] [comments]

[self-promotion] Kaggle: 16,000+ LinkedIn Job Postings From Last Week

Hello everyone, to pass time during my extra long summer break before starting college I decided to learn SQL through scraping and storing data from LinkedIn. Yesterday, I dumped all the data I collected to Kaggle in a csv format. It contains 27 columns in addition to several detached files containing info such as the benefits, industries, skills associated with each job (that’s right, I discovered what data table normalization is). There’s also a separate folder containing company information (name, desciption, size, employee_count, follower_count, industries).

I plan to run the collection script again next month, allowing for further analysis of trends such as company growth, salary changes, and job demand. Also if anyone wants, I can potentially share the scraper code on GitHub, although keep in mind you may get banned (especially with new accounts).

These are the columns of the main file:

[‘job_id’, ‘company_id’, ‘title’, ‘description’, ‘max_salary’, ‘med_salary’, ‘min_salary’, ‘pay_period’, ‘formatted_work_type’, ‘location’, ‘applies’, ‘original_listed_time’, ‘remote_allowed’, ‘views’,’job_posting_url’, ‘application_url’, ‘application_type’, ‘expiry’, ‘closed_time’, ‘formatted_experience_level’, ‘skills_desc’, ‘listed_time’, ‘posting_domain’, ‘sponsored’, ‘work_type’, ‘currency’, ‘compensation_type’]

Here’s the link to the dataset:

https://www.kaggle.com/datasets/arshkon/linkedin-job-postings

submitted by /u/Armi2
[link] [comments]

Complete Noob Requests Help With BLS Data

Hello all,

I come to you after 100s of google searches, 10s of hours spent squinting at my computer screen, and 1 near breakdown.

Basically, I’m trying to get demographics based on job types. For example, I’d like to know the average age, gender, income, and education level for real estate brokers in the U.S. I *think* the BLS has this data, but I have no idea how to find it. I would be eternally grateful if someone could point me in the right direction.

submitted by /u/starlit_ren
[link] [comments]

M&A Deal Premium On Refinitiv Eikon?

I am currently doing my Master thesis and this could be a huge help if someone could help me out. What is the deal premium called on Eikon Screener because I can’t find it in “add columns” section. Is it Price Premium. I am also trying to map ESG scores of target companies and financials. Should I use the PermID for Target? Pls help, this is kind of urgent especially if you’ve experience with this, pls pls help!

submitted by /u/Thick_Sun2297
[link] [comments]

[self-promotion] Indeed Dataset 730k Records

I’ve got a scraped job postings dataset from Indeed (US). The data is updated daily, with roughly 5-10k new records new every day. The dataset has all the fields in the job offer. Title, description, salary, urgently hiring, etc. Data goes back to early this year.

I can offer it all bulk (as of today) or subscription to you if you’re interested in updates.

submitted by /u/conjecturer_
[link] [comments]

Parking Lots Anomalous Activities Video Dataset

Hey Guys,I am currently working on a project in which I will need a dataset of video clips in parking lots which are anotated with activities being done in that parking lot by humans both normal and anomalous like fighting ,car accidents and others,I would be very grateful if someone could suggest me such a dataset or at least tell me which ones contain such video clips so I can filter through those datasets, i have heard of the UFC crime dataset but it contains many diverse situations and I don’t know if there are any parking lot video clips in that one, thanks in advance for any help!

submitted by /u/Demonking6444
[link] [comments]