Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

Help Merging Time Series Data Set [voltages]

Hey everyone, I am trying to work on a project. I have three datasets

Dataset 1: Machine voltage varying over a period of time. (continuous -40.000 rows) Dataset 2; Machine runtime, downtime and faults (continuous too – 8000 rows) Dataset 3: Machine degree of fault. Variable that varies between 1-3;integer. (Not exactly continuous, it states the time the alarm was triggered and identifies the degree of the machine fault). About 2000 rows.

How would I work with this dataset to do data analysis? I would like to find a relationship between voltage and degree of fault.

The end goal is optimizing the machine to minimize machine downtime. One approach is predictive maintenance/forecasting but other approaches are being considered too.

Edit: Changed flair

submitted by /u/maskedhypocriter
[link] [comments]

Looking For Student Feedback Dataset

Hey!
I’m looking for a dataset that has student feedback about faculty. I’ve looked at Kaggle and HuggingFace and found some datasets from there.

Wanted to know if there are more places I can check to get more data. Ideally, if possible dataset should have more than 5k rows

submitted by /u/shobhitnagpal
[link] [comments]

Looking For Disability Rates By State

I am trying to find simple disability rates by state from 1998 to now. This website basically has the information I want https://www.disabilitystatistics.org/, but there is not a way to download all the data.

I have looked into grabbing it from the American Community Survey (which is where the website gets the information). I know how to pull this ACS table (https://data.census.gov/table/ACSST1Y2021.S1810), but is so messy and not straightforward that I wanted to see if anyone knew of a centralized location before I jumped into dealing with the ACS data.

submitted by /u/jyddyj20
[link] [comments]

Question / Observation About The Group

So I’m new to this group. I’ve worked in corporate “big data” and “data warehousing” for almost 20 years. I also own a small business that sells the “CMS/NPPES Npi database in most popular database flavors.

My observation is as follows…it seems to me that a good portion of the requests in this group are for a very specific set of data.

I’m not an academic, but how do you end up deciding what you want your thesis to be before figuring out if the data is even available?

Many of the assistance requests are for data whose source is behind a corporate policy which the industry / corporations are not going to let become public.

Another example is medical data which is protected by HIPPA in the US.

Pardon my wording, but is this a “cart before the horse” situation?

submitted by /u/j_w_g_1
[link] [comments]

Contact For Metropolitan Police Dataset

I saw on London datastore that they released a dataset called MPS Antisocial. I want to contact them to see if they can make a similar one for MPS Violence Offences or link me an earlier version of MPS Antisocial as it says it started July 2022 but doesn’t have any data for 2022 (unless they meant 2023) as it has only 2023.

submitted by /u/infinity123248
[link] [comments]

Scraping Data Regarding To Lobbying Of US Corporates

Hi,

Can someone help with scraping data? Unfortunately I don’t have the skills to do that. I want to create a dataset of US corporations’ expenditures on lobbying, for each available year.

Example: https://www.opensecrets.org/federal-lobbying/clients/summary?id=D000023883

Here is Amazon’s total expenditures on lobbying in 2023. You can type any other company who participates in lobbying. I guess there are more sources for such data. If someone can help me collecting this data, it will be highly appreciated. Thanks!

submitted by /u/Porcoddio45
[link] [comments]

Dataset Containing Federal Criminal Charge Labels And Reference Data

I am looking for a list of federal charges that I can use as reference data when extracting mentions of said charges from unstructured text. For example, such a list would include things like:

Possession with Intent to Distribute 50 Grams or More of Methamphetamine Possession with Intent to Distribute 28 Grams or More of a Mixture or Substance Containing Cocaine Base Possession with Intent to Distribute Cocaine Possession with Intent to Distribute Heroin

I know I can get text extracts of US Code – but what I am looking for is how I could detect something like “Possession with Intent to Distribute 50 Grams or More of Methamphetamine” in freeform text and then ideally crosswalk over to a reference in USC. (example50%20grams%20or%20more%20of%20methamphetamine%2C%20its%20salts%2C%20isomers%2C%20and%20salts%20of%20its%20isomers%20or%20500%20grams%20or%20more%20of%20a%20mixture%20or%20substance%20containing%20a%20detectable%20amount%20of%20methamphetamine%2C%20its%20salts%2C%20isomers%2C%20or%20salts%20of%20its%20isomers%3B)).

submitted by /u/thegrif
[link] [comments]

Are Digital Metrics Data Available Online?

Hey all!
I’m a marketing master’s student and for one of my assignments, I have to interpret the digital metrics of a certain campaign or company using GA4. The demo account offers only data from Google merchandise or Flood it. I want to find more interesting campaign data that I can use!
I understand that most companies keep their metrics confidential but is there any resource online that hosts digital metrics data of different companies to use for educational purposes?
I would love to get all the help I can!
I really appreciate any help you can provide.

submitted by /u/obnoxiouschatterbox
[link] [comments]

Any Daily Datasets For London Related To Crime Or Similar

I really needed a daily count of violent crime in London but I don’t think it exists.

I decided to try any other related datasets and see if there is a correlation when aggregating them into months and checking against my monthly violent crime data. If it correlates well, I’ll use the dataset as a way to split the monthly violent crime into daily.

Any dataset with daily X of something in London that may correlate well to violent crime and that domain would be appreciated.

Thank you

submitted by /u/infinity123248
[link] [comments]

Dataset For Past Insurance Claims Of Car

Hello everybody, trying to create a model which detects damge on a car and estimates repair cost need some data for creating a model for estimating repair cost. Need data like car brand and model, damage area, damage severity, location, claim amounts, damage severity. It would helpful if someone could find dataset like this.

submitted by /u/Filthygamer11
[link] [comments]