Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

Dataset For Predictive Maintenance On Raspberry PI

Hi all. I am doing a research for centralized management of raspberry pi with one control node (raspberry pi) with ansible for automation in terms of first-time boot configuration, updating the OS with customization layer on it and few additional configurations. So, with that, I was planning to have a AI model in ansible to gather the data from the raspberry pi with sensors to collect the data and lead to predictive maintenance. I can develop algorithms to train the AI model, but I need help on getting the dataset. I have done my search but couldn’t manage to get it. Seeking out here, perhaps to find anyone have dataset or have done similar projects or no. Thank you

submitted by /u/Agreeable_Choice9980
[link] [comments]

Monetizing A Curated Database Of Useful Information

What useful data/knowledge:

– do you either own or require, but is cumbersome to assemble/not easily accessible?

– has the potential to be curated & packaged for ease of access?

– requires real-time/regular updates (optional) ?

– can be put behind a paywall for either 1-time purchase or subscription (subs only if regular updates are required) ?

– is valuable to a significant and accessible audience who are willing to pay for it (either low price/high volume or high price/low volume)?

If you are in possession of such a holy grail, then let us combine our powers. You bring the data, I’ll bring the infrastructure.

submitted by /u/tapinda
[link] [comments]

Requesting Twitter Followers Dataset For University Project

Hello everyone,

I’m currently working on a university project that requires data on the Twitter followers of the top 50 currencies over the past two years. I believe this data could provide valuable insights for my research, and I’m hoping someone might be able to assist me in obtaining it.
The dataset could but does not need to include information about the followers, such as their profiles, activity, and any relevant engagement with the currency-related content. But the followers count over time would be enough as of right now.
Your help would be greatly appreciated, and it will contribute to advancing my academic work. If you have access to or can point me in the right direction to obtain this data, please feel free to reach out via DM or comment below.

Thank you.

submitted by /u/Ruffi-
[link] [comments]

YouTube Trending Videos Data (US) – Views, Comments, Likes, Etc

This spreadsheet contains data on trending YouTube videos in the US, including video details, channel info, view counts, likes, dislikes, and comments. Useful to analyze trends, understand audience engagement, and research various factors of video popularity. Also, it’s just generally interesting.

https://app.gigasheet.com/spreadsheet/Youtube-trending-videos-in-US/9e2691d7_a524_43c0_b825_604e60e97c91

submitted by /u/n1nja5h03s
[link] [comments]

US Consumer Sales Affected By COVID Data Set Help!

Hi everyone, I have to track down a large dataset for a class project to examine COVID’s effects on consumer shopping/sales in the US. I haven’t been able to find anything large enough or that included the required info. I’m at the point where I’ll literally take any industry/focus/anything, I just need help 🙁

Thank you in advance for pointing me in the right direction!!

submitted by /u/laceyreed
[link] [comments]

Looking For A Dataset On Online Privacy Concerns And How People Navigate Online Privacy Tools

Hey as part of some regulatory advisory work I’m doing on the ‘privacy paradox’ (In a nutshell: people opting in to allow sites to have personal information online despite wanting more control over their data). I’m interested in the relationship between behaviours online: e.g., if people adjust privacy settings, and the levels of concern that they might have over their data.
I’m looking for a dataset that I can run some regressions on. Haven’t had much success with finding anything recent. Would appreciate any pointers – best I’ve got so far is Pew from 2019

submitted by /u/WanderingATM
[link] [comments]

Can’t Find Datas To Investigate Discrimination

Hi, I would like to ask you for advice.
How could we analyze the evolution of discrimination in recent years with data? I’m talking about discrimination based on race, religion, sex, gender, sexual orientation, disability…
For now I have thought about some indices, such as gender equality and hate crimes, looking at the trend of recent years. But I can’t find much relevant data. Do you have any advice? Thanks in any case.

submitted by /u/Pozzascu
[link] [comments]

Panel On US Population By State (1980-now)

This should be the most basic of the basic. Still, I cannot find an appropriate dataset nor an actually solved post in this subreddit (unless I’m blind).

People refer to the US census, but that is survey data and doesn’t actually count the total population. There does appear to be some sort of count every 10 years. But is that really all? Should I just assume linear growth between each 10-year measuring point?

This seems weird for a country known for good data.

submitted by /u/AtkinsonStiglitz
[link] [comments]

Does Anybody Know How To Obtain DMV Data? We’re Fine With Purchasing If Need Be.

Hi all. I’m spinning up a data driven automotive startup and on top of what we already have DMV data would really take us to the next level. Specifically I’m thinking name of registrant, year, make, model, VIN number, mileage. It’s pretty clear that this is out there based on all the mailers tailored to the type of vehicles I have but where to request or purchase it is less so. Does anyone know where I could find something like this? We’re specific to Colorado and Georgia, if that helps.

submitted by /u/Tamalelulu
[link] [comments]

Need A 5 GB+ Structured Labelled Dataset For Machine Learning (regression Or Classification). No Time-series Data. Where Can I Find One?

Hi, I need a labelled structured dataset for building regression or classification models that’s more than 5 GB for my big data class. I looked over Kaggle and other places but can’t seem to find one. They are mostly time-series, image, or text datasets that I don’t want to use. May I know where can I find one?

submitted by /u/_aKiRa_26
[link] [comments]

Are There Datasets About Healthcare For Doing Regression?

Hello there, I’m doing a project about how to solve healthcare prediction problems (like regression or binary classification) with machine learning, specifically tree-based models.

I just can find binary classification problems (like, does this person have cancer or not), but any about predicting a numerical value.

Is there any dataset, preferibily educational, related with medicine/healthcare, whose target is numerical? Also whose relation between features and targets are not too simple like a linear one that with the right tools like XGBoosRegressor I can make good predictions (that is, that not all features are non-informative)?

Thanks so much.

submitted by /u/SameItem
[link] [comments]

UK GDP Time-series Data From 1970 Onwards

I am trying to find annual time series data for current and constant price GDP for the United Kingdom from 1970 to 2019. I am also looking for current price gross investment data for the same time period. Can anyone point me to resources or databases where one can find this data? The ONS website only hosts data from 1997 onwards, which is not particularly helpful.

submitted by /u/sankalpsharmaa
[link] [comments]

Parasocial Relationships, Maybe Social Media Interaction Anxiety

Hiya, I been trying to find a decent dataset that contains data about social media & affects on folks – whether they develop anxieties while online or suffer with anxieties & us online as an outlet, anything social media related inc gaming. Found loads of literature docs on line about that topic but finding it difficult to locate a dataset, this is for a end of year project – part time course – so very rusty with all things ML related after the summer. Tks

submitted by /u/How_thehell7799
[link] [comments]