Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

Ml Sentiment Analysis Project For Mental Health Monitoring

Hi, so straight to the point me and my team chose a project idea for the machine learning course “social media mental health monitoring” basically Mental Health Monitoring: Collect data on social media posts, online forums, or surveys. Develop a sentiment analysis model to monitor and identify signs of mental health issues, and i think it’s gonna be fun and all but the first issue to face us is the lack of usable dataset, we looked into it alot but most of what we found was papers and sources and even the datasets we found (barely a handful) were not exactly aligned with how our project should go or unavailable, our professor told us she’d prefer a dataset that’s not from Kaggle for some reason, but I’d really appreciate it if someone could help me link a similar dataset that can be used for this project be it on kaggle or not and if there was a project implementation that’s close to what we’re trying to achieve here.
Thank you.

submitted by /u/_-_VIK_-_
[link] [comments]

[self-promotion] Issue With Dataset Promotion On Kaggle

I have what I think is an interesting and unique dataset with github accounts, but during its entire existence (20 days) there were only 90 views and not a single download. This is very strange considering that I have a dataset on a similar topic and it collected hundreds of downloads in a first week.

I wanted to know if this could be due to the fact that the dataset may contain user information (this information is available to all github users) or because I accidentally installed 6 tags when 5 were allowed (today I removed one).

Do you know any pitfalls in promoting datasets on kaggle that I should take into account?

submitted by /u/donBarbos
[link] [comments]

Comprehensive Criminal Sentencing Dataset

I am searching the internet for a comprehensive, case-by-case dataset describing criminal convictions and sentencing. I know that information for guilty convictions is made publically available, but I’m not sure if a case-by-case dataset has been aggregated in any form and made available to the public. Does anyone know of any existing sources for this information or have any suggestions for aggregating my own dataset from historical criminal/sentencing data?

submitted by /u/eastbay_jae
[link] [comments]

Can You Help Me Find Datasets For My Final Year Research Project Topic – “Android Malware Detection From User-generated Content – A Comparison Using CNN And NLP” Dataset”

Can you help me find datasets for my Final Year Research Project topic – “Android Malware Detection from User-generated content – A Comparison using CNN and NLP”. I am planning to use 2 machine learning techniques: CNN and NLP, for this comparative study. Please help me find datasets that have relevant variables, analysis and will be apt for a comparison.

submitted by /u/Silver_Hour_9963
[link] [comments]

Datasets On EU City Happiness Or Quality Of Life?

Hey there, people of Reddit! 🌍🏙️

I’m specifically interested in datasets focused on European cities that break down happiness or quality of life indicators. Can’t find anything. If you’ve stumbled upon such data or have any leads, please do share. Your insights could help shed light on some interesting trends! Thanks a million! 📊😄

#EuropeanCities #QualityOfLifeData #DataQuest

submitted by /u/BlueEurope
[link] [comments]

Do You Have Any Tips On Where I Can Find Data On The Airline Industry? I Need The Revenue Figures Of The Most Important And Largest Airlines In Each Respective Region (North America, Europe, Etc.). The Data From The Last Five Years Would Be Most Preferable.

Do you have any tips on where I can find data on the airline industry? I need the revenue figures of the most important and largest airlines in each respective region (North America, Europe, etc.). The data from the last five years would be most preferable.

submitted by /u/fndkkxnx
[link] [comments]

Dataset For Predictive Maintenance On Raspberry PI

Hi all. I am doing a research for centralized management of raspberry pi with one control node (raspberry pi) with ansible for automation in terms of first-time boot configuration, updating the OS with customization layer on it and few additional configurations. So, with that, I was planning to have a AI model in ansible to gather the data from the raspberry pi with sensors to collect the data and lead to predictive maintenance. I can develop algorithms to train the AI model, but I need help on getting the dataset. I have done my search but couldn’t manage to get it. Seeking out here, perhaps to find anyone have dataset or have done similar projects or no. Thank you

submitted by /u/Agreeable_Choice9980
[link] [comments]

Monetizing A Curated Database Of Useful Information

What useful data/knowledge:

– do you either own or require, but is cumbersome to assemble/not easily accessible?

– has the potential to be curated & packaged for ease of access?

– requires real-time/regular updates (optional) ?

– can be put behind a paywall for either 1-time purchase or subscription (subs only if regular updates are required) ?

– is valuable to a significant and accessible audience who are willing to pay for it (either low price/high volume or high price/low volume)?

If you are in possession of such a holy grail, then let us combine our powers. You bring the data, I’ll bring the infrastructure.

submitted by /u/tapinda
[link] [comments]

Requesting Twitter Followers Dataset For University Project

Hello everyone,

I’m currently working on a university project that requires data on the Twitter followers of the top 50 currencies over the past two years. I believe this data could provide valuable insights for my research, and I’m hoping someone might be able to assist me in obtaining it.
The dataset could but does not need to include information about the followers, such as their profiles, activity, and any relevant engagement with the currency-related content. But the followers count over time would be enough as of right now.
Your help would be greatly appreciated, and it will contribute to advancing my academic work. If you have access to or can point me in the right direction to obtain this data, please feel free to reach out via DM or comment below.

Thank you.

submitted by /u/Ruffi-
[link] [comments]

YouTube Trending Videos Data (US) – Views, Comments, Likes, Etc

This spreadsheet contains data on trending YouTube videos in the US, including video details, channel info, view counts, likes, dislikes, and comments. Useful to analyze trends, understand audience engagement, and research various factors of video popularity. Also, it’s just generally interesting.

https://app.gigasheet.com/spreadsheet/Youtube-trending-videos-in-US/9e2691d7_a524_43c0_b825_604e60e97c91

submitted by /u/n1nja5h03s
[link] [comments]

US Consumer Sales Affected By COVID Data Set Help!

Hi everyone, I have to track down a large dataset for a class project to examine COVID’s effects on consumer shopping/sales in the US. I haven’t been able to find anything large enough or that included the required info. I’m at the point where I’ll literally take any industry/focus/anything, I just need help 🙁

Thank you in advance for pointing me in the right direction!!

submitted by /u/laceyreed
[link] [comments]

Looking For A Dataset On Online Privacy Concerns And How People Navigate Online Privacy Tools

Hey as part of some regulatory advisory work I’m doing on the ‘privacy paradox’ (In a nutshell: people opting in to allow sites to have personal information online despite wanting more control over their data). I’m interested in the relationship between behaviours online: e.g., if people adjust privacy settings, and the levels of concern that they might have over their data.
I’m looking for a dataset that I can run some regressions on. Haven’t had much success with finding anything recent. Would appreciate any pointers – best I’ve got so far is Pew from 2019

submitted by /u/WanderingATM
[link] [comments]