Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

Seeking Advice On Customer Segmentation For E-commerce

I’m currently embarking on a project to revamp customer segmentation for an e-commerce company.
We’ve got lots of data already, but I’m not sure what exactly I need to make this work well. Figuring out customer groups helps us make shopping better for everyone.
Here’s what I’m wondering:
1. Important Data Stuff: What kind of information should we have in our data to understand our customers better?
2. Fixing Data: How can we make sure the data we have is good enough to help us understand our customers?
3. Good Ways to Sort Customers: Do you know any good tricks or tools to help us figure out what groups our customers belong to?
4. Checking if it Works: Once we have our groups, how can we tell if they’re helping us make shopping better?
We’ve got loads of data, but making sense of it all is tough. I’d really appreciate any advice you can give. Whether it’s from your job, what you’ve learned, or just good ideas, I’m all ears. Thanks a bunch for your help!

submitted by /u/Appropriate_Union_58
[link] [comments]

Seeking Help: FIVB Volleyball Men’s World Cup 2022 Attendance Data In Slovenia

Hey r/datasets community!

I hope this post finds you all well. I’m reaching out to this amazing community because I’m currently working on a sports analysis project focused on the FIVB Volleyball Men’s World Cup 2022, specifically looking into the attendance figures for matches held in Slovenia.

I’ve been scouring various sources for this data, but unfortunately, information on the number of people who attended each match in Slovenia seems to be quite elusive. The limited availability of this data is proving to be a significant challenge for my analysis.

If any of you were fortunate enough to have access to reliable sources, I would greatly appreciate your help. It would be fantastic to get accurate attendance figures for every match played in Slovenia during the FIVB Volleyball Men’s World Cup 2022.

Whether you have personal experiences, know someone who attended, or have stumbled upon some hidden gems of data, any information you can provide would be incredibly valuable for my project.

Additionally, if you have tips on where I could potentially find this data or if there are any local sources in Slovenia that might have compiled such information, please let me know.

Thank you so much for taking the time to read this, and I truly appreciate any assistance or guidance you can offer. Let’s work together to make this analysis a slam dunk!

Looking forward to your responses! 🏐🌍

submitted by /u/nejcGo3
[link] [comments]

Where Can I Find Datasets Relating To Genetics And Diseases?

For instance, data on how changes in a certain genetic locus impacted the rates of Alzheimer’s disease, or any other disease. Or– how a certain non-genetic lifestyle factor, ie: omega 3 in the diet, related to rates of Alzheimer’s disease. I’m doing a project for a statistics class where we use the program R to calculate summary statistics and analyze the data. The problem is, I have no idea where to actually find data! I’m pretty new to this. Does anyone have any suggestions? It doesn’t have to be this specific, either. It can be about anything, really. I mostly just want to know some good sources.

submitted by /u/Relevant_Engineer442
[link] [comments]

Help Finding Messy Stock Market Data

A friend and I are doing a data analysis and manipulation project using Python. We need to find data in three different formats. Also, the data should be preferably messy because part of the project is cleaning it. Where can we find this data, preferably free?
PS: Our project is based on the Stock Market and outside factors. But we are having trouble finding messy Stock Market data.

submitted by /u/AcanthocephalaOk4489
[link] [comments]

Seeking Doctor-Patient Conversation Audio (200 Hours, US/UK English, WAV Format)

I’m not sure if this is the right place.

Anyway, I’m currently on the lookout for doctor-to-patient conversation audio recordings. Specifically, I’m in need of approximately 200 hours of audio in US or UK English, and it must be in WAV format.

Also, if anyone has access to Arabic, Spanish, or Malay call center data, I’d be interested in those as well. The audios are required for various fields including banking, insurance, finance, medical care, telecommunications, and automobiles.

Please share your best rates as well.

If anyone can point me in the right direction or has any leads, I would greatly appreciate it. Thank you in advance!

submitted by /u/Disastrous_Piano7831
[link] [comments]

Looking For Datasets On US Automotive Advertising

Anyone know where I can find data on advertising by the automotive industry in the US? Right now I’m just trying to see what’s out there, so there’s some flexibility in the kind of data I use. Importantly, though, I need data that has information on advertising by region, which automakers are running the ad, and the language of the advertisement. It’s fine if I have to pay for it, but free is always nice.

Thanks.

submitted by /u/BigPenisMathGenius
[link] [comments]

Computer Vision Approach For Liver Tumor Classification Using CT Dataset

Hey guys. Iam studying deep learning, and Iam in desperate need of this dataset. I’ve come across a research paper with this title but can’t find the dataset. Please help me find this dataset.

Details: Mubasher Hussain, Najia Saher & Salman Qadri (2022) Computer Vision Approach for Liver Tumor Classification Using CT Dataset, Applied Artificial Intelligence, 36:1, DOI: 10.1080/08839514.2022.2055395

submitted by /u/Page_Future
[link] [comments]

Looking For A Soccer Penalty Kick Dataset

Hey everyone. I am looking for datasets that include data on penalty kicks taken in soccer matches over a large span of years. It would be ideal for it to be a major league, like the Premier or Champions League, or the World Cup, or just all international Soccer play. Ideally the data would include if the shot was made, which foot was used to kick, where the keeper dove, etc. Essentially any helpful data for running analysis to determine the best place to shoot the ball. Thanks!

submitted by /u/CheesyPanther
[link] [comments]

Looking For Bio Datasets On Textmining With Gene Gene Interactions.

Hey I am new to this sub and i just thaught i would join in asking for data sets. Hopefuöly and probably in ill contribute a few in the comming year during my phd. At the moment i am looking for exsisting bio datasets on gene gene interactions.

Like the title says. I am interested in datasets that provide a excerpt of a paper and the genes or proteins mentioned in it and their interactions.

Do any such data sets exsist ?

submitted by /u/Noxusequal
[link] [comments]

I Am A Researcher, And I Am Analyzing R/EnglishLearning.

“Please help me. I am a researcher, and I am analyzing r/EnglishLearning. My research is qualitative, and I must admit my ignorance of statistical data methods. I don’t have much time to delve into data collection methods. Still, I desperately need information about this subreddit to support my findings (my research spans one year, from January 2023 to January 2024).

Which are the most used flairs?

How many Redditors label themselves as ‘native’?

Are there any Redditors who are part of /r/EnglishLearning but have never posted?

Who has the most posts?

I know I am asking for a lot, but I would love it if somebody could help, even if only partially. Please, if you do, also tell me the methodology and tools you applied and how you arrived at the results without being too specific. I will definitely cite you in my bibliography if you help, and you will also be happy to help a desperate soul 🙂

submitted by /u/aaagggaaaiiinnn_88
[link] [comments]

Sample A/B Test Dataset Sources? Preferably Web Product Based

I’ve reviewed the sub and couldn’t find specific resources. I am looking

I am looking for:

preferably datasets around utilization of a web product (e.g. adding items to cart, visiting specific web pages, etc.) preferably has previous behavior data of the users prior to the experiment preferably metadata about the dataset e.g. what was the experiment hypothesis when did the experiment start and when did it end columns needed unique identifier for user date treatment was assigned to user (could be stored in a different dataset where the grain might be users and their account metadata vs. behavior) date interacting with event the label for the event group label that identifies if a user is part of the control or treatment group

Happy to answer any clarifying questions. Thanks for the help.

edit: changed flair to request

submitted by /u/shoeobssd
[link] [comments]

Are Lucid Dreamers Different From Us? (Also Welcome 18+ Non Lucid Dreamers With English Reading Skills) (Academic) (All Countries)

Hello everyone!

I’m excited to invite you to participate in my lucid dream research project, if you’re interested in exploring the world of lucid dreaming and contributing to scientific research. I’d love for you to participate in our study.

https://show.forms.app/research-survey/creative-problem-solving-and-metacognition-form

Hope everyone can join and if you have friends and family who’ll be interested to take part, please share the link. The more diverse perspectives we gather, the better!

Thank you in advance for your participation and support, I’m relying on you. 😇

submitted by /u/ManeeJ
[link] [comments]

Hi, I Have Made A Table With Data, Can You Look At The Columns And Write Whether They Look Like Real Ones?

I need it for studying, and I can’t find similar data on which I can practice calculations

‘user_id’: user_ids, ‘age’: age, ‘gender’: gender, ‘location’: location, ‘session_duration’: session_duration, ‘num_sessions’: num_sessions, ‘level’: level, ‘tasks_completed’: tasks_completed, ‘revenue’: revenue, ‘purchases’: purchases, ‘average_revenue_per_user’: average_revenue_per_user, ‘game_version’: game_version, ‘platform’: platform, ‘os_version’: os_version, ‘ad_campaign_cost’: ad_campaign_cost, ‘new_users_from_ads’: new_users_from_ads, ‘retention_rate’: retention_rate, ‘game_rating’: game_rating, ‘player_reviews’: player_reviews, ‘event_date’: event_dates, ‘event’: events

submitted by /u/101impossible
[link] [comments]