submitted by /u/alecs-dolt
[link] [comments]
Category: Datatards
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
Hey, Reddit community!
I stumbled upon a game-changer for businesses striving to harness the full potential of their data – Data Management Analytics Services! SG Analytics has put together an insightful article shedding light on how these services can revolutionize the way organizations handle and utilize their data.
π Link: Data Management Analytics Services
In this comprehensive blog post, you’ll explore:
ποΈ The key components of robust data management strategies. π How analytics-driven data management can optimize decision-making processes. πΌ Real-life examples of companies benefiting from data-driven insights. π The role of data management in enhancing overall business efficiency.
Whether you’re a data enthusiast, a business owner, or an aspiring analyst, this read will undoubtedly provide valuable knowledge and fresh perspectives.
Let’s engage in a discussion about the significance of data management in today’s fast-paced world. Share your thoughts, questions, and experiences in the comments below. Don’t forget to upvote if you find this topic as exciting as I do – let’s bring this valuable information to more people’s attention!
Stay curious and data-driven! ποΈπ
submitted by /u/David_starc150
[link] [comments]
I’m looking for medicine dataset which is publically available. Preferred if it is from tata 1mg/
submitted by /u/Majestic-Peach-9177
[link] [comments]
I am inviting you to for using ML knowledge to on image datasets which are :
submitted by /u/AsgardiansLoki
[link] [comments]
I want to know what percent of benign fetishists β like foot fetishists β also have more harmful fetishes, like pedophilia. Men who are into BDSM claim that this is harmless, but I suspect that theyβre lying.
Does anyone have a dataset on paraphilias?
submitted by /u/3amorange
[link] [comments]
I need a dataset which can be used for regression or classification. It can also be over 50mb. Don’t care about no. of rows and columns.
submitted by /u/Luffykent
[link] [comments]
I’d personally like the Google full scale historical cache dataset.
Google caches everything, fully backed up with every change to every website covering the last 20 years. Imagine the insight and knowledge you could gain processing that. Every lost website, every forum comment, every tweet, old reddit deleted posts. We have archive but a searchable time backtrackable complete Google cache dataset would be magical.
And you know they have it.
Keeps me up some nights just thinking about it.
What are some datasets that you can only dream of getting access to?
submitted by /u/omgsoftcats
[link] [comments]
The Netflix prize dataset and the AOL dataset.
Are there any other datasets that have been banned or removed from existence?
submitted by /u/omgsoftcats
[link] [comments]
Iβm at the end of my data science course, I need to find a dataset with 80 to 100 columns, in order to start a final project for the course and get my certificate. Is there a way to make the search but only by how many columns in the datasets ? Please help
submitted by /u/jeremydavid2
[link] [comments]
I need the monthly churn rate for twitter. How do I get the number of annual users from the number of Monthly Active Users for a social media site? Is there some general formula or some percentage that is used? I am guessing the churn rate would help.
submitted by /u/itzSwain_
[link] [comments]
Hi everyone, is there any suggestion public dataset websites other than data.world and Kaggle, since my lecturer does not allow to use Kaggle for my work (Prohibit). My requirement is minimum range size 450mb to 500mb with the 40 to 50 columns in my desired dataset. If you guys have any suggestion please comment below here. Thankss π
submitted by /u/Sweet_Impact6880
[link] [comments]
Iβm trying to find a dataset that will show that I can do joins but every dataset I find has simply one table with everything in it rather then information split across two or more tables. Id rather have info split and be connected via some key so that I could show that I can do joins.
Thank you for any help
submitted by /u/fhdjnjcj
[link] [comments]
I just need some ideas for my project. Have done pelenty of health and bank related problems. And want something new and different
submitted by /u/iwasagnes
[link] [comments]
I would like to get the historical temperatures for U.S. cities, specifically the southwest, over the past fifty years in a CSV. I tried NOAA, and selected the date range, but only got weather data for 2023. Other sites I found either charge for this data or do not make it available to download. I thought climate data was readily available for public use, but it is proving surprisingly difficult to find. Are there publicly available resources or APIs available?
submitted by /u/sch0lars
[link] [comments]
Iβm primarily interested in trucks light duty – medium duty, but Iβm struggling to find much data on this specific topic in general other than a few write ups saying that there is a correlation, but lacking any reference actual data.
Iβm interested as I have a client that delayed replacing a large swath of their fleet in 2021-2022 and seen a massive uptick in maintenance costs in 2022. Iβd like to provide some actual insight into that with data and their historical data is lacking.
submitted by /u/Pragmegatronic
[link] [comments]
The data base is based on discord conversations from multiple servers, it contains roughly 46 million messages in the right order based on conversational relevance if I understood it correctly, if not then my mistake, anyway here is the link:
submitted by /u/JamesAibr
[link] [comments]
Hey, Redditors! ππ‘
I just discovered a thought-provoking blog post that delves into the cutting-edge world of Big Data strategies! π
SG Analytics has an insightful piece on how businesses are evolving their approaches with two key concepts: Data Lakehouses and Data Mesh! π’π»
Data Lakehouses combine the benefits of Data Warehouses and Data Lakes, bridging the gap between structured and unstructured data. This approach simplifies data management, enabling easier access, and promoting data-driven decision-making. ππ
On the other hand, Data Mesh advocates a decentralized data architecture, empowering individual teams to manage their data domains. This democratized system fosters agility, scalability, and collaboration within organizations. πΈοΈπ
The integration of these innovative strategies marks a significant shift in how companies harness the power of data. Let’s discuss their potential implications on data-driven insights and the future of analytics! π¬π€
Check out the blog post here: SG Analytics –https://us.sganalytics.com/blog/evolving-big-data-strategies-with-data-lakehouses-and-data-mesh/
Stay informed, fellow data enthusiasts! ππ
submitted by /u/annas01s
[link] [comments]
I have a confession to make: I suck at small talk. I’m good at big talk. Like, existential crisis big. But I want to make people laugh, not cry.
That’s why I’m working on mastering small talk. My idea is to have a statistically derived list of of frequently asked questions in casual conversations, and witty responses for each one.
But how do I get this list?
For this, I need a dataset of real conversations, especially the ones that are about small talk. It should be big enough to show me what kind of questions and topics people usually chat about. I don’t want any artificial or synthetic datasets for this project.
By the way, do you know if someone has already made something like this? If there is no existing solution, I’ll use the dataset to make my own. But if it already exists, I can skip the hassle.
COCA, LDC, and BNC, seem to be either paid or restricted. I’ve also seen some related posts on this subreddit.
https://www.reddit.com/r/datasets/comments/u8etiq/spoken_conversation_datasets_transcripts_needed/ https://www.reddit.com/r/datasets/comments/mcwldg/conversational_datasets/ https://www.reddit.com/r/datasets/comments/6bjzgl/i_put_together_a_few_conversational_datasets_if/
submitted by /u/8ta4
[link] [comments]
Any 2022 or 2023 datasets of Michelin guide and Michelin star restaurants with addresses available in a tabular format? Interested in doing some spatial analysis with the data. Thanks!
submitted by /u/teriyakinori
[link] [comments]
Hello everyone!
So I found a real dataset on bike rentals between 12/01/2017 – 12/31/2018. It has fields such as date, hour of the day, bike rentals in that hour, temperature, season, rainfall, snow, wind speed.
The only thing that I was curious about is if it’s even a good idea to include the data on the last month of 2017. Or would it be best to simply do an annual analysis of bike rentals for just 2018 since it includes every day sales for the whole year.
I wouldβve liked to include 2017 but I feel as if the month of December 2017 might skew some results if I do, such as season or even weather analysis on rentals.
Iβm trying to answer business needs questions such as factors affecting bike rentals (weather conditions) to suggest possible solutions.
submitted by /u/htxastrowrld
[link] [comments]
Hey,
I have found loads of data concerning US flights, but googling hasn’t gotten me anywhere concerning data about flights within Europe. Any good open-source data sources?
Thanks in advance!
submitted by /u/ChallengeAccepted83
[link] [comments]
Hey, before the API got restricted I collected a bunch of data that I’ve uploaded on Kaggle. I have pretty nice About section that explains the contents of the dataset.
Let me know what you think!
Link – https://www.kaggle.com/datasets/rohitrajesh/reddit-dataset
submitted by /u/04RR
[link] [comments]