submitted by /u/adjectivenounnr
[link] [comments]
Category: Datatards
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
Nice idea to use chatGPT. It would be great if someone took on the task of creating an open datasets, so that resources wouldn’t be wasted on work that has already been done.
submitted by /u/KMiNT21
[link] [comments]
Hi, I am currently a student and in need of a dataset that I can use to practice my CFA knowledge. Do you guys have any dataset that I can use? I would appreciate if it is a real world dataset so that I can research more about the topic. Thank you!
submitted by /u/Flazh722
[link] [comments]
Hi, I created a simple dashboard with public data from the following sources:
– https://www.visualcrossing.com/weather-data
Some of you might find it useful. The result is the following (a short article with a link to a public demo): https://medium.com/gooddata-developers/showcase-will-we-run-out-of-gas-183eadb29e75
submitted by /u/AmphibianInfamous574
[link] [comments]
I am looking for data that shows the sales of bidets pre-pandemic, during the pandemic, and maybe after it too.
submitted by /u/slapm3withit
[link] [comments]
I know this is a long shot. All exportable sunrise/sunset data is specific to one location, but I need thousands of locations in the same dataset for several years. Would my best bet be to write a loop in R to download it from Naval Observatory and append it all?
submitted by /u/string_of_letter_s
[link] [comments]
Looking for someplace that has data on spotify tracks popularity, etc.
I realize there is an API but was wondering on some past data so am looking for someone who has been tracking it already.
submitted by /u/mithi26
[link] [comments]
Hey everyone,
I hope you’re all doing well. I’m currently working on a startup in the gaming industry and I’m looking for some specific data that is available on Statista. However, I don’t have a premium subscription and unfortunately, the data I need is not available with the free version.
So, I was wondering if anyone here has a Statista Premium subscription and would be willing to help me out. I know it’s a long shot, but I thought I’d give it a try.
I don’t want to take up too much of your time, but if you’re able to help, I would be extremely grateful.
Thank you for reading this far, and I hope you have a great day!
submitted by /u/saltpeppermint
[link] [comments]
I need to do some data visualization work with books, and the dataset from Goodreads is almost perfect for what I need to do.
However, it doesn’t have any genre(s) listed. Is there an existing dataset, which I can use in conjunction with this one, that also has a list of genres? I don’t need it to line up with all 10,000 books in the Goodreads set, but a decent amount.
Any help would be greatly appreciated
Edit: An english equivalent of this is what I’m trying to find.
submitted by /u/jakehenderson01
[link] [comments]
Good evening, I hope all is well!
I am looking for a dataset that includes anything about video game users race, time spent playing, gender, etc.
I see a couple different datasets on game sales which is great but not what I’m interested in.
Thank you so much.
submitted by /u/Dwall_208
[link] [comments]
Looking for datasets with data on how effective natural medicine methods are, such as herbs and supplements.
submitted by /u/Raccoon_1131
[link] [comments]
There was a list posted in this thread with some dead links from 8 years ago for the locations of Home Depot and Lowe’s stores but I need help finding an updated list or a way to create a list of all locations by zip code.
Any help or a point in the right direction would be extremely helpful! TYIA
submitted by /u/ilovemarketresearch
[link] [comments]
I’m looking for a dataset/corpus containing linux shell command inputs and system outputs. I’d really appreciate any kind of help
submitted by /u/JamesAntoni
[link] [comments]
Hello,
I am looking for a dataset where I can see the gold prices since there are historical gold prices (~1980) at least daily exactly in Euro and USD. I have so far found only those that give me an average for the whole year.
submitted by /u/God_Enki
[link] [comments]
Is there a place that has compiled standardized test score information for all public schools in the U.S.? I know that each state has this information, but I am wondering if there’s a central place where all of this might be available.
submitted by /u/jyddyj20
[link] [comments]
I’m participating in a study on ways in which different people write their thoughts, lecture notes, reminders, and other short-form texts that are usually not meant to be shared.
Does anyone know of datasets that could be helpful here? One of our goals is to do some clustering analysis and determine the main “forms” of notes people use. We also want to find out how often people write multiple notes related to the same topic and obtain other interesting results.
Any suggestions are appreciated!
submitted by /u/smthamazing
[link] [comments]
hi all, could i please get some help in finding the LCOE for diff generating technologies( both fossil and RE) in Europe for the past decade.
thankyou!
submitted by /u/one100eyes
[link] [comments]
Does anyone know the underlying data source for website Allbiz.com?
submitted by /u/HereToLearnArt
[link] [comments]
As a novice in the field of Artificial Intelligence and Machine Learning, I would appreciate some guidance on the various platforms that professionals use to acquire datasets for training/fine tuning their models with images and videos.
submitted by /u/akanshtyagi
[link] [comments]
I’m looking for any dataset that contains social network texts labeled as political or non-political.
Any help?
submitted by /u/bigbrainjune
[link] [comments]
Hi, I just wanted to ask if you know any datasets that would be suitable for the models I wrote about in the title? I would want R2 to be high and also that, for example, during moderation, the moderator and the variable should affect Y.
submitted by /u/9Black_Rabbit8
[link] [comments]
With book bans rising in popularity The Marshall Project compiled a list of 50,000 titles that are banned in 19 states. They’re currently cleaning some additional lists from other states to add to the data.
(Un)surprisingly, Florida bans the most titles at over 20,000 Georgia bans the least at 28 If a reason is given, it’s hard to wrap your head around how something like Coding for Parents could pose a threat to security (Wisconsin)
Source: https://www.themarshallproject.org/2022/12/21/prison-banned-books-list-find-your-state
View the Data: https://app.gigasheet.com/spreadsheet/Banned-Books-in-U-S–Prisons/7b6b282b_a6d1_48bc_9df2_71b27f9ab107
submitted by /u/Adorable-Kitchen-919
[link] [comments]
I am trying to replicate the model in this paper and to make sure it works I would like to apply it to the same data as in the original paper
There are two datasets used, one is the weekly prices of 5 NYMEX crude oil futures contracts from 1/2/1990 to 2/17/1995. The paper says that these were made public by Knight-Ridder Financial, a company that has ceased to exist since.
The other one is a set of crude oil prices by Enron Capitial, a company that has also ceased to exist since.
I doubt I could obtain the second dataset but I was wondering if anyone had any suggestions on where I could find the first dataset by Knight-Ridder Financial. I have tried accessing their website through internet archive but I wasn’t able to find anything on there, nor was I able to locate the original publication.
Bloomberg is not an option for me right now either.
Full reference: Schwartz, E. and Smith, J.E., 2000. Short-term variations and long-term dynamics in commodity prices. Management Science, 46(7), pp.893-911.
submitted by /u/horux123
[link] [comments]
My math class has an end of year coding project which uses the basic plotting tools in pandas to analyze and review a dataset of my own choosing. Im pretty okay at coding and i wont struggle to set up everything once i have it planed out. Problem is, all my classmates have picked the cool stuff like weatherpatterns, tempareture changes with correlation to co2 increase and other easy targets.
I would like to stand out a bit. Do you have an interesting dataset that i can use in pytjon without doing any sorting for, outside of the obvious x and y values? I am not an expert at dataset analysis so i can exclusively use pandas and i can only use datasets stored as .csv files.
Im getting slightly stressed over this project as the deadline is creeping closer and closer. So if you have an old coding poject from your class where you learned about comparing graphs and looking for correlations. It would be a huge help to give me some help here.
submitted by /u/RevolutionaryAd4161
[link] [comments]
I need a data set for loading into QGIS to plot the coverage area of different cellular network standards in a particular area. (Chennai, India)
Any idea where to look?
submitted by /u/Awkward_Smile7
[link] [comments]
Where can I find traffic data that gives data like the amount of traffic on a route , I would require data spanning atleast across 1-2 years and preferable 2021-22 or 23
I require the dataset for a traffic prediction model
submitted by /u/akkaaaaaashhhh
[link] [comments]
Hey guys, stumbled upon this sweet dataset the other day. You can export it to KML for some serious parsing and analysis. It’s the crowd-sourced geolocation of every damn corn on the cob vendor in Mexico City! How cool is that? I challenge y’all to train a neural network on it and see what kind of insights you can get. Let’s get cracking, folks!
submitted by /u/JulieJas
[link] [comments]
I’m interested in datasets containing number or percent of Black voters, registered voters, voting rates, etc.
submitted by /u/captainschnarf
[link] [comments]
Hi everyone.
I was wondering if anyone on this sub had experince working with/downloading solo and duoque League of legends data. Is it possible to export from na.op.gg or maybe riot has an API I get it from.
Ideally I would like to wrangle the data in a way where I could separate my soloq games from my duoq to get some stats and expose my duoq partner.
Anyone have experince with this or think its possible?
EDIT: I can use python with things like pandas, numpy etc for some simple data wrangling and analysis.
submitted by /u/ebscodingjourney
[link] [comments]