They say that your data analytics are only as good as the data you have to analyze.
Where do I go to find sources of data that I can do analytics on?
Is there a directory for this or something?
submitted by /u/sanman
[link] [comments]
They say that your data analytics are only as good as the data you have to analyze.
Where do I go to find sources of data that I can do analytics on?
Is there a directory for this or something?
submitted by /u/sanman
[link] [comments]
Hello all, just wondering if I have a massive set of data that I want to compare or analyze the set for trends, would there be a good way to do this through a website or should I manually look for these trends myself. Another question would be how could I easily spot trends or important data figures within my set of data. Thanks!
submitted by /u/floppy11
[link] [comments]
Hello, I am trying to find a data set that looks like the following:
consumer good consumption or price (as dependent variable) predicted by demographics like gender, race, income, education, etc.
I’m also interested in any other factors that could impact consumption or price.
submitted by /u/AssignmentOk1408
[link] [comments]
They have escaped the goat matrix. I think this is very important to know for all who have nothing left to lose.
There are also mountain GOAT’s (greatest of all times). These are usually mountain Buddha niggas located on the peak of a mountain who practice transcendence.
I am looking for historical data sets on top 10 luxury brands sales worldwide over the years, something like comparison over the years of brands like Hermes, Gucci, Chanel based on their sales number. Please help.
submitted by /u/gtrivedi47
[link] [comments]
Hi everyone,
I am fairly new, learning Python since December 2022, and coming from a non-tech background. I took part in the DataTalksClub Zoomcamp. I started using these tools used in the project in January 2023.
Project link: GitHub repo for Magic: The Gathering
Project background:
I used to play Magic: The Gathering a lot back in the 90s I wanted to understand the game from a meta perspective and tried to answer questions that I was interested in
Technologies used:
Infrastructure via terraform, and GCP as cloud I read the scryfall API for card data Push them to my storage bucket Push needed data points to BigQuery Transform the data there with DBT Visualize the final dataset with Looker
I am somewhat proud to having finished this, as I never would have thought to learn all this. I did put a lot of long evenings, early mornings and weekends into this. In the future I plan to do more projects and apply for a Data Engineering or Analytics Engineering position – preferably at my current company.
Please feel free to leave constructive feedback on code, visualization or any other part of the project.
Thanks 🧙🏼♂️ 🔮
submitted by /u/binchentso
[link] [comments]
Hi everyone! For the past couple of weeks, I’ve been helping some fellow community members with some data requests and I’m wondering which other channels can you find people requesting for specific datasets? Seems like r/datasets is the most active forum online for data request!
submitted by /u/nobilis_rex_
[link] [comments]
Hello everyone,
I am currently pursuing a career as a Senior Business Analyst, and I know that having a strong understanding of SQL is essential for this role. However, there are so many aspects of SQL to learn, and I’m not sure where to focus my attention.
I would like to know from those who work as Senior Business Analysts, or those who have experience working with them, what are the best aspects of SQL to learn for this position? Which SQL skills do you use the most in your day-to-day work, and which ones have been the most valuable for you?
I appreciate any insights or advice you can offer, and I look forward to learning from your experiences. Thank you!
submitted by /u/LampRunner
[link] [comments]
Hello everyone,
I am currently working on creating a chatbot that can recommend solutions to log errors that occur in Java applications. To do this, I need a dataset that contains examples of log errors along with their corresponding solutions. I am hoping to find a dataset that is large enough to train a machine learning model to accurately suggest solutions based on the log error message.
If anyone knows of a dataset that would be helpful for this project or has any suggestions on where to find one, I would greatly appreciate it. Any information or assistance would be extremely valuable to me.
Thank you for your time and consideration.
submitted by /u/Farjou69
[link] [comments]
Looking for data that can help me compare how covid may have encouraged more people to take hobby flying lessons. I could use either: – # of people that signed up for classes – # take offs/landings of smaller aircrafts like Cessnas – # of PPLs/CPLs issued as a proxy for seeing the impact
submitted by /u/Eeshoo
[link] [comments]
Financial thematic data package, pertaining to banking
https://app.snowflake.com/marketplace/listing/GZTSZAS2KF7/cybersyn-inc-financial-data-package
Includes data from:
Federal Deposit Insurance Corporation (FDIC) Federal Reserve Economic Data (FRED) Federal Financial Institutions Examination Council (FFIEC) Consumer Financial Protection Bureau (CFPB)
submitted by /u/aiatco2
[link] [comments]
Here is a simple spreadsheet of several thousand battles. I am working (slowly) to get a ton of information on each battle. Please critique and notify me of errors. Cheers.
submitted by /u/UnlimitedRed
[link] [comments]
Hello there, I have a medical dataset in which some features are numeric, while others are categorical. With “categorical” I mean that these features are natively encoded with ordinal integer encoding, such that every possible value is represented as an incremental integer value. It is important for you to know that this dataset has been obtained as part of a survey, so that every categorical value is referred to different types of answers such as “never”, “sometimes”, “a lot of the time” and so on. I have to apply a MLP to this kind of data and I know that in order to do it I first need to scale data. Question is, do I have to scale all features without regard to categorical ones or do I need to scale only numerical variables applying One-hot encoding to the others? I was also wondering if it is necessary to apply one-hot encoding to categorical columns or if I can leave them as they are, applying standardization only to the numerical variables.
submitted by /u/NathanDrake27
[link] [comments]
Came across this pretty popular dataset on Maryland Crashes from 2016-2022. Check it out here:
From these findings, it’s pretty clear that:
Baltimore county (not city) has the highest number of crashes at 156K incidents, with 2018 being the highest year for accidents. The Baltimore Beltway seems to be the highest place for these incidents, with 2.2K incidents occurring over the course of 2016-2022. Yikessss. The Capital Beltway has the highest # of incidents, sitting at 22K Marylanders tend to hit other cars and objects on the road the most but have the least amount of incidents at U-turns (surprising!) The lowest county with crashes is Kent County
Source: https://opendata.maryland.gov/Public-Safety/Maryland-Statewide-Vehicle-Crashes/65du-s3qu
submitted by /u/sheetheadd
[link] [comments]
I am looking for good source to get historical intraday stock data for individual stocks (Norwegian). Maximum timeframe 30min. Any good databases/APIs
submitted by /u/waleed3011
[link] [comments]
Interesting dataset pulled from Boston’s Official Government Site. I definately heard about the spike of crimes that occurred during height of Covid, so I decided to merge the two CSVs from 2021 and 2020. It also helps depict/infer the safest streets in Boston.
Curious, is anyone else interested in a specific location/city and it’s crime data? I see tons of datasets like this online. Would love to share and see some interesting ones!
Click here to view the dataset: https://app.gigasheet.com/spreadsheet/2020-2021-Covid-Crime-in-Boston/94982770_3c8c_48fb_9176_efeb72becdd8
submitted by /u/sheetheadd
[link] [comments]
With the bias of most face detection algorithms against marginal groups such as people of color I’m working a project for skin disease for people of color and would like to know where I can find dataset for people of color. Thanks
submitted by /u/Think_Huckleberry299
[link] [comments]
Data from pulsating stars explained in this data story: https://www.marpledata.com/data-stories/disco-balls-in-space-how-pulsating-stars-work
Data itself is available at https://archive.stsci.edu/
submitted by /u/mbaerto
[link] [comments]
Hey everyone,
I’m working on a map project that needs a list of all airports worldwide along with their geolocation coordinates. I’ve searched online, but I’m having trouble finding a reliable/up to date source.
I was wondering if anyone here knows of a dataset that has it? It would be great if the data included the airport IATA code, and latitude/longitude coordinates.
If anyone has any suggestions or recommendations, I’d greatly appreciate it.
Thank you in advance!
submitted by /u/px07x
[link] [comments]
I’m a beginner to data analytics, I’m applying for data analyst trainee position in Dubai, i need to create a portfolio showcasing skills in SQL, Excel & Tableau. the company I’m applying for focus luxury goods consumer behavior and shopping in middle east and GCC countries.
all help will be much appreciated. Thanks
submitted by /u/Fit-Bird-1601
[link] [comments]
I was wondering if anybody knows of a location I could get some form of dataset with the structure aforementioned in the above. I’m looking to create a supervised learning classification model that takes a set of poker hands (hold-em style I think) that predicts raise, check or fold based on the cards presented. If it were trained on a dataset from professional poker players I’d imagine it would make plays very similar to them, as such it could be rather successful.
My only other option for gathering this data, I thought, would be to host a simple web app that shows the user 5 cards and asks them whether they want to raise, check or fold, and post it on forums (here?) and and gather the data from the responses into a large database. This however may result in bad plays from users that don’t know how to play poker, and bogus answers, so I’d rather stay away from that.
submitted by /u/ryanward02
[link] [comments]
Hi, I’m looking for a space dataset about a specific galaxy. Any galaxy will do. It needs to have spacial information for each celestial body (planet, star, black hole) for a snapshot in time, so I’m thinking an x, y, z value. I want to know each object’s location in the galaxy. It would also be nice if the dataset contained what each object is (star, planet, black hole). It could also go into more specifics about the class of the type of object it is like dwarf star, gas planet, etc & the size of the object or its radius. I’m planing on using this dataset for an art project for one of my classes. Thank you.
submitted by /u/michaelbschulte21
[link] [comments]
https://goo.gl/maps/RLEF9jKtPNvNG7SPA
HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK HOLY FUCK
“the one and only true authentic god nigga”