submitted by /u/growth_man
[link] [comments]
Category: Datatards
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
I would prefer it to have aerial photos of military tanks (from above). I was looking into movie datasets or something similar but can’t find anything. Any suggestions?
submitted by /u/stratos1st
[link] [comments]
Hello. Grad student here. I’m looking for the US population male to female ratio by age group. Specifically looking for the sex ratio for 18-24, for current and past years. Can anyone guide on how to retrieve this data from census? Or are there other datasets that would have this info? Thank you in advance for any suggestions.
submitted by /u/blksquare
[link] [comments]
Hey everyone, I am working on a mobile app and need a Book dataset with the following information: Title, ISBN, Author and Price. Extras like Edition, release date and Publisher would be great but those four are the big ones. I have found a lot of datatsets but none with the 4 required columns, some are missing ISBN while others are missing the proce. Please let me know if you know where I could find any dataset with a good amount of books and this information. Thank you so much
submitted by /u/Ironlad2045
[link] [comments]
Using Tobin’s Q as a variable – does anyone know how to compute on datastream at an individual company level within a specific exchange? Thx
submitted by /u/ArchieMoon
[link] [comments]
Hi guys, I need help with finding datasets on Social network analysis for my project but so far no luck in finding the one I need. I did found a couple of websites which had those datasets like in Standford Large Network Dataset Collection but I’m not too sure how the datasets are supposedly used from this website. I also tried various websites such as Kaggle, data.gov, data.world. Still could not find it although I specifically typed in social datasets or social networking datasets or network datasets and other keywords related to social network. My topic is suppose to be on related to social phenomenon such as public health or politics or environmental. Could anyone please provide some helpful websites? Thanks in advance 🙇♂️
submitted by /u/Alternative-Oil2132
[link] [comments]
I need a dataset with all the free agent transfers with their new contract from the past years. I’m doing a proyect where I try to predict the new contract for free agents based on their performance from the last season, I’ve already found a dataset for the performances, but I can only find the dataset of free agents from the last season, and I need at least 3 or 4 seasons to have enough training volume
submitted by /u/-sarx2-
[link] [comments]
Working on an ml project and need to train the model with a dataset of photos of lacerations through a medical database but I cant really find any sets of lacerations specifically.
submitted by /u/Imaballofstress
[link] [comments]
So far I’ve been using google image search, yandex image search, and some stock photo websites. But it seems to be really hard to find high quality images of people having facial expressions other than “default look” or “smiling”. For example, finding images of people with facial expression “biting lip” seems very difficult. I was hoping to get some ideas or pointers how I could do this more efficiently?
submitted by /u/belladorexxx
[link] [comments]
I am looking to use R to access real-time or daily summary precipitation data. Rnoaa package will be retired soon and the NCDC and NCEI are both non-functional. I have no idea where to find other sources. Are there any that can give precipitation data by selecting specific coordinates and using the closest station?
Thanks!
submitted by /u/wateriscrisp
[link] [comments]
Hello all –
Does anyone know where I could find county-level unemployment data by age or, more generally speaking, for individuals between the ages of 16-24? Looking for Pennsylvania specifically.
Many thanks!
submitted by /u/Puzzleheaded-Gas2140
[link] [comments]
I’m trying to compile a report on how much a bunch of publicly traded companies are spending on R&D as a percent of revenue each year for the last couple of decades.
All of the data is in the 10k stock filings that companies are required to make and I feel like someone must parse it and turn into structured data. But I can’t find anyone for this particular information.
Any suggestions? Ideally free ones.
submitted by /u/MarketMan123
[link] [comments]
Hey r/datasets,
I’m looking for a dataset of different alcohol brands to ingest.
So far I have found: https://www.alcoholcontents.com/
Which has been relatively sufficient for beers and liquors, but the wine department is pretty lackluster.
Does anyone know of any additional datasets for this information? Preferably with a robust wine dataset.
Thanks
submitted by /u/ryan_s007
[link] [comments]
Hi!
I’m a university student, and for a project, I need to find a relational database to normalize (3NF) and optimize. I need it to have 10 tables, and at least 2 of those have to have between 100k – 1M rows. After I find a workable database, I can divide it into more tables, to make up to the 10 minimum table count, and also can make the primary key, foreign key relations between them, but I’m having a bit of a difficulty when finding my data set.
Since I’m quite new to this stuff, I’m hoping to find a little help here.
submitted by /u/actual_tsukuyomi
[link] [comments]
I’m looking for a dataset that has photos of basic trash items and if it is or isn’t recyclable, it can include compost too but I’m having trouble finding something online, thanks in advance.
submitted by /u/charixander
[link] [comments]
I’m not sure if i’m in the right sub but I thought i’d ask anyway. I work at a zoo and we have membership passes that are all entered into a google sheet. We have one for current and expired. We keep things like addresses, phone numbers and emails for each one. It’s getting difficult to keep track of everything and I was wondering if there was a better software or website(preferably free) that can manage the vast amount of data. If google sheets really is the best option let me know.
submitted by /u/thatmunchiemunch
[link] [comments]
Hi, I am planning to create a data science-related portfolio project, and I want it to be focused on finance. So, I am considering using a free Python API where I can access OHLC data, volume, etc., enabling me to create indicators, conduct modeling, perform price prediction, sentiment analysis, and more. It can be stocks, options, or cryptocurrencies; I am indifferent, as long as the API is reliable. A few months ago, I utilized the yfinance Python library, but it appears that Yahoo Finance is reluctant to share their data, as I encountered numerous issues with blocked requests, etc. Currently, I am contemplating the Binance API. Although I have not yet used it, I have heard that it provides an extensive amount of data. Can anyone confirm this? Thanks in advance.
submitted by /u/-Oake
[link] [comments]
Hello data experts! I recently graduated as an analytical chemistry and started working for a system integrating company as an R&D specialist. I test and validate instrumentation, and develop applications for specific analyses among other activities.
In my latest project I collect data every ten seconds 24/7 from multiple inputs which at the end of the week leaves me with hundreds of thousands of data point. Graphing these data sets with Excel has become almost impossible even after reducing the number of points. What programs/procedures would you recommend to make these graphs and analyse trends without the program crashing on me every time I change anything? I haven’t used anything else other than Excel up to this point and my experience with programming is non existent. Definitely willing to explore options if it means fast and efficient data analysis. Help is much appreciated, A starting data analyst
submitted by /u/Leading-Click-7558
[link] [comments]
I’m searching for a dataset of collections of photos that people took during their trips and journeys. It is preferred if the photos feature people, not just places.
submitted by /u/Alternative_Card_989
[link] [comments]
Do you know where I could find any free dataseet for the light curves of asteroids or lunar occultations. I need to filter them.
submitted by /u/Unable-Speed-5263
[link] [comments]
Hello, can you advice me a couple dataset which I can use for all these purpeses:
-Naive,
-Moving Average (MA),
-Exponential Smoothing for your data
(ES)
– Regression methods
submitted by /u/Unlikely_Start9464
[link] [comments]
I have a new startup company that is using Zapier and i am searching for other small business owners and startup clients
I came across this post on https://www.usesignhouse.com/blog/zapier-stats which breaks down the top industries that use Zapier and it lead me here
I will like to ask if you can share the dataset you used for the analysis or if anyone can point me in the right direction so i can get the list and distribution of the various types of companies that use Zapier so i can target similar companies for my marketing.
I am looking for datasets in a csv format i can further analyze industries or companies using data analytics to find a good niche that is underserved but needs Zapier automations so i can find clients.
Any help would be appreciated.
submitted by /u/cool-pop
[link] [comments]
Hi everyone.
Not sure if this exist since things are usually cleaned up quite a bit before going public, but are there any data sources that could be used to study common numeral errors?
Mainly interested in instances of leading-digit bias (i.e. reading 9.88 as 9 instead of 10), but that’s even weirder and harder to track down in speech. No way of filtering out ‘misspeaks’ in major corpora like ANC or COCA, AFAIK. Any recommendations or leads?
submitted by /u/dennu9909
[link] [comments]
looking for a data set of 100 observations that measures weight or height of something and a data set with inter arrival times of a sequence of 100-150 events.
submitted by /u/spaceninja7707
[link] [comments]
Hi everyone, I am doing a project about traffic prediction so I was looking for traffic flow related database. I came across Irland Traffic flow Data from SDDC but I have an issue, I can’t find “site” names and location, wherever i look i only find incomplete data like this one.
Does anyone know where i can find a complete one?
submitted by /u/Salato32
[link] [comments]
Hi,
I’m trying to create a dataset of exam questions from the A-level Edexcel Physics question papers.
Here’s a sample paper%2520QP.pdf) for example.
Ideally, I’d want to extract all the text, equations properly and the images (mainly graphs and diagrams) through just uploading the file but I assume this isn’t feasible as far as I know.
What I’m doing right now is just using PyPDF to extract the text alone and I’m ignoring possible errors in the format in which equations may be extracted in (which puts me in a difficult position, when there are more complex equations involved that just straight one line formulas). I’m then just manually cleaning it up, using regular expressions where I can to simplify the process. After that, I plan on just manually ‘snipping’ the images out and put all of this into a MySQL database.
The project I’m working on rn is a question suggestion system based on content and question difficulty and I’m using a very specific subset of questions, as I mentioned earlier, just because I’m not too committed atm to tediously creating a dataset. I’m not even sure if storing this in MySQL is a good idea and I’ve personally never worked on any ML projects that don’t involve .csv files or aren’t image datasets, so I am pretty lost on this.
Any advice would be super highly appreciated! Wish you a great day 🙂
submitted by /u/cakeandflowers2202
[link] [comments]
Hello, I am trying to find Fixed Wing UAV dataset. I searched some websites but there are only drones and planes like Boeing-737. Are there any suggestions?
Thanks.
submitted by /u/hikmet_li
[link] [comments]