Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

Looking For A Book Dataset For A Mobile App Project

Hey everyone, I am working on a mobile app and need a Book dataset with the following information: Title, ISBN, Author and Price. Extras like Edition, release date and Publisher would be great but those four are the big ones. I have found a lot of datatsets but none with the 4 required columns, some are missing ISBN while others are missing the proce. Please let me know if you know where I could find any dataset with a good amount of books and this information. Thank you so much

submitted by /u/Ironlad2045
[link] [comments]

Dataset For Social Network Analysis Project

Hi guys, I need help with finding datasets on Social network analysis for my project but so far no luck in finding the one I need. I did found a couple of websites which had those datasets like in Standford Large Network Dataset Collection but I’m not too sure how the datasets are supposedly used from this website. I also tried various websites such as Kaggle, data.gov, data.world. Still could not find it although I specifically typed in social datasets or social networking datasets or network datasets and other keywords related to social network. My topic is suppose to be on related to social phenomenon such as public health or politics or environmental. Could anyone please provide some helpful websites? Thanks in advance 🙇‍♂️

submitted by /u/Alternative-Oil2132
[link] [comments]

Nba Free Agents Dataset From The Past Few Years

I need a dataset with all the free agent transfers with their new contract from the past years. I’m doing a proyect where I try to predict the new contract for free agents based on their performance from the last season, I’ve already found a dataset for the performances, but I can only find the dataset of free agents from the last season, and I need at least 3 or 4 seasons to have enough training volume

submitted by /u/-sarx2-
[link] [comments]

I’m Trying To Create Datasets For Different Facial Expressions

So far I’ve been using google image search, yandex image search, and some stock photo websites. But it seems to be really hard to find high quality images of people having facial expressions other than “default look” or “smiling”. For example, finding images of people with facial expression “biting lip” seems very difficult. I was hoping to get some ideas or pointers how I could do this more efficiently?

submitted by /u/belladorexxx
[link] [comments]

Methods To Access Precipitation Data In R.

I am looking to use R to access real-time or daily summary precipitation data. Rnoaa package will be retired soon and the NCDC and NCEI are both non-functional. I have no idea where to find other sources. Are there any that can give precipitation data by selecting specific coordinates and using the closest station?

Thanks!

submitted by /u/wateriscrisp
[link] [comments]

Dataset That Shows How Much Publicly Traded Company Spend On R&D

I’m trying to compile a report on how much a bunch of publicly traded companies are spending on R&D as a percent of revenue each year for the last couple of decades.
All of the data is in the 10k stock filings that companies are required to make and I feel like someone must parse it and turn into structured data. But I can’t find anyone for this particular information.
Any suggestions? Ideally free ones.

submitted by /u/MarketMan123
[link] [comments]

Looking For Dataset For University Project

Hi!
I’m a university student, and for a project, I need to find a relational database to normalize (3NF) and optimize. I need it to have 10 tables, and at least 2 of those have to have between 100k – 1M rows. After I find a workable database, I can divide it into more tables, to make up to the 10 minimum table count, and also can make the primary key, foreign key relations between them, but I’m having a bit of a difficulty when finding my data set.
Since I’m quite new to this stuff, I’m hoping to find a little help here.

submitted by /u/actual_tsukuyomi
[link] [comments]

Data Management For Memberships Help

I’m not sure if i’m in the right sub but I thought i’d ask anyway. I work at a zoo and we have membership passes that are all entered into a google sheet. We have one for current and expired. We keep things like addresses, phone numbers and emails for each one. It’s getting difficult to keep track of everything and I was wondering if there was a better software or website(preferably free) that can manage the vast amount of data. If google sheets really is the best option let me know.

submitted by /u/thatmunchiemunch
[link] [comments]

Good APIs For Financial/trading Data (OHLC, Volume Etc.)

Hi, I am planning to create a data science-related portfolio project, and I want it to be focused on finance. So, I am considering using a free Python API where I can access OHLC data, volume, etc., enabling me to create indicators, conduct modeling, perform price prediction, sentiment analysis, and more. It can be stocks, options, or cryptocurrencies; I am indifferent, as long as the API is reliable. A few months ago, I utilized the yfinance Python library, but it appears that Yahoo Finance is reluctant to share their data, as I encountered numerous issues with blocked requests, etc. Currently, I am contemplating the Binance API. Although I have not yet used it, I have heard that it provides an extensive amount of data. Can anyone confirm this? Thanks in advance.

submitted by /u/-Oake
[link] [comments]

Make Graphs With Large Data Sets In Excel?

Hello data experts! I recently graduated as an analytical chemistry and started working for a system integrating company as an R&D specialist. I test and validate instrumentation, and develop applications for specific analyses among other activities.
In my latest project I collect data every ten seconds 24/7 from multiple inputs which at the end of the week leaves me with hundreds of thousands of data point. Graphing these data sets with Excel has become almost impossible even after reducing the number of points. What programs/procedures would you recommend to make these graphs and analyse trends without the program crashing on me every time I change anything? I haven’t used anything else other than Excel up to this point and my experience with programming is non existent. Definitely willing to explore options if it means fast and efficient data analysis. Help is much appreciated, A starting data analyst

submitted by /u/Leading-Click-7558
[link] [comments]

Looking For Zapier Datasets On Industries Or Companies That Use Zapier

I have a new startup company that is using Zapier and i am searching for other small business owners and startup clients

I came across this post on https://www.usesignhouse.com/blog/zapier-stats which breaks down the top industries that use Zapier and it lead me here

I will like to ask if you can share the dataset you used for the analysis or if anyone can point me in the right direction so i can get the list and distribution of the various types of companies that use Zapier so i can target similar companies for my marketing.

I am looking for datasets in a csv format i can further analyze industries or companies using data analytics to find a good niche that is underserved but needs Zapier automations so i can find clients.

Any help would be appreciated.

submitted by /u/cool-pop
[link] [comments]

Speech Datasets That Capture Numeral Errors (i.e., 57 > 75)?

Hi everyone.

Not sure if this exist since things are usually cleaned up quite a bit before going public, but are there any data sources that could be used to study common numeral errors?

Mainly interested in instances of leading-digit bias (i.e. reading 9.88 as 9 instead of 10), but that’s even weirder and harder to track down in speech. No way of filtering out ‘misspeaks’ in major corpora like ANC or COCA, AFAIK. Any recommendations or leads?

submitted by /u/dennu9909
[link] [comments]

Looking For Advice On Creating A Dataset Of Exam Questions From A Set Of Exam Papers

Hi,

I’m trying to create a dataset of exam questions from the A-level Edexcel Physics question papers.

Here’s a sample paper%2520QP.pdf) for example.

Ideally, I’d want to extract all the text, equations properly and the images (mainly graphs and diagrams) through just uploading the file but I assume this isn’t feasible as far as I know.

What I’m doing right now is just using PyPDF to extract the text alone and I’m ignoring possible errors in the format in which equations may be extracted in (which puts me in a difficult position, when there are more complex equations involved that just straight one line formulas). I’m then just manually cleaning it up, using regular expressions where I can to simplify the process. After that, I plan on just manually ‘snipping’ the images out and put all of this into a MySQL database.

The project I’m working on rn is a question suggestion system based on content and question difficulty and I’m using a very specific subset of questions, as I mentioned earlier, just because I’m not too committed atm to tediously creating a dataset. I’m not even sure if storing this in MySQL is a good idea and I’ve personally never worked on any ML projects that don’t involve .csv files or aren’t image datasets, so I am pretty lost on this.

Any advice would be super highly appreciated! Wish you a great day 🙂

submitted by /u/cakeandflowers2202
[link] [comments]