Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

[self-promotion] Git Version Controlled Datasets In S3

Ever wanted to use Git to version control datasets or large files but Github LFS turned out to be too expensive and now you have a bunch of hacky scripts put together to use S3 for storage but there’s no version control?

We’re here to help you with that. You can use your own S3 buckets or our Free LFS Storage with Github.

Try out: https://underhive.in (please use on Desktop, the mobile version is broken right now)

Dashboard Screenshot: https://i.imgur.com/eYwGGjw.png

submitted by /u/kaisoma
[link] [comments]

Open Source Data Set To Reprocess Data

I am trying to find a data set that takes data from a database and reproccesses it by removing noise and other irrelevant information from it. I have been looking at REDD (A public data set for energy disaggregation), but the hardware for it seems outside my budget. I am not sure if this is the right place to put this but if anyone knows of something, please let me know!

submitted by /u/labib0910
[link] [comments]

Looking For Data Sets Covering California Buildings And HVAC Systems

Hello, so I am working on a research project that is looking to understand the relationships between buildings in California and the unique locations and environments they are located in, specifically pertaining to which types of HVAC (heating, ventilation, and air conditioning) systems (e.g. air conditioner, heat pump, etc.) are in use. I am essentially trying to map a building footprint of California buildings with an attribute denoting the type of HVAC system attributed to each building. I would be mapping this data against a map of California using GIS software, on top of other data layers showing various demographic, economic, and environmental factors. Would anyone here perhaps know if such a data set exists and is available? This could be a shapefile with building geometry, or honestly just a file showing a specific HVAC system for a given building address. Or would anyone know how I could assemble such a data set? Any recommendations, guidance, or help would be much appreciated. Thank you!

submitted by /u/teledude_22
[link] [comments]

Turn Your Google Sheet Of Data Into A Sellable API

Hello, I’m a professional web developer planning to build a service that does exactly what the title says. I’m looking for feedback from you, what features are must haves. I’m thinking the flow would essentially be like this:

1) You create and fill the google sheet that has the dataset in it you wish to sell API access to. Probably you’ve already done this

2) You login to my site, give the API a name and define pricing. I think what makes the most sense here are rate limits (i.e. x number of GET requests per minute/hour/day), so you define 3 rate limits, for your 3 pricing tiers. You provide credentials that provide access to only that google sheet, my service automatically generates a REST API for your sheet.

3) you can either manually generate API keys for your customers through my website, or you can use an internal API with my site to automatically provision them for your customers.

4) Your customers pay you directly for API usage, if they stop paying you revoke their keys (again either through the UI or through one of my internal API’s).

I’m thinking of charging $30, $50 and $100 per month for 3 tiers that will allow you to provision your API based on requests/hour and number of API keys.

What concerns do you have about that? The beauty of integrating with google sheets is that I suspect many of you already have many datasets within google sheets. I’m just providing a way to give programatic access. You can also update the google sheet and the API will automatically provide that updated data. You could also programatically write updates to your google sheet as well, your customers would not have write access.

submitted by /u/AnyPrize
[link] [comments]

Project Help Regarding Power Bi Dashboard

Hello everyone, this is my first post here. I am currently in second year pursuing ai and data science as an undergraduate degree. I am currently working on a student complaint dashboard using Power bi. I was thinking of collecting the data through google form using various questionnaire to provide to my classmates and then performing ETL on it in power bi. But i am having trouble regarding the questions that would provide me with valuable insights. So far i have categorised the complaints in category for example:- 1) hostel/mess 2) infrastructure 3) fee etc And then i would further ask questions regarding hygiene of the mess for ex and then provide a rating for their answer.

What measures/ questions should i incorporate to draw some good insights from this?

submitted by /u/Lazy_Telephone6759
[link] [comments]

NEED HELP FOR RESEARCH WORK PERTAINING TO THE INDIAN AGRICULTURAL SECTOR

hello, I am studying *”Crowding in the Effects of Public Sector Investments in the Agricultural Sector on Private Sector Investments in Agriculture”*and to run some statistical analysis **I need time series data relating to ‘Govt expenditure on the agricultural sector’ and ‘private sector gross fixed capital formation in agriculture’**so where can I find credible data ? please pinpoint some authentic free databases regarding this if you are aware of any.Thank you already 🙂 (and if anyone wants to help with research work then most welcome )

submitted by /u/wholelottathisnthat
[link] [comments]

[Request] Looking For Datasets For Emoji Prediction Based On Text

Hello there,

I’m currently working on an individual class project for machine learning.

Project: Discord bot that replies with emojis to user’s entered text.

Example: User enters: “A man goes fishing” and it will respond with 👨🎣

If you have or know of any datasets related to this topic, I would greatly appreciate your help. The data will be used for training and improving the bot’s emoji prediction capabilities. Any relevant datasets, no matter how big or small, would be incredibly valuable for this project. Please feel free to share the dataset sources or offer any advice related to this task.

Best regards,

Zero

submitted by /u/ZeroCreations
[link] [comments]

Video Game FPS Dataset With Firing Audio + Animation

Hey ya’ll!! Just wondering if there is a data set out there already with clips of video game guns and firing animations with audio. Wanted to start a project where I train a machine learning algorithm on the clip animation and audio to eventually make generative audio by feeding the system with video alone. Would be greatly appreciated if there was already a dataset out there with clips of firing animations with audio!!

submitted by /u/GamesAndTrains
[link] [comments]