Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

Data On AI Startups – Number Of Employees, Revenue, Etc.

Dear dataset community,

I am currently in the process of writing my Master’s Thesis in Business Analytics. I have been desperately looking for data related to startups and AI startups that contain aspects such as revenue and the number of employees. I am trying to investigate productivity gains in AI startups.

I tried going on platforms such as Crunchbase, however, they don’t have revenue data and the data on employees seems to be quite broad. Do you have any suggestions on where I could find this data? Or does anyone have access to this data that might help me?

Thank you very much!

submitted by /u/susanin76
[link] [comments]

Aggregated (40k Records So Far) Smart Meter Data Available, More To Come

Hi all,

Just something I figured worth sharing. UK power networks have started publishing aggregated smart meter data on their open data portal. This dataset currently has 40,000 smart meters aggregated within it, with all that their region covers expected to be in the dataset by the end of the month. (Need to log in to download/get API details)

The rest of the distribution networks in the UK need to publish the same data for their region by the end of the month too. Expect it’ll be a real treasure trove for those of you wanting to do stuff with energy data.

Enjoy!

https://ukpowernetworks.opendatasoft.com/pages/smart-meters/

submitted by /u/BackpackingScot
[link] [comments]

Data For Pet Care Business By Zip Codes And Neighborhoods

Hi there! I have recently started a pet-sitting and pet-care business that provides high-end services. As our target audience is high-income earners, I am wondering if there is a way to access income data by zip code or even by specific neighborhoods. Additionally, I would like to know if there is a way to find out which areas or neighborhoods have a higher number of pets or dogs. This information would be very helpful for me to conduct further research. Thank you!

submitted by /u/MannyOchoaVC
[link] [comments]

Looking For A Low-cost Hosted Data Portal

Hello. Nonprofit here with public and private datasets. We’d love to host a portal to make them available based on user permissions. Looking at CKAN, Taylor, Datahub… what’s available at a lower cost for our small org that is somewhat plug and play? Am I asking for too much?

submitted by /u/rudeamy
[link] [comments]

Show: Codeplot – A Interactive Canvas For Python Data Exploration

Github: https://github.com/codeplot-co/codeplot App: https://codeplot.co Discord: https://codeplot.co/discord

Hey Datasets community,

I’m excited to introduce codeplot, a tool I’ve been working on that’s designed to revolutionize the way we interact with data visualizations in Python.

What is codeplot?

codeplot is an interactive spatial canvas that allows for dynamic data exploration. It’s built to move beyond static images and fixed layouts, giving your data the interactive, engaging platform it deserves. With codeplot, you can easily integrate live data visualizations directly from your Python code or REPL into a flexible, interactive canvas hosted at codeplot.co.

Key Features:

Dynamic Visualization: Say goodbye to static charts. Visualize your data in real-time on an interactive canvas. Easy Integration: Seamlessly plot from Python with just a few lines of code. Varied Visualizations: Support for a wide range of data representations, from basic charts to complex widgets. Flexible Layouts: Customize your data exploration space with draggable and resizable plots. Open Community: Whether you’re a data scientist or a hobbyist, codeplot is designed for anyone passionate about data. Getting Started is Simple:

Install codeplot with pip, connect to a room, and start plotting right away. We even support usage in Jupyter Notebooks for an integrated development experience.

Docker Support:

For those who prefer self-hosting, codeplot is Docker-ready, allowing you to run your own server and client locally with ease.

Join Our Community:

We’re building a community of data enthusiasts and professionals on Discord. It’s a place to share insights, ask questions, and collaborate on data visualization projects.

I’d love to get your feedback, suggestions, and hear about the visualizations you create with codeplot. Let’s make data exploration more interactive and engaging together!

Thanks for checking out codeplot!

– @antl3x (Creator of codeplot)

submitted by /u/nthypes
[link] [comments]

Can’t Find A Big Website With Datasets That I Used Before – Maybe Someone Knows?

I remember I browsed it a year ago quite extensively, but used a different source for my project in the end. Now I don’t remember what it was called and can’t find it anywhere now – can anyone help?

The site had different datasets uploaded by random users, some of them were just, for example, bulks of documents about a particular issue for the last 10 years. No clean data. Maybe it was a bit sketchy, I don’t know. I think it had a red icon/layout and a short name… pretty sure something with a letter “A”

submitted by /u/merlarchenemy
[link] [comments]

Food & Drink Carbon Footprint Dataset

I am studying data science and looking for a dataset that lists food and drink products and their associated carbon footprint. For my project the carbon figures do not need to be super accurate.

Branded (e.g. Nutella®, Coca‑Cola) or unbranded (e.g. banana, litre of milk), either would be useful.

submitted by /u/AchillesFirstStand
[link] [comments]

Does Anyone Have A Dataset With Images Of Ingredients?

I need a dataset with images of food ingredients, from vegetables to fruit, meat and whatnot. It can be multiple datasets specializing in different kinds of ingredients. I’m making a program to identify ingredients from a picture and suggest a recipe that uses the available food, but can’t find a dataset with what I’m looking for, as most only have already prepared meals.

submitted by /u/Accomplished-Level-1
[link] [comments]

What Tools And Tech Should I Use To Build An Open Source Dataset?

Hey everyone,

I want to build an open source dataset in the clinical trial space. I’m looking for some tech/tools recommendations that make building an open source dataset easy.

I guess the easiest would just be to set up a Google Sheet and Google Form to get new data submissions. I also came across: https://github.com/dolthub/dolt, but this seems to be quite expensive.

Some requirements that need to be fulfilled:
– The core dataset should be public, but we want to restrict access to contact information such as email or phone numbers to avoid that people get spammed
– People should be able to submit new data or submit updates to existing data points, but this data should be verified before it’s written to the public dataset – The final dataset could become quite large (10-20GB). Google Sheet won’t work with this – Users and contributors are non-technical. So it needs to be easy for them to user

Would be curious to learn more about how other people have built their datasets.

Thanks a lot!

submitted by /u/Affenbob123
[link] [comments]

Where Can I Find Survey Data About Immigrants’ Opinions/attitudes In The US Or Other Countries?

I need three pieces of information: 1. Country of origin of the survey participants, 2. When did they move to their current country and 3. An opinion, belief, attitude, behavior… about a topic; it would be ideal to measure their level of openmindedness or trust.

I couldn’t find a specific survey oriented exclusively to measure immigrants’ opinions, so I’m now looking for surveys oriented to the general public (immigrants and natives) and then I would just filter for immigrant cases. Unfortunately, I have been successful in finding a survey that has the three points mentioned above.

Currently, I’m fully focused on the US. The General Social Survey (GSS) has very interesting questions on attitudes but I couldn’t find variables indicating the country of origin or the moving year. Data from IPUMS, census.gov and American Community Surveys (ACS) include information on the country of origin and moving year, but no interesting data about attitudes/opinions, the data from these sources is pretty much technical.

Does anyone know where could I find a survey that fulfills the three requirements?

submitted by /u/Puzzleheaded_Steak54
[link] [comments]