Also, are there any pre-trained models that can do this which i can get access to for free or for not a lot of money??
submitted by /u/ProblemGupta
[link] [comments]
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
Also, are there any pre-trained models that can do this which i can get access to for free or for not a lot of money??
submitted by /u/ProblemGupta
[link] [comments]
Hi, I am currently building a dataset indexing platform. The purpose is to enable users to list and find datasets more easily as compared to existing options such as Kaggle and Google Dataset Search. As a dataset owner, you can freely list your valuable data; as a dataset user, you can have an effective and exploratory search experience.
I love to get feedback from this community and/or schedule a 1:1 session to find out more about how you currently list or search for datasets and share with you our idea, which is to tokenize the dataset and store the dataset’s attributes as metadata for easy indexing. I am also looking for early adopters – applicable to anyone who has data or is searching for data!
Anyone who is keen to explore further, please let me know. Thank you.
submitted by /u/bdx_cbtan
[link] [comments]
Hi guys,
Ive never posted on this but im pretty desperate right now. Im doing a reserach project where im using ML algorithms to classify sites for renewable energy potential. Ive searched everywhere and even tried making an api requester code in python but with the amount of data I need (50-100k rows) it would take waaay to long. So I come here to ask if anyone has a dataset with lat and lon, wind speed, direction, at minimum. Pressure and temp would be nice as well if possible. For solar, GHI and DNI, and maybe lateral tilt. But I want it to have random lat and lon coordinates not all in one spot.
Please guys, i need your help
dm me if you need more info
submitted by /u/phoenixducky1
[link] [comments]
Hey I’m wondering if anyone has seen data that is labeled habitats for malaria mosqutios from images of google earth or other satellite data. Thank you
submitted by /u/thecuh1
[link] [comments]
I am doing a research on the energy emissions of cement plants and I need data on this. Where can I find it.
I need energy emissions suitable for any sectoral distribution. When I increased in the subreddit, I found only one website, but still, if there is a higher quality data set, I would like to obtain it as well.
submitted by /u/hyyperi
[link] [comments]
Recently, I’ve been exploring the area of 3-dimensional data in machine learning. By that, I mean arrays with shape (x, x, x). As an example:
All the numbers are randomized, but hopefully, this will give you a gist of what I’m looking for
I have only encountered image datasets in my search, which I am not looking for. In addition, I want to find data already in three dimensions instead of two-dimensional time series data that can be made into three-dimensional data. Where could I find datasets like the ones I’m looking for?
Links or search terms would be greatly appreciated.
submitted by /u/Figsups
[link] [comments]
As the title suggests, I’m looking for a dataset that provides the grocery item and maybe the most common aisle it’s found in, followed by the potentially the next most common aisle.
Ideally it’s something like item, category, image, aisle_1, aisle_2.
If something like that doesn’t exist, an acceptable alternative would be in paragraph form like the example below.
Tahini
In most grocery stores, tahini is either in the aisle with other condiments like peanut butter or in the aisle with international foods. You can also find it at a specialty or Middle Eastern grocery. It is sold shelf-stable in glass or plastic jars and is not refrigerated.
submitted by /u/yankpat9
[link] [comments]
Doing some research for a project I am working on and started thinking:
What are the different types of proprietary data that can be accessed more cheaply in other geographies?
Why is it hard to access that data in the US/UK and not anywhere else? Is it because the data creator has a monopoly? Or are there regulatory issues? Is the cost too high to gather and store?
Any advice, leads, or tips would be greatly appreciated!!
submitted by /u/young-litty
[link] [comments]
For a university project I want a dataset that has the symptoms, cause, region, country, which food or which conditions the allergy is likely to happen.
submitted by /u/Shining-bright
[link] [comments]
Hey, Reddit community!
I stumbled upon a game-changer for businesses striving to harness the full potential of their data – Data Management Analytics Services! SG Analytics has put together an insightful article shedding light on how these services can revolutionize the way organizations handle and utilize their data.
🔗 Link: Data Management Analytics Services
In this comprehensive blog post, you’ll explore:
🗝️ The key components of robust data management strategies. 📊 How analytics-driven data management can optimize decision-making processes. 💼 Real-life examples of companies benefiting from data-driven insights. 🌐 The role of data management in enhancing overall business efficiency.
Whether you’re a data enthusiast, a business owner, or an aspiring analyst, this read will undoubtedly provide valuable knowledge and fresh perspectives.
Let’s engage in a discussion about the significance of data management in today’s fast-paced world. Share your thoughts, questions, and experiences in the comments below. Don’t forget to upvote if you find this topic as exciting as I do – let’s bring this valuable information to more people’s attention!
Stay curious and data-driven! 🗝️📊
submitted by /u/David_starc150
[link] [comments]
I’m looking for medicine dataset which is publically available. Preferred if it is from tata 1mg/
submitted by /u/Majestic-Peach-9177
[link] [comments]
I am inviting you to for using ML knowledge to on image datasets which are :
submitted by /u/AsgardiansLoki
[link] [comments]
I want to know what percent of benign fetishists — like foot fetishists — also have more harmful fetishes, like pedophilia. Men who are into BDSM claim that this is harmless, but I suspect that they’re lying.
Does anyone have a dataset on paraphilias?
submitted by /u/3amorange
[link] [comments]
I need a dataset which can be used for regression or classification. It can also be over 50mb. Don’t care about no. of rows and columns.
submitted by /u/Luffykent
[link] [comments]
I’d personally like the Google full scale historical cache dataset.
Google caches everything, fully backed up with every change to every website covering the last 20 years. Imagine the insight and knowledge you could gain processing that. Every lost website, every forum comment, every tweet, old reddit deleted posts. We have archive but a searchable time backtrackable complete Google cache dataset would be magical.
And you know they have it.
Keeps me up some nights just thinking about it.
What are some datasets that you can only dream of getting access to?
submitted by /u/omgsoftcats
[link] [comments]
The Netflix prize dataset and the AOL dataset.
Are there any other datasets that have been banned or removed from existence?
submitted by /u/omgsoftcats
[link] [comments]
I’m at the end of my data science course, I need to find a dataset with 80 to 100 columns, in order to start a final project for the course and get my certificate. Is there a way to make the search but only by how many columns in the datasets ? Please help
submitted by /u/jeremydavid2
[link] [comments]
I need the monthly churn rate for twitter. How do I get the number of annual users from the number of Monthly Active Users for a social media site? Is there some general formula or some percentage that is used? I am guessing the churn rate would help.
submitted by /u/itzSwain_
[link] [comments]
Hi everyone, is there any suggestion public dataset websites other than data.world and Kaggle, since my lecturer does not allow to use Kaggle for my work (Prohibit). My requirement is minimum range size 450mb to 500mb with the 40 to 50 columns in my desired dataset. If you guys have any suggestion please comment below here. Thankss 🙂
submitted by /u/Sweet_Impact6880
[link] [comments]
I’m trying to find a dataset that will show that I can do joins but every dataset I find has simply one table with everything in it rather then information split across two or more tables. Id rather have info split and be connected via some key so that I could show that I can do joins.
Thank you for any help
submitted by /u/fhdjnjcj
[link] [comments]