Looking for statistics on mens pant sizes.
Waist and Inseam.
Looking for a discrete table
submitted by /u/Many-Wasabi9141
[link] [comments]
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
Looking for statistics on mens pant sizes.
Waist and Inseam.
Looking for a discrete table
submitted by /u/Many-Wasabi9141
[link] [comments]
submitted by /u/xshopx
[link] [comments]
Hey everyone,
I want to build an open source dataset in the clinical trial space. I’m looking for some tech/tools recommendations that make building an open source dataset easy.
I guess the easiest would just be to set up a Google Sheet and Google Form to get new data submissions. I also came across: https://github.com/dolthub/dolt, but this seems to be quite expensive.
Some requirements that need to be fulfilled:
– The core dataset should be public, but we want to restrict access to contact information such as email or phone numbers to avoid that people get spammed
– People should be able to submit new data or submit updates to existing data points, but this data should be verified before it’s written to the public dataset – The final dataset could become quite large (10-20GB). Google Sheet won’t work with this – Users and contributors are non-technical. So it needs to be easy for them to user
Would be curious to learn more about how other people have built their datasets.
Thanks a lot!
submitted by /u/Affenbob123
[link] [comments]
I want a dataset where the “Voltage at maximum power point(Vmp)” of a solar cell is measured for different ”irradiance” and ”temperature” values. Thanks a lot in advance. 🙏
submitted by /u/faruquei
[link] [comments]
All, looking a dataset that has monthly consumer spend volumes. If availability is a problem, something about the consumer and their opinion about the economy. Can be hard data or a survey. Appreciate the help.
submitted by /u/HangryChef
[link] [comments]
Does anyone have access to a dataset containing UK suburbs by their city across the whole uk. I have found one dataset however it is massive and is of poor quality.
submitted by /u/R4eWclan
[link] [comments]
I need three pieces of information: 1. Country of origin of the survey participants, 2. When did they move to their current country and 3. An opinion, belief, attitude, behavior… about a topic; it would be ideal to measure their level of openmindedness or trust.
I couldn’t find a specific survey oriented exclusively to measure immigrants’ opinions, so I’m now looking for surveys oriented to the general public (immigrants and natives) and then I would just filter for immigrant cases. Unfortunately, I have been successful in finding a survey that has the three points mentioned above.
Currently, I’m fully focused on the US. The General Social Survey (GSS) has very interesting questions on attitudes but I couldn’t find variables indicating the country of origin or the moving year. Data from IPUMS, census.gov and American Community Surveys (ACS) include information on the country of origin and moving year, but no interesting data about attitudes/opinions, the data from these sources is pretty much technical.
Does anyone know where could I find a survey that fulfills the three requirements?
submitted by /u/Puzzleheaded_Steak54
[link] [comments]
Can anyone please suggest some websites to get healthcare data with a 10k records and 20 indipendente variables.
submitted by /u/anil_6
[link] [comments]
Greetings everyone! I am looking for a Body dataset which contains images and their measurements. Please do let me know if you do find one.
P.S: I have already know about Kaggle and Hugging face body measurement dataset. I am looking for a bigger dataset not just 20 rows.
Thank you!
submitted by /u/Neither-Bag-696
[link] [comments]
Hi everyone, I appreciate any help you can give here. If you think there’s a better place to post this, that would be incredibly helpful, as well. Thanks in advance!
The Ask
Now, here’s the situation: I’m working on an analysis of the workloads of some real estate-specific “groups” in NYC. I’m trying to find datasets that might help me better understand and quantify the time and effort put into menial and repetitive tasks that could be automated.
I’m particularly interested in the apartment rental market, but sales data would also be helpful here.
The Problem
Here are some of the questions I’m trying to answer:
How many inquiries does a typical listing receive across all channels (phone, email, text, listing site — e.g. StreetEasy, Zillow — etc.) from listing to close? (An “inquiry” being defined as a potential lead doing something to initiate communication with an agent or the property like asking a question, requesting a tour, etc.) How many back-and-forths does each channel receive — i.e. how many messages or phone calls, on average, does each inquiry (lead) yield? (This is essentially a follow up to the first question. For example, how many emails are sent by both parties before a showing schedule is confirmed?) How much time is spent on non-transaction related communications like answering basic questions, coordinating with owners to set up a showing (scheduling), sending requests for information or follow up, etc.? (This is essentially just the former questions represented in a different dimension: time instead of volume.)
Target Cohorts and Parameters
As I mentioned, I’m looking for data specific to New York City and the rental market; however, data from other markets could be helpful so I’m open to other markets if there’s a change I might be able to infer data about NYC from it. For example, national data could be helpful in at least setting some anchors/context.
The data I’m seeking should be specific to:
Real estate agents (brokerage-level stats could be subbed in potentially) Buildings/Property Groups (loosely, could be property managers or management companies)
What I’ve tried
This is an opaque industry, at best. While I’ve found some indirect data from sources like the NAR and StreetEasy (see below), I really haven’t had any luck finding anything.
Here are some of the closest examples I’ve found (with some notes roughly outlining relevant data I could try to use):
2022 Home Buyers and Sellers Generational Trends Report (NAR; industry report) Exhibit 4-8 (contact methods and response rate for home buyers) maybe. It’s somewhat unclear how to read, but there might be value in this 2023 Home Buyers and Sellers Generational Trends Report (NAR; industry report) Same as for the 2022 report
Most of the other sources I’d have thought would have something (StreetEasy, REBNY) have turned up woefully inadequate. It just doesn’t seem to be something they track (or at least release publicly).
Asks (summary)
Recommendations on datasets that might be helpful to me. Ideas about how I might reframe this and find data indirectly — e.g. number of inquiries an average renter sends before finding an apartment.
Thanks again for any help here! Really appreciate it.
submitted by /u/humanatwork
[link] [comments]
I’ve been looking for a while in the internet and only got 150 plus for both of the class, so I am trying my luck here to find more information if anyone has a link that I can download. It is for thesis purposes not commercial.
submitted by /u/leanzky
[link] [comments]
As I said at title I need a dataset about cargo logistic processes. I want to examine cargo’s delivery performance in term of delivery time, right location etc. I looked for dataset at Kaggle, Github etc. but could not find yet.
Thank you in advice.
submitted by /u/data_sapien
[link] [comments]
Hi! Does anyone know of a publicly available dataset that includes a debt collection variable?
submitted by /u/MomentsOfHope
[link] [comments]
Anyone know where to find property level data at the county/zip code level? Hoping for sale price, # of beds/baths, sqft, etc.. Any help is greatly appreciated!
submitted by /u/88dontrape
[link] [comments]
In today’s AI-driven world, data reigns supreme, fueling innovation and propelling technological advancements. However, a pressing challenge persists: the fragmented nature of data sources. Despite the abundance of data generated daily, accessing high-quality and diverse datasets remains a daunting task, impeding progress in AI/ML training and development.
The current situation of data sources is characterized by siloed datasets, proprietary restrictions, and limited accessibility. While large corporations and tech giants may have access to extensive datasets, smaller organizations and researchers often struggle to find relevant and comprehensive data for their projects. This scarcity of data not only impedes innovation but also exacerbates inequalities in the AI landscape, favoring those with access to privileged data sources.
Compounding this issue is the lack of compensation for data contributors, creating a lose-lose situation for all parties involved. However, platforms like Ocean, Streamr, and the emerging Nuklai are changing the game by offering compensation for data contributors and providing decentralized marketplaces for data enthusiasts.
Ocean Protocol leads the charge with its decentralized data exchange protocol, enabling secure and privacy-preserving data sharing. Through Ocean Market, users can discover, publish, and consume data assets transparently and in a decentralized manner, addressing the challenge of fragmented data by facilitating seamless data exchange across ecosystems.
On the other hand, Nuklai emerges as a disruptive force, leveraging blockchain technology to create a transparent and inclusive ecosystem for data storage, sharing, and monetization. By empowering data contributors to retain control over their data and receive fair compensation, Nuklai fosters more interaction and metadata availability, especially within data consortiums.
Meanwhile, Streamr stands out for its emphasis on real-time data monetization, providing a decentralized marketplace where users can stream and sell their data streams. With a focus on IoT (Internet of Things) data, Streamr enables devices to securely share data and receive instant compensation. Its data marketplace fosters innovation by providing a platform for buyers and sellers to engage in data transactions, thereby addressing the growing demand for timely and actionable data insights.
While all of these platforms offer unique features and strengths, they collectively contribute to the broader goal of democratizing data access and driving innovation in the AI/ML space. By fostering collaboration, transparency, and fair compensation, these decentralized data protocols are reshaping the data landscape and paving the way for a more inclusive and equitable data economy.
submitted by /u/kuonanaxu
[link] [comments]
Hello, this is my first time in the subreddit. I’m looking for a data set that I find interesting to use for a project, and I’m pretty into fitness (more so on the muscle gaining / body building side). My idea is to work on data set with data on results / success of different traning programs. I’ve been on kaggle and awesome public datasets, but havent found anything yet. If anyone has any recommendations I would really appreciate it!
submitted by /u/noeffortnoreward
[link] [comments]
I’ve been looking all over but have not been able to find it anywhere. Best I can find is List of S&P 500 companies sub-industry GICS classification. Other than that, the Sector and Industry classification of thousands of stocks is readily obtainable.
Have you found a free resource that has the list of everything GICS classified? If not free, a paid resource is fine as long as it’s not crazy.
Thanks!
submitted by /u/AceDenied
[link] [comments]
Book summaries data available from blinkist, shortform, getAbstract and instaread.
Text is converted to epub/pdf format and audio is in mp3 format.
Price is 25usd. Dm me for more info.
submitted by /u/waqarHocain
[link] [comments]
I’m trying to create an odds model for darts. Can anyone recommend a dataset or some variable I should consider if trying to create a synthetic dataset
submitted by /u/lightuponlight99
[link] [comments]
Is there anyone who can assist me with obtain datasets on patient physician relationships focusing on cardiovascular diseases and the risk of readmissions? I’m having the most difficult time securing deidentified information needed for my class. Please help.
submitted by /u/Curious-Mind-SG71
[link] [comments]
I am struggling with how they applied this method with binary variables or with datasets that contain just matrix X without the target variables Y as in classification aims.
submitted by /u/StrongCollection1687
[link] [comments]
Need to figure out how much UK banks invest in technology from 2010 – 2017.
submitted by /u/prototype101z
[link] [comments]
Our developers have just created this amazing plugin called “MassiveMark” that allows users to input any markdown and render it to HTML.
So you no longer have to spend hours formatting and editing the content which you directly copied from ChatGPT/Bard/Bing etc.
It also renders all the equations, formulae, mathematics/physics/chemistry/, tables, code blocks, quotes, heading, bold, italics, underline and whatever formatting one gets.
Please check it out on MassiveMark playground at https://www.assignmenthelp.net/massivemark and provide us your feedback, thank you.
(update: We now allow you to download the output as a .Docx file for convenience)
submitted by /u/Professional-Dig-669
[link] [comments]
Hi everyone, I need any dataset that can be queued for decision making. Orders from a store/restaurant, restaurant seating, etc.
Most everything on kaggle is geared towards machine learning and I don’t know where else to look.
Thanks so much!
submitted by /u/puffball400
[link] [comments]
I need help finding a Data set for New religious movements for my capstone project. I’ve looked on the GSS and couldn’t find anything that relates to what I’m looking for. Any suggestions is appreciated!
submitted by /u/StarvingShark
[link] [comments]
Hi, we are a small group of three students trying to train an AI to detect this specific kind of nest with cameras. Does anyone have a lot of photos of the nests of Selenopsis Invicta (Fire Ant)? This project is for educational purposes only.
Any dataset containg ant nests would fit our needs also.
We have already tried to contact some authors from papers in China that have already trained some AI with this specific nest, but we have been unsuccessful in obtaining the images yet.
Thank you all, any help is welcome.
submitted by /u/Beksito
[link] [comments]
RedditMods is a dataset that anonymously lists moderators of 25’834 largest and most popular communities on Reddit. The dataset is ideal for studying Reddit as a bipartite graph, where a moderator-node and a community-node are connected if one the associated user moderates this subreddit. Clustering can then be performed to identify groups of subreddits with a particular leaning, or to recommend similar communities.
The data was publicly available and collected on 06 Feb 2024. All usernames were anonymised by hashing with SHA256, so that they cannot be linked to the moderators’ Reddit accounts.
Visualisations using this data have garnered interest. Other examples: 1, 2.
submitted by /u/OmOshIroIdEs
[link] [comments]
I’m thinking of obviously ChatGPT but it has its limits on row count, found alternate projects like datasetGPT which seems to use multiple openai requests to fill large sets,
do any of you know of a tool that makes this pretty trivial? thanks!!
submitted by /u/underbrownmaleroad
[link] [comments]
Hello! I’m not sure if this is the right place to ask, but I was given some feedback on my dashboard (https://public.tableau.com/views/UFOSightingsintheUS_17069361456020/Dashboard3?:language=en-US&publish=yes&:display_count=n&:origin=viz_share_link) to incorporate a metric that accounts for the population of each state to show sightings per capita instead of just highlighting areas with larger populations. I’m trying to get population data for the years 1990-2014, so I can create a map of the populations by state and then layer the number of sightings on top of this map.
However, I’ve been having an extremely difficult time doing this. I think I may be overthinking it, but I’ve tried to look for the data (Population by State) on the US Census website and haven’t been able to get any dataset for any of the years I want. I did find this dataset on GitHub, which I believe I can use (https://github.com/aaronpenne/data_visualization/blob/master/population/data/USA_Population_of_States_US_Census_Intercensal_Tables_1917-2017.csv) but from here, how do I create a map out of it and connect it to my UFO sightings data? This dataset also doesn’t get properly imported when I try to upload it in Tableau, so I’m also having that issue.
Sorry if any of this sounds confusing I can clarify if needed. I just don’t know what to do I’ve tried asking ChatGPT and looking through Reddit and Tableau Community, but I’m still lost and need to submit this dashboard today :/
Thank you!
submitted by /u/communityboyfriend
[link] [comments]