I’m doing a project for class and looking for some data ragarding digital anonymity to analyze.
submitted by /u/kab9713
[link] [comments]
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
I’m doing a project for class and looking for some data ragarding digital anonymity to analyze.
submitted by /u/kab9713
[link] [comments]
Hi! I am looking for a data set containing decisions by Indian and English Courts till date. Any leads will be helpful.
submitted by /u/thefifthelemental
[link] [comments]
I’m currently working on a research project focusing on the fascinating world of Over-The-Top (OTT) streaming platforms and how they’re reshaping the entertainment industry. 🍿📺
Specifically, I’m diving into Survival Analysis to understand how various events and industry changes impact user retention over time. 📈⌛
However, here’s where I could use your expertise and assistance! 🙏 I’m on the hunt for a suitable dataset that contains comprehensive information related to OTT platforms, user behavior, subscription details, and relevant industry events. 📦💼
If any of you have access to or know of a dataset that aligns with these criteria, I would be immensely grateful if you could share it with me. Your support and collaboration would be a significant contribution to my research project.
Feel free to drop me a message here if you have any leads or datasets to share.
submitted by /u/Gemperle00
[link] [comments]
I need help finding a dataset concerning geology that I can do my project over. I am taking a rudimentary geology lab and I have a final project where all I need is to discuss a cool geologic finding. I am a stats major and want to use my skills to better this project and potentially add to my resume.
Does anybody have any cool datasets or even topics I should research that I can potentially use for my project? I don’t have a big background on geology but I’m confident enough in my data analysis skills to help me by.
submitted by /u/PersonalityHonest729
[link] [comments]
Hello everyone,
We are currently working on a crucial project that requires access to a specific dataset available on the Answer ALS Data Portal. However, we have encountered an error on the website which is preventing us from accessing the dataset. We have attempted to contact the administrators through the provided email address, but unfortunately, we have not received any response so far.
We are reaching out to this community in hopes that someone might have the required dataset or can assist us in procuring it. The dataset is critical for our ongoing research, and any help would be greatly appreciated.
If you have access to the dataset or know of any alternative way to obtain it, please feel free to reach out. We are willing to discuss any necessary arrangements or collaborations that can help us move forward with our project.
Thank you in advance for your assistance and understanding. Your help could significantly contribute to the progress of our research.
submitted by /u/Weird_Cockroach963
[link] [comments]
Hello everyone,
We are currently working on a crucial project that requires access to a specific dataset available on the Answer ALS Data Portal. However, we have encountered an error on the website which is preventing us from accessing the dataset. We have attempted to contact the administrators through the provided email address, but unfortunately, we have not received any response so far.
We are reaching out to this community in hopes that someone might have the required dataset or can assist us in procuring it. The dataset is critical for our ongoing research, and any help would be greatly appreciated.
If you have access to the dataset or know of any alternative way to obtain it, please feel free to reach out. We are willing to discuss any necessary arrangements or collaborations that can help us move forward with our project.
Thank you in advance for your assistance and understanding. Your help could significantly contribute to the progress of our research.
submitted by /u/Weird_Cockroach963
[link] [comments]
I am taking a regression analysis course this semester. We have a project to do simple linear regression analysis on a data set and in a few weeks another project to do multiple linear regression analysis.
I’ve been searching online for a good data set that I could use for both my SLR and MLR projects. Does anyone have any recommendations on where I could find a data set that I could use?
submitted by /u/Prince_Alizadeh
[link] [comments]
Not sure if this is the right place to ask this.. but anyways
I’m working on a report that presents and analyzes survey results.
One of the survey questions requires of the respondent to pick 3 things out of a list. What type of graph on excel would best illustrate this? I’ve been using horizontal and vertical bar graphs, but those were for data that requires a choice of one thing among a range (Strongly agree— Strongly disagree..)… should I use the same style of graph?
Many thanks in advance
submitted by /u/alsi3dy
[link] [comments]
The original Bert model gives similarities scores of
.5736 between vue.js and react
.6389 between vue.js and k8s
Vue.js and react are both frontend frameworks and kubernetes (also called k8s) is a server orchestrator. Therefore it”s odd that the first score is higher than the second.
Do you know of any pretrained model that can catch this type of tech jargon better ?
submitted by /u/Throweuway
[link] [comments]
I need to turn a bunch of academic PDFs (with tables) into neat JSON files for data extraction. I’m searching for a Python OCR tool that can: do text and table recognition in scholarly papers; spit out well-structured JSON with the extracted info. If you’ve got recommendations, please let me know! Open-source is awesome, but I’m open to anything that does the job well.
Thanks a for your help!
submitted by /u/Apprehensive_View366
[link] [comments]
Do you guys have access or do you have datasets for banana pest and diseases? Do you mind sharing them with me? I am currently working on a mobile app that can detect pests and diseases in banana plants but I only found very few available ones online. Please help.
submitted by /u/killakusha
[link] [comments]
I want to perform analysis on reasons for loan rejection. And specifically need data on number of partial loan offers given. By partial loan I mean, if an individual requested for $100 and they get $80.
Any good sources or methods to access/collect the data is appreciated.
submitted by /u/ajeenkkya
[link] [comments]
I am trying to make a mental health analysis chatbot , where 2 models will be used, one will talk to the user and other model will do the analysis on the user’s inputs. I have enough dataset for the 2nd bot but I am not able to get a good dataset for my chatbot, I am using blenderbot (Facebook) and I need a dataset which has friendly conversation so that the chatbot can learn to talk like a friend and provide consoling outputs. (I am a beginner to this, any would be really really helpful ✨)
submitted by /u/G_Wriath
[link] [comments]
Dear Dear Data People!
Now that Twitter and Reddit APIs are paywalled and pretty much unaffordable for amateur projects, are there some other good social network APIs that you can use for similar projects? I’m quite into NLP and always thought of these two APIs as a steady option for experiments, it’s really devastating to see them go.
Cheers!
submitted by /u/deiteorg
[link] [comments]
I’ve been searching for a while now and I cannot seem to find any pre-built datasets out there that are free. I would like to avoid using the API and doing it myself. I’ll take any large trends data set you got, provided that it is not industry/category specific and more general in nature!
Thank you in advanced!
submitted by /u/Official_Forsaken
[link] [comments]
As the title says, how would I go about creating a dataset for SQL Query generation of a specific language? What steps would I need to follow to create a clean and vibrant dataset? Any resources will be a great plus. Thank you!
submitted by /u/The-Inevitable-One
[link] [comments]
Hi
I am looking for a dataset of YAML format files. Not able to find much through search as its redirecting to individual yaml config files. Suggestion on source through which YAML files can be scrapped would be welcome.
submitted by /u/AshSaxx
[link] [comments]
Does anyone know where I can find RE Sales Data for NYS. I am looking for granular detail: condion, days on mkt, that kind of detail. What years? All that I can get
submitted by /u/Snoo752
[link] [comments]
I can export my Whatsapp messages and Heart Rate data from Garmin.
I’m looking for software that can easily import data from a variety of sources and clean it up quickly and easily.
I’d appreciate ideas
submitted by /u/OculoDoc
[link] [comments]
Delaware Business Licenses 2023: https://app.gigasheet.com/spreadsheet/list-of-delaware-business-licenses—2023/15072fd3_4cb9_4460_b20c_e950b60cb2c2
Delaware Business Licenses 2007-2022: https://app.gigasheet.com/spreadsheet/list-of-delaware-business-licenses-2007-2022/68ab986b_b593_4274_8dd8_6a73f0247261
Source: https://data.delaware.gov/Licenses-and-Certifications/Delaware-Business-Licenses/5zy2-grhr
submitted by /u/n1nja5h03s
[link] [comments]
As the title says, how would I go about creating a dataset for code generation of a specific language? What steps would I need to follow to create a clean and vibrant dataset? Any resources will be a great plus. Thank you!
submitted by /u/RAIV0LT
[link] [comments]
I am playing around with some EPA monitor data. I was wondering if the EPA has a dataset that indicates when their monitor is classified as “nonattainment”, “unclassifiable/attainment”, or “unclassifiable”? Here is what this is defined as from the EPA’s website
“If the air quality in a geographic area meets or is cleaner than the national standard, it is called an attainment area (designated “attainment/unclassifiable”); areas that don’t meet the national standard are called nonattainment areas.”
I feel like I have seen this before in a dataset, but I am having a hard time finding it now. Any help is appreciated!
submitted by /u/jyddyj20
[link] [comments]
I’m looking for the data set which has data about heat sealing inspection. I’ve tried searching everywhere on Google but couldn’t find it. It’s my university project . If this data set is not found , will it just be better to create my own ? Thanks
submitted by /u/Zealousideal-Card747
[link] [comments]
Hi, I am looking for a dataset of English typing users (for biometrics approach). The most interesting for me is text (original typing) and assigning this text to the specific user. Does anyone know or has access to such dataset and would like to share it to me?
submitted by /u/Purple_Vehicle_1983
[link] [comments]
Hello, I want to build an app similar to PC part picker for my end of degree project. Does anyone know of a legal API that contains the data I would need? I saw that there’s repositories out there from people doing data scraping on PC part picker, but I read that it’s a violation against their TOS, so I’d love to do things legally. Thanks!
submitted by /u/Jonthepug
[link] [comments]
So i’m a marine biologisy trying to teach myself data analysis. I figured looking at stuff related to my field would help. Any datasets on fishing/aquaculture. Maybe species, production, prices over time, stuff like that?
submitted by /u/JustaSimpleFisherman
[link] [comments]
Hello, Fellow Data Enthusiasts!
I am looking for a dataset about electric grid congestion for a potential research project.
The central objective is to devise a robust methodology for estimating local voltage levels. This estimation process should consider historical data including voltage level, local supply, and local demand, as well as forecasted demand and power generation data. The product will be a risk classification, supplemented by a confidence level, indicating the likelihood of grid congestion based on the forecasted voltage level.
🔍 Dataset Details:
Ideally, the data should contain information about grid-level 7, but data about other levels would also be beneficial. A proposed structure for the dataset would be as follows:
Time-Series Data
Timestamp Generator ID Node ID House ID Solar Generation (optional) Other Generation (optional) Residential Demand (kW) 2023-10-23 12:00:00 G1 N1 H1 2.0 0.0 6.0 2023-10-23 12:15:00 G2 N2 H2 1.7 0.0 9.0 …. … … … … … …
Node Data
Node ID Type of Node Demand (kW) Generation (kW) Voltage Data (V) Equipment Data N1 Load Node 12.0 16.0 400 NA N3 Transformer 0 0 230 e.g. Rating of the Transformer … … … … … …
Time-Series Data of the Nodes
Timestamp Node ID Voltage Level (V) Demand (kW) Generation (kW) Status Information 2023-10-23 12:00:00 N1 234.6 12.2 16.0 Active 2023-10-23 12:15:00 N2 229.6 9.0 NA Active … … … … … …
🚧 Challenges Faced:
While I’ve tried sourcing data locally, DSOs in Switzerland have proven to be protective of their data. Thus, turning to this knowledgeable community in hopes of discovering alternative avenues or potential sources where such data might be available.
🤖 Synthetic Data:
I’ve examined synthetic data like SimBench, yet securing real data would be a genuine game-changer for this project.
🙏 Your Help:
If anyone is aware of where to find real data that aligns with the aforementioned structure or any related data source that could be helpful, it would genuinely make my day 🙂
Thank You in advance to anyone who takes the time to read this and for any guidance or pointers that you may be able to provide! Feel free to DM if you have data or information you prefer not to share publicly.
submitted by /u/Ok-Environment-3431
[link] [comments]
Please help, if somebody had already parsed this data for any purpose. I’m interested in events from 2022/01/01 till now, all distances up to half-mararhon.
submitted by /u/suharkov
[link] [comments]