Where can i find numberic value only datasets for simple random forest classifier on colab
Need help asap plz!
submitted by /u/Odd-Programmer-9413
[link] [comments]
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
Where can i find numberic value only datasets for simple random forest classifier on colab
Need help asap plz!
submitted by /u/Odd-Programmer-9413
[link] [comments]
Hello everyone!
I’m working on a computer vision task that involves detecting broken fences, but the dataset I have is quite small.
I was thinking of generating synthetic data to overcome this issue. Since it’s easier to find images of intact fences, I thought about using an image-to-image model to artificially “break” parts of the fence in those images.
Do you think this approach is feasible? Any suggestions or recommendations on how to implement this?
Thanks in advance!
submitted by /u/_Enf
[link] [comments]
Hello, I am conducting an undergraduate thesis study and am looking for (preferrably) video datasets of Romanian Deadlifts and Squats. I will be performing something involving computer vision models such as MediaPipe and YOLOv8, and I require videos for my study. Thank you in advance!
submitted by /u/imPriv
[link] [comments]
Is there any available dataset of job description and resumes that secured the job based on the job description?
This is for a college project that I’m doing. If anybody knows anything about this help me.
submitted by /u/Adhil-Roshan
[link] [comments]
This might be a bit of a stretch, but I’m hoping to find a dataset of completed project management artefacts, things like schedules, project charters/briefs, RAIDD logs, reports etc. hopefully categorised by types of projects (development work, platform adoption, infrastructure work). I realise that a lot of this work would be proprietary to organisations so I might not have much luck.
submitted by /u/denzmilk
[link] [comments]
I want to reach a data set for labeled data of SNPs or microarray gene expression for Alzheimer’s Disease to train a model.
submitted by /u/Fast-Cry-3438
[link] [comments]
I am looking for carbon emission dataset from India coal mines in recent years to calculate carbon footprint
And appreciate suggestions for machine model to train the dataset
submitted by /u/Devansh_Durgapal
[link] [comments]
I am looking for dataset of videos that scan different items like objectron ?
No need for object detection, segmentation or pose estimation data. Just videos of scanned different items.
submitted by /u/Puzzleheaded_Mall546
[link] [comments]
For a pet project, I want to build a robot that collects fallen apples and clears dog mess from the lawn and garden areas. To identify the items to clear and collect I will need images of the subject items in various poses and scenarios. Whilst I do have both dogs and apples trees, it will take me a while to collect images and also generate variations of those images for training. I thought the best way (maybe not the most sensible) was to ask Reddit. Please people of Reddit, please can you send me images of the requested items from about a metre (3ft) away where possible. email: ozoid at proton dot me
Thank you.
submitted by /u/Ozoid
[link] [comments]
Anyone knows where should I try finding them? We are ready to pay for it. Thankyou so much
submitted by /u/Desperate_Parking_29
[link] [comments]
where can i get datasets related to “agriculture commodities production like for onions, potatoes etc” and their price trends
submitted by /u/nave_en04
[link] [comments]
I need data about MOF material for CO2 capture for post-combustion?
submitted by /u/WildBarracuda4570
[link] [comments]
I’m looking for data on public support for infrastructure projects, particularly in the energy sector. Do you have any recommendations for where to look? I’m new to data science and having a tough time figuring out where to start. All help is appreciated 🙂
submitted by /u/Equivalent-Sorbet-63
[link] [comments]
My company is currently creating a synthetic facial dataset (a 3D geometry head set, based on real human scans). Our set strives to be more diverse with respect to ethnicity, age, body type and gender. Additionally, we have the ability to create an infinite number of facial variations (ie, blended percentages of differing people, thus creating many unique resulting faces)
All of our input source subjects have consented (via a robustly worded model release), to ensure fairness as well as adherence to all current and any future legislation pertaining to facial datasets. 🙂)
My question is: What elements would data scientists like to have, to make their training sets more effective and usable? For example, we currently have 3D and 2D facial tracking points, plus occlusion identifiers. Also, we can completely randomize any aspect of the face (skin, eyes, hair, clothing, etc) and also the rotation of the head, camera view, lighting, background image, etc.
What other things would be useful?
submitted by /u/suzipolklittle
[link] [comments]
Hello, I’m studying statistics using the handbook mentioned above, unfortunately the companion website is no longer available so I don’t know how to access the the data set to perform the example and do some of the end of chapter exercises. So I don’t know if anyone has the said data set and could hand it over to me or has a valid link where I can download it. Thank you in advance for your kind reply.
submitted by /u/Hot-Pay-8850
[link] [comments]
I have downloaded version 18 of Mozilla common voice for Pashto language. There are a lot of folders in the file I downloaded and I am unable to understand what they mean.
Someone help me understand how can I use this data to finetune my whisper model.
submitted by /u/williamsuck
[link] [comments]
Does anybody know if this is available/where can I find this?
submitted by /u/SamiAmjidKhan
[link] [comments]
Hi everyone I have special requests from everyone if anyone can provide me
Dataset :Urdu handwritten Doctor prescription for machine learning project
Thank you Regards Danial Afridi
submitted by /u/Hidanial
[link] [comments]
I have a project in mind for my big data course. I have always been interested in films and movie culture. I currently have a minor in Film Studies as well. I want to predict movie success based on the people associated with each movie. Movie success can be defined either by box office success or critical success such as Oscar nominations. Obviously, it is always an unpredictable thing because a lot of factors lead to the success or failure of a movie. I want to look at if a movie was a success what factors led to that success and if it is a failure what led to that failure. I believe in both “buckets” there will be patterns that show up. For example, does the social media following of an actor have an impact on the box office success of a movie. The idea applies for newer movies more than older movies. There are many data sources where I can retrieve data such as IMDB. Please let me know your thoughts.
My prof. responded by saying that IMDB while being around 5GB may not be enough to be called “big data.” He suggested I look at datasets with text reviews as they can be pretty lengthy and can lead to a larger size.
Is there any way I can get a dataset for this project? I was thinking about web scraping movie reviews as well. If I web scrape, I would use IMDB, Rotten Tomatoes, Letterboxd, etc.
Appreciate all the help!
submitted by /u/Sakaburu
[link] [comments]
Hello there.
I’m part of a team of four Data Analitics’ students and we are searching for a useable dataset to make our capstone. We are searching for a sales dataset of a retail shop. We tried in places like Kaggle and saw in horror that some of the ones that could work for us are the same previous years’ teams had already used or criminally non-updated ones. Trying to search in several places only make us to hit our faces against paywals, some of them extremely high.
The main idea is simple, the registry of sales of that retail shop over time.
If any of you could give some insights of where we could find something workable. There is any company that gives that kind of information for free?
submitted by /u/Most_Breadfruit_2388
[link] [comments]
Hi everyone!
My team and I are studying how different organizations manage their data quality.
This poll is 5 questions and takes <1 min. Take the poll here and get exclusive access to the in-depth report: https://qkbg47fsj9g.typeform.com/to/D6qL7hfB
Confidentiality Notice: Your responses will be kept confidential and won’t be associated with your name or company’s likeness.
Thank you for providing your time and participation!
submitted by /u/BlueStreetDataTeam
[link] [comments]
I am looking for any kind of dataset I am currently conducting research on Contract Lifecycle Management (CLM) and I am looking for datasets related to the management of contracts within CLM systems. Specifically, I am interested in any datasets that provide insights into how contracts are handled, monitored, or executed within CLM platforms.
Additionally, I would like to know if there are any available datasets focused on dispute resolution, especially concerning contractual disputes. Any information or guidance on where to find such data would be highly appreciated.
Thank you in advance for your assistance.
submitted by /u/lahaine93
[link] [comments]
hello guys, ive been looking for a dataset like this for a study im conducting trying to use Neural ODES to make consumption predictions, do any of you know where to get something like this?
submitted by /u/Kaneko_BS
[link] [comments]
Hello!
Does anyone know any good sources of music statistics? I am studying sound production at uni and part of the course requires us to do research on marketing and promotion.
I thought that looking at statistics and weaving that into the report would be a good idea but i cant find anything that’s specific enough and if it is it will be behind a pay wall.
the genre we are researching is punk but I can find a way to tie in a wider genre if punk is too specific.
Edit: mostly looking for demographic statistics and what medium music is consumed
submitted by /u/_callumstewart_
[link] [comments]
My teammates and I are looking for large datasets, ideally revolving around marketing and/or sustainability as that’s where our interests lie. Datasets must be free to download and not be synthetically generated. Thanks in advance!
submitted by /u/LawnMowerMassacre
[link] [comments]
I need a dataset
1) it has to have multiple waves/ be longitudinal .
2) Needs to be easy enough to use I’ve been deemed by a statistics professor as not being “capable enough” to use quantitative data. If it’s not easy to use that is fine. I’ve had to hire a tutor before.
3) looking at hospitalizations, reasons for hospitalization, age, and cause/mode of death.
4) or half of these variables
5) for a human geography population project.
6) our professor wants it to be a public dataset that is national for the states if it is not national it needs to include the United States.
submitted by /u/Rajah_1994
[link] [comments]
Hi everyone,
I’m currently working on my PhD, focusing on reconstructing and creating patient stories and clinical narratives for clinicians using Large Language Models (LLMs). I’m looking for open, unstructured medical notes, ideally related to Remote Patient Monitoring. If the dataset also includes some quantitative data, that would be even better!
I’ve already looked into MIMIC and am considering applying for access, but I’m wondering if there are any other datasets or sources that might be useful for my research. Any recommendations or pointers would be greatly appreciated!
Thanks in advance!
submitted by /u/Stealthy_Nachos
[link] [comments]
Hey all,
This is my first post in this sub. I am looking for a dataset that I would’ve assumed would be easy to find but I’m having no luck 🙁 As the US politics has been a recent fixation for me, a small project I would like to start involves looking at currently tipped occupations (ie waiters, cashiers, hair salons etc) and comparing the income that comes from tips currently to what we will observe in the future due to both parties (Dem and Rep) committing to a tax free tip policy. So far the closest dataset I have found is this from the US bureau of labor stats however it only details their gross pay (I’m assuming this means pre tax) and includes the tips. This doesn’t help much because as a part of this project I would like to answer the questions;
(i) Will these occupations force more tips onto consumers due to the policy change?
(ii) Will other occupations that don’t currently get tipped begin to take tips in order to get more tax free income?
I unfortunately don’t see how I can answer these questions if the tips are included and the numbers are pre tax 🙁
Any help or suggestions is welcome and appreciated.
submitted by /u/Girgis99
[link] [comments]