submitted by /u/xshopx
[link] [comments]
Category: Datatards
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
Just as the title says. I am in search of raw data in the marijuana industry within specific states as well. Such as demographics, categories, amount spent within each category, brands etc. that sort of stuff.
Any ideas where I can find something like this?
submitted by /u/Sckeet
[link] [comments]
I am looking for a list of federal charges that I can use as reference data when extracting mentions of said charges from unstructured text. For example, such a list would include things like:
Possession with Intent to Distribute 50 Grams or More of Methamphetamine Possession with Intent to Distribute 28 Grams or More of a Mixture or Substance Containing Cocaine Base Possession with Intent to Distribute Cocaine Possession with Intent to Distribute Heroin
I know I can get text extracts of US Code – but what I am looking for is how I could detect something like “Possession with Intent to Distribute 50 Grams or More of Methamphetamine” in freeform text and then ideally crosswalk over to a reference in USC. (example50%20grams%20or%20more%20of%20methamphetamine%2C%20its%20salts%2C%20isomers%2C%20and%20salts%20of%20its%20isomers%20or%20500%20grams%20or%20more%20of%20a%20mixture%20or%20substance%20containing%20a%20detectable%20amount%20of%20methamphetamine%2C%20its%20salts%2C%20isomers%2C%20or%20salts%20of%20its%20isomers%3B)).
submitted by /u/thegrif
[link] [comments]
Hey all!
I’m a marketing master’s student and for one of my assignments, I have to interpret the digital metrics of a certain campaign or company using GA4. The demo account offers only data from Google merchandise or Flood it. I want to find more interesting campaign data that I can use!
I understand that most companies keep their metrics confidential but is there any resource online that hosts digital metrics data of different companies to use for educational purposes?
I would love to get all the help I can!
I really appreciate any help you can provide.
submitted by /u/obnoxiouschatterbox
[link] [comments]
Anybody who has verified emails, please let me know. I’m in desperate need of data. Thanks in advance (:
submitted by /u/Tabasco4realtho
[link] [comments]
I really needed a daily count of violent crime in London but I don’t think it exists.
I decided to try any other related datasets and see if there is a correlation when aggregating them into months and checking against my monthly violent crime data. If it correlates well, I’ll use the dataset as a way to split the monthly violent crime into daily.
Any dataset with daily X of something in London that may correlate well to violent crime and that domain would be appreciated.
Thank you
submitted by /u/infinity123248
[link] [comments]
Hello everybody, trying to create a model which detects damge on a car and estimates repair cost need some data for creating a model for estimating repair cost. Need data like car brand and model, damage area, damage severity, location, claim amounts, damage severity. It would helpful if someone could find dataset like this.
submitted by /u/Filthygamer11
[link] [comments]
Hello,
Looking for data similar to that used in this paper https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10648304/.
The dataset is “openly available” at https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&aihubDataSe=realm&dataSetSn=138, but I am unable access it.
Any similar acoustic data for pipeline conditions (e.g., leaks) would be appreciated.
submitted by /u/-_ll_-
[link] [comments]
I’m looking for average monthly temperature and precipitation datasets so I can do a Köppen climate classification by country. It would be best if it went up to the year 2022 however, this isn’t exactly a requirement. I haven’t had any luck finding a data set for this so if anyone finds one and is willing to share please do!
submitted by /u/PaleontologistOk3713
[link] [comments]
Hi everyone,
I have a project where I need to know the daily number of crimes in London. The Police datasets I’ve come across record crime monthly which doesn’t work for me.
Can anyone help please?
submitted by /u/infinity123248
[link] [comments]
Hello!
I am interested in analysing weather trends using images. Therefore, I am in search of a dataset of images which are taken at the same location every hour or every day (timestamped). The image doesn’t need to be exclusively of the sky, but the sky must be partially visible. Ideally I would like 1 year’s worth of data.
I have looked into FAA weather cams and other live web cam sites, but I could not ascertain whether these sources had historical data archived. Currently I am scheduling a python script to take a screenshot of these sites at specified times, but the job is only executed when my computer isn’t asleep.
submitted by /u/mineralwatercritic
[link] [comments]
Hi! I’m an Applied Stats major and for our capstone we have to choose a statistical topic and go in depth on the learning of it and use it on a dataset. If you couldn’t tell by the title I’m doing bootstrap stats.
My issue is that since it’s simulating missing/limited population data I don’t know where I could find datasets to use for it and even more so what type.
Would anyone be willing to point me in the right direction?
submitted by /u/No-Parfait6484
[link] [comments]
Hello! I’m looking for a “large” dataset for a class project. It needs to have providers with multiple products and consumer behavior information (basically consumers interacting with different providers). An example of this would be something like Amazon or eBay where users buy from different sellers. Thanks!
submitted by /u/AnasBuhayh
[link] [comments]
If anybody interested, I have 15MB of artist data which is equal to approx. 1.107.600 artists. It’s sorted from A to Z and can share a portion of a specific letter if need proof. DM me for more information.
submitted by /u/alpingo232
[link] [comments]
I was wondering if there is some website that shows me all subreddits by member count or by the date the sub was created from oldest to newest.
submitted by /u/WoReddi
[link] [comments]
I am looking for a dataset that contains these variables for my economics capstone project. I’ve been looking for hours and I don’t know if it’s what I’m searching or what but I cannot find anything related to this.
I’m not really too worried about what the other variables are, just as long as it has those two and the other variables arent “weird”, I guess.
submitted by /u/gonelikewind
[link] [comments]
I am currently doing research regarding natural language processing and have no idea where to start for gathering data. Mainly, I need a way to get both Big 5 scores and writing samples from the user in order to train the model. Any ideas where to look?
submitted by /u/BrxuJrg
[link] [comments]
Unfortunately most of my searches is cluttered with info regarding the 2023 strikes. If anyone has or knows where to find date and location data for the 2010 demonstrations it would be much appreciated.
submitted by /u/Colonia_
[link] [comments]
Need a good data set for performing simple preprocessing methods,data visualization and model creation and prediction
submitted by /u/Terrible-Ad-1079
[link] [comments]
Hello /r/datasets community!
I came across an interesting fact about Zyla Labs and thought it might be of interest here, especially for those in search of comprehensive datasets for various projects. It appears that Zyla Labs, primarily through its API hub, has amassed a substantial collection of datasets, now boasting over 2000 different datasets.
Zyla Labs might not be the most talked-about name in the data industry, but the scale of their dataset repository is quite impressive. For researchers, analysts, and anyone in need of diverse data for analysis, the variety offered by Zyla could prove to be a valuable resource.
Has anyone here utilized Zyla’s datasets for their projects? It would be great to hear about your experiences or any insights on the quality and usability of the data they provide.
Cheers to more data and better insights!
submitted by /u/alejandrobrega
[link] [comments]
I’ve been looking for a dataset to create an anime chatbot for my discord server. I haven’t had any luck on Kaggle and I don’t want to resort to web scraping off Crunchyroll. Do you have any recommendations?
submitted by /u/SeniruSan13
[link] [comments]
I’m a graduate student at UT. I’m working on my research related to prediction of EV charging demands on the grid. I have a dataset from a company called FleetCarma which recorded charging session information of about 850 EVs in Canada from 2017-2019 (Real-world dataset).
I also want to see cost-effectiveness of managed charging on distribution lines by seeing how different managed charging demand curves look v/s unmanaged charging demands curves.
Now I need a dataset from a utility or charging point operator (CPO) showing change in load curves due to managed charging programs. If any relevant dataset comes to your mind please share the resource link. Your guidance can help me complete my masters thesis in time.
submitted by /u/Positive_Interest402
[link] [comments]
Hey Reddit community,
I’m embarking on a research project that focuses on understanding the impact of Artificial Intelligence (AI) on e-commerce platforms. To this end, I am in search of datasets that provide insights into how AI influences key performance metrics. I am particularly interested in data related to:
Conversion rate
Customer satisfaction
User experience
Site visibility in search engines
Site loading speed
The aim is to analyze these metrics across major online retail platforms (like Amazon, eBay, Shopify, Etsy, Walmart) where AI plays a significant role in shaping their strategies and operations.
If anyone here has access to such datasets, knows where they might be available, or can point me towards resources or communities that could help in this regard, I’d be immensely grateful. This information is crucial for my research, as it will enable a comprehensive understanding of AI’s real-world effectiveness in e-commerce.
Any leads, advice, or guidance you can provide would be invaluable to my project.
Thanks so much for your help!
submitted by /u/Lilia_HA
[link] [comments]
My friends and I are discussing what the best investments would be if we had a time machine and opened a lemonade stand. We’re assuming that remembering exacy days or weeks to trade could be hard enough that we want appoximates (so summary statistics for buckets of time are totally okay). I wanted to write a little script that takes in two time points and returns the top N stocks by return. Bonus if total volume/market cap is available so that we can also calculate what % ownership each would consistute.
Edit: should say early 2010s in title, not 2010#
submitted by /u/Ok-Needleworker-6595
[link] [comments]