submitted by /u/cavedave
[link] [comments]
Category: Datatards
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
Unfortunately most of my searches is cluttered with info regarding the 2023 strikes. If anyone has or knows where to find date and location data for the 2010 demonstrations it would be much appreciated.
submitted by /u/Colonia_
[link] [comments]
Need a good data set for performing simple preprocessing methods,data visualization and model creation and prediction
submitted by /u/Terrible-Ad-1079
[link] [comments]
Hello /r/datasets community!
I came across an interesting fact about Zyla Labs and thought it might be of interest here, especially for those in search of comprehensive datasets for various projects. It appears that Zyla Labs, primarily through its API hub, has amassed a substantial collection of datasets, now boasting over 2000 different datasets.
Zyla Labs might not be the most talked-about name in the data industry, but the scale of their dataset repository is quite impressive. For researchers, analysts, and anyone in need of diverse data for analysis, the variety offered by Zyla could prove to be a valuable resource.
Has anyone here utilized Zyla’s datasets for their projects? It would be great to hear about your experiences or any insights on the quality and usability of the data they provide.
Cheers to more data and better insights!
submitted by /u/alejandrobrega
[link] [comments]
I’ve been looking for a dataset to create an anime chatbot for my discord server. I haven’t had any luck on Kaggle and I don’t want to resort to web scraping off Crunchyroll. Do you have any recommendations?
submitted by /u/SeniruSan13
[link] [comments]
I’m a graduate student at UT. I’m working on my research related to prediction of EV charging demands on the grid. I have a dataset from a company called FleetCarma which recorded charging session information of about 850 EVs in Canada from 2017-2019 (Real-world dataset).
I also want to see cost-effectiveness of managed charging on distribution lines by seeing how different managed charging demand curves look v/s unmanaged charging demands curves.
Now I need a dataset from a utility or charging point operator (CPO) showing change in load curves due to managed charging programs. If any relevant dataset comes to your mind please share the resource link. Your guidance can help me complete my masters thesis in time.
submitted by /u/Positive_Interest402
[link] [comments]
Hey Reddit community,
I’m embarking on a research project that focuses on understanding the impact of Artificial Intelligence (AI) on e-commerce platforms. To this end, I am in search of datasets that provide insights into how AI influences key performance metrics. I am particularly interested in data related to:
Conversion rate
Customer satisfaction
User experience
Site visibility in search engines
Site loading speed
The aim is to analyze these metrics across major online retail platforms (like Amazon, eBay, Shopify, Etsy, Walmart) where AI plays a significant role in shaping their strategies and operations.
If anyone here has access to such datasets, knows where they might be available, or can point me towards resources or communities that could help in this regard, I’d be immensely grateful. This information is crucial for my research, as it will enable a comprehensive understanding of AI’s real-world effectiveness in e-commerce.
Any leads, advice, or guidance you can provide would be invaluable to my project.
Thanks so much for your help!
submitted by /u/Lilia_HA
[link] [comments]
My friends and I are discussing what the best investments would be if we had a time machine and opened a lemonade stand. We’re assuming that remembering exacy days or weeks to trade could be hard enough that we want appoximates (so summary statistics for buckets of time are totally okay). I wanted to write a little script that takes in two time points and returns the top N stocks by return. Bonus if total volume/market cap is available so that we can also calculate what % ownership each would consistute.
Edit: should say early 2010s in title, not 2010#
submitted by /u/Ok-Needleworker-6595
[link] [comments]
I’m taking a data analytics course and I’m trying to find at least one free dataset that has the right information about minerals, rocks, and/or crystals. I’m hoping to analyze the data to determine which minerals/stones hold the most potential for further scientific research. Factors that could determine whether or not there is research potential for a mineral/stone include:
Piezoelectric properties Electromagnetic shielding capabilities The ability to filter water
I’m also open to suggestions of other factors to look for; those three are just the things I thought of off the top of my head as a rock/crystal enthusiast.
I can find plenty of mineral/stone datasets, but I’m struggling to find any that include the particular information I’m looking for. Is anyone able to suggest any datasets that contain this information?
submitted by /u/Angel_dot_exe
[link] [comments]
Hi, I’m looking for a FREE dataset of all flights (either global or from/to UK) starting from March 2022 up until today (January 29, 2024).
Ideally the dataset will include the Datetime of the flight (departure or arrival), airline, departure/arrival airport.
If you guys know sources where I could get this data that would be very helpful. Thanks!
submitted by /u/Tiny-Magician-9125
[link] [comments]
As many folks are, I am looking for work. I am in search of a resource for companies headquartered by state or even region. Will someone point me in the right direction? TIA
submitted by /u/SociallyAwkwardLibra
[link] [comments]
Hello, I’ve recently been looking through this subreddit for data on video game sales and the first few I’ve looked at show significantly different information. I know that sales data is not public so information must be collected through things like press releases and may not be fully accurate, but I was wondering:
How would you get over these inconsistencies if you were doing a project and finding a lack of coherence between different datasets?
Does anyone know of any source for video game sales data that is regarded as the most reliable or widely used?
Here are some that I’ve looked at so far for reference:
https://www.kaggle.com/datasets/gregorut/videogamesales
https://www.kaggle.com/datasets/baynebrannen/video-game-sales-2020
https://www.kaggle.com/datasets/thedevastator/global-video-game-sales-ratings/discussion
submitted by /u/VinceTheCat02
[link] [comments]
Hello ,
I am currently working on a data science project that involves analyzing invoices (factures) in the French language. Unfortunately, I have been unable to find a suitable dataset for this specific task. If anyone has access to or knows of a dataset containing French invoices, your help would be greatly appreciated.
Specifically, I am looking for data that includes information such as invoice amounts, dates, itemized details, and any other relevant fields. The dataset should be in French, as it is crucial for my analysis.
This is quite urgent, so any assistance or guidance on where to find such datasets would be incredibly valuable. If you have any leads or suggestions, please feel free to share them.
Thank you in advance for your help!
submitted by /u/Personal_Ad3341
[link] [comments]
Looking for information on U.S Health Insurance Providers and the number of members they have. Is this available somewhere?
submitted by /u/artfuldawdg3r
[link] [comments]
I am currently writing a research project that aims to analyse the uptake of vaccination by brand in Indonesia compared to another country, however I cannot find a time series based dataset that shows the administration of vaccines by brand, on a daily basis.
Does this information publicly exist for Indonesia? The World Health Organisation omits Indonesia for daily vaccines by brand but there are websites in Indonesian that provide more data but I am struggling to read them due to the language! (https://vaksin.kemkes.go.id/#/detail_data)
If anyone knows how I could access this data via public datasets or health APIs etc, please help.
submitted by /u/silver_89
[link] [comments]
I need to scrape content of a online pharmacy site. I need the medicine name, composition and price of all the products present on that site. Any pointers on how this can be achieved?
btw 1mg.com is the site
submitted by /u/Real_Cut_9360
[link] [comments]
Im new in DL projects. Ive been trying to search a dataset that should have atleast three columns sentence1, sentence2, their semantic similarity. So far i found SICK dataset and snli but something else would be more suitable for my task so do you know any datasets like this.
basically im trying to build a system that searches for most similar sentence to the query in a video transcript. suppose u have a podcast video you take its subtitles and do a query and it will give u timestamps of the most similar sentence so for that ill grab a bert model and fine tune on some semantic similarity dataset. it will be good if the dataset is based upon a certain style, topic or domain. like for example, sentences on technology or animal documentary or some human conversation or anything basically
submitted by /u/Deferfire
[link] [comments]
Hi! I am looking for a dataset for a research course I am currently taking. Ideally the dataset would be:
– In the field of housing, poverty, employment/labour, finance or something else Economics-related
– Machine Learning applicable: so far this is the area I have struggled with. Most of the datasets that I find present lots of GDP data, or housing data, etc. instead of many predictor variables and then a target variable
– Following on from above, a somewhat balanced dataset in terms of categorical variables and continuous variables
– Panel Data/Pooled cross-sectional data would be a bonus
– Canadian-applicable data would be a bonus
If anyone has come across a dataset that fits some of these criteria, please let me know!
submitted by /u/Own_Application9253
[link] [comments]
I’m doing research on defence integration in the EU. I’m looking for some data about military exchanges between member states. If I can’t directly get that information, I’ve thought about possibly the number of foreign officers working at ministries of defence in member states, but I can’t seem to find this information anywhere. Thanks for the help.
submitted by /u/NotSoSilentRob
[link] [comments]
I would like a recommendation for a service or dataset where I would be able to easily access free worldwide city historical weather data.
Ultimately I would like to create a database with columns for: City, Country, month-year, temp high, temp low, days of rain, humidity, wind. I want to build my own database to easily run queries against it (rather than always making an API call against a service any time I want data for a city).
Requirements: 1 full year of data (either by day or by month) City, country, temperature, precipitation Scrape-able or downloadable dataset
Nice to have: Free Humidity, wind, days of sun, days of rain, temp high, temp low As many cities from around the world as possible
Any help would be appreciated!
submitted by /u/Asleep_Parsley_4720
[link] [comments]
Hello, I’m currently working on a project that requires historical order book data for the BTC/USDT pair from Binance. I’ve checked various public data repositories and forums but haven’t found anything beyond basic price and volume history. If anyone knows where I could get this specific dataset, please let me know. Thank you!
submitted by /u/Bayes_T
[link] [comments]
This competition was organized by the Imperial College in London, UK. I can’t seem to find where did they collect the data from, or when was it done… the metadata documentation basically.
Link: https://www.kaggle.com/competitions/loan-default-prediction/overview
submitted by /u/uchat24
[link] [comments]
Hello, I’m a PhD student and am working with the THOR data released by the US Department of Defence. It’s quite detailed, giving me the geolocation of the bombs dropped on Vietnam by the US between 1965 and 1975. It also has details of the date of mission, among many other information. However, it does not have casualties data. I was wondering if there was a publicly available dataset where I can link casualties (both Vietnamese and US) to the missions. Thank you!
submitted by /u/mamil2608
[link] [comments]
Looking into PitchBook , we def can’t afford the price… I’m wondering, we only want it for the Emerging Industries section, is there something that competes with it for that???
We’re just looking to do way deeper market research than we can do currently.
submitted by /u/Salt-Resolution2113
[link] [comments]
I am currently completing my capstone project regarding data analytics which requires me to find a data set with 50 K rows if anyone could please help me as I haven’t been able to find any data sets this large, I am looking for something finance related as data set s regarding banks or stocks would do thank you in advance.
submitted by /u/Own_Ad_7041
[link] [comments]