Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

UK Bank Branches Location Dataset / Api

Hey, just trying to do some analysis on banks in the UK. However, I’m finding that a lot of available datasets are outdated. Any chance someone could help me find an up to date data set with location and number of banks? I’m aware that each each bank has their own locator API, but I’m unfamiliar with APIs.

Thanks in advance.

submitted by /u/FrostyJozoid
[link] [comments]

MENA Region Restaurant/food Consumption Data

Hi all,

I’m involved in a food project in the Middle East – specifically Iraq/Kuwait. I was wondering if you know any good market research tools/websites/resources/reports on these countries/Middle East region – specifically related to restaurants/food/franchises/what works vs. what doesn’t etc. Or things like average spend per person etc. The closer it relates to food/restaurants the better.

I”m in the beginning stages of working with franchises and need to research to assess the market in terms of needs and what could work vs what doesn’t. Having an insight into regional countries would give me something to think about, as there seems to be a real dearth of research/data.

Appreciate any info/tips/help. Thank you.

submitted by /u/RevolutionaryWalk592
[link] [comments]

Looking For Hours/minutes Of Precipitation Data (not Amount Or % Chance)

For a travel-related project, I need a dataset that contains the total duration of rainfall, as this is the most relevant measure of whether you can spend time outside in a given location. For example: Miami might average 5cm of rainfall, but it may all happen in 30 minutes. Seattle might average only 1cm, but spread out over 8 hours of a day.

Has anyone come across a weather dataset that has precipitation duration metrics? Haven’t found anything like this on WeatherSpark, wunderground, etc.

Thanks!

submitted by /u/uberdev
[link] [comments]

Event-Based, Fine-Grained Tennis Matches Dataset?

Does anyone know of a tennis matches dataset that’s event-based? I mean a dataset listing every event in a tennis match with timestamps, categories, etc.

For instance, I wanted to answer some questions which I think are only answerable with such a dataset, such as:

In the last set of the game, how often does the person who starts win? Does it matter to start a set? What about a tiebreak? How often do reversals happen? On best 3s? On best of 5s? How beneficial is it to have contested games vs cleanly won games? Does that percentage change when you get to the top when compared to lower ranked players? Can game duration predict winners? Out of those players who got injured during a game, how many end up winning? Out of those, how many manage to keep on winning?

I think having something like this would be so valuable for bettors…

I suspect the ATP does have a dataset like this, but I think they do not intend on sharing it.

submitted by /u/Fanaro009
[link] [comments]

Multilingual Corpus With Text Data And Coordinates

Hi guys!
We have collected a multilingual corpus with text data and coordinates. The dataset is divided into the 123 most populated regions of the world: ~500,000 messages from social media + their coordinates, each in a separate json file according to the region. The dataset is suitable for tasks such as geotagging text data. Use it, share your opinion 🤗
PS we also have a similar dataset with timestamps, let me know if you need it 👾

submitted by /u/robvbar
[link] [comments]

Does Anyone Know Where I Can Find Price Discrimination Data?

I want a dataset of prices where there is an increasing price for higher Consumption bands. An example could be Consumption caps in the covid pandemic panic (on toilet papers, masks or Alcohol gel). Other example could be residential energy Consumption bands, where heavy spenders are penalized, or other environmental tax is applied.

I want specifically transacional data, longitudinal per consumer, but time series panel data with average price per location that shows this higher price by higher Consumption Pattern would also be Nice.

I can find only limited/paid datasets. I would be of great help!

submitted by /u/TomSargent
[link] [comments]

Looking For Repetitive Islamic Patterns Dataset

Hi Guys
I am embarking on a new project where I would like to train a GAN to create Islamic patterns for doors, roofs, and other furniture. I’ve attached a few examples of what I have in mind as I am looking for a dataset I can use for training.
Any directions or recommendations will be appreciated, and advise on the model are welcome as well.

submitted by /u/Prestigious_Virus_33
[link] [comments]

Looking For Repetitive Islamic Patterns Dataset

Hi Guys
I am embarking on a new project where I would like to train a GAN to create Islamic patterns for doors, roofs, and other furniture. I’ve attached a few examples of what I have in mind as I am looking for a dataset I can use for training.
Any directions or recommendations will be appreciated, and advise on the model are welcome as well.

submitted by /u/Prestigious_Virus_33
[link] [comments]

US Midwestern City Populations By Year

Hello friends, I am doing some exploration with central place theory, and I would love a dataset where all cities in a midwestern state are listed by population per year (or decade), from the date of founding (or as far back as possible). Where might I find such a thing? I could scrape “historical population” on Wikipedia but I am sure more streamlined data exists somewhere

submitted by /u/bc_951
[link] [comments]

Need A Dataset For Survival Analysis

I’m currently working on a research project focusing on the fascinating world of Over-The-Top (OTT) streaming platforms and how they’re reshaping the entertainment industry. 🍿📺

Specifically, I’m diving into Survival Analysis to understand how various events and industry changes impact user retention over time. 📈⌛

However, here’s where I could use your expertise and assistance! 🙏 I’m on the hunt for a suitable dataset that contains comprehensive information related to OTT platforms, user behavior, subscription details, and relevant industry events. 📦💼

If any of you have access to or know of a dataset that aligns with these criteria, I would be immensely grateful if you could share it with me. Your support and collaboration would be a significant contribution to my research project.

Feel free to drop me a message here if you have any leads or datasets to share.

submitted by /u/Gemperle00
[link] [comments]

Are There Any Cool Geology Datasets?

I need help finding a dataset concerning geology that I can do my project over. I am taking a rudimentary geology lab and I have a final project where all I need is to discuss a cool geologic finding. I am a stats major and want to use my skills to better this project and potentially add to my resume.

Does anybody have any cool datasets or even topics I should research that I can potentially use for my project? I don’t have a big background on geology but I’m confident enough in my data analysis skills to help me by.

submitted by /u/PersonalityHonest729
[link] [comments]

Rgent Help Needed: Unable To Access Dataset On Answer ALS Data Portal Due To Site Error

Hello everyone,

We are currently working on a crucial project that requires access to a specific dataset available on the Answer ALS Data Portal. However, we have encountered an error on the website which is preventing us from accessing the dataset. We have attempted to contact the administrators through the provided email address, but unfortunately, we have not received any response so far.

We are reaching out to this community in hopes that someone might have the required dataset or can assist us in procuring it. The dataset is critical for our ongoing research, and any help would be greatly appreciated.

If you have access to the dataset or know of any alternative way to obtain it, please feel free to reach out. We are willing to discuss any necessary arrangements or collaborations that can help us move forward with our project.

Thank you in advance for your assistance and understanding. Your help could significantly contribute to the progress of our research.

https://dataportal.answerals.org/request-access

submitted by /u/Weird_Cockroach963
[link] [comments]

Rgent Help Needed: Unable To Access Dataset On Answer ALS Data Portal Due To Site Error

Hello everyone,

We are currently working on a crucial project that requires access to a specific dataset available on the Answer ALS Data Portal. However, we have encountered an error on the website which is preventing us from accessing the dataset. We have attempted to contact the administrators through the provided email address, but unfortunately, we have not received any response so far.

We are reaching out to this community in hopes that someone might have the required dataset or can assist us in procuring it. The dataset is critical for our ongoing research, and any help would be greatly appreciated.

If you have access to the dataset or know of any alternative way to obtain it, please feel free to reach out. We are willing to discuss any necessary arrangements or collaborations that can help us move forward with our project.

Thank you in advance for your assistance and understanding. Your help could significantly contribute to the progress of our research.

https://dataportal.answerals.org/request-access

submitted by /u/Weird_Cockroach963
[link] [comments]

Need Help Finding Dataset For SLR And MLR Project

I am taking a regression analysis course this semester. We have a project to do simple linear regression analysis on a data set and in a few weeks another project to do multiple linear regression analysis.

I’ve been searching online for a good data set that I could use for both my SLR and MLR projects. Does anyone have any recommendations on where I could find a data set that I could use?

submitted by /u/Prince_Alizadeh
[link] [comments]

Best Graph To Visualize A Dataset Where Respondents Have To Choose 3 Things Out Of A List

Not sure if this is the right place to ask this.. but anyways

I’m working on a report that presents and analyzes survey results.

One of the survey questions requires of the respondent to pick 3 things out of a list. What type of graph on excel would best illustrate this? I’ve been using horizontal and vertical bar graphs, but those were for data that requires a choice of one thing among a range (Strongly agree— Strongly disagree..)… should I use the same style of graph?

Many thanks in advance

submitted by /u/alsi3dy
[link] [comments]

I’m Looking For An Embedding Fine Tuned For Tech Words.

The original Bert model gives similarities scores of

.5736 between vue.js and react

.6389 between vue.js and k8s

Vue.js and react are both frontend frameworks and kubernetes (also called k8s) is a server orchestrator. Therefore it”s odd that the first score is higher than the second.

Do you know of any pretrained model that can catch this type of tech jargon better ?

submitted by /u/Throweuway
[link] [comments]

Need Help Finding Python OCR Tool For Scholarly PDFs To JSON

I need to turn a bunch of academic PDFs (with tables) into neat JSON files for data extraction. I’m searching for a Python OCR tool that can: do text and table recognition in scholarly papers; spit out well-structured JSON with the extracted info. If you’ve got recommendations, please let me know! Open-source is awesome, but I’m open to anything that does the job well.

Thanks a for your help!

submitted by /u/Apprehensive_View366
[link] [comments]