Category: Other Nonsense & Spam

Music Charts, AT40, Country Latin, British Charts

I am looking for a complete/comprehensive list of songs, and their position on the America Top 40 (or 100), Country, Latin, R&B, British music charts since inception. Looking to have weekly rankings showing current position, previous weeks position, song title and artist. Does anyone know where I can find this, if it is available?

submitted by /u/TexasBound1973
[link] [comments]

Auto Claims Dataset [Insurance Dataset]

I am looking for text + columnar dataset related to auto claims for insurance; an ideal dataset that I am looking for would have customer data, insurance claim data. For Insurance Data,the data lifecycle would start from first notice of loss made by customer to insurance company paying out or rejecting the claim.

It need not be real, a synthetic dataset would also do.

submitted by /u/willing-Stres
[link] [comments]

Mobile Vs Desktop/Laptop Internet Traffic? [Looking For A Dataset]

Hi all,

I’m looking for a dataset that details mobile vs desktop (or laptop) internet traffic. This can be global, or specific to a country (global would be best but i’m being a bit of a beggar with this so anything would do).

I’d like to use it to try and do some sort of time-series forecasting.

If anyone knows where I could find a dataset like that i’d massively appreciate it!

submitted by /u/EddieDemo
[link] [comments]

Databoutique.com, A Marketplace For Web Data

Hi 👋 all! We’re building a marketplace for web data (https://www.databoutique.com).

If you need web data for training models or app development, you can ask the community for it. The goal is to save time and cut down on scraping costs.

The basic idea is that most of the times, you’ll need data that someone is already scraping, so it’s faster and easier to ask for it, instead of doing again the scrape yourself.

We’re in early phase, any feedback is welcome. We hope this helps lower the barriers to data.

submitted by /u/Pigik83
[link] [comments]

Dealing With Missing Standart Deviation Due To Only 1 Observation

Hi,

i have the following problem: I need the standart deviation as part of my regression. Therefore i restrict the data to be atleast 3 observation per category for a specific year. However i do also want to include the data with only 1 or 2 observation but ofc for 1 there isn’t a standart deviation and is kinda pointless for 2. The standart deviation is only a control variable but vital for the result.

Does anyone know how i could handle that so i can still include these years for the categories with only 1 or 2 observations and not ruin my regression?

submitted by /u/Basilis988
[link] [comments]

Chinese Outward Foreign Direct Investment Data

Hi!

Since my first post here was a request if someone knew how to access Chinese OFDI Data sorted by country which some researchers frequently seem to use, I can now finally share where exactly the data comes from and hope that I thereby maybe save another poor soul from spending hours to find it.

Unsurprisingly, you have to search in Chinese to find the data:

XX 年度中国对外直接投资统计公报 (XX for the year you are searching)

sample: http://images.mofcom.gov.cn/hzs/201810/20181029160118046.pdf

Hope this helps and have fun!

submitted by /u/BlueApple12
[link] [comments]

Looking For M&A And Series Funding Rounds Datasets

Hi All,

I looking for datasets options that would provide me:

For M&A:

Buyer Acquisition Target announcement date close date deal value

For Series funding rounds:

Company Series round Closing series round amount secured

So far, I have been using google but feel like that is not the best way to get the data I need

submitted by /u/malaya100
[link] [comments]

How Would You Go About Populating Your Own Data Set Similar To Yelp And Google Maps?

I’m trying to build an app for travel. I’ve looked at the APIs and they’re very expensive and don’t allow for long-term caching. OSM is interesting but also requires you to abide by ODbL, which isn’t great if you don’t want to share proprietary info. Are there any approaches or alternatives to using an API or OSM? I haven’t been able to find any great data sets to bulk purchase.

submitted by /u/marvinshkreli
[link] [comments]

Subnational COVID-19 Vaccination Data (Europe)

Hello, I am looking to assemble a subnational data set on COVID-19 vaccinations in Europe (preferrably at the level of NUTS-2 or NUTS-3 regions; mixed would also be okay). I have seen some maps for individual countries, so the data seems to be somewhere out there – at least for some. Does anyone know of any resources in that direction?

I’d like to focus on the EU27, but other European Countries are also fine. I appreciate any help, be it aggregate data sets or data for individual countries. Thanks!

submitted by /u/lu2idreams
[link] [comments]

NCAA Men’s Basketball March Madness Historical Results

I’m looking for some historical results in order to do some analysis and submit my bracket. Essentially, I’m looking for something like this, but more up to date:

https://blog.softartisans.com/2013/03/19/march-madness-predict-your-ncaa-basketball-brackets-with-excel/

https://blog.softartisans.com/wp-content/uploads/2013/03/NCAATournamentBracket.xlsx (like the first “data” tab. just the seeds, team names, round and score is good enough, but more is better)

Does anyone have anything like this? Thanks!

submitted by /u/JlwRfwkm
[link] [comments]

Questions Regarding The ADNI Dataset

Hey so I am working on a project related to Alzheimers detection and intend to publish a paper. I have used a dataset that I found on Kaggle having MRIs. The author of the dataset has said that he has taken the images from ADNI and some other sources. Now can I use that data on my paper and if yes how do i cite it?

Also, I got permission to access the ADNI dataset yesterday and I downloaded the MRIs. But there are no labels on the images like if they are demented or not. If anyone could help me with it then that would be really helpful as I need to prepare the project asap. Thanks

submitted by /u/pg_blue
[link] [comments]

Similarity Semantic Search Sentences Or Paragraphs

Hi! I am doing experiments in semantic similarity search. Given a sentence, I need to find the most similar sentence to the given sentence in a data set that consists of sentences or paragraphs, using semantic search. Which means I need to have sentences, that I know are similar. How would I go about finding similar sentences and comprising the data set?

submitted by /u/MultiTiger
[link] [comments]

Moving Mean Of A Moving Mean. Is This Ok ?

Hello, I have a data set I am plotting which is very noisy. After applying a moving average using the nearest 1000 or even up to 10000 neighbors, it is still not good enough to give a reasonable plot.

But after applying a moving average to the initial moving average, the plot looks pretty good and a trend is clear.

Is this ever ok? Is there a name for this ? I see online that there is something called a “double moving average” , but this is mostly posted on stock-trading sites.

My data set is 200,000 values measured at 2000hz

submitted by /u/2059097
[link] [comments]