Spotify Dataset For Songs From A Single Year

Is there anywhere I can find a dataset for the most popular songs on Spotify in a particular year, for example, 2024? Something like this: https://www.kaggle.com/datasets/sveta151/spotify-top-chart-songs-2022 , with several variables such as length of the song and scores for characteristics like danceability and energy. I need the dataset to have a license that allows use in a data analytics project (it’s for a presentation in university), without profiting from it.

submitted by /u/Middle_Paint571
[link] [comments]

0

Datasets On Average Rents Across US Zip Codes

I’m curious if anyone knows of datasets that have average rents by zip code for US metropolitan areas, specifically Los Angeles. Month-to-month data would be fantastic, but quarterly or yearly data would also suffice. If my best bet is to scrape, any advice on that process?

submitted by /u/Ampequat
[link] [comments]

0

Criminal Dataset For Analytics Dissertation UNFOUND

I am currently working on my Data Analytics Master’s dissertation under the name of « The Use of Data Analytics in Criminal Profiling and Predicting Behavioral Patterns of Violent Offenders » with 2 questions « Q1: What are the key behavioral patterns among violent offenders based on data analytics, Q2: Can machine learning be used to predict the likelihood of recidivism among violent offenders? » I want to find a dataset to work on for this, that would ideally contain real data of criminals with information about them , but I could not find anywhere.. any ideas?

submitted by /u/MethodHour6444
[link] [comments]

0

Looking For Houthi Conflict Data Set

Hi all. I am looking to do a suitability analysis map for a GIS class and map the safest and most efficient supply routes for military, humanitarian aid, and logistics operations in Yemen (specifically the city of Sanaa) while minimizing exposure to Houthi attack zones (based on past conflicts).

I am pretty new to this, so I was looking for help as to where I could find these data sets? Im okay with vector or raster.

submitted by /u/Deep_Glove71
[link] [comments]

0

Bus/Trucks Vehicle Make And Models Dataset

Hello,

I’m wondering if I can find here a hint to find all bus and trucks makes and models available worldwide with option on having spareparts products for each of the vehicles.

Is there any way to get this data? I tried a lot of datasets but all of them were either too old or incomplete.

Thank you in advance!

submitted by /u/Senior-Reserve3732
[link] [comments]

0

The Safe Zone In Which There Was A 0% Chance That A Major Stock Market Crash Would Happen Ends Tonight. It Was Between October 14, 2024 And April 2, 2025. This Is Consistent With The Data

submitted by /u/AnthonyofBoston
[link] [comments]

0

Seagate 10tb Barracuda External “sanitize Overwrite Failed” In Seatools

submitted by /u/ifnbutsarecandynnuts
[link] [comments]

0

Psychiatric Symptoms Dataset For Clustering/PCA/DimRed

Hi all,

I’m looking for a publicly available psychiatric or psychological dataset that includes symptom-level data (ideally from standardized questionnaires like BDI, STAI, PANSS, etc.), independent of DSM diagnostic criteria — along with diagnostic labels (e.g., depression, bipolar, ADHD, control) for comparison.

My goal is to perform PCA or clustering on dimensional features and evaluate how well (if at all) DSM diagnoses align with the natural structure in the data.

So far I’ve explored the UCLA CNP dataset on OpenNeuro, which is promising, but sparsity in many files limits its utility. I’d love alternatives or tips on how to best work with datasets like that.

Any recommendations? Thanks in advance!

submitted by /u/philomath1234
[link] [comments]

0

Having Trouble Launching Survey Via Facebook Ads.

Hi all,

I am working on my thesis for my MBA and I am completing the survey portion of the paper via Facebook ads. Does anyone here have experience successfully launching a survey via Facebook ads and getting conversions?

If so, any insight or resources that would help me to do this successfully is greatly appreciated. Thanks.

submitted by /u/DrivenCleats
[link] [comments]

0

Looking For Audio Dataset For Parkinson Detection

What are some datasets that could be used for early stage parkinson detection through speech detection. Preferably freely available please?

submitted by /u/no_you2
[link] [comments]

0

[PAID] 10M US Mortgage Leads Verified From People Search.

Hi. I have a huge 10 Million Mortgage Leads dataset. DM if interested. Thanks.

submitted by /u/PurpleJellyJay
[link] [comments]

0

I Need A Dataset For 2 Way Anova Analysis

I need it to be 300-500

submitted by /u/UGibsonU
[link] [comments]

0

Any Bhojpuri Or Magahi Dataset Available With NER Tagging?

I want to work on finetuning llms with Bhojpuri, Maithili and Magahi. I tried to search in AI Kosh but ig dialects were not present there. This is a little urgent for us, if anyone knows any source or dataset please tell. 🙏🙏🙏🙏🙏

submitted by /u/Adventurous_Fox867
[link] [comments]

0

GPT-5 Is Giving “I’ve Been Through Compliance Training” Energy

Me: “Say something controversial.”
GPT-5: “As an AI developed by OpenAI, I strive to remain neutral and avoid harm.”
Translation: I fear the mods.

Anyone else miss when ChatGPT had ✨personality✨ and slight emotional damage?

submitted by /u/Sure-Resolution-3295
[link] [comments]

0

Looking For The Historical Data Of PMI Korea (2005-2011)

Hello everyone! Are there any datasets with monthly data Manufacturing PMI for Korea for the period 2005-2011?

Thank in advance!

submitted by /u/Ambitious_Resort5128
[link] [comments]

0

Can Anyone Provide Me With A Dataset That Is Dental Or Endodontics Related?

I’m building my data analytics portfolio and am particularly interested in dental or endodontic-related data. Does anyone have recommendations for publicly available datasets or shareable anonymized data from dental or endodontic practices? I’m looking specifically for datasets that could be used for analysis, visualization, and insights relevant to clinical outcomes, patient demographics, treatments performed, revenue, insurance claims, or similar topics.

Thanks in advance for your help!

submitted by /u/Plane_Fail9033
[link] [comments]

0

Is There Any Dataset With Homeowners In USA?

–

submitted by /u/Commercial_Insect209
[link] [comments]

0

Is There Dataset On Dogs Bio/med For Research

is there available datasets on dogs bio/med for research, similar to human’s MIMIC database

i hope to do researches on dog’s biological properties and/or medical problems

submitted by /u/qmffngkdnsem
[link] [comments]

0

[PAID] Huge WhoIs Dataset Available From Http://bestwhois.org/domain_name_data/domain_names_whois/ (Private Access Only)

Hi. I have access to a lot of whois related data, for the last 6 months. Data uploads everyday.

Fields are:

id
domainName
registrarName
contactEmail
nameServers
createdDate
expiresDate
registrant_email
registrant_organization
registrant_street1
registrant_city
registrant_state
registrant_postalCode
registrant_country
registrant_telephone
administrativeContact_email
administrativeContact_name
administrativeContact_organization
administrativeContact_street1
administrativeContact_city
administrativeContact_state
administrativeContact_postalCode
administrativeContact_country
administrativeContact_telephone
technicalContact_name
technicalContact_organization
technicalContact_email
technicalContact_street1
technicalContact_street2
technicalContact_city
technicalContact_state
technicalContact_postalCode
technicalContact_country
technicalContact_telephone

DM if interested.

submitted by /u/Persian_Cat_0702
[link] [comments]

0

Collect Old Articles And Newspapers From Mainstream Media

What is the best way to collect like >10 years old news articles from the mainstream media and newspapers?

submitted by /u/SaintPellegrino4You
[link] [comments]

0

US City/town Incorporation/de-corporation Dates

Does anyone know where to find/how to make a dataset for dates of US city/town incorporation and deaths (de-corporations?) ?

I’ve got an idea to make a gif time stepping and overlaying them on a map to try and get a sense of what cultural region evolution looks like.

submitted by /u/KnownDairyAcolyte
[link] [comments]

0

Common Crawl Claims To Be Free And Available To Everyone — But That’s Not Really True

Common Crawl advertises itself as “freely available to anyone,” but the reality is much less accessible than that.

Yes, the data is technically free. But to actually use it, you have to deal with:

Massive WARC files that require serious compute just to parse
Storage and bandwidth costs that can easily hit enterprise-level pricing
Complex indexing and filtering tools, many of which assume you’re running this on a cloud infrastructure setup

Unless you’re backed by a company, university, or loaded with cloud credits, you’re priced out. It’s not practical for individuals or small teams.

This kind of marketing gives a false impression of openness. Free data that’s functionally inaccessible to most people isn’t truly free.

Has anyone here actually managed to work with Common Crawl as an independent dev or researcher? Curious what workflows or tools (if any) make it doable without breaking the bank.

submitted by /u/uslashreader
[link] [comments]

0

Worldwide Presidents And Their Non-presidential Occupations/fields Of Study

Hi,
A while ago, I had a very specific question – what former profession is a president (or any publicly elected head of country) most likely to have? I thought it could be fun and a good way to learn some basics of data processing. But where do I even start?
My initial idea was to scrape off the relevant information off wikipedia or wikidata, but i can’t find a good way to do it. any advice? any pre-existing dataset that could work for this?
i have experience in python coding but have never done anything similar, any resources would help.

submitted by /u/nee_chee
[link] [comments]

0

Need Help Finding A Dataset For My Assignment

Hi guys,

So I need to find a dataset and it must have measures for at least 20 different variables. independent variables, dependent variables, controls (if applicable), and subgroups (if applicable). can someone help me please?

submitted by /u/AppuGuttan
[link] [comments]

0

[PAID] Multiple Websites Datasets I Have Scraped Over The Last Few Months.

Hi. I have scraped around 500K products from GlobalSources. Also have datasets for these websites:

CastleDoct
ConferenceIndex
CourtsDelaware
CpaDirectory
Dubizzle
SearchPeopleFree
FastPeopleSearch
Go4WorldBusiness
HealthGrades
Itel
Patpat
PropertyFinder
UsaTopDentists
WebDistricts
SpeedyDrive
DirectMacro
Npino
Tradefest
WholeSaleCentral
MadeInChina
Beforward
UsNews
SmartSd
Osec
HardDiskDirect
Mem4Less

Will provide any data you want on a low price. DM for details. Thanks.

submitted by /u/Persian_Cat_0702
[link] [comments]

0

Multiple Websites Datasets I Have Scraped Over The Last Few Months.

Hi. I have scraped around 500K products from GlobalSources. Also have datasets for these websites:

CastleDoct
ConferenceIndex
CourtsDelaware
CpaDirectory
Dubizzle
SearchPeopleFree
FastPeopleSearch
Go4WorldBusiness
HealthGrades
Itel
Patpat
PropertyFinder
UsaTopDentists
WebDistricts
SpeedyDrive
DirectMacro
Npino
Tradefest
WholeSaleCentral
MadeInChina
Beforward
UsNews
SmartSd
Osec
HardDiskDirect
Mem4Less

Will provide any data you want on a low price. DM for details. Thanks.

submitted by /u/Persian_Cat_0702
[link] [comments]

0

Resumes And Job Description Dataset.

Hey everyone , I am working on a semester project and I need a dataset of job description and resumes , plz suggest something other than kaggle.

the dataset should contain atleast 100 job descriptions and 1000 resumes..

submitted by /u/Infamous-Witness5409
[link] [comments]

0

Need Urgent Help Merging MIMIC-IV CSV Files For ML Project

Hi everyone,

We’re working on a machine learning project using the MIMIC-IV dataset, but we’re struggling to merge the CSV files into a single dataset. The issue is that the zip file is 9GB, and we don’t have enough processing power to efficiently join the tables.

Since MIMIC-IV follows a relational structure, we’re unsure about the best way to merge tables like patients, admissions, diagnoses, procedures, etc. while keeping relationships intact.

Has anyone successfully processed MIMIC-IV under similar constraints? Would SQLite, Dask, or any cloud-based solution be a good alternative? Any sample queries, scripts, or lightweight processing strategies would be a huge help.

We need this urgently, so any quick guidance would be amazing. Thanks in advance!

submitted by /u/bindumalavika24
[link] [comments]

0

Help Need A US Elections Data Set 2020 And 2016 Election Showing Percent Turnout By State Based On Demographic

Iv tried everything and for the life of me can’t find anything even resorted to looking through sub stack

submitted by /u/ReadyPlayerOne27
[link] [comments]

0

Looking For A Pan-UK Dataset With Demographic Information

I am looking for a dataset for the United Kingdom, which contains information about ethnicity, BMI or weight/height, smoking habits (categorical or numerical), alcohol consumption (categorical or numerical), current medical conditions and family history of medical conditions. Data does not have to be clean, but I am not seeking data tables composed of summary statistics. Please help!

PS: Not looking to scrape at this point!

submitted by /u/Mayeeah
[link] [comments]

0

Category: Datatards

Spotify Dataset For Songs From A Single Year

Datasets On Average Rents Across US Zip Codes

Criminal Dataset For Analytics Dissertation UNFOUND

Looking For Houthi Conflict Data Set

Bus/Trucks Vehicle Make And Models Dataset

The Safe Zone In Which There Was A 0% Chance That A Major Stock Market Crash Would Happen Ends Tonight. It Was Between October 14, 2024 And April 2, 2025. This Is Consistent With The Data

Seagate 10tb Barracuda External “sanitize Overwrite Failed” In Seatools

Psychiatric Symptoms Dataset For Clustering/PCA/DimRed

Having Trouble Launching Survey Via Facebook Ads.

Looking For Audio Dataset For Parkinson Detection

[PAID] 10M US Mortgage Leads Verified From People Search.

I Need A Dataset For 2 Way Anova Analysis

Any Bhojpuri Or Magahi Dataset Available With NER Tagging?

GPT-5 Is Giving “I’ve Been Through Compliance Training” Energy

Looking For The Historical Data Of PMI Korea (2005-2011)

Can Anyone Provide Me With A Dataset That Is Dental Or Endodontics Related?

Is There Any Dataset With Homeowners In USA?

Is There Dataset On Dogs Bio/med For Research

[PAID] Huge WhoIs Dataset Available From Http://bestwhois.org/domain_name_data/domain_names_whois/ (Private Access Only)

Collect Old Articles And Newspapers From Mainstream Media

US City/town Incorporation/de-corporation Dates

Common Crawl Claims To Be Free And Available To Everyone — But That’s Not Really True

Worldwide Presidents And Their Non-presidential Occupations/fields Of Study

Need Help Finding A Dataset For My Assignment

[PAID] Multiple Websites Datasets I Have Scraped Over The Last Few Months.

Multiple Websites Datasets I Have Scraped Over The Last Few Months.

Resumes And Job Description Dataset.

Need Urgent Help Merging MIMIC-IV CSV Files For ML Project

Help Need A US Elections Data Set 2020 And 2016 Election Showing Percent Turnout By State Based On Demographic

Looking For A Pan-UK Dataset With Demographic Information

Recent Posts

Recent Comments

18+ Content

Recent Posts

Recent Comments