Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

Does Dataset Of 3D Models Of Linear Induction Motors Exist?

I am working on quite an ambitious research project related to the design of Linear Induction Motors (LIMs) specifically. It is about generating the shape of a LIM with some given constraints and/or performance targets (thrust, achieved speed, efficiency, etc).

I cannot give away too much information regarding the exact way that I will be using the data, but I am looking for a dataset that consists of 3D model files of LIMs and if possible, the level of performance metrics it is able to achieve on paper or in real world. I can make do without the latter part maybe, but desperately need the 3D model file samples of atleast some LIMs.

I tried searching for anything related in this subreddit, online, and on google datasets site but could not find anything helpful.

Anyone would be kind enough to point me in the right direction in my quest?

In short I need:

  • 3D models of Linear Induction motors
  • Calculated/simulated/real world performance of said motors

submitted by /u/WonderfulMuffin6346
[link] [comments]

Looking For The Full Dataset From The Two Sigma Financial News Kaggle Competition

Hello,
I’m trying to get access to the full dataset from the Two Sigma: Using News to Predict Stock Movements Kaggle competition (it ended a while back and the data is no longer officially available).

I’ve found a small sample, but it’s way too limited for any real analysis or model training.

If anyone still has the full dataset files and would be willing to share or point me in the right direction, I’d be super grateful!

Thanks in advance!

submitted by /u/yuxme
[link] [comments]

Spotify Dataset For Songs From A Single Year

Is there anywhere I can find a dataset for the most popular songs on Spotify in a particular year, for example, 2024? Something like this: https://www.kaggle.com/datasets/sveta151/spotify-top-chart-songs-2022 , with several variables such as length of the song and scores for characteristics like danceability and energy. I need the dataset to have a license that allows use in a data analytics project (it’s for a presentation in university), without profiting from it.

submitted by /u/Middle_Paint571
[link] [comments]

Criminal Dataset For Analytics Dissertation UNFOUND

I am currently working on my Data Analytics Master’s dissertation under the name of « The Use of Data Analytics in Criminal Profiling and Predicting Behavioral Patterns of Violent Offenders » with 2 questions « Q1: What are the key behavioral patterns among violent offenders based on data analytics, Q2: Can machine learning be used to predict the likelihood of recidivism among violent offenders? » I want to find a dataset to work on for this, that would ideally contain real data of criminals with information about them , but I could not find anywhere.. any ideas?

submitted by /u/MethodHour6444
[link] [comments]

Looking For Houthi Conflict Data Set

Hi all. I am looking to do a suitability analysis map for a GIS class and map the safest and most efficient supply routes for military, humanitarian aid, and logistics operations in Yemen (specifically the city of Sanaa) while minimizing exposure to Houthi attack zones (based on past conflicts).

I am pretty new to this, so I was looking for help as to where I could find these data sets? Im okay with vector or raster.

submitted by /u/Deep_Glove71
[link] [comments]

Bus/Trucks Vehicle Make And Models Dataset

Hello,

I’m wondering if I can find here a hint to find all bus and trucks makes and models available worldwide with option on having spareparts products for each of the vehicles.

Is there any way to get this data? I tried a lot of datasets but all of them were either too old or incomplete.

Thank you in advance!

submitted by /u/Senior-Reserve3732
[link] [comments]

Psychiatric Symptoms Dataset For Clustering/PCA/DimRed

Hi all,

I’m looking for a publicly available psychiatric or psychological dataset that includes symptom-level data (ideally from standardized questionnaires like BDI, STAI, PANSS, etc.), independent of DSM diagnostic criteria — along with diagnostic labels (e.g., depression, bipolar, ADHD, control) for comparison.

My goal is to perform PCA or clustering on dimensional features and evaluate how well (if at all) DSM diagnoses align with the natural structure in the data.

So far I’ve explored the UCLA CNP dataset on OpenNeuro, which is promising, but sparsity in many files limits its utility. I’d love alternatives or tips on how to best work with datasets like that.

Any recommendations? Thanks in advance!

submitted by /u/philomath1234
[link] [comments]

Having Trouble Launching Survey Via Facebook Ads.

Hi all,

I am working on my thesis for my MBA and I am completing the survey portion of the paper via Facebook ads. Does anyone here have experience successfully launching a survey via Facebook ads and getting conversions?

If so, any insight or resources that would help me to do this successfully is greatly appreciated. Thanks.

submitted by /u/DrivenCleats
[link] [comments]

Can Anyone Provide Me With A Dataset That Is Dental Or Endodontics Related?

I’m building my data analytics portfolio and am particularly interested in dental or endodontic-related data. Does anyone have recommendations for publicly available datasets or shareable anonymized data from dental or endodontic practices? I’m looking specifically for datasets that could be used for analysis, visualization, and insights relevant to clinical outcomes, patient demographics, treatments performed, revenue, insurance claims, or similar topics.

Thanks in advance for your help!

submitted by /u/Plane_Fail9033
[link] [comments]

[PAID] Huge WhoIs Dataset Available From Http://bestwhois.org/domain_name_data/domain_names_whois/ (Private Access Only)

Hi. I have access to a lot of whois related data, for the last 6 months. Data uploads everyday.

Fields are:

  • id
  • domainName
  • registrarName
  • contactEmail
  • nameServers
  • createdDate
  • expiresDate
  • registrant_email
  • registrant_organization
  • registrant_street1
  • registrant_city
  • registrant_state
  • registrant_postalCode
  • registrant_country
  • registrant_telephone
  • administrativeContact_email
  • administrativeContact_name
  • administrativeContact_organization
  • administrativeContact_street1
  • administrativeContact_city
  • administrativeContact_state
  • administrativeContact_postalCode
  • administrativeContact_country
  • administrativeContact_telephone
  • technicalContact_name
  • technicalContact_organization
  • technicalContact_email
  • technicalContact_street1
  • technicalContact_street2
  • technicalContact_city
  • technicalContact_state
  • technicalContact_postalCode
  • technicalContact_country
  • technicalContact_telephone

DM if interested.

submitted by /u/Persian_Cat_0702
[link] [comments]

Common Crawl Claims To Be Free And Available To Everyone — But That’s Not Really True

Common Crawl advertises itself as “freely available to anyone,” but the reality is much less accessible than that.

Yes, the data is technically free. But to actually use it, you have to deal with:

  • Massive WARC files that require serious compute just to parse
  • Storage and bandwidth costs that can easily hit enterprise-level pricing
  • Complex indexing and filtering tools, many of which assume you’re running this on a cloud infrastructure setup

Unless you’re backed by a company, university, or loaded with cloud credits, you’re priced out. It’s not practical for individuals or small teams.

This kind of marketing gives a false impression of openness. Free data that’s functionally inaccessible to most people isn’t truly free.

Has anyone here actually managed to work with Common Crawl as an independent dev or researcher? Curious what workflows or tools (if any) make it doable without breaking the bank.

submitted by /u/uslashreader
[link] [comments]

Worldwide Presidents And Their Non-presidential Occupations/fields Of Study

Hi,
A while ago, I had a very specific question – what former profession is a president (or any publicly elected head of country) most likely to have? I thought it could be fun and a good way to learn some basics of data processing. But where do I even start?
My initial idea was to scrape off the relevant information off wikipedia or wikidata, but i can’t find a good way to do it. any advice? any pre-existing dataset that could work for this?
i have experience in python coding but have never done anything similar, any resources would help.

submitted by /u/nee_chee
[link] [comments]