Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

How To Get A GDP Breakdown For Sub-industries?

Hi guys,

I need for a project to get the data of the GDP of countries by sub-industries and the best would be to have it breakdown using the Global Industry Classification Standard (or an other advanced standard that shows sub-industries).

I wasn’t able to found data that was that much precise (most GDP by sectors or some big sectors by not going into industries & sub). So maybe the data needed is on a special website that I don’t know or is hardly accessible by a simple Google search.

Thanks for any response / upvote / help.

submitted by /u/Haunting_Taste6349
[link] [comments]

Is There A Longitudinal Dataset On US Newspaper Ownership Such That I Can Track Changes In The Ownership Of Any Given US Newspaper/daily Over A Period Of Time?

I want to look at how change in ownership affects the type of information conveyed by a newspaper, especially in cases where the acquirer may have a vested commercial motive. For example, there has been a significant uptick in the number of US newspapers acquired by private equity players. I’d like to see if such acquisitions affect the choice and delivery of content that may have direct commercial implications for the private equity owner.

submitted by /u/Charming-Incident600
[link] [comments]

Anyone Have/know Where To Find A Dataset For The Following:

Hi, so for my AP statistics project, I have to analyze two quantitative variables (separately). I am trying to answer the following question: How does the annual enrollment rate in STEM courses at educational institutions correlate with the annual increase in women pursuing careers in STEM fields over the past years?

Additionally, here are more specifications:

Response Variable: The number of women in STEM increased every year.

Explanatory Variable: The enrollment rate of STEM courses in different American Institutes.

Parameter: The population correlation coefficient between the annual enrollment rate in STEM courses and the number of women pursuing careers in STEM fields over the past years.

Null Hypothesis (H0):

“There is no significant correlation between the annual enrollment rate in STEM courses and the annual increase in the number of women pursuing careers in STEM fields over the past years (ρ = 0).”

Alternative Hypothesis (Ha):

“There is a significant correlation between the annual enrollment rate in STEM courses and the annual increase in the number of women pursuing careers in STEM fields over the past years (ρ ≠ 0).”

Please comment on any links to places where I can find raw quantitative datasets that are CSV files.

submitted by /u/Aloeiq
[link] [comments]

Seeking Assistance: Categorizing 500K Food Products Into Specific Categories

I’m currently faced with the task of categorizing a massive inventory of 500,000 food products into specific categories such as meat, dairy, pastry, and more. Despite extensive searches, I haven’t been able to locate a dataset that provides products with their corresponding categories.
I’ve scoured various sources, including old posts on this Subreddit, but unfortunately, I found nothing. If anyone could point me in the right direction or share a relevant dataset, I would greatly appreciate the help. Thank you in advance!

submitted by /u/omar_zr
[link] [comments]

Looking For A Set With OTC/RX Medications, Recommended Dosages, And Safe Dosing Intervals

I’m brainstorming a project and while I’m sure a set like this exists in the world, I imagine the risk of misuse and liability makes it difficult for someone without a doctorate to get their hands on. Looking to you folks for even a mock dataset/csv that would have something like

Ibuprofen, 200-400mg, 4-6 hours

Eventually, I would like to work towards a more complete dataset that factors body mass where applicable but something to this effect with even just OTC recommendations would be a huge boon

TIA!

submitted by /u/Life-Particular-9708
[link] [comments]

Where Can I Get A Zillow Rentals Dataset?

I need a zillow dataset of rentals, along with all their details, for a research project. I know zillow is very possessive of their data, but it needs not be current – is there a way to get a dataset of old rental listings from somewhere?
Alternatively, is there a different dataset that I could use that would provide a similar level of details on rentals? I know there are probably a lot of sources where I could get a footage, bedrooms/bathrooms and a price, but zillow provides data such as laundry machine/drier unit availability, pet policy and pet rent, etc. Are there any datasets like that available?

Thank you in advance

submitted by /u/SofisticatiousRattus
[link] [comments]

NIST Ballistics Toolmark Research Database

The NIST Ballistics Toolmark Research Database (NBTRD) is an open-access research database of bullet and cartridge case toolmark data. The development of the database is sponsored by the U.S. Department of Justice’s National Institute of Justice. The database is being developed to:

foster the development and validation of measurement methods, algorithms, metrics, and quantitative confidence limits for objective firearm identification

improve the scientific knowledge base on the similarity of marks from different firearms and the variability of marks from the same firearm, and ease the transition to the application of three-dimensional surface topography data in firearms identification.

The database contains traditional reflectance microscopy images and three-dimensional surface topography data acquired by NIST or submitted by database users. The goal is a collection of data sets that:

-represents the large variety of ballistic toolmarks encountered by forensic examiners, and

-represents challenging identification scenarios, such as those posed by consecutively manufactured firearm components.

submitted by /u/lurklord_
[link] [comments]

Princeton University ML Great Datasets

Princeton University ML Datasets

contents [8puzzle.zip – aol.zip – assign.zip – autocomplete-tst.zip – autocomplete.zip – backtrack.zip – bacon.zip – baseball.zip – batcher.zip – bins.zip – bottle.zip – burrows.zip – circle.zip – collinear.zip – factor.zip – goldberg.zip – kdtree.zip – linksort.zip – location.zip – map.zip – markov.zip – model.zip – moviedb-3.24.zip – netflix.zip – paths.zip – percolation.zip – puzzle.zip – queues.zip – redundant.zip – rogue.zip – seamCarving.zip]

Link 1

https://www.up-4ever.net/pskmv8n6p3p4

link 2

https://www.file-upload.org/wa9xtfas8fd1

submitted by /u/DataExpx
[link] [comments]

How To Sift Through Papers More Accurately Using These Search Terms?

Hi everyone

I’m trying to create a search for an analysis that I’m doing in rural health australia but I’m unable to sift through anymore of the papers and my current search is yielding 10,938.

how can i imrpove my mesh search term?

(((((((australia*[Title/Abstract] OR victoria*[Title/Abstract] OR tasmania*[Title/Abstract] OR western australia*[Title/Abstract] OR south australia*[Title/Abstract] OR northern territor*[Title/Abstract] OR queensland*[Title/Abstract] OR new south wales[Title/Abstract] OR australian capital territory[Title/Abstract]) AND (2013:2024[pdat])) OR (((australia or victoria or tasmania or western australia or south australia or northern territory or queensland or new south wales or australian capital territory[MeSH Terms]) AND (2013:2024[pdat])) OR (((australia[Affiliation] OR wa[Affiliation] OR sa[Affiliation] OR nsw[Affiliation] OR vic[Affiliation] OR nt[Affiliation] OR act[Affiliation] OR qld[Affiliation] OR tas[Affiliation])) OR (western australia[Affiliation] OR south australia[Affiliation] OR new south wales[Affiliation] OR victoria[Affiliation] OR northern territory[Affiliation] OR australian capital territory[Affiliation] OR queensland[Affiliation] OR tasmania[Affiliation]) AND (2013:2024[pdat])))) AND (rural health OR rural health services OR rural population OR rural nursing OR hospitals, rural[MeSH Terms] AND (2013:2024[pdat]))) AND (rural*[Title/Abstract] OR regional[Title/Abstract] OR remote*[Title/Abstract] AND (2013:2024[pdat]))

submitted by /u/Efficient_Mud_5072
[link] [comments]