Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

Business Transformation Assets And Artefacts

🚀 Business Transformation Assets Sale: Premium Guides & Reference Materials 🚀

Unlock the secrets behind successful business transformations with exclusive assets from top-tier consultancy firms like Accenture, JPMorgan & Chase, EY, PwC, Deloitte, and KPMG!

📂 What’s Included? Business Transformation Assets for 18 Key Business Functions:

Commerce Cyber Data & Analytics Finance Global Business Service Human Resources Information Technology Internal Audit Legal Marketing Procurement Resilience Risk Sales Service Service Management Framework Supply Chain Management Sustainability

📊 Assets Provided:

Target Operating Models Guides Reference Materials (Process Taxonomies, Maturity Model Scale, etc.) Engagement Artefacts

🔧 Supported Technological Platforms:

Tech Agnostic Ivalua Coupa SAP Salesforce Workday Microsoft ServiceNow Okta

🌟 Why Buy?

Lifetime Access: One-time purchase with lifetime access to a Google Drive containing all the assets.

Comprehensive Coverage: All the tools and guides you need to revolutionize your business across multiple functions.

Proven Success: Backed by the methodologies and frameworks from leading consultancy firms.

Price: 0.05 BTC

PM if interested

submitted by /u/OrganicGoo
[link] [comments]

Constrained Faces With Ages Datasets

Hello,

I’m looking for datasets that contains faces of people with their age. Ideally the photos should be constrained, like in passports for instance, and should contain a wide range of ages, from 10 or even lower to at least 40. I would be really interested in constrained videos too instead of simple photos. Do you have any suggestions ?

Thanks.

submitted by /u/bastmed
[link] [comments]

Dream Data Set? Mine Would Be Local Traffic Data

every time i drive i find myself wondering what kind of data goes into decisions like stoplight vs stop sign, roundabout, etc. Or like how much collective time is wasted due to an accident. as a kid i used to think about how if an accident caused a 30 minute delay for 500 cars, that was collectively 250 hours of waste. never knew what to do with that data, lol. but anyway yeah i’ve always wanted to get access to data like this.

anyone got any other dream data sets? or even just something that’s super inaccessible if it does technically exist

submitted by /u/bhousecjs
[link] [comments]

How To Compare Two Data Sets From The Same Time And Proximate Location

Hi there, my first post not sure if this is the sub for it,

So I am working on a weather datasets (taken from stats can:https://climate.weather.gc.ca/index_e.html), The dataset I am working with has some missing values that I wish to fill using another dataset from a similar location. For this I found two other datasets from similar location, but both report slightly different numbers (as expected).

I wanna figure out if these differences are significant enough for me to not choose these datasets. How do I go about this? Do I use t test individually on each column? or ANOVA?

submitted by /u/Nepoleon_bone_apart
[link] [comments]

Looking For Researchers And Members Of AI Development Teams

We are looking for researchers and members of AI development teams who are at least 18 years old with 2+ years in the software development field to take an anonymous survey in support of my research at the University of Maine. This may take 20-30 minutes and will survey your viewpoints on the challenges posed by the future development of AI systems in your industry. If you would like to participate, please read the following recruitment page before continuing to the survey. Upon completion of the survey, you can be entered in a raffle for a $25 amazon gift card.

https://docs.google.com/document/d/1Jsry_aQXIkz5ImF-Xq_QZtYRKX3YsY1_AJwVTSA9fsA/edit

submitted by /u/wildercb
[link] [comments]

BIC (Bank Identifier Code) To Bank Name?!

Hi! I have a dataset of BIC and am doing a master data template. The template also wants me to put in the banks name. Is there any resource where I can get a table of BIC codes with bank names I can then use to fill in the name slots via lookups?

I’ve found sites that convert the BIC codes, unfortunately one by one and I have cca 2k entries…

Any help would be appreciated! Thx

submitted by /u/Gregib
[link] [comments]

Recommendations For Extensive Datasets In Process Engineering And Optimization For End-to-End DS/DE Projects

Hi everyone,

I’m a data science researcher focusing on process engineering and optimization, and I’m looking to further strengthen my knowledge through different use cases. I’m reaching out for recommendations on extensively large datasets that can be processed using cloud platforms.

My goal is to create an end-to-end Data Science/Data Engineering project that involves ingesting these large datasets and applying domain knowledge to derive insights. I’m particularly interested in **time series** modeling, which is crucial for capturing temporal trends.

Some areas I’m considering include:

Oil and gas unit operations datasets Carbon Capture, Utilization, and Storage (CCUS) datasets FMCG manufacturing datasets, such as edible oil or biomass production Water treatment units, especially where time-sensitive data is key

To give you an idea of my background, I’ve worked on modeling and optimization in amine treating, sulfur recovery, and carbon capture datasets. I’ve also successfully developed an anomaly detection model for the Tennessee Eastman process. However, I’m eager to dive deeper into time series modeling for my next project.

Major requirements:

Focus on time series data Can involve classification or regression tasks Comparatively large datasets with many columns (variables) and datapoints

I would greatly appreciate any suggestions or pointers to datasets that align with what I mentioned.

Thanks in Advance!

submitted by /u/ryanroy0698
[link] [comments]

Value Of Historical Freight Transaction Dataset?

Hi all,

Several new partnerships/doors have opened up and allowed my business to aggregate historical (road) freight transactions. They are mostly lane/rate confirmations, and include information such as route, $ rate, shippers, carriers, brokers, etc.. They are all PDFs, but we’re working on building out a pipeline to start structurizing them.

This data is not free for us to collect, so we were debating whether or not it’s worthwhile to continue to collect this data. Are there any businesses/places this data might be useful?

submitted by /u/Interesting_Law_9138
[link] [comments]

Help Deciphering Data Sets From NCEI

I am pulling data from NCEI for some annual average temperature etc and the csv it is giving me for the local sites has a weird format I cannot figure out for temperature. What in the heck are these numbers and why is it not in Celsius?

TMP

|| || || |-0017,5| |-0028,5| |-0033,5| |-0044,5| |-0056,5| |-0067,5| |-0078,5| |-0078,5| |-0094,5| |-0089,5|

submitted by /u/agonzal7
[link] [comments]

125k LinkedIn Job Postings From 2024

Hey everyone! I created a dataset of ~125k job postings from LinkedIn with attributes like job title, description, company, compensation, benefits, zip code etc. All the postings are from the United States and over a period of ~1 week, but you can fork the repo and modify it for a specific location/keyword for real-time data.

It was originally intended both to extract some insights about the job market and help me filter live postings. Published the code to save time for anyone pursuing a similar goal.

Dataset link

Scraper link

submitted by /u/Armi2
[link] [comments]

Best Way/place To Find Specific Datasets?

Hi All, I’m currently in a bootcamp and need to find a applicable data set for the problem we are trying to solve. I’m having a hard time finding something suitable so I’m here to ask for some advice. I’m looking for a data set that has sensor data recorded at varying intervals (this part is easy) but the issue is finding a data set that also contains operational cost data as well. Any pointers on where or how to find a dataset would be very appreciated!

submitted by /u/Jeromes-in-the-House
[link] [comments]

Regression Project For Portfolio, Sugestions Please

Hi guys, I am starting to build mt DS portfolio, i already work wih DS and ML but i cannot use my job project on my portfolio due to NDA. I am having a bad time to finding some dataset or even have some ideas on ML projects such as regression, classification, etc. Do you have any sugestion of dataset or projects? (I didnt want to use kaggle datasets because some say companies dont lime projects fone with kaggle datasets too much) Aprecciate your help!

submitted by /u/pdrmrtn
[link] [comments]

Historical Loan-to-value Ratios For USA

Hi!

As part of my thesis, I am conducting an econometric analysis of the housing market in the US.

For this I really need historical LTV data, I am however having a hard time finding it for a longer time period.

The closest I have come is FRED, where they have data back to 2012.

Preferably I would need it back to year 2000 or earlier.

Any help would be greatly appreciated!

submitted by /u/NielsSm0ker
[link] [comments]

I’m Looking For The Unique Datasets For Multiple Modalities

Hello guys. I’m looking for a datasets (free only) for multiple stuff (on HF, or just Reddit subs to scrape):

Labeled music: a dataset with songs and corresponding descriptions, like tempo, key signatures, or just the way the general mood feels Discussions of super controversial, NSFW, and unethical ideas about everything from conspiracy theories to the meaning of life Role-play dialogs. Or just general dialogs but not just texting World knowledge Q&As Grammarly-like datasets, with bad and good sentences

Thanks.

submitted by /u/yukiarimo
[link] [comments]

Legally Acquired Footage Of Football Games

Hi!

As part of my thesis I would like to combine AI and football. To achieve this I would need whole match recordings of some team’s previous season. Maybe someone has recordings of their local team that I could legally use, or knows where I could get such materials(also legally pls). Thanks in advance for any help and suggestions 🙂

submitted by /u/G1b0
[link] [comments]