Hey, I wanted the list of top 2000 companies in Forbes with the employee count of each. It’s available on data Bahn but it’s paid. Any free version of the same available?
submitted by /u/party_1234567
[link] [comments]
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
Hey, I wanted the list of top 2000 companies in Forbes with the employee count of each. It’s available on data Bahn but it’s paid. Any free version of the same available?
submitted by /u/party_1234567
[link] [comments]
I have a dataset composed by IT ticket logs from 2020 to 2023. I have structured the columns as it follows: day, month, year, holiday(0 if its not a holiday and 1 if it is) name of the day(1 to 7), hour of the day(0 to 23), bank campaign (just for July and December, bonus and finally the number of tickets per day and hour. When I organize the logs only by date, the dataset is composed by 1014 logs. If I add the hour attribute, the dataset ends with 6000 logs. I want to train ML algorithms (random forest and lstm) to forecast the number of IT tickets for a certain time (hour) and date but my metrics are underperforming. I’d like to know if there’s a way to improve my metrics? Could it be related to the algorithms? How could I improve the quality of my dataset?(if that’s even possible)
Thanks in advance for your help!
submitted by /u/CheisonVS
[link] [comments]
Need to find companies by a given industry. Specific company name and address. For example:
All fast food restaurants in the US All tire manufacturers… Need to find companies by a given industry. Specific company name and address. For example:
submitted by /u/jpschaffer
[link] [comments]
Does anyone know how to get lottery data for the powerball and mega millions? I specifically want to get the numbers pulled, the Megaplier/Powerball and the dates of the drawings. I can get the numbers online, but I can only download them as a PDF. I’m not sure where exactly to find it.
submitted by /u/DancingSingingVirus
[link] [comments]
I have a data set of hospitalized patients with their zip codes, we’re trying to determine which of them live in Appalachian counties. I have been unable to find a doc that includes all the zip codes of the Appalachian counties, but I did find this list of the county names however.
Anyone have any insight on where to find that info? Thank you!
submitted by /u/PA1999
[link] [comments]
I’m exploring the possibility of having a basic chatbot for customer service. I need some data for this to train a simple text chatbot.
Are there any datasets available for this? Ideally I’d like each data point to be a textual conversation between a customer and a representative trying to resolve customer’s issues.
The actual topic/domain if conversation can be anything – Pharma, ecommerce, telecom, etc. I’m not restricted to any particular domain.
Let me know if anything like this is publicly available.
submitted by /u/stlo0309
[link] [comments]
Hi everyone, I’d like to share a dashboard I developed showcasing Zimbabwe’s economic performance for the last 5 years using official sources. I have developed a methodology page, where there is a link to every data source I used to calculate each of my metrics.
Please let me know what you think
submitted by /u/BigIntroduction4586
[link] [comments]
Hello, I’m learning Tableau right now and I want some fake/old data(excel sheets or similar) in order to learn and manipulate data, You can either share me the link of the old data OR you can give me some website to fetch the data so I can use it ! I’m a super beginner here and I’ll be really happy if anyone give me some advice/insights that I should follow while learning tableau, Thanks in advance !
submitted by /u/venom_holic_
[link] [comments]
Please can anvone assist me on this or advise me on how to go about getting some data… I need to perform Cost-Benefit Analysis and Predictive Modeling, I’ll be needing a comprehensive seaweed dataset that includes information on environmental conditions, farming practices, labour records, production outputs, costs, and market data.
Some of the parameters lIl need are, labour records, such as labour hours, tasks performed, associated costs. Cost data such as farming process costs, operational expenses, maintenance costs and marketing expenses. #datasets #questions #seaweed
submitted by /u/Sindarel
[link] [comments]
Hey! I’m a rising senior in high school and its college month (August). I’m starting to look at potential colleges I want to apply to and their cost. I realized that I’m decent at SQL (been writing queries for two years) and I want to do an SQL project relating to analyzing costs for different colleges. By comparing the costs across different institutions, it can narrow down my tedious college search haha. Is there a dataset for this?
submitted by /u/TwistLow1558
[link] [comments]
Hi everyone! I’m new in the data field. So, I’m confused quite alot, where to find datasets? ik about kaggle nd Google dataset search engine BUT what are the other resources where u guys get datasets?
submitted by /u/Emergency_Island_668
[link] [comments]
We’re exploring partnerships with companies that already have massive video data catalogues, covering everything from news content to home shopping, YouTube influencers, and sports. People-oriented content is our focus – think talking to the camera and engaging conversations.
submitted by /u/C0tt
[link] [comments]
I refuse to believe there is no dataset for the titles of reddit posts, caveat, they should have been contemporaneously captured.
Is there a dataset that shows, day by day, or hour by hour, the titles in context for some top subs?
submitted by /u/OH-YEAH
[link] [comments]
Big data is an extensive data volume containing texts, images, sounds, audiovisuals, and program-specific files. Businesses manage these ever-expanding databases using data warehouses or lakes. Big data analytics involves determining a recurring pattern based on those repositories’ structured, semi-structured, and unstructured data objects.
Meanwhile, Educational technology, or Edtech, encompasses all the software and hardware innovations that teachers, corporate trainers, and students employ to streamline academic or professional training activities. Therefore, brands working in automated translation, virtual reality (VR) laboratories, or e-libraries are also EdTech businesses.
Still, integrating edtech tools and big data analytics trends varies from company to company. For instance, a smartboard developer will likely use analytical models to explore how educators and learners interact with their hardware. If a company offers remote skill development opportunities, it can employ marketing analytics to attract more students. https://us.sganalytics.com/blog/top-edtech-companies-using-big-data-analytics/
submitted by /u/Beautiful-Ad-7743
[link] [comments]
Think of a kitchen counter with salt, butter, chicken, lettuce, etc on it. Looking to train an object detection model that can recognize common ingredients, not complete dishes.
submitted by /u/d_Milt
[link] [comments]
I am creating synthetic data by stitching images of a ship with lights on or off for a LSTM model that can decode morse. I can’t find any publicly available dataset, please help
submitted by /u/BANANATHEGREAT
[link] [comments]
I have created a free Centralized Lightweight Digital software program that lets you to save your favorite internet video links so that you may easily retrieve them later. You can Use it to keep track of any type of link that you need to keep track of. That is, you do not need to have separate accounts and playlists for each tube site you visit. Using the url you specify, this software takes the image of the website you wish to preserve, you may then provide a searchable title. For simpler navigation, the stored videos can be shuffled and reversed. You may name the videos whatever you like, making it simpler to find them than depending on the original title. Furthermore, the playlist is superior even if you only use it for one site.
submitted by /u/Luktred
[link] [comments]
I’m trying to access the CFEE Dataset to classify compound facial emotions from this website: http://cbcsl.ece.ohio-state.edu/dbform_compound.html. I have filled up the request form, and have been granted access. However, when I click the download dataset link, the page times out.
I’ve tried contacting the people mentioned on their website, but to no avail. Is there something I can do?
submitted by /u/grumpyowlgirl
[link] [comments]
Does someone knows where I could find the drug sales by quantity sold with prices in United States.
Thanks a lot.
submitted by /u/fuseraga
[link] [comments]
Hi guys how are you doing?
last week I share my first version of this simple Languaje model training with php.
For thoose who missed, it use a simple Markov Chain for calculate the probabilities for the next word based on the previous words.
Now I have improved the training dataset and the next word selector.
Here’s is the link:
https://github.com/AcidBurn86/LM-nGram-with-php/
is a good way to start understand how big LLM works. And of course I know this could never perform like GPT or Llama.
Is just an educational code for php fans.
Shares and github stars are welcome!
submitted by /u/OficialPimento
[link] [comments]
I’m currently searching for a website that provides user reviews and data on the accessibility of places for people with disabilities. I’m interested in finding information about the accessibility of places like airlines, hotels, and restaurants.
Are there any websites that offer comprehensive reviews, ratings, or details about the accessibility of various locations? If the website has an API to access the data, that would be great.
Thank you in advance.
submitted by /u/19datascientist
[link] [comments]
Apologies for cross-posting but can’t seem to find answers anywhere…I work as a researcher and have been wondering if anyone knew of big data providers/sources which are nationally representative (country agnostic) and work within ethical collection parameters.
We use YouGov a lot and outside of panel providers, we’ve tried to do market mapping but find that most companies have ML/AI aided profile scraping which classifies people by gender/race etc. which is problematic, as it’s just physiognomy/face reading and subject to its own biases and reproduces the inequalities we’re working hard to lessen. I’ve been searching but can’t seem to find anything conclusive in the way of providers or datasets.
Anyone know of anything that could help?
TL;DR – There’s a lot of structural inequalities around collecting data (as per Data Feminism) and was wondering how I can collect data with greater sensitivity to ethics, but at relative scale?
submitted by /u/Mundane-Mark2403
[link] [comments]
Today data is becoming an integral part of a company’s operations. And trust in data is becoming an increasingly fraught issue, especially while gathering more information.
The first step in building trust in data is ensuring that the data supply chain is aware of the critical role data plays in every operation. The second step is to focus on accuracy and eliminate the restrictions on the volume of data required for accuracy and performance needed for actions. The third step involves building trust to ensure that all analytics and actions that are taken can be replicated and proven.
The power of a data-driven world is likely to outweigh the risks and encourage trust in the technology that makes innovation possible. For organizations to walk the walk on data, they should start trusting their analytics, promoting the kind of data-focused operation that every business needs.
Continue Reading – https://us.sganalytics.com/blog/what-is-data-trust-and-how-do-you-build-trust-with-data/
submitted by /u/Beautiful-Ad-7743
[link] [comments]
0
I’m trying to access the CFEE Dataset to classify compound facial emotions from this website: http://cbcsl.ece.ohio-state.edu/dbform_compound.html. I have filled up the request form, and have been granted access. However, when I click the download dataset link, the page times out. I am fairly new to these kinds of things, so I apologize if my question comes off as kind of lame.
I’ve tried contacting the people mentioned on their website, but to no avail. Is there something I can do?
submitted by /u/grumpyowlgirl
[link] [comments]
does anyone have a list? other than r/datasets of course.
submitted by /u/bdx_cbtan
[link] [comments]