Is there any dataset of books which contains , Title, ISBN, Author , Ratings, No of sales and some other details which i can use for a project?
submitted by /u/Key_Investment_6818
[link] [comments]
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
Is there any dataset of books which contains , Title, ISBN, Author , Ratings, No of sales and some other details which i can use for a project?
submitted by /u/Key_Investment_6818
[link] [comments]
I’ve been tasked with finding a list of current athletic directors at private schools (elementary, middle, and high schools) in the US. I’m having a hard time finding anything close – is this an unreasonable ask? Thanks in advance!
submitted by /u/DoleraWedding2023
[link] [comments]
MATCH(MatchID, Season, MatchDate, MatchStartTime, Field#, Park, HomeTeamID, AwayTeamID, RefereeName, MatchScore)
FIELD(Field#, Park, FieldName)
PLAYER(PlayerID, JerseyNumber, PlayerFirstName, PlayerLastName, PlayerGender, PlayerAge, TeamID, Position, CaptainStatus)
PLAYERSTATS(MatchID, PlayerID, MatchDate, MatchStartTime, HomeTeamName, AwayTeamName, JerseyNumber, PlayerName, Goals, Assists, PossessionPercent, PassCount, PassingChain#)
TEAM(TeamID, TeamName, CoachID, AssistantCoachID, SponsorID)
COACH(CoachID, CoachFirstName, CoachLastName, TeamID, CoachAge, CoachGender, CoachRole)
SPONSOR(SponsorID, SponsorName, SponsorEmail, SponsorAddress)
submitted by /u/volkxx
[link] [comments]
I am doing a project in abnormal event detection in ATM counters. For training purposes, I need videos from people behaving ‘normally’ in ATM counters and from people showing abnormal behavior.
With ATM counter I mean a (small) room with one or more ATM machines built in a wall.
Normal event: A person walks into the room, puts his card into the ATM, enters pincode, retrieves his card, take his money, maybe a receipt, then leaves.
Abnormal events: Someone hitting the ATM, attacking and robbing a customer at an ATM, fiddling with the machine in unexpected ways, taking photographs inside, etc.
Thank you so much!
submitted by /u/ThreshLaSquale
[link] [comments]
Hello, i am trying to train a model based on pictures from social media,
Do you know where i could find a dataset or a service that sells them?
many thanks
submitted by /u/Global_Gas_6441
[link] [comments]
If you guys need a mock data generator, me and my team got you covered!
Our product core features are:
Ouptut support for json, yaml, psql, sql, and xml (with more formats and application support coming soon) Code gen for various languages with language specific settings (rust, typescript, go, dart, c, c++, c#, java, swift, protobuf (syntax3) more on the roadmap) Nested object generation, with array, and null controls Seeded generation possible too for reproducible results
Let me know what you guys think, or if you want us to add more features
Try it out here https://www.dataconstruct.io/organizations/playground/schemas
No sign up required!
submitted by /u/originalchuan
[link] [comments]
Hi, I am a student now doing research for LLM content moderation. Does anyone know where I can find a dataset that only contains comments that do not violate any macro norms of Reddit (the dataset doesn’t need to be super big)? Thank you in advance for the help!
submitted by /u/Cecelia1Chen
[link] [comments]
I’m in search of job listing data in English speaking countries, preferably USA that spans back through 10 years.
My purpose is to evaluate the advertised pay for positions and how it compares to positions today, my project would consist of measuring change over time and that sort of thing.
If anyone knows of anything please give me a shout! Thanks.
submitted by /u/applesauce566
[link] [comments]
Hello,
I’m building a SAF global market size model as a side project to decompress outside of work, and I have plenty of experience building market models.
My question is the data – I have disparate, non-granular, sketchy data sources. I have no problem scraping public data and creating my own databases, and often that is for the best (not propagating upstream calculation errors from a 3p source). But the market is quite opaque.
Ideally, i am looking for a dataset like this:
https://www.resourcewise.com/platforms/prima-carbonzero
I would like to mix my first-party data and this third party data to build a more robust model.
Is there any place I can buy this data? Or access something similar to it?
submitted by /u/TableConnect_Market
[link] [comments]
anybody know where to find a recent dataset of car images? i found this dataset but is over 4 years old.
https://www.reddit.com/r/MachineLearning/s/hqJ4j2AGZX
i have a bunch of video driving around town. my friend and i want to do image recognition on it. thank you in advance!
submitted by /u/calebowen
[link] [comments]
as example, A 1min stock candle can see 1M volume, the only thing i can deduce from this data is the amount of shares traded. i would like to gain insight on how many participants where involved. and for what size. Do you know any data provider that has this info?
submitted by /u/MercyFive
[link] [comments]
Curious why I would ever use R instead of python for data related tasks.
submitted by /u/Nickaroo321
[link] [comments]
I’m trying to find out the relative probabilities of the relationship between the abuser and the victim in CSA cases. Since no one seems to extract the data in ways that will help, I need to find a source of anonymous data that has relevant fields.
A culled data set where specific records have been removed because they could lead to identifying the people is fine. I do not need to know the location, nor the economic class. Because I’m looking for low probability events, it needs to be a substantial size. I’d like a hundred thousand records of events, where the following fields are known for each event.
Specifics:
For victim: Victim’s age when abuse started, sex, nature of the abuse.
For abuser Abuser’s age when abuse started. sex, relationship to victim
Relationship: 1st degree relative (Parent, brother, sister), 2nd degree relative (uncle/aunt, cousin, grandparent) Neighbour, family friend, authority figure (coach, minister, teacher, scoutmaster, employer, etc)
So far attempts to find data on places like pubmed have resulted in:
Only an abstract is available without payment. Papers are only summaries of other papers/reports. Datasets are not open to the general public Datasets have substantial price tags on them. Datasets are extremely selective. Data does not link abuser and victim in a 1:1 manner.
I come from a CSA background. I was molested at age 3. My sister says, that mom said it was a neighbour two doors down.
I think even then my mom was covering up. It happened multiple times over a period of I think at least a couple months, and less than a year and a half. The multiple times poses a logistical problem. The two people who had the best access was my mother during the day (stay at home mom) and my brother (age 13) during the night. (separate bedroom opposite end of house, in basement.)
I can’t confront them. Mom is dead. Brother is deep in Alzheimers.
While stats on this won’t form an absolute answer, they form grist for the mill.
submitted by /u/Canuck_Voyageur
[link] [comments]
Greetings. I have an upcoming project that involved using Pyspark. The guidelines for the project indicate a dataset about 7~8 GB at the minimum. The project is graded lesser on analysis and more on the complexity of data and range of methods and techniques we employ for data manipulation and processing. looking for a flexible and large financial dataset. can be anything from stock market data, economic indicators, consumption data, text data etc. Please direct me towards such datasets
submitted by /u/ComprehensiveAd1629
[link] [comments]
Similar to this dataset on F1, but a dataset for GT3 racing. I’m trying to break into the field of analytics so any help on sourcing such data would be extremely helpful.
submitted by /u/thepunnman
[link] [comments]
Hey everyone, suggest any dataset where i can learn KPI creation. Like i want to learn how to create Growth Percentages, Last year’s sales, last month’s sales, Net sales, gross sales for the present year, and also for last year and other similar KPIs, and to learn i need those types of dataset where i can do calculation.
I tried to find it on Kaggle but there are some simple dataset, till now all i do is drag and drop on dashboard.
submitted by /u/Akhand_P_Singh
[link] [comments]
Hi, anyone have an excel of the data of International Country Risk Guide (ICRG). It’s really urgent, I need it for my thesis and I would appreciate if someone has it. Thanks
submitted by /u/jasmn1
[link] [comments]
Hi all! This is probably a stupid question, but please someone just clarify for me without shaming me lol. If I am attempting to complete a research project (for a quantitative research methods class) on fear of crime and media consumption can I take two different datasets and combine them in order to run analysis on the variables? Or, does anyone know of any good survey datasets that contains variables I could use for this (or recode and use)? I apologize for the dumb question, but my professor has been no help. Thanks in advance!
submitted by /u/laurenmarie184_
[link] [comments]
Hello all.
I have spent the entire year of 2023 collecting data on my day-to-day life. I have collected everything I could think of, including quantitative variables like exercise, sleep amount, sex, etc., and qualitative ones like my own feelings and overall happiness. It is my ultimate goal to determine what in my life makes me happier, but there are plenty of other analyses that could be done with this dataset. Please feel free to take a look! If anyone does any interesting analysis please comment the results and/or DM me.
The dataset is pretty extensive… take a look.
https://docs.google.com/spreadsheets/d/1mi1vzfOQ2CpddAQQI25ACBixot2Xs5z-nO5qx91L12c/edit?usp=sharing
submitted by /u/tsawsum1
[link] [comments]
Are there any opensource datasources or APIs that could be used to pull the data related to environmental factors and information of sustainabilty from the cordinates.
submitted by /u/Fast_Whole_8492
[link] [comments]
Hello All,
As per the title, I am looking to pull data on all stocks that traded on the NASDAQ in 2023: I can get only partial attributes from Yahoo. I need
– Outstanding Shares per day (can’t get this from Yahoo; Bloomberg is asking for a fee)
– For each ticker, opening, closing, high, low price daily
-Industry
submitted by /u/BBjayjay
[link] [comments]
I am in dire straits and I need help.
I haven’t had any luck finding any sources that have both the datasets and the author’s user information, I’ve only found tweets that have been identified as real and fake news without the user’s information. I want to know if such a dataset exists before I go and purchase a developer account at X. I’m a student right now and 100$USD would make things pretty tight for me.
Thank you all in advance
submitted by /u/Ok-ButterscotchBabe
[link] [comments]
The database has entries like this, each one of them being a full chess game:
[‘e2e4’, ‘g8f6’, ‘d2d4’, ‘g7g6’, ‘c2c4’, ‘f8g7’, ‘b1c3’, ‘e8g8’, ‘e2e4’, ‘d7d6’, ‘f1e2’, ‘e7e5’, ‘e1g1’, ‘b8c6’, ‘d4d5’, ‘c6e7’, ‘c1g5’, ‘h7h6’, ‘g5f6’, ‘g7f6’, ‘b2b4’, ‘f6g7’, ‘c4c5’, ‘f7f5’, ‘f3d2’, ‘g6g5’, ‘a1c1’, ‘a7a6’, ‘d2c4’, ‘e7g6’, ‘a2a4’, ‘g6f4’, ‘a4a5’, ‘d6c5’, ‘b4c5’, ‘f5e4’, ‘c4e3’, ‘c7c6’, ‘d5d6’, ‘c8e6’, ‘c3e4’, ‘d8a5’, ‘e2g4’, ‘e6d5’, ‘d1c2’, ‘a5b4’, ‘e4g3’, ‘e5e4’, ‘c1b1’, ‘b4d4’, ‘b1b7’, ‘a6a5’, ‘g3f5’, ‘f8f5’, ‘e3f5’]
e2e4 means the piece on e2 (the pawn) moved to e4. Problem is, I have no way of knowing which piece is moving somewhere. For example, “g7h8” means the piece on g7 moved to h8 but unless I run all the previous moves I have no way of knowing which piece is that.
How can I transform this into a more understandable dataset?
I’m not sure this is the sub to ask this, if it isn’t I’d appreciate if you could tell me where to ask it
PD: I’ve checked the chess library in python but I haven’t found anything
submitted by /u/Aston28
[link] [comments]
I am doing my final year project , and it’s a KNN predictive model for prediction of tomato growth rate . When I was doing my proposal I proposed use of cm/week as growth metric . I am open to changing the growth metric .
I’m kindly looking for collaborators and mentors, I’m looking for a data set that has values temp, soil moisture, ph , EC , humidity and or NPK values against either yield or any growth rate metric.
submitted by /u/RMGwinji
[link] [comments]
I’m trynna train a model off of those images using 3 classes, green(not riped) riped and over-riped, the catch is banans should be in farms, like in their trees and stuff not picked up yet. If someone has an idea please let me know! Thank you kindly in advance!
submitted by /u/LoanNice
[link] [comments]
I’m working on a dynamic access control policy generation model using machine learning and I need a real dataset to evaluate the performance. Is there any access control policy dataset consisting entities (device, users), operations, roles, attributes, and permission policies?
I only could find the datasets having network traffic and access logs without any actual permission policies. If the dataset is related IoT (Internet of Things) or WSN (Wireless Sensor Networks) that would be ideal. If anyone knows a good dataset that would be a big help. Also, please mention the source paper (if any) so I can cite it.
submitted by /u/binodmx
[link] [comments]
I am working on a side project and I am looking for some dataset/database where i can search all animals by size and/or weight.
right now im not that picky on how complete it is… but having those attributes is key.
thanks!!
I’m reviewing links here: https://www.doi.gov/library/internet/animals
but so far none seem to be what i want
submitted by /u/thisfunnieguy
[link] [comments]
Hi,
I am looking for pressure transient data set for pipelines or facilities. Is there an online source I could use?
TIA
submitted by /u/jph1022
[link] [comments]