submitted by /u/alecs-dolt
[link] [comments]
Category: Datatards
Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?
Hi !
I want to find the Data of twitter about the numbers of Users in France between 2010 and 2022 and if possible the % of tweets per years of politicals users (Like Macron) and their numbers of tweets per year too.
Is it possible ? Thanks for your awnsers
submitted by /u/Notizz
[link] [comments]
I found other types of medical datasets with bounding box annotations (eg. images of pills), but I can’t find any such datasets containing images of medical injections.
Please let me know if you know of any such dataset. Any help is appreciated.
Note: By bounding box annotations, I mean some way of indicating the size and location of the injection within the whole image. YOLOv5 style annotations are preferred, but any other style of annotations is also ok.
submitted by /u/sohang-3112
[link] [comments]
Hi everyone, I am trying to get percent under the poverty line data for all counties in the US (S1701) from the US Census Bureau. However, not every county is available. Is there any work around to get all counties? I know there is an issue because of the defunding…
submitted by /u/shutthefucupcake
[link] [comments]
Collected from smartwatch as 51 test subjects perform 18 activities for 3 minutes each. I did some light data prep and merged the 51 individual subject files and joined in the activity descriptions. You can browsed the unified data below, or get the raw files from the source.
Prep’d data: https://app.gigasheet.com/spreadsheet/Smartwatch-Activity-Biometrics—Accel/6e2f9653_9590_4e1a_b794_b0b85fe57ec1
Raw Source: https://archive.ics.uci.edu/ml/datasets/WISDM+Smartphone+and+Smartwatch+Activity+and+Biometrics+Dataset+
submitted by /u/n1nja5h03s
[link] [comments]
Hello! I am desperately looking for district level data on development outcomes in India, such as health and diseases, number of hospitals/hospital beds, educational levels, GDP per capita, or anything else that can be related to development really.
Ideally, I am looking for the period 1991-2019, but I would settle for less.
Can anyone help? I was hoping to download data from http://microdata.gov.in, but I’m not able to register due to not receiving the confirmation email.
Thank you so much in advance to anyone who may be able to help. 🙂
submitted by /u/anythingusynthesize
[link] [comments]
Does anyone have a source which provides the number of stocks listed on exchanges over years? I’m interested to see how many stocks were listed on e.g. NYSE in January 1998 or something like that.
submitted by /u/brucebrowde
[link] [comments]
Not sure if this belongs here or is allowed. I’m not specifically looking to make a transaction, just looking for some information.
We’re closing down our range of hearing aid companies soon after around 3 years. How would we go about selling the data to interested parties? Would we need lawyers involved? Is there a market for this? Our data is comprised of full name, email, billing address, and phone number, IP, hearing aid (style purchased), time of purchase and $ amount spent.
submitted by /u/burna057
[link] [comments]
Hello, I am searching for a dataset with I-can-do-sentences (and I-can-not-do-sentences) to train a ML-Model for intention detection (it is supposed to determine whether some can do something or can’t do). Is such a dataset existent or where could I maybe find one? I’m looking for either English or German language datasets.
submitted by /u/Tim-orius
[link] [comments]
Looking for an API or data download that contains name, location, type, date of creation etc? There was a thread 5 years ago which covered this but curious if there have been any new data discoveries. Cheers.
submitted by /u/Coup1
[link] [comments]
I need a dataset of python code lines labeled so I can train my model to predict code line from a given label
submitted by /u/Vrspii
[link] [comments]
I am writing to request photos of wind turbine blades in poor condition for a university project that I am working on. Our research aims to analyze the types of damage that can occur to wind turbine blades, I would appreciate it if you share it with me.
submitted by /u/zo1el
[link] [comments]
Does anyone know where I can find a comprehensive directory of farms in the UK and what their revenue and profits were for certain crops, e.g tomatoes or carrots? Thank you.
submitted by /u/DPingIt6981
[link] [comments]
Is there a Data set That Provide Metadata from the disc Info. Like:
Format : Blu-ray Playlist File size : 3.81 KiB Duration : 2 h 1 min Overall bit rate mode : Variable Overall bit rate : 4 b/s
Video #1 ID : 4113 (0x1011) Menu ID : 1 (0x1) Format : HEVC Format/Info : High Efficiency Video Coding Format profile : Main 10@L5.1@High HDR format : SMPTE ST 2094 App 4, Version 1, HDR10+ Profile A compatible Codec ID : 36 Duration : 35 s 952 ms Width : 3 840 pixels Height : 2 160 pixels Display aspect ratio : 16:9 Frame rate : 23.976 (24000/1001) FPS Color space : YUV Chroma subsampling : 4:2:0 (Type 2) Bit depth : 10 bits Color range : Limited Color primaries : BT.2020 Transfer characteristics : PQ Matrix coefficients : BT.2020 non-constant Mastering display color primaries : Display P3 Mastering display luminance : min: 0.0001 cd/m2, max: 1000 cd/m2 Maximum Content Light Level : 233 cd/m2 Maximum Frame-Average Light Level : 63 cd/m2 format_identifier : HDMV Source : 00687.m2ts
Video #3 ID : 4113 (0x1011) Menu ID : 1 (0x1) Format : HEVC Format/Info : High Efficiency Video Coding Format profile : Main 10@L5.1@High HDR format : SMPTE ST 2094 App 4, Version 1, HDR10+ Profile A compatible Codec ID : 36 Duration : 1 h 55 min Width : 3 840 pixels Height : 2 160 pixels Display aspect ratio : 16:9 Frame rate : 23.976 (24000/1001) FPS Color space : YUV Chroma subsampling : 4:2:0 (Type 2) Bit depth : 10 bits Color range : Limited Color primaries : BT.2020 Transfer characteristics : PQ Matrix coefficients : BT.2020 non-constant Mastering display color primaries : Display P3 Mastering display luminance : min: 0.0001 cd/m2, max: 1000 cd/m2 Maximum Content Light Level : 737 cd/m2 Maximum Frame-Average Light Level : 130 cd/m2 format_identifier : HDMV Source : 00688.m2ts
Video #5 ID : 4113 (0x1011) Menu ID : 1 (0x1) Format : HEVC Format/Info : High Efficiency Video Coding Format profile : Main 10@L5.1@High HDR format : SMPTE ST 2094 App 4, Version 1, HDR10+ Profile A compatible Codec ID : 36 Duration : 2 min 1 s Width : 3 840 pixels Height : 2 160 pixels Display aspect ratio : 16:9 Frame rate : 23.976 (24000/1001) FPS Color space : YUV Chroma subsampling : 4:2:0 (Type 2) Bit depth : 10 bits Color range : Limited Color primaries : BT.2020 Transfer characteristics : PQ Matrix coefficients : BT.2020 non-constant Mastering display color primaries : Display P3 Mastering display luminance : min: 0.0001 cd/m2, max: 1000 cd/m2 Maximum Content Light Level : 1000 cd/m2 Maximum Frame-Average Light Level : 18 cd/m2 format_identifier : HDMV Source : 00674.m2ts
Video #7 ID : 4113 (0x1011) Menu ID : 1 (0x1) Format : HEVC Format/Info : High Efficiency Video Coding Format profile : Main 10@L5.1@High HDR format : SMPTE ST 2094 App 4, Version 1, HDR10+ Profile A compatible Codec ID : 36 Duration : 3 min 28 s Width : 3 840 pixels Height : 2 160 pixels Display aspect ratio : 16:9 Frame rate : 23.976 (24000/1001) FPS Color space : YUV Chroma subsampling : 4:2:0 (Type 2) Bit depth : 10 bits Color range : Limited Color primaries : BT.2020 Transfer characteristics : PQ Matrix coefficients : BT.2020 non-constant Mastering display color primaries : Display P3 Mastering display luminance : min: 0.0001 cd/m2, max: 1000 cd/m2 Maximum Content Light Level : 505 cd/m2 Maximum Frame-Average Light Level : 13 cd/m2 format_identifier : HDMV Source : 00689.m2ts
I am Primars interested und this section as this is Most missing in all Data i habe Access to.
Color primaries : BT.2020 Transfer characteristics : PQ Matrix coefficients : BT.2020 non-constant Mastering display color primaries : Display P3 Mastering display luminance : min: 0.0001 cd/m2, max: 1000 cd/m2 Maximum Content Light Level : 233 cd/m2 Maximum Frame-Average Light Level : 63 cd/m2
submitted by /u/Weak_Ad9730
[link] [comments]
Does anyone know of any sites or lists that track remote/hybrid policies across organizations?
I remember a few sites and open-sourced lists popping up during the pandemic, but can’t seem to find anything now.
submitted by /u/Particular_Stable
[link] [comments]
Hi
Im looking for a Formula E dataset that is still getting updated frequently with the most detail possible!
Thanks in advance!
submitted by /u/AkMega
[link] [comments]
Processed NPPES data on all US healthcare providers, along with mapped taxonomies.
How many cardiologists in Philadelphia? Who are this year’s batch of medical students? How has the number of nurse practitioners changed over time?
We have a 30 day free trial. If you want to use this for academic reasons, just email/DM me and we can make it available for free. The cost is to cover our compute/effort in cleaning this up.
https://app.snowflake.com/marketplace/listing/GZTSZAS2KFN/cybersyn-inc-us-healthcare-providers
submitted by /u/aiatco2
[link] [comments]
I am looking for a dataset on DMV vehicle registrations. Specifically on the fees for registering at vehicle in each state
submitted by /u/Zestyclose-Eagle1938
[link] [comments]
I just put up another dataset and accompanying notebook on Kaggle. It’s the USGS Core Sample Catalog.
I’d love feedback on either but if you want to answer the burning questions of what keeps you up at night such as “What is the easternmost sample well drill in the US?”, “Why are the 64 well drilled in the pacific ocean?”, or “Why does US Geological Survey have nine wells samples that aren’t in the US? It’s not like we’re going to invade Canada and take their oil — or are we?” well, I’d understand.
submitted by /u/hrokrin
[link] [comments]
So a few months back for our professor asked for topic for thesis. I was absent for a few days beforeso i didnt know it. He started asking for everyone’s topic which could be changed. Everyone were saying complex ML projects or data analysts topcis before me, So i just panicked and choose this topic. Fastfoward a few months i procrastinated all my projects so when the time came i just gave a rough proposal and turns out you cannot change your topic anymore. I searched in kaggle but just cant seem to get the dataset. I literally have no clue where and how to search for it, so even if i cant find it here where should i begin to search.
sorry for the poor english
submitted by /u/a_non_weeb
[link] [comments]
I keep looking for written text training data but only find English. Are there any dataset that have Spanish/Mexican Spanish as the main language.
submitted by /u/theprestige3811
[link] [comments]
I submitted a project proposal for detecting and analyzing posts with malicious intent like scam, phishing, etc on Reddit. But later I realized that Reddit is very well moderated platform(atleast the most popular subreddits) where there are usually no such posts. So is there any dataset which contains any subreddits where I can find such posts? I dont want to change the topic for proposal now
submitted by /u/psbankar
[link] [comments]
Where can I get a dataset on cybersecurity/ cyber attacks/ internet threats or anything similar, by country and year?
submitted by /u/Pleasant_Savings_256
[link] [comments]
Need urgent help on converting data
I’m doing a project using UCR crime data from this source https://www.icpsr.umich.edu/web/ICPSR/series/57?start=0&SERIESQ=57&ARCHIVE=ICPSR&PUBLISH_STATUS=PUBLISHED&sort=score%20desc&rows=50&q=County%20level%20arrest
The data from 2003-2008 is only available in a strange format while 1994-2002 and 2009-2016 is available as complete datasets in either R or STATA. Can someone please help with that.
submitted by /u/ItsRickDalton
[link] [comments]