Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

Where Can I Find Some Image Datasets Of Medical Injections With Bounding Box Annotations?

I found other types of medical datasets with bounding box annotations (eg. images of pills), but I can’t find any such datasets containing images of medical injections.

Please let me know if you know of any such dataset. Any help is appreciated.

Note: By bounding box annotations, I mean some way of indicating the size and location of the injection within the whole image. YOLOv5 style annotations are preferred, but any other style of annotations is also ok.

submitted by /u/sohang-3112
[link] [comments]

Smartwatch Biometrics: 3.8M Accelerometer Time-series Sensor Data Points

Collected from smartwatch as 51 test subjects perform 18 activities for 3 minutes each. I did some light data prep and merged the 51 individual subject files and joined in the activity descriptions. You can browsed the unified data below, or get the raw files from the source.

Prep’d data: https://app.gigasheet.com/spreadsheet/Smartwatch-Activity-Biometrics—Accel/6e2f9653_9590_4e1a_b794_b0b85fe57ec1

Raw Source: https://archive.ics.uci.edu/ml/datasets/WISDM+Smartphone+and+Smartwatch+Activity+and+Biometrics+Dataset+

submitted by /u/n1nja5h03s
[link] [comments]

Looking For District-level Data On Development Outcomes In India

Hello! I am desperately looking for district level data on development outcomes in India, such as health and diseases, number of hospitals/hospital beds, educational levels, GDP per capita, or anything else that can be related to development really.

Ideally, I am looking for the period 1991-2019, but I would settle for less.

Can anyone help? I was hoping to download data from http://microdata.gov.in, but I’m not able to register due to not receiving the confirmation email.

Thank you so much in advance to anyone who may be able to help. 🙂

submitted by /u/anythingusynthesize
[link] [comments]

Selling ECommerce (Hearing Aid Customers) Data – Over 500,000 Customers And Leads.

Not sure if this belongs here or is allowed. I’m not specifically looking to make a transaction, just looking for some information.

We’re closing down our range of hearing aid companies soon after around 3 years. How would we go about selling the data to interested parties? Would we need lawyers involved? Is there a market for this? Our data is comprised of full name, email, billing address, and phone number, IP, hearing aid (style purchased), time of purchase and $ amount spent.

submitted by /u/burna057
[link] [comments]

Blu-ray Film-Disc Metadata Info Color Primäres

Is there a Data set That Provide Metadata from the disc Info. Like:

Format : Blu-ray Playlist File size : 3.81 KiB Duration : 2 h 1 min Overall bit rate mode : Variable Overall bit rate : 4 b/s

Video #1 ID : 4113 (0x1011) Menu ID : 1 (0x1) Format : HEVC Format/Info : High Efficiency Video Coding Format profile : Main 10@L5.1@High HDR format : SMPTE ST 2094 App 4, Version 1, HDR10+ Profile A compatible Codec ID : 36 Duration : 35 s 952 ms Width : 3 840 pixels Height : 2 160 pixels Display aspect ratio : 16:9 Frame rate : 23.976 (24000/1001) FPS Color space : YUV Chroma subsampling : 4:2:0 (Type 2) Bit depth : 10 bits Color range : Limited Color primaries : BT.2020 Transfer characteristics : PQ Matrix coefficients : BT.2020 non-constant Mastering display color primaries : Display P3 Mastering display luminance : min: 0.0001 cd/m2, max: 1000 cd/m2 Maximum Content Light Level : 233 cd/m2 Maximum Frame-Average Light Level : 63 cd/m2 format_identifier : HDMV Source : 00687.m2ts

Video #3 ID : 4113 (0x1011) Menu ID : 1 (0x1) Format : HEVC Format/Info : High Efficiency Video Coding Format profile : Main 10@L5.1@High HDR format : SMPTE ST 2094 App 4, Version 1, HDR10+ Profile A compatible Codec ID : 36 Duration : 1 h 55 min Width : 3 840 pixels Height : 2 160 pixels Display aspect ratio : 16:9 Frame rate : 23.976 (24000/1001) FPS Color space : YUV Chroma subsampling : 4:2:0 (Type 2) Bit depth : 10 bits Color range : Limited Color primaries : BT.2020 Transfer characteristics : PQ Matrix coefficients : BT.2020 non-constant Mastering display color primaries : Display P3 Mastering display luminance : min: 0.0001 cd/m2, max: 1000 cd/m2 Maximum Content Light Level : 737 cd/m2 Maximum Frame-Average Light Level : 130 cd/m2 format_identifier : HDMV Source : 00688.m2ts

Video #5 ID : 4113 (0x1011) Menu ID : 1 (0x1) Format : HEVC Format/Info : High Efficiency Video Coding Format profile : Main 10@L5.1@High HDR format : SMPTE ST 2094 App 4, Version 1, HDR10+ Profile A compatible Codec ID : 36 Duration : 2 min 1 s Width : 3 840 pixels Height : 2 160 pixels Display aspect ratio : 16:9 Frame rate : 23.976 (24000/1001) FPS Color space : YUV Chroma subsampling : 4:2:0 (Type 2) Bit depth : 10 bits Color range : Limited Color primaries : BT.2020 Transfer characteristics : PQ Matrix coefficients : BT.2020 non-constant Mastering display color primaries : Display P3 Mastering display luminance : min: 0.0001 cd/m2, max: 1000 cd/m2 Maximum Content Light Level : 1000 cd/m2 Maximum Frame-Average Light Level : 18 cd/m2 format_identifier : HDMV Source : 00674.m2ts

Video #7 ID : 4113 (0x1011) Menu ID : 1 (0x1) Format : HEVC Format/Info : High Efficiency Video Coding Format profile : Main 10@L5.1@High HDR format : SMPTE ST 2094 App 4, Version 1, HDR10+ Profile A compatible Codec ID : 36 Duration : 3 min 28 s Width : 3 840 pixels Height : 2 160 pixels Display aspect ratio : 16:9 Frame rate : 23.976 (24000/1001) FPS Color space : YUV Chroma subsampling : 4:2:0 (Type 2) Bit depth : 10 bits Color range : Limited Color primaries : BT.2020 Transfer characteristics : PQ Matrix coefficients : BT.2020 non-constant Mastering display color primaries : Display P3 Mastering display luminance : min: 0.0001 cd/m2, max: 1000 cd/m2 Maximum Content Light Level : 505 cd/m2 Maximum Frame-Average Light Level : 13 cd/m2 format_identifier : HDMV Source : 00689.m2ts

I am Primars interested und this section as this is Most missing in all Data i habe Access to.

Color primaries : BT.2020 Transfer characteristics : PQ Matrix coefficients : BT.2020 non-constant Mastering display color primaries : Display P3 Mastering display luminance : min: 0.0001 cd/m2, max: 1000 cd/m2 Maximum Content Light Level : 233 cd/m2 Maximum Frame-Average Light Level : 63 cd/m2

submitted by /u/Weak_Ad9730
[link] [comments]

[self-promo] All US Healthcare Providers On Snowflake

Processed NPPES data on all US healthcare providers, along with mapped taxonomies.

How many cardiologists in Philadelphia? Who are this year’s batch of medical students? How has the number of nurse practitioners changed over time?

We have a 30 day free trial. If you want to use this for academic reasons, just email/DM me and we can make it available for free. The cost is to cover our compute/effort in cleaning this up.

https://app.snowflake.com/marketplace/listing/GZTSZAS2KFN/cybersyn-inc-us-healthcare-providers

submitted by /u/aiatco2
[link] [comments]

New Dataset On Holes Drilled Into The Earth For Fun, Science, And Profit — But Mostly Profit.

I just put up another dataset and accompanying notebook on Kaggle. It’s the USGS Core Sample Catalog.

I’d love feedback on either but if you want to answer the burning questions of what keeps you up at night such as “What is the easternmost sample well drill in the US?”, “Why are the 64 well drilled in the pacific ocean?”, or “Why does US Geological Survey have nine wells samples that aren’t in the US? It’s not like we’re going to invade Canada and take their oil — or are we?” well, I’d understand.

submitted by /u/hrokrin
[link] [comments]

Dataset To Measure How Frequently Vehicular Parts Are Subjugated To Wear And Tear Of Specific Brand / Specific Model (ANY WILL DO).

So a few months back for our professor asked for topic for thesis. I was absent for a few days beforeso i didnt know it. He started asking for everyone’s topic which could be changed. Everyone were saying complex ML projects or data analysts topcis before me, So i just panicked and choose this topic. Fastfoward a few months i procrastinated all my projects so when the time came i just gave a rough proposal and turns out you cannot change your topic anymore. I searched in kaggle but just cant seem to get the dataset. I literally have no clue where and how to search for it, so even if i cant find it here where should i begin to search.

sorry for the poor english

submitted by /u/a_non_weeb
[link] [comments]

Dataset For Malicious Posts On Reddit

I submitted a project proposal for detecting and analyzing posts with malicious intent like scam, phishing, etc on Reddit. But later I realized that Reddit is very well moderated platform(atleast the most popular subreddits) where there are usually no such posts. So is there any dataset which contains any subreddits where I can find such posts? I dont want to change the topic for proposal now

submitted by /u/psbankar
[link] [comments]

Need Help Making My UCR Data Readable. Got It From ICSPR But 5 Years Are In A Strange Format I Can Do Anything

Need urgent help on converting data

I’m doing a project using UCR crime data from this source https://www.icpsr.umich.edu/web/ICPSR/series/57?start=0&SERIESQ=57&ARCHIVE=ICPSR&PUBLISH_STATUS=PUBLISHED&sort=score%20desc&rows=50&q=County%20level%20arrest

The data from 2003-2008 is only available in a strange format while 1994-2002 and 2009-2016 is available as complete datasets in either R or STATA. Can someone please help with that.

submitted by /u/ItsRickDalton
[link] [comments]