Category: Other Nonsense & Spam

Dataset Of Medical Case Scenarios And Appropriate Diagnosis

I’m looking for a dataset that contains a medical case examples and the diagnosis presented in the case.

Example of what I’m talking about:

[“Bob has been having issues with excessive thirst and blurry vision. He has elevated levels of glucose in the urine and blood.”, “Diabetes”]

I’m not too picky about the format as long as the diagnosis is seperate from the scenario and the formatting is consistent.

Artificial datasets are okay, maybe even preferred, as long as they’re accurate.

submitted by /u/flavorfulcherry
[link] [comments]

Moving Data From Excel To Something Online

So I’m in a college alumni group with chapters around the country. Our central office has a speaker’s list that chapters can look at to contact speakers for their own events. But it’s a little out of date and because it’s an Excel workbook, many chapter people don’t like using it and want to see speaker photos.

So is there a good web-based platform this data can be exported to that has a good UX, easily designed (maybe a dozen fields), have a search function, can be password -protected, and easily updated?

Thanks for any help!

submitted by /u/DelcoPAMan
[link] [comments]

Statista Reports – Full Availability

Good Morning to all, if some of you need reports from Statista official Web site, don’t hesitate to contact me.

This is not a scam and the reports are true genuine. If you need we can issue a regular invoice. How it is possible? My company paid the full commercial licence to provide such reports.

Thank you very much for your attention

submitted by /u/satchurated
[link] [comments]

Where We Actually Buy Big Data For Company?

Hi

I’m wondering where I can buy machine learning data directly for my project/product. Let’s say it’s a music or allergy app. I would like to connect a chat/predictor which, based on a few data, is able to indicate a certain percentage of something. However, large amounts of data are needed to train such algorithms. Where can you actually buy them?

submitted by /u/jackoborm
[link] [comments]

The Largest Dataset Of Graded Diamonds On Kaggle

Hi there!

I just put up a new dataset on Kaggle. It’s cryptically titled The largest diamond dataset currently on Kaggle

It has just under 220,000 diamonds and 25 columns of data making it about 3x larger than next largest. I think it’s perfect for regression models and there is an attached notebook.

This is my first submission to Kaggle so I’d be very much interested in any feedback you might have.

Thanks!

submitted by /u/hrokrin
[link] [comments]

[Synthetic] DatasetGPT – A Command-line Tool To Generate Datasets By Inferencing LLMs At Scale. It Can Even Make Two ChatGPT Agents Talk With One Another.

GitHub: https://github.com/radi-cho/datasetGPT

It can generate texts by varying input parameters and using multiple backends. But, personally, the conversations dataset generation is my favorite: It can produce dialogues between two ChatGPT agents.

Possible use cases may include:

Constructing textual corpora to train/fine-tune detectors for content written by AI. Collecting datasets of LLM-produced conversations for research purposes, analysis of AI performance/impact/ethics, etc. Automating a task that a LLM can handle over big amounts of input texts. For example, using GPT-3 to summarize 1000 paragraphs with a single CLI command. Leveraging APIs of especially big LLMs to produce diverse texts for a specific task and then fine-tune a smaller model with them.

What would you use it for?

submitted by /u/radi-cho
[link] [comments]

Suggestions For Ecology Dataset For Classification

I’m looking for a dataset similar to the Amphibians dataset from UCI for an undergraduate data science project. It should be a classification problem, i.e. presence/absence of a species dependent on habitat characteristics such as temperature, type of vegetation, size of water reservoir, amount of rainfall, distance to roads/civilisation, etc.

It should include

>15 numerical and categorical features >300 observations temporal and/or spatial data if possible, so I can play around with some heat maps or time series analysis.

Any hints are highly appreciated as I’m a beginner and I’ve been scrolling my eyes out on kaggle etc. all weekend.

submitted by /u/apex—-predator
[link] [comments]

Finding Datasets For Computer Vision

Hello! I’m a senior electronics engineering student. My friend trying to make a blind-assistant that helps blind people to differentiate same form-objects as like Coca-Cola vs Sprite. He design a hardware with esp8266 and uses a cloud for storing datasets. We create a dataset with taking photos of cokes however its hard to creating for all stuff. Is there any solution or resource for finding daily life datasets? We had dive a lot of open datasets CIFAR, Berkley, Kaggle, COCO, MNIST but we required 224×224 pixels for our ML model.

submitted by /u/yagmurxyildiz
[link] [comments]

Chinese Outward Foreign Direct Investment Data

Hi!

Since my first post here was a request if someone knew how to access Chinese OFDI Data sorted by country which some researchers frequently seem to use, I can now finally share where exactly the data comes from and hope that I thereby maybe save another poor soul from spending hours to find it.

Unsurprisingly, you have to search in Chinese to find the data:

XX 年度中国对外直接投资统计公报 (XX for the year you are searching)

sample: http://images.mofcom.gov.cn/hzs/201810/20181029160118046.pdf

Hope this helps and have fun!

submitted by /u/BlueApple12
[link] [comments]

Looking For M&A And Series Funding Rounds Datasets

Hi All,

I looking for datasets options that would provide me:

For M&A:

Buyer Acquisition Target announcement date close date deal value

For Series funding rounds:

Company Series round Closing series round amount secured

So far, I have been using google but feel like that is not the best way to get the data I need

submitted by /u/malaya100
[link] [comments]

How Would You Go About Populating Your Own Data Set Similar To Yelp And Google Maps?

I’m trying to build an app for travel. I’ve looked at the APIs and they’re very expensive and don’t allow for long-term caching. OSM is interesting but also requires you to abide by ODbL, which isn’t great if you don’t want to share proprietary info. Are there any approaches or alternatives to using an API or OSM? I haven’t been able to find any great data sets to bulk purchase.

submitted by /u/marvinshkreli
[link] [comments]

Subnational COVID-19 Vaccination Data (Europe)

Hello, I am looking to assemble a subnational data set on COVID-19 vaccinations in Europe (preferrably at the level of NUTS-2 or NUTS-3 regions; mixed would also be okay). I have seen some maps for individual countries, so the data seems to be somewhere out there – at least for some. Does anyone know of any resources in that direction?

I’d like to focus on the EU27, but other European Countries are also fine. I appreciate any help, be it aggregate data sets or data for individual countries. Thanks!

submitted by /u/lu2idreams
[link] [comments]

NCAA Men’s Basketball March Madness Historical Results

I’m looking for some historical results in order to do some analysis and submit my bracket. Essentially, I’m looking for something like this, but more up to date:

https://blog.softartisans.com/2013/03/19/march-madness-predict-your-ncaa-basketball-brackets-with-excel/

https://blog.softartisans.com/wp-content/uploads/2013/03/NCAATournamentBracket.xlsx (like the first “data” tab. just the seeds, team names, round and score is good enough, but more is better)

Does anyone have anything like this? Thanks!

submitted by /u/JlwRfwkm
[link] [comments]

Questions Regarding The ADNI Dataset

Hey so I am working on a project related to Alzheimers detection and intend to publish a paper. I have used a dataset that I found on Kaggle having MRIs. The author of the dataset has said that he has taken the images from ADNI and some other sources. Now can I use that data on my paper and if yes how do i cite it?

Also, I got permission to access the ADNI dataset yesterday and I downloaded the MRIs. But there are no labels on the images like if they are demented or not. If anyone could help me with it then that would be really helpful as I need to prepare the project asap. Thanks

submitted by /u/pg_blue
[link] [comments]