Category: Datatards

Here you can observe the biggest nerds in the world in their natural habitat, longing for data sets. Not that it isn’t interesting, i’m interested. Maybe they know where the chix are. But what do they need it for? World domination?

How To Price Image Data For Data Monetization?

I’m currently researching how satellite imagery data (or any other type of Image data), especially hyperspectral and multispectral data, is priced by different companies. I’m particularly interested in how these companies determine the cost for various sectors like agriculture, mining, and environmental monitoring.

Here’s some context:

Service Tiers: Companies often offer different service tiers (e.g., tasking, archive access, subscription models).

Resolution and Coverage: Pricing seems to vary based on image resolution (e.g., 5-meter vs. sub-meter) and the area covered.

Applications: Different use cases might influence pricing (e.g., crop health monitoring, yield prediction, soil analysis).

Technology: Advances in satellite technology, such as deployable optics, might impact cost.

I’ve seen companies like Wyvern Space, Planet Labs, and Pixxel offering these services but haven’t found detailed public pricing information.

Could anyone share insights or resources on:

– General pricing strategies for satellite imagery (and image data in general) data and any approximate numbers?

– How factors like resolution, coverage area, and application affect pricing?

– Any case studies or examples from companies in this field?

Thanks in advance for your help!

submitted by /u/sidhulogy
[link] [comments]

DataSet For Training Models For Detecting Levels Of Depression

Hi everyone! I wish to create a dataset with phrases depicting various levels of depression.

I am aware of the fact that I can easily scout through reddit posts and create a dataset, but I wish to create it using a model, which could give me an endless supply of “human-like” phrases which mimics actual people describing their depression.

I was thinking of maybe scraping through some medical journals which could give me some symptoms of depression and related issues, and then create a model which takes these symptoms and creates “human-like” phrases related to these symptoms, but am not sure how I could implement this.

Any help would be appreciated. Thanks a lot!

submitted by /u/CutDangerous127
[link] [comments]

I’m Having Troubles Finding Economic Data About The Democratic People’s Republic Of Korea (North Korea) – Bachelor Thesis

Hi, I’m Paula

I’m working on my bachelor’s thesis and need to find some reliable economic data on North Korea. It’s pretty tricky to locate good sources for this, so I thought I’d ask if you have any suggestions on where to look or who to talk to. I’m looking for data spanning from 1960 to 2023, covering the following indicators:

GDP at constant prices

Investment (Gross Fixed Capital Formation, GFCF)

State intervention: public spending as a percentage of GDP

Country openness: the sum of exports plus imports divided by GDP ((X+M)/GDP)

Real exchange rate

Economic structure (GDP by sector)

Sorry if this is not the right place to post this, but I’m quite lost and don’t know where else to look. I already have some of the data, but it’s either not for all years or it’s incomplete. I’ve also checked the Bank of Korea and World Bank data, but most of it only covers a few years or isn’t very old.

submitted by /u/Fluffy-Advice4967
[link] [comments]

Seeking Dataset For Internet Traffic Analysis (Malicious Vs. Legitimate)

I’m currently working on my bachelor’s thesis, that is aimed at building a classification model to differentiate between malicious and legitimate internet traffic. I’m trying to gather the data on my own but I’m unable to get the ammount of data needed to train a decent model. I’m in need of a dataset containing internet traffic labeled as either malicious or legitimate (binary classification).

The dataset should ideally include features commonly associated with internet traffic analysis, such as IP addresses, timestamps, protocols, packet sizes, etc. Any additional contextual information would be highly beneficial.

If you know of any publicly available datasets or have access to such data, including well-done synthetic datasets, please let me know.

submitted by /u/Ortzadar
[link] [comments]

Search Engine And Dataset For Local Government Meetings In US And Canada [self-promotion]

I wanted to share a new search engine called CivicSearch. You can type in a keyword like “pickleball” or “affordable housing” and get a list of mentions in government meetings from 600+ US and Canadian cities: civicsearch.org

For an example of what’s possible with this data, we’ve written (and are writing) a series of newsletters that explore specific topics in detail, like Black History Month, school absenteeism, and bus rapid transit. You can subscribe to receive these updates by email, as well as personalized alerts for any location or keyword.

I created this tool, and I hope you find it useful. I’m here if you have any questions or suggestions.

submitted by /u/CivicSearch
[link] [comments]

What Exactly Is Clickstream Data And Where To Find It?

Several analytics companies that offer “competitor analysis” can get data on website visits, direct traffic, referral traffic, app downloads, app searches, time on site, bounce rate, etc.

When I contact them to ask where they source the data, they mutually say “from Clickstream” but refuse to elaborate more.

What is Clicksream? is it a single data provider? or multiple? where to find them?

Google search hasn’t really revealed much, I guess it is a very niche b2b area where you need connections and good sources…

submitted by /u/semlowkey
[link] [comments]

Anyone Into Data Science? Need Some Career Advice

20 year old statistics student(2nd year) from BHU. 2nd year is here and I’ve been feeling the need to get serious about career . Latelu I’ve been wanting to get into data analytics/ data science and AI.But i have absolutely 0 idea as to how to go about it.as of skills I am learning python these days. anyone who’s already into this field that can help me out? Maybe as in what courses can I take online or like a rough road map. I wish to eventually bag an internship by 3rd year.

submitted by /u/Outrageous-Truth-756
[link] [comments]

Anyone Into Data Science? Need Some Career Advice

20 year old statistics student(2nd year) from BHU. 2nd year is here and I’ve been feeling the need to get serious about career . Latelu I’ve been wanting to get into data analytics/ data science and AI.But i have absolutely 0 idea as to how to go about it.as of skills I am learning python these days. anyone who’s already into this field that can help me out? Maybe as in what courses can I take online or like a rough road map. I wish to eventually bag an internship by 3rd year.

submitted by /u/Outrageous-Truth-756
[link] [comments]

Million Song Dataset Help (Bachelor Thesis)

Hi everyone, i am currently doing my bachelors thesis and i need to use the million song dataset. I can’t download it from the MSD website and from what i heard its because im in the wrong region.

Anyways, i can’t download a 300GB dataset due to hardware limitations. I only need the dataset with the following features (to hopefully knock down the file size):

Title, artist_name, track_id, duration, key, mode, tempo, loudness, segments_pitches and segments_timbre

If anyone knows how to help me out with this, id be an amazing help! I can’t afford AWS

submitted by /u/HairNo5183
[link] [comments]

World Wide Cell Towers Dataset: Geographic Coordinates & Network Info

Description:

Hey Reddit! 📡 Check out this extensive dataset containing detailed geographic coordinates and network information for cell tower locations worldwide, organized by continent. It’s a treasure trove for spatial analysis, telecommunications research, and network planning enthusiasts!

Key Features:

Coverage: Over 46 million records of cell tower locations. Columns: Includes data like Radio technology, MCC (Mobile Country Code), MNC (Mobile Network Code), LAC (Location Area Code), CID (Base Transceiver Station ID), Longitude, Latitude, Range, Samples, Changeable status, Created and Updated timestamps, AverageSignal strength, Country, Network owner, and Continent.

Use Cases:

Explore global distribution and characteristics of cell towers. Analyze network coverage patterns and trends. Dive into telecommunications research.

Note: The dataset’s AverageSignal column mostly displays zero values due to data aggregation methods.

Check the Dataset in kaggle

Feel free to dive into this dataset and share your insights! Let me know if you need more details or have questions. 😊

submitted by /u/Miozaki0
[link] [comments]

Research About Data Platform For University Thesis

Hello guys and girls 🙂

My name is Augustin, and I’m currently studying and researching how data professionals, like you, can maximize the impact of data platforms.

I’m working on a concept which aims to create a data platform for marketing use, for an eSport team. The goal would be to provide a platform that simplifies complex data sets and transforms them into actionable insights.

I’d love to hear your thoughts on the following questions:

What are the biggest challenges you currently face with data platforms?

What features do you find most useful in existing platforms, and what do you wish they could improve?

How important are predictive analytics for your work, and what predictive features do you find valuable?

Your input will directly contribute to refining my research and I’d greatly appreciate your insights! If you have any questions about it, feel free to ask, I will gladly answer!

Thanks a lot for your time 🙂

Augustin

submitted by /u/Phacebyaz
[link] [comments]

Looking For Data On Country Population By Income Brackets

I’m looking for datasets that break down the population by income brackets. E.g.:

Annual income Percentage of population Less than $10,000 3% $10,000 to $15,000 7% $15,000 to $20,000 11% $20,000 to $25,000 30% etc… etc…

I would like to find this data for various countries across the world. I don’t need every country, but the majority of the more economically developed countries (i.e. western europe, usa, canada etc.)

For example, here is one I found for the U.S on https://data.census.gov/table?q=income

Is there any database where I can find this data for other countries? Thank you!

submitted by /u/fcbasel9995
[link] [comments]

Need Help Finding Open Online Games Dataset

Hi,

I am running a project for which I need to analyse player performance histories for lots of different kinds of online games

Thus, the minimum requirement is that the dataset should have playerID, match outcomes, and time stamps.

I have found datasets for chess, CSGO, DOTA, League of Legends, Scrabble and sports betting. However, I want help finding more games.

For example:

Variants of poker, fantasy sports, board games played online, card games like bridge, solitaire (klondike), minesweeper, any racing games, puzzles..

And so on. Is there a place where I can find these?

I feel like I have exhausted Kaggle or cannot enter the right keywords

submitted by /u/the_cogsci_guy
[link] [comments]

Help Improve Social Media: Your Opinion Matters!

Dear Friends,

I am working on an important project for my probability and statistics course that aims to address the issue of social media bots. Your input is invaluable in shaping this research and potentially influencing social media platforms to reduce the presence of bots, leading to a better online experience with reliable information.

How You Can Help:

Kindly spare a few moments from your busy schedule to fill out this survey: https://forms.gle/uk2czZkAh4cmH2DEA

Your contribution will have a significant impact on creating a more authentic online environment.

Why It Matters:

By participating, you are contributing to a cause that can enhance the quality of online interactions and promote the spread of genuine information.

Your support means the world to me, and I am grateful for your participation in this endeavor.

Thank you for being part of this initiative.

submitted by /u/skinnyytallboy
[link] [comments]