I Got Tired Of Checking Kaggle, HuggingFace, Data.gov, And Other Sites Every Time I Needed A Dataset, So I Built A Tool That Searches All Of Them At Once

Disclosure: I’m one of the creators of this tool.

Hi all,

I do ML research at Berkeley and the most tedious part of every project is dataset discovery. I’d spend hours opening tabs across Kaggle, HuggingFace, data.gov, Census, WHO, Semantic Scholar, and a dozen other platforms just to find the right data. Then I’d have to manually check licenses, preview columns, and figure out citations.

So my friend and I built Mobus, an open-source MCP server that lets you do all of that from inside Claude or Cursor. You describe what you need in natural language and it searches across 20 platforms, lets you preview the actual data, checks licenses, and generates citations.

It’s free and open source: https://github.com/mobus-ai/Mobus

Quick demo on the site if you want to see it in action: https://mobus.ai

Would love feedback from anyone who deals with this pain point. What data sources are missing that you’d want to see added?

submitted by /u/Swimming_Outside_988
[link] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *