2 Million Messy → Clean Addresses. What Would You Build With This?

Hello fellow developers,

I have a dataset containing 2 million complete Brazilian addresses, manually typed by real users. These addresses include abbreviations, typos, inconsistent formatting, and other common real-world issues.

For each raw address, I also have its fully corrected, standardized, and structured version.

Does anyone have ideas on what kind of solutions or products could be built with this data to solve real-world problems?

Thanks in advance for any insights!

submitted by /u/Hour-Dirt-8505
[link] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *