Hi everyone,
I’m working on a project where I need a dataset that contains numbers (like 4–8 digit sequences, phone numbers, PINs, etc.) along with some measure of how easy they are to remember.
For example, numbers like 1234 or 7777 are obviously easier to recall than something like 9274, but I need structured data where each number has a “memorability” score (human-rated or algorithmically assigned).
I’ve been searching, but I haven’t found any existing dataset that directly covers this. Before I go ahead and build a synthetic dataset (based on repetition, patterns, palindromes, chunking, etc.), I wanted to check:
- Does such a dataset already exist in psychology, telecom, or cognitive science research?
- If not, has anyone here worked on generating similar “memorability” metrics for numbers?
- Any tips on crowdsourcing this kind of data (e.g., survey setups)?
Any leads or references would be super helpful
Thanks in advance!
submitted by /u/abel_maireg
[link] [comments]