Just released the largest open-source behavioral dataset for CAPTCHA research on huggingface. Most existing datasets only provide the solution labels (image/text); this dataset includes the full cursor telemetry.
Specs:
- 30,000+ verified human sessions.
- Features: Path curvature, accelerations, micro-corrections, and timing.
- Tasks: Drag mechanics and high-precision object tracking (harder than current production standards).
- Source: Verified human interactions (3 world records broken for scale/participants).
Ideal for training behavioral biometric models, red-teaming anti-bot systems, or researching human-computer interaction (HCI) patterns.
Dataset: https://huggingface.co/datasets/Capycap-AI/CaptchaSolve30k
submitted by /u/SilverWheat
[link] [comments]