“OpenWebMath: An Open Dataset Of High-Quality Mathematical Web Text”, Paster Et Al 2023 (14.7b Tokens Of Internet HTML/LaTeX Math Text) submitted by /u/gwern [link] [comments]0