Foundation models for mathematics

As part of a multi-lab collaboration we developed Llemma and ProofPile II, a foundation model for mathematics and 55 billion token dataset with 1.5 billion tokens of formal code.

Related publications

2023

  1. Zhangir Azerbayev, Hailey Schoelkopf, Keiran Paster, and 6 more authors
    arXiv preprint arXiv:2310.06786, 2023