Scalable evaluation
We aim to reduce the human effort needed to evaluate large language models. For example, Mauve enables automatic evaluation of text at a distributional level using information divergences.
Related publications
2023
2021
- In Advances in Neural Information Processing Systems, Nov 2021
- In Advances in Neural Information Processing Systems, Nov 2021