Albert - A-lite-BERT

A guide to understand how pretraining transformer models work. It’s basically the prestage, before usage with Huggingface is possible. See the full guide at https://github.com/arrrrrmin/albert-guide.

11 min · arrrrrmin