XLM


  • XLM supports multi-GPU and multi-node training, and contains code for: - **Language model pretraining**: - **Causal Language Model** (CLM) - **Masked Language Model** (MLM) - **Translation Language Model** (TLM) - **GLUE** fine-tuning - **XNLI** fine-tuning - **Supervised / Unsupervised MT** training: - Denoising auto-encoder - Parallel data training - Online back-translation