OVERVIEW This repository contains PyTorch Lightning code for training and evaluating the PE-AG-GMoE model. It supports single- and multi-GPU runs (DDP), gradient accumulation and automatic ...