LightSeq2: Accelerated Training for Transformer-Based Models on GPUs | IEEE Conference Publication | IEEE Xplore