Concept of Tensor Model Parallelism#
Authors: Kichang Yang, Kevin Ko, Minho Ryu
Tensor Model Parallelism makes it possible to train larger models by partitioning the parameter tensors into multiple dimensions. We support 1D, 2D, 2.5D, and 3D tensor partitioning algorithms which make tensor parallel training more efficient.