Paper page - Efficient Training on Multiple Consumer GPUs with RoundPipe
…In this paper, we propose RoundPipe , a novel pipeline schedule that breaks the weight binding constraint on consumer GPU servers. RoundPipe treats GPUs as a pool of stateless execution workers and dynamically…