News

"These models can get as big as we allow them too." To handle that, PyTorch 1.1 adds the ability to split networks across GPUs, known as "sharding" the model. Previously, PyTorch allowed ...