cached models
weights of the cached models
gradients of the cached models
cached criterion
cached state
module running time
cached validation methods
cached optim methods
cached criterion
cached validation methods
cached models
cached state
gradients of the cached models
weights of the cached models
module running time
cached optim methods
Optimizer cache some metadata on each executor
Tensor element type
cached models
weights of the cached models
gradients of the cached models
cached criterion
cached state
module running time
cached validation methods
cached optim methods