Loading TRAINED models with VISSL #577

davitpapikyan · 2023-01-08T23:58:34Z

❓ How to load an already trained model on VISSL

I'm using Torchvision ResNet50, have attached a linear classifier on top of it and have already trained the model on MNIST using VISSL (accuracy ~98%). Now I want to load my model and have smth like this:

model = load("checkpoint")
predictions = model(images)

I tried using Loading a pre-trained model in inference mode but it doesn't help. Seems like my model's weights are randomly initialized and accuracy on the same dataset is ~10%.
Note that the tutorial above shows how to load a pre-trained model, my case is different - I have modified the trunk by adding a linear classifier on top of it.

QuentinDuval · 2023-01-09T19:59:04Z

Hello @davitpapikyan,

Thanks a lot for using VISSL and thanks a lot for your questions :)

I am not 100% I understand your use case correctly:

Is it that you trained a model with VISSL and what to load it in a torchvision ResNet50?
Or is it that you trained a model with TorchVision manually and what to load it in VISSL?
Or it is something else entirely?

I would like to get some additional details so that I can help you appropriately.
Also, do you have logs you could share with me, so I can see if the weights are loaded properly?

Thank you,
Quentin

davitpapikyan · 2023-01-09T21:32:09Z

Hi @QuentinDuval ,
Thank you for contacting.

I have appended a classification head to Torchvision ResNet50 trunk, have fine-tuned the classifier on MNIST. Now I have "<checkpoint_name.torch>" checkpoint of that model (produced by run_distributed_engines.py) and want to load it like this:

model = load("checkpoint_name.torch")
predictions = model(images)

What would yo recommend me to do?
Unfortunately I don't have useful logs to share with you.

Thank yo very much.

QuentinDuval · 2023-01-09T23:24:35Z

Hi @davitpapikyan,

So there are several ways. One way is go through the run_distributed_engine.py again to evaluate the accuracy on a given dataset. Here is how it looks to evaluate on a given dataset with a ResNet50:

python /private/home/qduval/project/ssl_scaling/tools/run_distributed_engines.py config=benchmark/robustness_out_of_distribution/eval_resnet_8gpu_test_only +config/benchmark/robustness_out_of_distribution/datasets=imagenet_val  +config/benchmark/fulltune/imagenet1k/models=resnext50    config.MODEL.WEIGHTS_INIT.PARAMS_FILE=/checkpoint/qduval/vissl/supervised/rn50_torchvision.torch

The other way is the programmatic way in which you do it through code:

import torch
from vissl.models import build_model
from vissl.utils.hydra_config import compose_hydra_configuration, convert_to_attrdict


def load_model(overrides, checkpoint):
    cfg = compose_hydra_configuration(overrides)
    _, config = convert_to_attrdict(cfg, dump_config=False)
    model = build_model(config.MODEL, config.OPTIMIZER)
    checkpoint = torch.load(checkpoint)
    model.init_model_from_weights_params_file(config, checkpoint)
    return model

The overrides contain a list of the configuration relevant to the creation of your model, here is an example:

model = load_model(
    checkpoint="/checkpoint/qduval/vissl/supervised/rn50_torchvision.torch",
    overrides=[
        "config=benchmark/robustness_out_of_distribution/eval_resnet_8gpu_test_only",
        "+config/benchmark/fulltune/imagenet1k/models=resnext50",
    ]
)

Please tell me if that helps you :)

davitpapikyan · 2023-01-10T10:40:52Z

Unfortunately with your approach (load_model) the accuracy of my classifier degrades from 98% to 10%. I checked your repo and found another way which preserves the performance:

from vissl.models.base_ssl_model import BaseSSLMultiInputOutputModel
from classy_vision.generic.util import load_checkpoint
from vissl.utils.hydra_config import compose_hydra_configuration, convert_to_attrdict


config = compose_hydra_configuration(config)
_, config = convert_to_attrdict(config)

model = BaseSSLMultiInputOutputModel(config.MODEL, config.OPTIMIZER)
weights = load_checkpoint(checkpoint_path=config.MODEL.WEIGHTS_INIT.PARAMS_FILE)
vissl_state_dict = weights.get("classy_state_dict")
model.set_classy_state(vissl_state_dict["base_model"])

QuentinDuval · 2023-01-10T14:30:19Z

Oh interesting, it works on my end, what version of VISSL are you using ? I will check if there is a bug there.

davitpapikyan · 2023-01-10T14:56:25Z

Dear @QuentinDuval, I'm using 0.1.6 version of VISSL.

davitpapikyan · 2023-01-22T13:04:23Z

Dear @QuentinDuval, given a full-tuned model, how can one lead it?
Let's take Benchmark Full-Finetuning on ImageNet-1K as an example.
My approach fails because the full-tuned model has prefixes different from configs and I cannot handle them using APPEND_PREFIX or REMOVE_PREFIX options. Your approach loads the model successfully but its accuracy is ~10% instead of 98%. Can you please suggest a workable solution?

davitpapikyan · 2023-01-25T19:11:52Z

Here are the logs when loading the checkpoint:

INFO 2023-01-25 23:21:05,161 resnext.py:  64: ResNeXT trunk, supports activation checkpointing. Deactivated
INFO 2023-01-25 23:21:05,161 resnext.py:  87: Building model: ResNeXt50-1x64d-w1-BatchNorm2d
INFO 2023-01-25 23:21:05,600 feature_extractor.py:  50: Freezing model trunk...
INFO 2023-01-25 23:21:05,602 util.py: 276: Attempting to load checkpoint from ./vissl_tools/checkpoints/tochvision_resnet50_mnist_2048_10_sc_fulltune/model_final_checkpoint_phase0.torch
INFO 2023-01-25 23:21:05,665 util.py: 281: Loaded checkpoint from ./vissl_tools/checkpoints/tochvision_resnet50_mnist_2048_10_sc_fulltune/model_final_checkpoint_phase0.torch
INFO 2023-01-25 23:21:05,666 base_ssl_model.py: 446: Rank 0: Loading Trunk state dict....
INFO 2023-01-25 23:21:05,682 base_ssl_model.py: 459: Rank 0: Loading Heads state dict....
INFO 2023-01-25 23:21:05,683 base_ssl_model.py: 474: Rank 0: Model state dict loaded!
INFO 2023-01-25 23:21:05,683 checkpoint.py: 672: Loaded: base_model._feature_blocks.conv1.weight                              of shape: torch.Size([64, 3, 7, 7]) from checkpoint
INFO 2023-01-25 23:21:05,684 checkpoint.py: 672: Loaded: base_model._feature_blocks.bn1.weight                                of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,684 checkpoint.py: 672: Loaded: base_model._feature_blocks.bn1.bias                                  of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,684 checkpoint.py: 672: Loaded: base_model._feature_blocks.bn1.running_mean                          of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,684 checkpoint.py: 672: Loaded: base_model._feature_blocks.bn1.running_var                           of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,684 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.bn1.num_batches_tracked
INFO 2023-01-25 23:21:05,684 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.conv1.weight                     of shape: torch.Size([64, 64, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,684 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.bn1.weight                       of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,684 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.bn1.bias                         of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,684 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.bn1.running_mean                 of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,684 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.bn1.running_var                  of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,684 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer1.0.bn1.num_batches_tracked
INFO 2023-01-25 23:21:05,684 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.conv2.weight                     of shape: torch.Size([64, 64, 3, 3]) from checkpoint
INFO 2023-01-25 23:21:05,684 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.bn2.weight                       of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,685 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.bn2.bias                         of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,685 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.bn2.running_mean                 of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,685 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.bn2.running_var                  of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,685 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer1.0.bn2.num_batches_tracked
INFO 2023-01-25 23:21:05,685 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.conv3.weight                     of shape: torch.Size([256, 64, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,685 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.bn3.weight                       of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,685 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.bn3.bias                         of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,685 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.bn3.running_mean                 of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,685 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.bn3.running_var                  of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,685 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer1.0.bn3.num_batches_tracked
INFO 2023-01-25 23:21:05,685 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.downsample.0.weight              of shape: torch.Size([256, 64, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,685 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.downsample.1.weight              of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,686 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.downsample.1.bias                of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,686 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.downsample.1.running_mean        of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,686 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.0.downsample.1.running_var         of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,686 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer1.0.downsample.1.num_batches_tracked
INFO 2023-01-25 23:21:05,686 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.1.conv1.weight                     of shape: torch.Size([64, 256, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,686 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.1.bn1.weight                       of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,686 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.1.bn1.bias                         of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,686 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.1.bn1.running_mean                 of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,686 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.1.bn1.running_var                  of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,686 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer1.1.bn1.num_batches_tracked
INFO 2023-01-25 23:21:05,686 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.1.conv2.weight                     of shape: torch.Size([64, 64, 3, 3]) from checkpoint
INFO 2023-01-25 23:21:05,686 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.1.bn2.weight                       of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,686 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.1.bn2.bias                         of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,687 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.1.bn2.running_mean                 of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,687 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.1.bn2.running_var                  of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,687 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer1.1.bn2.num_batches_tracked
INFO 2023-01-25 23:21:05,687 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.1.conv3.weight                     of shape: torch.Size([256, 64, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,687 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.1.bn3.weight                       of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,687 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.1.bn3.bias                         of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,687 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.1.bn3.running_mean                 of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,687 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.1.bn3.running_var                  of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,687 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer1.1.bn3.num_batches_tracked
INFO 2023-01-25 23:21:05,687 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.2.conv1.weight                     of shape: torch.Size([64, 256, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,687 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.2.bn1.weight                       of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,687 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.2.bn1.bias                         of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,687 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.2.bn1.running_mean                 of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,688 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.2.bn1.running_var                  of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,688 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer1.2.bn1.num_batches_tracked
INFO 2023-01-25 23:21:05,688 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.2.conv2.weight                     of shape: torch.Size([64, 64, 3, 3]) from checkpoint
INFO 2023-01-25 23:21:05,688 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.2.bn2.weight                       of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,688 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.2.bn2.bias                         of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,688 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.2.bn2.running_mean                 of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,688 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.2.bn2.running_var                  of shape: torch.Size([64]) from checkpoint
INFO 2023-01-25 23:21:05,688 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer1.2.bn2.num_batches_tracked
INFO 2023-01-25 23:21:05,688 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.2.conv3.weight                     of shape: torch.Size([256, 64, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,688 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.2.bn3.weight                       of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,688 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.2.bn3.bias                         of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,688 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.2.bn3.running_mean                 of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,688 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer1.2.bn3.running_var                  of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,689 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer1.2.bn3.num_batches_tracked
INFO 2023-01-25 23:21:05,689 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.conv1.weight                     of shape: torch.Size([128, 256, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,689 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.bn1.weight                       of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,689 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.bn1.bias                         of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,689 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.bn1.running_mean                 of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,689 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.bn1.running_var                  of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,689 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer2.0.bn1.num_batches_tracked
INFO 2023-01-25 23:21:05,689 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.conv2.weight                     of shape: torch.Size([128, 128, 3, 3]) from checkpoint
INFO 2023-01-25 23:21:05,689 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.bn2.weight                       of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,689 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.bn2.bias                         of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,689 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.bn2.running_mean                 of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,689 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.bn2.running_var                  of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,689 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer2.0.bn2.num_batches_tracked
INFO 2023-01-25 23:21:05,690 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.conv3.weight                     of shape: torch.Size([512, 128, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,690 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.bn3.weight                       of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,690 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.bn3.bias                         of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,690 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.bn3.running_mean                 of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,690 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.bn3.running_var                  of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,690 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer2.0.bn3.num_batches_tracked
INFO 2023-01-25 23:21:05,690 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.downsample.0.weight              of shape: torch.Size([512, 256, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,690 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.downsample.1.weight              of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,690 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.downsample.1.bias                of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,690 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.downsample.1.running_mean        of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,690 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.0.downsample.1.running_var         of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,690 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer2.0.downsample.1.num_batches_tracked
INFO 2023-01-25 23:21:05,690 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.1.conv1.weight                     of shape: torch.Size([128, 512, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,691 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.1.bn1.weight                       of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,691 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.1.bn1.bias                         of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,691 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.1.bn1.running_mean                 of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,691 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.1.bn1.running_var                  of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,691 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer2.1.bn1.num_batches_tracked
INFO 2023-01-25 23:21:05,691 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.1.conv2.weight                     of shape: torch.Size([128, 128, 3, 3]) from checkpoint
INFO 2023-01-25 23:21:05,691 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.1.bn2.weight                       of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,691 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.1.bn2.bias                         of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,691 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.1.bn2.running_mean                 of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,691 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.1.bn2.running_var                  of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,691 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer2.1.bn2.num_batches_tracked
INFO 2023-01-25 23:21:05,691 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.1.conv3.weight                     of shape: torch.Size([512, 128, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,692 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.1.bn3.weight                       of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,692 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.1.bn3.bias                         of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,692 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.1.bn3.running_mean                 of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,692 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.1.bn3.running_var                  of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,692 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer2.1.bn3.num_batches_tracked
INFO 2023-01-25 23:21:05,692 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.2.conv1.weight                     of shape: torch.Size([128, 512, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,692 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.2.bn1.weight                       of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,692 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.2.bn1.bias                         of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,692 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.2.bn1.running_mean                 of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,692 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.2.bn1.running_var                  of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,692 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer2.2.bn1.num_batches_tracked
INFO 2023-01-25 23:21:05,692 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.2.conv2.weight                     of shape: torch.Size([128, 128, 3, 3]) from checkpoint
INFO 2023-01-25 23:21:05,692 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.2.bn2.weight                       of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,693 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.2.bn2.bias                         of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,693 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.2.bn2.running_mean                 of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,693 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.2.bn2.running_var                  of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,693 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer2.2.bn2.num_batches_tracked
INFO 2023-01-25 23:21:05,693 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.2.conv3.weight                     of shape: torch.Size([512, 128, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,693 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.2.bn3.weight                       of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,693 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.2.bn3.bias                         of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,693 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.2.bn3.running_mean                 of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,693 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.2.bn3.running_var                  of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,693 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer2.2.bn3.num_batches_tracked
INFO 2023-01-25 23:21:05,693 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.3.conv1.weight                     of shape: torch.Size([128, 512, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,693 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.3.bn1.weight                       of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,693 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.3.bn1.bias                         of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,693 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.3.bn1.running_mean                 of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,693 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.3.bn1.running_var                  of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,693 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer2.3.bn1.num_batches_tracked
INFO 2023-01-25 23:21:05,693 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.3.conv2.weight                     of shape: torch.Size([128, 128, 3, 3]) from checkpoint
INFO 2023-01-25 23:21:05,693 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.3.bn2.weight                       of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,693 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.3.bn2.bias                         of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,693 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.3.bn2.running_mean                 of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,693 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.3.bn2.running_var                  of shape: torch.Size([128]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer2.3.bn2.num_batches_tracked
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.3.conv3.weight                     of shape: torch.Size([512, 128, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.3.bn3.weight                       of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.3.bn3.bias                         of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.3.bn3.running_mean                 of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer2.3.bn3.running_var                  of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer2.3.bn3.num_batches_tracked
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.conv1.weight                     of shape: torch.Size([256, 512, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.bn1.weight                       of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.bn1.bias                         of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.bn1.running_mean                 of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.bn1.running_var                  of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.0.bn1.num_batches_tracked
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.conv2.weight                     of shape: torch.Size([256, 256, 3, 3]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.bn2.weight                       of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.bn2.bias                         of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.bn2.running_mean                 of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.bn2.running_var                  of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.0.bn2.num_batches_tracked
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.conv3.weight                     of shape: torch.Size([1024, 256, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.bn3.weight                       of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.bn3.bias                         of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.bn3.running_mean                 of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.bn3.running_var                  of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.0.bn3.num_batches_tracked
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.downsample.0.weight              of shape: torch.Size([1024, 512, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.downsample.1.weight              of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.downsample.1.bias                of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,694 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.downsample.1.running_mean        of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.0.downsample.1.running_var         of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.0.downsample.1.num_batches_tracked
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.1.conv1.weight                     of shape: torch.Size([256, 1024, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.1.bn1.weight                       of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.1.bn1.bias                         of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.1.bn1.running_mean                 of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.1.bn1.running_var                  of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.1.bn1.num_batches_tracked
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.1.conv2.weight                     of shape: torch.Size([256, 256, 3, 3]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.1.bn2.weight                       of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.1.bn2.bias                         of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.1.bn2.running_mean                 of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.1.bn2.running_var                  of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.1.bn2.num_batches_tracked
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.1.conv3.weight                     of shape: torch.Size([1024, 256, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.1.bn3.weight                       of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.1.bn3.bias                         of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.1.bn3.running_mean                 of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.1.bn3.running_var                  of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.1.bn3.num_batches_tracked
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.2.conv1.weight                     of shape: torch.Size([256, 1024, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.2.bn1.weight                       of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.2.bn1.bias                         of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.2.bn1.running_mean                 of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.2.bn1.running_var                  of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.2.bn1.num_batches_tracked
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.2.conv2.weight                     of shape: torch.Size([256, 256, 3, 3]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.2.bn2.weight                       of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,695 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.2.bn2.bias                         of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.2.bn2.running_mean                 of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.2.bn2.running_var                  of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.2.bn2.num_batches_tracked
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.2.conv3.weight                     of shape: torch.Size([1024, 256, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.2.bn3.weight                       of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.2.bn3.bias                         of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.2.bn3.running_mean                 of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.2.bn3.running_var                  of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.2.bn3.num_batches_tracked
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.3.conv1.weight                     of shape: torch.Size([256, 1024, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.3.bn1.weight                       of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.3.bn1.bias                         of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.3.bn1.running_mean                 of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.3.bn1.running_var                  of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.3.bn1.num_batches_tracked
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.3.conv2.weight                     of shape: torch.Size([256, 256, 3, 3]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.3.bn2.weight                       of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.3.bn2.bias                         of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.3.bn2.running_mean                 of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.3.bn2.running_var                  of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.3.bn2.num_batches_tracked
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.3.conv3.weight                     of shape: torch.Size([1024, 256, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.3.bn3.weight                       of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.3.bn3.bias                         of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.3.bn3.running_mean                 of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.3.bn3.running_var                  of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.3.bn3.num_batches_tracked
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.4.conv1.weight                     of shape: torch.Size([256, 1024, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,696 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.4.bn1.weight                       of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.4.bn1.bias                         of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.4.bn1.running_mean                 of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.4.bn1.running_var                  of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.4.bn1.num_batches_tracked
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.4.conv2.weight                     of shape: torch.Size([256, 256, 3, 3]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.4.bn2.weight                       of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.4.bn2.bias                         of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.4.bn2.running_mean                 of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.4.bn2.running_var                  of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.4.bn2.num_batches_tracked
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.4.conv3.weight                     of shape: torch.Size([1024, 256, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.4.bn3.weight                       of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.4.bn3.bias                         of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.4.bn3.running_mean                 of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.4.bn3.running_var                  of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.4.bn3.num_batches_tracked
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.5.conv1.weight                     of shape: torch.Size([256, 1024, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.5.bn1.weight                       of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.5.bn1.bias                         of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.5.bn1.running_mean                 of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.5.bn1.running_var                  of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.5.bn1.num_batches_tracked
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.5.conv2.weight                     of shape: torch.Size([256, 256, 3, 3]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.5.bn2.weight                       of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.5.bn2.bias                         of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.5.bn2.running_mean                 of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.5.bn2.running_var                  of shape: torch.Size([256]) from checkpoint
INFO 2023-01-25 23:21:05,697 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.5.bn2.num_batches_tracked
INFO 2023-01-25 23:21:05,697 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.5.conv3.weight                     of shape: torch.Size([1024, 256, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.5.bn3.weight                       of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.5.bn3.bias                         of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.5.bn3.running_mean                 of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer3.5.bn3.running_var                  of shape: torch.Size([1024]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer3.5.bn3.num_batches_tracked
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.conv1.weight                     of shape: torch.Size([512, 1024, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.bn1.weight                       of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.bn1.bias                         of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.bn1.running_mean                 of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.bn1.running_var                  of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer4.0.bn1.num_batches_tracked
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.conv2.weight                     of shape: torch.Size([512, 512, 3, 3]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.bn2.weight                       of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.bn2.bias                         of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.bn2.running_mean                 of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.bn2.running_var                  of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer4.0.bn2.num_batches_tracked
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.conv3.weight                     of shape: torch.Size([2048, 512, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.bn3.weight                       of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.bn3.bias                         of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.bn3.running_mean                 of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.bn3.running_var                  of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer4.0.bn3.num_batches_tracked
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.downsample.0.weight              of shape: torch.Size([2048, 1024, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.downsample.1.weight              of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.downsample.1.bias                of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.downsample.1.running_mean        of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.0.downsample.1.running_var         of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,698 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer4.0.downsample.1.num_batches_tracked
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.1.conv1.weight                     of shape: torch.Size([512, 2048, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.1.bn1.weight                       of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.1.bn1.bias                         of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.1.bn1.running_mean                 of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.1.bn1.running_var                  of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer4.1.bn1.num_batches_tracked
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.1.conv2.weight                     of shape: torch.Size([512, 512, 3, 3]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.1.bn2.weight                       of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.1.bn2.bias                         of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.1.bn2.running_mean                 of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.1.bn2.running_var                  of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer4.1.bn2.num_batches_tracked
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.1.conv3.weight                     of shape: torch.Size([2048, 512, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.1.bn3.weight                       of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.1.bn3.bias                         of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.1.bn3.running_mean                 of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.1.bn3.running_var                  of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer4.1.bn3.num_batches_tracked
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.2.conv1.weight                     of shape: torch.Size([512, 2048, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.2.bn1.weight                       of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.2.bn1.bias                         of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.2.bn1.running_mean                 of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.2.bn1.running_var                  of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer4.2.bn1.num_batches_tracked
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.2.conv2.weight                     of shape: torch.Size([512, 512, 3, 3]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.2.bn2.weight                       of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.2.bn2.bias                         of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.2.bn2.running_mean                 of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,699 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.2.bn2.running_var                  of shape: torch.Size([512]) from checkpoint
INFO 2023-01-25 23:21:05,700 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer4.2.bn2.num_batches_tracked
INFO 2023-01-25 23:21:05,700 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.2.conv3.weight                     of shape: torch.Size([2048, 512, 1, 1]) from checkpoint
INFO 2023-01-25 23:21:05,700 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.2.bn3.weight                       of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,700 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.2.bn3.bias                         of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,700 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.2.bn3.running_mean                 of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,700 checkpoint.py: 672: Loaded: base_model._feature_blocks.layer4.2.bn3.running_var                  of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,700 checkpoint.py: 657: Ignored layer: base_model._feature_blocks.layer4.2.bn3.num_batches_tracked
INFO 2023-01-25 23:21:05,700 checkpoint.py: 672: Loaded: 0.channel_bn.weight                                                  of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,700 checkpoint.py: 672: Loaded: 0.channel_bn.bias                                                    of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,700 checkpoint.py: 672: Loaded: 0.channel_bn.running_mean                                            of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,700 checkpoint.py: 672: Loaded: 0.channel_bn.running_var                                             of shape: torch.Size([2048]) from checkpoint
INFO 2023-01-25 23:21:05,700 checkpoint.py: 657: Ignored layer: 0.channel_bn.num_batches_tracked
INFO 2023-01-25 23:21:05,700 checkpoint.py: 672: Loaded: 0.clf.clf.0.weight                                                   of shape: torch.Size([10, 2048]) from checkpoint
INFO 2023-01-25 23:21:05,700 checkpoint.py: 672: Loaded: 0.clf.clf.0.bias                                                     of shape: torch.Size([10]) from checkpoint
INFO 2023-01-25 23:21:05,700 checkpoint.py: 685: Extra layers not loaded from checkpoint: []

@QuentinDuval Do you have any suggestion why the performance degrades? Seems that weights are being loaded correctly.

osoblanco · 2023-05-07T16:44:42Z

@QuentinDuval Any news on this issue ?

QuentinDuval self-assigned this Jan 9, 2023

QuentinDuval added the awaiting-user-response label Jan 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Loading TRAINED models with VISSL #577

Loading TRAINED models with VISSL #577

davitpapikyan commented Jan 8, 2023

QuentinDuval commented Jan 9, 2023

davitpapikyan commented Jan 9, 2023

QuentinDuval commented Jan 9, 2023

davitpapikyan commented Jan 10, 2023

QuentinDuval commented Jan 10, 2023 •

edited

Loading

davitpapikyan commented Jan 10, 2023

davitpapikyan commented Jan 22, 2023

davitpapikyan commented Jan 25, 2023 •

edited

Loading

osoblanco commented May 7, 2023

Loading TRAINED models with VISSL #577

Loading TRAINED models with VISSL #577

Comments

davitpapikyan commented Jan 8, 2023

❓ How to load an already trained model on VISSL

QuentinDuval commented Jan 9, 2023

davitpapikyan commented Jan 9, 2023

QuentinDuval commented Jan 9, 2023

davitpapikyan commented Jan 10, 2023

QuentinDuval commented Jan 10, 2023 • edited Loading

davitpapikyan commented Jan 10, 2023

davitpapikyan commented Jan 22, 2023

davitpapikyan commented Jan 25, 2023 • edited Loading

osoblanco commented May 7, 2023

QuentinDuval commented Jan 10, 2023 •

edited

Loading

davitpapikyan commented Jan 25, 2023 •

edited

Loading