Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

re-training diffusion model #12

Open
pokameng opened this issue Nov 11, 2022 · 1 comment
Open

re-training diffusion model #12

pokameng opened this issue Nov 11, 2022 · 1 comment

Comments

@pokameng
Copy link

Hi
@jiamings @bahjat-kawar
I have retrain the diffusion model from guide-diffusion, and my setting as follow:
MODEL_FLAGS="--image_size 256 --num_channels 256 --num_res_blocks 2 --learn_sigma True --use_scale_shift_norm true --attention_resolutions 32,16,8 --num_head_channels 64"
DIFFUSION_FLAGS="--resblock_updown True --diffusion_steps 1000 --noise_schedule linear --rescale_learned_sigmas False --rescale_timesteps False "
TRAIN_FLAGS="--lr 1e-4 --microbatch 4 --dropout 0.1"

but it was wrong with this ckpt:
Missing key(s) in state_dict: "temb.dense.0.weight", "temb.dense.0.bias", "temb.dense.1.weight", "temb.dense.1.bias", "conv_in.weight", "conv_in.bias", "down.0.block.0.norm1.weight", "down.0.block.0.norm1.bias", "down.0.block.0.conv1.weight", "down.0.block.0.conv1.bias", "down.0.block.0.temb_proj.weight", "down.0.block.0.temb_proj.bias", "down.0.block.0.norm2.weight", "down.0.block.0.norm2.bias", "down.0.block.0.conv2.weight", "down.0.block.0.conv2.bias", "down.0.block.1.norm1.weight", "down.0.block.1.norm1.bias", "down.0.block.1.conv1.weight", "down.0.block.1.conv1.bias", "down.0.block.1.temb_proj.weight", "down.0.block.1.temb_proj.bias", "down.0.block.1.norm2.weight", "down.0.block.1.norm2.bias", "down.0.block.1.conv2.weight", "down.0.block.1.conv2.bias", "down.0.downsample.conv.weight", "down.0.downsample.conv.bias", "down.1.block.0.norm1.weight", "down.1.block.0.norm1.bias", "down.1.block.0.conv1.weight", "down.1.block.0.conv1.bias", "down.1.block.0.temb_proj.weight", "down.1.block.0.temb_proj.bias", "down.1.block.0.norm2.weight", "down.1.block.0.norm2.bias", "down.1.block.0.conv2.weight", "down.1.block.0.conv2.bias", "down.1.block.1.norm1.weight", "down.1.block.1.norm1.bias", "down.1.block.1.conv1.weight", "down.1.block.1.conv1.bias", "down.1.block.1.temb_proj.weight", "down.1.block.1.temb_proj.bias", "down.1.block.1.norm2.weight", "down.1.block.1.norm2.bias", "down.1.block.1.conv2.weight", "down.1.block.1.conv2.bias", "down.1.downsample.conv.weight", "down.1.downsample.conv.bias", "down.2.block.0.norm1.weight", "down.2.block.0.norm1.bias", "down.2.block.0.conv1.weight", "down.2.block.0.conv1.bias", "down.2.block.0.temb_proj.weight", "down.2.block.0.temb_proj.bias", "down.2.block.0.norm2.weight", "down.2.block.0.norm2.bias", "down.2.block.0.conv2.weight", "down.2.block.0.conv2.bias", "down.2.block.0.nin_shortcut.weight", "down.2.block.0.nin_shortcut.bias", "down.2.block.1.norm1.weight", "down.2.block.1.norm1.bias", "down.2.block.1.conv1.weight", "down.2.block.1.conv1.bias", "down.2.block.1.temb_proj.weight", "down.2.block.1.temb_proj.bias", "down.2.block.1.norm2.weight", "down.2.block.1.norm2.bias", "down.2.block.1.conv2.weight", "down.2.block.1.conv2.bias", "down.2.downsample.conv.weight", "down.2.downsample.conv.bias", "down.3.block.0.norm1.weight", "down.3.block.0.norm1.bias", "down.3.block.0.conv1.weight", "down.3.block.0.conv1.bias", "down.3.block.0.temb_proj.weight", "down.3.block.0.temb_proj.bias", "down.3.block.0.norm2.weight", "down.3.block.0.norm2.bias", "down.3.block.0.conv2.weight", "down.3.block.0.conv2.bias", "down.3.block.1.norm1.weight", "down.3.block.1.norm1.bias", "down.3.block.1.conv1.weight", "down.3.block.1.conv1.bias", "down.3.block.1.temb_proj.weight", "down.3.block.1.temb_proj.bias", "down.3.block.1.norm2.weight", "down.3.block.1.norm2.bias", "down.3.block.1.conv2.weight", "down.3.block.1.conv2.bias", "down.3.downsample.conv.weight", "down.3.downsample.conv.bias", "down.4.block.0.norm1.weight", "down.4.block.0.norm1.bias", "down.4.block.0.conv1.weight", "down.4.block.0.conv1.bias", "down.4.block.0.temb_proj.weight", "down.4.block.0.temb_proj.bias", "down.4.block.0.norm2.weight", "down.4.block.0.norm2.bias", "down.4.block.0.conv2.weight", "down.4.block.0.conv2.bias", "down.4.block.0.nin_shortcut.weight", "down.4.block.0.nin_shortcut.bias", "down.4.block.1.norm1.weight", "down.4.block.1.norm1.bias", "down.4.block.1.conv1.weight", "down.4.block.1.conv1.bias", "down.4.block.1.temb_proj.weight", "down.4.block.1.temb_proj.bias", "down.4.block.1.norm2.weight", "down.4.block.1.norm2.bias", "down.4.block.1.conv2.weight", "down.4.block.1.conv2.bias", "down.4.attn.0.norm.weight", "down.4.attn.0.norm.bias", "down.4.attn.0.q.weight", "down.4.attn.0.q.bias", "down.4.attn.0.k.weight", "down.4.attn.0.k.bias", "down.4.attn.0.v.weight", "down.4.attn.0.v.bias", "down.4.attn.0.proj_out.weight", "down.4.attn.0.proj_out.bias", "down.4.attn.1.norm.weight", "down.4.attn.1.norm.bias", "down.4.attn.1.q.weight", "down.4.attn.1.q.bias", "down.4.attn.1.k.weight", "down.4.attn.1.k.bias", "down.4.attn.1.v.weight", "down.4.attn.1.v.bias", "down.4.attn.1.proj_out.weight", "down.4.attn.1.proj_out.bias", "down.4.downsample.conv.weight", "down.4.downsample.conv.bias", "down.5.block.0.norm1.weight", "down.5.block.0.norm1.bias", "down.5.block.0.conv1.weight", "down.5.block.0.conv1.bias", "down.5.block.0.temb_proj.weight", "down.5.block.0.temb_proj.bias", "down.5.block.0.norm2.weight", "down.5.block.0.norm2.bias", "down.5.block.0.conv2.weight", "down.5.block.0.conv2.bias", "down.5.block.1.norm1.weight", "down.5.block.1.norm1.bias", "down.5.block.1.conv1.weight", "down.5.block.1.conv1.bias", "down.5.block.1.temb_proj.weight", "down.5.block.1.temb_proj.bias", "down.5.block.1.norm2.weight", "down.5.block.1.norm2.bias", "down.5.block.1.conv2.weight", "down.5.block.1.conv2.bias", "mid.block_1.norm1.weight", "mid.block_1.norm1.bias", "mid.block_1.conv1.weight", "mid.block_1.conv1.bias", "mid.block_1.temb_proj.weight", "mid.block_1.temb_proj.bias", "mid.block_1.norm2.weight", "mid.block_1.norm2.bias", "mid.block_1.conv2.weight", "mid.block_1.conv2.bias", "mid.attn_1.norm.weight", "mid.attn_1.norm.bias", "mid.attn_1.q.weight", "mid.attn_1.q.bias", "mid.attn_1.k.weight", "mid.attn_1.k.bias", "mid.attn_1.v.weight", "mid.attn_1.v.bias", "mid.attn_1.proj_out.weight", "mid.attn_1.proj_out.bias", "mid.block_2.norm1.weight", "mid.block_2.norm1.bias", "mid.block_2.conv1.weight", "mid.block_2.conv1.bias", "mid.block_2.temb_proj.weight", "mid.block_2.temb_proj.bias", "mid.block_2.norm2.weight", "mid.block_2.norm2.bias", "mid.block_2.conv2.weight", "mid.block_2.conv2.bias", "up.0.block.0.norm1.weight", "up.0.block.0.norm1.bias", "up.0.block.0.conv1.weight", "up.0.block.0.conv1.bias", "up.0.block.0.temb_proj.weight", "up.0.block.0.temb_proj.bias", "up.0.block.0.norm2.weight", "up.0.block.0.norm2.bias", "up.0.block.0.conv2.weight", "up.0.block.0.conv2.bias", "up.0.block.0.nin_shortcut.weight", "up.0.block.0.nin_shortcut.bias", "up.0.block.1.norm1.weight", "up.0.block.1.norm1.bias", "up.0.block.1.conv1.weight", "up.0.block.1.conv1.bias", "up.0.block.1.temb_proj.weight", "up.0.block.1.temb_proj.bias", "up.0.block.1.norm2.weight", "up.0.block.1.norm2.bias", "up.0.block.1.conv2.weight", "up.0.block.1.conv2.bias", "up.0.block.1.nin_shortcut.weight", "up.0.block.1.nin_shortcut.bias", "up.0.block.2.norm1.weight", "up.0.block.2.norm1.bias", "up.0.block.2.conv1.weight", "up.0.block.2.conv1.bias", "up.0.block.2.temb_proj.weight", "up.0.block.2.temb_proj.bias", "up.0.block.2.norm2.weight", "up.0.block.2.norm2.bias", "up.0.block.2.conv2.weight", "up.0.block.2.conv2.bias", "up.0.block.2.nin_shortcut.weight", "up.0.block.2.nin_shortcut.bias", "up.1.block.0.norm1.weight", "up.1.block.0.norm1.bias", "up.1.block.0.conv1.weight", "up.1.block.0.conv1.bias", "up.1.block.0.temb_proj.weight", "up.1.block.0.temb_proj.bias", "up.1.block.0.norm2.weight", "up.1.block.0.norm2.bias", "up.1.block.0.conv2.weight", "up.1.block.0.conv2.bias", "up.1.block.0.nin_shortcut.weight", "up.1.block.0.nin_shortcut.bias", "up.1.block.1.norm1.weight", "up.1.block.1.norm1.bias", "up.1.block.1.conv1.weight", "up.1.block.1.conv1.bias", "up.1.block.1.temb_proj.weight", "up.1.block.1.temb_proj.bias", "up.1.block.1.norm2.weight", "up.1.block.1.norm2.bias", "up.1.block.1.conv2.weight", "up.1.block.1.conv2.bias", "up.1.block.1.nin_shortcut.weight", "up.1.block.1.nin_shortcut.bias", "up.1.block.2.norm1.weight", "up.1.block.2.norm1.bias", "up.1.block.2.conv1.weight", "up.1.block.2.conv1.bias", "up.1.block.2.temb_proj.weight", "up.1.block.2.temb_proj.bias", "up.1.block.2.norm2.weight", "up.1.block.2.norm2.bias", "up.1.block.2.conv2.weight", "up.1.block.2.conv2.bias", "up.1.block.2.nin_shortcut.weight", "up.1.block.2.nin_shortcut.bias", "up.1.upsample.conv.weight", "up.1.upsample.conv.bias", "up.2.block.0.norm1.weight", "up.2.block.0.norm1.bias", "up.2.block.0.conv1.weight", "up.2.block.0.conv1.bias", "up.2.block.0.temb_proj.weight", "up.2.block.0.temb_proj.bias", "up.2.block.0.norm2.weight", "up.2.block.0.norm2.bias", "up.2.block.0.conv2.weight", "up.2.block.0.conv2.bias", "up.2.block.0.nin_shortcut.weight", "up.2.block.0.nin_shortcut.bias", "up.2.block.1.norm1.weight", "up.2.block.1.norm1.bias", "up.2.block.1.conv1.weight", "up.2.block.1.conv1.bias", "up.2.block.1.temb_proj.weight", "up.2.block.1.temb_proj.bias", "up.2.block.1.norm2.weight", "up.2.block.1.norm2.bias", "up.2.block.1.conv2.weight", "up.2.block.1.conv2.bias", "up.2.block.1.nin_shortcut.weight", "up.2.block.1.nin_shortcut.bias", "up.2.block.2.norm1.weight", "up.2.block.2.norm1.bias", "up.2.block.2.conv1.weight", "up.2.block.2.conv1.bias", "up.2.block.2.temb_proj.weight", "up.2.block.2.temb_proj.bias", "up.2.block.2.norm2.weight", "up.2.block.2.norm2.bias", "up.2.block.2.conv2.weight", "up.2.block.2.conv2.bias", "up.2.block.2.nin_shortcut.weight", "up.2.block.2.nin_shortcut.bias", "up.2.upsample.conv.weight", "up.2.upsample.conv.bias", "up.3.block.0.norm1.weight", "up.3.block.0.norm1.bias", "up.3.block.0.conv1.weight", "up.3.block.0.conv1.bias", "up.3.block.0.temb_proj.weight", "up.3.block.0.temb_proj.bias", "up.3.block.0.norm2.weight", "up.3.block.0.norm2.bias", "up.3.block.0.conv2.weight", "up.3.block.0.conv2.bias", "up.3.block.0.nin_shortcut.weight", "up.3.block.0.nin_shortcut.bias", "up.3.block.1.norm1.weight", "up.3.block.1.norm1.bias", "up.3.block.1.conv1.weight", "up.3.block.1.conv1.bias", "up.3.block.1.temb_proj.weight", "up.3.block.1.temb_proj.bias", "up.3.block.1.norm2.weight", "up.3.block.1.norm2.bias", "up.3.block.1.conv2.weight", "up.3.block.1.conv2.bias", "up.3.block.1.nin_shortcut.weight", "up.3.block.1.nin_shortcut.bias", "up.3.block.2.norm1.weight", "up.3.block.2.norm1.bias", "up.3.block.2.conv1.weight", "up.3.block.2.conv1.bias", "up.3.block.2.temb_proj.weight", "up.3.block.2.temb_proj.bias", "up.3.block.2.norm2.weight", "up.3.block.2.norm2.bias", "up.3.block.2.conv2.weight", "up.3.block.2.conv2.bias", "up.3.block.2.nin_shortcut.weight", "up.3.block.2.nin_shortcut.bias", "up.3.upsample.conv.weight", "up.3.upsample.conv.bias", "up.4.block.0.norm1.weight", "up.4.block.0.norm1.bias", "up.4.block.0.conv1.weight", "up.4.block.0.conv1.bias", "up.4.block.0.temb_proj.weight", "up.4.block.0.temb_proj.bias", "up.4.block.0.norm2.weight", "up.4.block.0.norm2.bias", "up.4.block.0.conv2.weight", "up.4.block.0.conv2.bias", "up.4.block.0.nin_shortcut.weight", "up.4.block.0.nin_shortcut.bias", "up.4.block.1.norm1.weight", "up.4.block.1.norm1.bias", "up.4.block.1.conv1.weight", "up.4.block.1.conv1.bias", "up.4.block.1.temb_proj.weight", "up.4.block.1.temb_proj.bias", "up.4.block.1.norm2.weight", "up.4.block.1.norm2.bias", "up.4.block.1.conv2.weight", "up.4.block.1.conv2.bias", "up.4.block.1.nin_shortcut.weight", "up.4.block.1.nin_shortcut.bias", "up.4.block.2.norm1.weight", "up.4.block.2.norm1.bias", "up.4.block.2.conv1.weight", "up.4.block.2.conv1.bias", "up.4.block.2.temb_proj.weight", "up.4.block.2.temb_proj.bias", "up.4.block.2.norm2.weight", "up.4.block.2.norm2.bias", "up.4.block.2.conv2.weight", "up.4.block.2.conv2.bias", "up.4.block.2.nin_shortcut.weight", "up.4.block.2.nin_shortcut.bias", "up.4.attn.0.norm.weight", "up.4.attn.0.norm.bias", "up.4.attn.0.q.weight", "up.4.attn.0.q.bias", "up.4.attn.0.k.weight", "up.4.attn.0.k.bias", "up.4.attn.0.v.weight", "up.4.attn.0.v.bias", "up.4.attn.0.proj_out.weight", "up.4.attn.0.proj_out.bias", "up.4.attn.1.norm.weight", "up.4.attn.1.norm.bias", "up.4.attn.1.q.weight", "up.4.attn.1.q.bias", "up.4.attn.1.k.weight", "up.4.attn.1.k.bias", "up.4.attn.1.v.weight", "up.4.attn.1.v.bias", "up.4.attn.1.proj_out.weight", "up.4.attn.1.proj_out.bias", "up.4.attn.2.norm.weight", "up.4.attn.2.norm.bias", "up.4.attn.2.q.weight", "up.4.attn.2.q.bias", "up.4.attn.2.k.weight", "up.4.attn.2.k.bias", "up.4.attn.2.v.weight", "up.4.attn.2.v.bias", "up.4.attn.2.proj_out.weight", "up.4.attn.2.proj_out.bias", "up.4.upsample.conv.weight", "up.4.upsample.conv.bias", "up.5.block.0.norm1.weight", "up.5.block.0.norm1.bias", "up.5.block.0.conv1.weight", "up.5.block.0.conv1.bias", "up.5.block.0.temb_proj.weight", "up.5.block.0.temb_proj.bias", "up.5.block.0.norm2.weight", "up.5.block.0.norm2.bias", "up.5.block.0.conv2.weight", "up.5.block.0.conv2.bias", "up.5.block.0.nin_shortcut.weight", "up.5.block.0.nin_shortcut.bias", "up.5.block.1.norm1.weight", "up.5.block.1.norm1.bias", "up.5.block.1.conv1.weight", "up.5.block.1.conv1.bias", "up.5.block.1.temb_proj.weight", "up.5.block.1.temb_proj.bias", "up.5.block.1.norm2.weight", "up.5.block.1.norm2.bias", "up.5.block.1.conv2.weight", "up.5.block.1.conv2.bias", "up.5.block.1.nin_shortcut.weight", "up.5.block.1.nin_shortcut.bias", "up.5.block.2.norm1.weight", "up.5.block.2.norm1.bias", "up.5.block.2.conv1.weight", "up.5.block.2.conv1.bias", "up.5.block.2.temb_proj.weight", "up.5.block.2.temb_proj.bias", "up.5.block.2.norm2.weight", "up.5.block.2.norm2.bias", "up.5.block.2.conv2.weight", "up.5.block.2.conv2.bias", "up.5.block.2.nin_shortcut.weight", "up.5.block.2.nin_shortcut.bias", "up.5.upsample.conv.weight", "up.5.upsample.conv.bias", "norm_out.weight", "norm_out.bias", "conv_out.weight", "conv_out.bias".
Unexpected key(s) in state_dict: "time_embed.0.weight", "time_embed.0.bias", "time_embed.2.weight", "time_embed.2.bias", "input_blocks.0.0.weight", "input_blocks.0.0.bias", "input_blocks.1.0.in_layers.0.weight", "input_blocks.1.0.in_layers.0.bias", "input_blocks.1.0.in_layers.2.weight", "input_blocks.1.0.in_layers.2.bias", "input_blocks.1.0.emb_layers.1.weight", "input_blocks.1.0.emb_layers.1.bias", "input_blocks.1.0.out_layers.0.weight", "input_blocks.1.0.out_layers.0.bias", "input_blocks.1.0.out_layers.3.weight", "input_blocks.1.0.out_layers.3.bias", "input_blocks.2.0.in_layers.0.weight", "input_blocks.2.0.in_layers.0.bias", "input_blocks.2.0.in_layers.2.weight", "input_blocks.2.0.in_layers.2.bias", "input_blocks.2.0.emb_layers.1.weight", "input_blocks.2.0.emb_layers.1.bias", "input_blocks.2.0.out_layers.0.weight", "input_blocks.2.0.out_layers.0.bias", "input_blocks.2.0.out_layers.3.weight", "input_blocks.2.0.out_layers.3.bias", "input_blocks.3.0.in_layers.0.weight", "input_blocks.3.0.in_layers.0.bias", "input_blocks.3.0.in_layers.2.weight", "input_blocks.3.0.in_layers.2.bias", "input_blocks.3.0.emb_layers.1.weight", "input_blocks.3.0.emb_layers.1.bias", "input_blocks.3.0.out_layers.0.weight", "input_blocks.3.0.out_layers.0.bias", "input_blocks.3.0.out_layers.3.weight", "input_blocks.3.0.out_layers.3.bias", "input_blocks.4.0.in_layers.0.weight", "input_blocks.4.0.in_layers.0.bias", "input_blocks.4.0.in_layers.2.weight", "input_blocks.4.0.in_layers.2.bias", "input_blocks.4.0.emb_layers.1.weight", "input_blocks.4.0.emb_layers.1.bias", "input_blocks.4.0.out_layers.0.weight", "input_blocks.4.0.out_layers.0.bias", "input_blocks.4.0.out_layers.3.weight", "input_blocks.4.0.out_layers.3.bias", "input_blocks.5.0.in_layers.0.weight", "input_blocks.5.0.in_layers.0.bias", "input_blocks.5.0.in_layers.2.weight", "input_blocks.5.0.in_layers.2.bias", "input_blocks.5.0.emb_layers.1.weight", "input_blocks.5.0.emb_layers.1.bias", "input_blocks.5.0.out_layers.0.weight", "input_blocks.5.0.out_layers.0.bias", "input_blocks.5.0.out_layers.3.weight", "input_blocks.5.0.out_layers.3.bias", "input_blocks.6.0.in_layers.0.weight", "input_blocks.6.0.in_layers.0.bias", "input_blocks.6.0.in_layers.2.weight", "input_blocks.6.0.in_layers.2.bias", "input_blocks.6.0.emb_layers.1.weight", "input_blocks.6.0.emb_layers.1.bias", "input_blocks.6.0.out_layers.0.weight", "input_blocks.6.0.out_layers.0.bias", "input_blocks.6.0.out_layers.3.weight", "input_blocks.6.0.out_layers.3.bias", "input_blocks.7.0.in_layers.0.weight", "input_blocks.7.0.in_layers.0.bias", "input_blocks.7.0.in_layers.2.weight", "input_blocks.7.0.in_layers.2.bias", "input_blocks.7.0.emb_layers.1.weight", "input_blocks.7.0.emb_layers.1.bias", "input_blocks.7.0.out_layers.0.weight", "input_blocks.7.0.out_layers.0.bias", "input_blocks.7.0.out_layers.3.weight", "input_blocks.7.0.out_layers.3.bias", "input_blocks.7.0.skip_connection.weight", "input_blocks.7.0.skip_connection.bias", "input_blocks.8.0.in_layers.0.weight", "input_blocks.8.0.in_layers.0.bias", "input_blocks.8.0.in_layers.2.weight", "input_blocks.8.0.in_layers.2.bias", "input_blocks.8.0.emb_layers.1.weight", "input_blocks.8.0.emb_layers.1.bias", "input_blocks.8.0.out_layers.0.weight", "input_blocks.8.0.out_layers.0.bias", "input_blocks.8.0.out_layers.3.weight", "input_blocks.8.0.out_layers.3.bias", "input_blocks.9.0.in_layers.0.weight", "input_blocks.9.0.in_layers.0.bias", "input_blocks.9.0.in_layers.2.weight", "input_blocks.9.0.in_layers.2.bias", "input_blocks.9.0.emb_layers.1.weight", "input_blocks.9.0.emb_layers.1.bias", "input_blocks.9.0.out_layers.0.weight", "input_blocks.9.0.out_layers.0.bias", "input_blocks.9.0.out_layers.3.weight", "input_blocks.9.0.out_layers.3.bias", "input_blocks.10.0.in_layers.0.weight", "input_blocks.10.0.in_layers.0.bias", "input_blocks.10.0.in_layers.2.weight", "input_blocks.10.0.in_layers.2.bias", "input_blocks.10.0.emb_layers.1.weight", "input_blocks.10.0.emb_layers.1.bias", "input_blocks.10.0.out_layers.0.weight", "input_blocks.10.0.out_layers.0.bias", "input_blocks.10.0.out_layers.3.weight", "input_blocks.10.0.out_layers.3.bias", "input_blocks.10.1.norm.weight", "input_blocks.10.1.norm.bias", "input_blocks.10.1.qkv.weight", "input_blocks.10.1.qkv.bias", "input_blocks.10.1.proj_out.weight", "input_blocks.10.1.proj_out.bias", "input_blocks.11.0.in_layers.0.weight", "input_blocks.11.0.in_layers.0.bias", "input_blocks.11.0.in_layers.2.weight", "input_blocks.11.0.in_layers.2.bias", "input_blocks.11.0.emb_layers.1.weight", "input_blocks.11.0.emb_layers.1.bias", "input_blocks.11.0.out_layers.0.weight", "input_blocks.11.0.out_layers.0.bias", "input_blocks.11.0.out_layers.3.weight", "input_blocks.11.0.out_layers.3.bias", "input_blocks.11.1.norm.weight", "input_blocks.11.1.norm.bias", "input_blocks.11.1.qkv.weight", "input_blocks.11.1.qkv.bias", "input_blocks.11.1.proj_out.weight", "input_blocks.11.1.proj_out.bias", "input_blocks.12.0.in_layers.0.weight", "input_blocks.12.0.in_layers.0.bias", "input_blocks.12.0.in_layers.2.weight", "input_blocks.12.0.in_layers.2.bias", "input_blocks.12.0.emb_layers.1.weight", "input_blocks.12.0.emb_layers.1.bias", "input_blocks.12.0.out_layers.0.weight", "input_blocks.12.0.out_layers.0.bias", "input_blocks.12.0.out_layers.3.weight", "input_blocks.12.0.out_layers.3.bias", "input_blocks.13.0.in_layers.0.weight", "input_blocks.13.0.in_layers.0.bias", "input_blocks.13.0.in_layers.2.weight", "input_blocks.13.0.in_layers.2.bias", "input_blocks.13.0.emb_layers.1.weight", "input_blocks.13.0.emb_layers.1.bias", "input_blocks.13.0.out_layers.0.weight", "input_blocks.13.0.out_layers.0.bias", "input_blocks.13.0.out_layers.3.weight", "input_blocks.13.0.out_layers.3.bias", "input_blocks.13.0.skip_connection.weight", "input_blocks.13.0.skip_connection.bias", "input_blocks.13.1.norm.weight", "input_blocks.13.1.norm.bias", "input_blocks.13.1.qkv.weight", "input_blocks.13.1.qkv.bias", "input_blocks.13.1.proj_out.weight", "input_blocks.13.1.proj_out.bias", "input_blocks.14.0.in_layers.0.weight", "input_blocks.14.0.in_layers.0.bias", "input_blocks.14.0.in_layers.2.weight", "input_blocks.14.0.in_layers.2.bias", "input_blocks.14.0.emb_layers.1.weight", "input_blocks.14.0.emb_layers.1.bias", "input_blocks.14.0.out_layers.0.weight", "input_blocks.14.0.out_layers.0.bias", "input_blocks.14.0.out_layers.3.weight", "input_blocks.14.0.out_layers.3.bias", "input_blocks.14.1.norm.weight", "input_blocks.14.1.norm.bias", "input_blocks.14.1.qkv.weight", "input_blocks.14.1.qkv.bias", "input_blocks.14.1.proj_out.weight", "input_blocks.14.1.proj_out.bias", "input_blocks.15.0.in_layers.0.weight", "input_blocks.15.0.in_layers.0.bias", "input_blocks.15.0.in_layers.2.weight", "input_blocks.15.0.in_layers.2.bias", "input_blocks.15.0.emb_layers.1.weight", "input_blocks.15.0.emb_layers.1.bias", "input_blocks.15.0.out_layers.0.weight", "input_blocks.15.0.out_layers.0.bias", "input_blocks.15.0.out_layers.3.weight", "input_blocks.15.0.out_layers.3.bias", "input_blocks.16.0.in_layers.0.weight", "input_blocks.16.0.in_layers.0.bias", "input_blocks.16.0.in_layers.2.weight", "input_blocks.16.0.in_layers.2.bias", "input_blocks.16.0.emb_layers.1.weight", "input_blocks.16.0.emb_layers.1.bias", "input_blocks.16.0.out_layers.0.weight", "input_blocks.16.0.out_layers.0.bias", "input_blocks.16.0.out_layers.3.weight", "input_blocks.16.0.out_layers.3.bias", "input_blocks.16.1.norm.weight", "input_blocks.16.1.norm.bias", "input_blocks.16.1.qkv.weight", "input_blocks.16.1.qkv.bias", "input_blocks.16.1.proj_out.weight", "input_blocks.16.1.proj_out.bias", "input_blocks.17.0.in_layers.0.weight", "input_blocks.17.0.in_layers.0.bias", "input_blocks.17.0.in_layers.2.weight", "input_blocks.17.0.in_layers.2.bias", "input_blocks.17.0.emb_layers.1.weight", "input_blocks.17.0.emb_layers.1.bias", "input_blocks.17.0.out_layers.0.weight", "input_blocks.17.0.out_layers.0.bias", "input_blocks.17.0.out_layers.3.weight", "input_blocks.17.0.out_layers.3.bias", "input_blocks.17.1.norm.weight", "input_blocks.17.1.norm.bias", "input_blocks.17.1.qkv.weight", "input_blocks.17.1.qkv.bias", "input_blocks.17.1.proj_out.weight", "input_blocks.17.1.proj_out.bias", "middle_block.0.in_layers.0.weight", "middle_block.0.in_layers.0.bias", "middle_block.0.in_layers.2.weight", "middle_block.0.in_layers.2.bias", "middle_block.0.emb_layers.1.weight", "middle_block.0.emb_layers.1.bias", "middle_block.0.out_layers.0.weight", "middle_block.0.out_layers.0.bias", "middle_block.0.out_layers.3.weight", "middle_block.0.out_layers.3.bias", "middle_block.1.norm.weight", "middle_block.1.norm.bias", "middle_block.1.qkv.weight", "middle_block.1.qkv.bias", "middle_block.1.proj_out.weight", "middle_block.1.proj_out.bias", "middle_block.2.in_layers.0.weight", "middle_block.2.in_layers.0.bias", "middle_block.2.in_layers.2.weight", "middle_block.2.in_layers.2.bias", "middle_block.2.emb_layers.1.weight", "middle_block.2.emb_layers.1.bias", "middle_block.2.out_layers.0.weight", "middle_block.2.out_layers.0.bias", "middle_block.2.out_layers.3.weight", "middle_block.2.out_layers.3.bias", "output_blocks.0.0.in_layers.0.weight", "output_blocks.0.0.in_layers.0.bias", "output_blocks.0.0.in_layers.2.weight", "output_blocks.0.0.in_layers.2.bias", "output_blocks.0.0.emb_layers.1.weight", "output_blocks.0.0.emb_layers.1.bias", "output_blocks.0.0.out_layers.0.weight", "output_blocks.0.0.out_layers.0.bias", "output_blocks.0.0.out_layers.3.weight", "output_blocks.0.0.out_layers.3.bias", "output_blocks.0.0.skip_connection.weight", "output_blocks.0.0.skip_connection.bias", "output_blocks.0.1.norm.weight", "output_blocks.0.1.norm.bias", "output_blocks.0.1.qkv.weight", "output_blocks.0.1.qkv.bias", "output_blocks.0.1.proj_out.weight", "output_blocks.0.1.proj_out.bias", "output_blocks.1.0.in_layers.0.weight", "output_blocks.1.0.in_layers.0.bias", "output_blocks.1.0.in_layers.2.weight", "output_blocks.1.0.in_layers.2.bias", "output_blocks.1.0.emb_layers.1.weight", "output_blocks.1.0.emb_layers.1.bias", "output_blocks.1.0.out_layers.0.weight", "output_blocks.1.0.out_layers.0.bias", "output_blocks.1.0.out_layers.3.weight", "output_blocks.1.0.out_layers.3.bias", "output_blocks.1.0.skip_connection.weight", "output_blocks.1.0.skip_connection.bias", "output_blocks.1.1.norm.weight", "output_blocks.1.1.norm.bias", "output_blocks.1.1.qkv.weight", "output_blocks.1.1.qkv.bias", "output_blocks.1.1.proj_out.weight", "output_blocks.1.1.proj_out.bias", "output_blocks.2.0.in_layers.0.weight", "output_blocks.2.0.in_layers.0.bias", "output_blocks.2.0.in_layers.2.weight", "output_blocks.2.0.in_layers.2.bias", "output_blocks.2.0.emb_layers.1.weight", "output_blocks.2.0.emb_layers.1.bias", "output_blocks.2.0.out_layers.0.weight", "output_blocks.2.0.out_layers.0.bias", "output_blocks.2.0.out_layers.3.weight", "output_blocks.2.0.out_layers.3.bias", "output_blocks.2.0.skip_connection.weight", "output_blocks.2.0.skip_connection.bias", "output_blocks.2.1.norm.weight", "output_blocks.2.1.norm.bias", "output_blocks.2.1.qkv.weight", "output_blocks.2.1.qkv.bias", "output_blocks.2.1.proj_out.weight", "output_blocks.2.1.proj_out.bias", "output_blocks.2.2.in_layers.0.weight", "output_blocks.2.2.in_layers.0.bias", "output_blocks.2.2.in_layers.2.weight", "output_blocks.2.2.in_layers.2.bias", "output_blocks.2.2.emb_layers.1.weight", "output_blocks.2.2.emb_layers.1.bias", "output_blocks.2.2.out_layers.0.weight", "output_blocks.2.2.out_layers.0.bias", "output_blocks.2.2.out_layers.3.weight", "output_blocks.2.2.out_layers.3.bias", "output_blocks.3.0.in_layers.0.weight", "output_blocks.3.0.in_layers.0.bias", "output_blocks.3.0.in_layers.2.weight", "output_blocks.3.0.in_layers.2.bias", "output_blocks.3.0.emb_layers.1.weight", "output_blocks.3.0.emb_layers.1.bias", "output_blocks.3.0.out_layers.0.weight", "output_blocks.3.0.out_layers.0.bias", "output_blocks.3.0.out_layers.3.weight", "output_blocks.3.0.out_layers.3.bias", "output_blocks.3.0.skip_connection.weight", "output_blocks.3.0.skip_connection.bias", "output_blocks.3.1.norm.weight", "output_blocks.3.1.norm.bias", "output_blocks.3.1.qkv.weight", "output_blocks.3.1.qkv.bias", "output_blocks.3.1.proj_out.weight", "output_blocks.3.1.proj_out.bias", "output_blocks.4.0.in_layers.0.weight", "output_blocks.4.0.in_layers.0.bias", "output_blocks.4.0.in_layers.2.weight", "output_blocks.4.0.in_layers.2.bias", "output_blocks.4.0.emb_layers.1.weight", "output_blocks.4.0.emb_layers.1.bias", "output_blocks.4.0.out_layers.0.weight", "output_blocks.4.0.out_layers.0.bias", "output_blocks.4.0.out_layers.3.weight", "output_blocks.4.0.out_layers.3.bias", "output_blocks.4.0.skip_connection.weight", "output_blocks.4.0.skip_connection.bias", "output_blocks.4.1.norm.weight", "output_blocks.4.1.norm.bias", "output_blocks.4.1.qkv.weight", "output_blocks.4.1.qkv.bias", "output_blocks.4.1.proj_out.weight", "output_blocks.4.1.proj_out.bias", "output_blocks.5.0.in_layers.0.weight", "output_blocks.5.0.in_layers.0.bias", "output_blocks.5.0.in_layers.2.weight", "output_blocks.5.0.in_layers.2.bias", "output_blocks.5.0.emb_layers.1.weight", "output_blocks.5.0.emb_layers.1.bias", "output_blocks.5.0.out_layers.0.weight", "output_blocks.5.0.out_layers.0.bias", "output_blocks.5.0.out_layers.3.weight", "output_blocks.5.0.out_layers.3.bias", "output_blocks.5.0.skip_connection.weight", "output_blocks.5.0.skip_connection.bias", "output_blocks.5.1.norm.weight", "output_blocks.5.1.norm.bias", "output_blocks.5.1.qkv.weight", "output_blocks.5.1.qkv.bias", "output_blocks.5.1.proj_out.weight", "output_blocks.5.1.proj_out.bias", "output_blocks.5.2.in_layers.0.weight", "output_blocks.5.2.in_layers.0.bias", "output_blocks.5.2.in_layers.2.weight", "output_blocks.5.2.in_layers.2.bias", "output_blocks.5.2.emb_layers.1.weight", "output_blocks.5.2.emb_layers.1.bias", "output_blocks.5.2.out_layers.0.weight", "output_blocks.5.2.out_layers.0.bias", "output_blocks.5.2.out_layers.3.weight", "output_blocks.5.2.out_layers.3.bias", "output_blocks.6.0.in_layers.0.weight", "output_blocks.6.0.in_layers.0.bias", "output_blocks.6.0.in_layers.2.weight", "output_blocks.6.0.in_layers.2.bias", "output_blocks.6.0.emb_layers.1.weight", "output_blocks.6.0.emb_layers.1.bias", "output_blocks.6.0.out_layers.0.weight", "output_blocks.6.0.out_layers.0.bias", "output_blocks.6.0.out_layers.3.weight", "output_blocks.6.0.out_layers.3.bias", "output_blocks.6.0.skip_connection.weight", "output_blocks.6.0.skip_connection.bias", "output_blocks.6.1.norm.weight", "output_blocks.6.1.norm.bias", "output_blocks.6.1.qkv.weight", "output_blocks.6.1.qkv.bias", "output_blocks.6.1.proj_out.weight", "output_blocks.6.1.proj_out.bias", "output_blocks.7.0.in_layers.0.weight", "output_blocks.7.0.in_layers.0.bias", "output_blocks.7.0.in_layers.2.weight", "output_blocks.7.0.in_layers.2.bias", "output_blocks.7.0.emb_layers.1.weight", "output_blocks.7.0.emb_layers.1.bias", "output_blocks.7.0.out_layers.0.weight", "output_blocks.7.0.out_layers.0.bias", "output_blocks.7.0.out_layers.3.weight", "output_blocks.7.0.out_layers.3.bias", "output_blocks.7.0.skip_connection.weight", "output_blocks.7.0.skip_connection.bias", "output_blocks.7.1.norm.weight", "output_blocks.7.1.norm.bias", "output_blocks.7.1.qkv.weight", "output_blocks.7.1.qkv.bias", "output_blocks.7.1.proj_out.weight", "output_blocks.7.1.proj_out.bias", "output_blocks.8.0.in_layers.0.weight", "output_blocks.8.0.in_layers.0.bias", "output_blocks.8.0.in_layers.2.weight", "output_blocks.8.0.in_layers.2.bias", "output_blocks.8.0.emb_layers.1.weight", "output_blocks.8.0.emb_layers.1.bias", "output_blocks.8.0.out_layers.0.weight", "output_blocks.8.0.out_layers.0.bias", "output_blocks.8.0.out_layers.3.weight", "output_blocks.8.0.out_layers.3.bias", "output_blocks.8.0.skip_connection.weight", "output_blocks.8.0.skip_connection.bias", "output_blocks.8.1.norm.weight", "output_blocks.8.1.norm.bias", "output_blocks.8.1.qkv.weight", "output_blocks.8.1.qkv.bias", "output_blocks.8.1.proj_out.weight", "output_blocks.8.1.proj_out.bias", "output_blocks.8.2.in_layers.0.weight", "output_blocks.8.2.in_layers.0.bias", "output_blocks.8.2.in_layers.2.weight", "output_blocks.8.2.in_layers.2.bias", "output_blocks.8.2.emb_layers.1.weight", "output_blocks.8.2.emb_layers.1.bias", "output_blocks.8.2.out_layers.0.weight", "output_blocks.8.2.out_layers.0.bias", "output_blocks.8.2.out_layers.3.weight", "output_blocks.8.2.out_layers.3.bias", "output_blocks.9.0.in_layers.0.weight", "output_blocks.9.0.in_layers.0.bias", "output_blocks.9.0.in_layers.2.weight", "output_blocks.9.0.in_layers.2.bias", "output_blocks.9.0.emb_layers.1.weight", "output_blocks.9.0.emb_layers.1.bias", "output_blocks.9.0.out_layers.0.weight", "output_blocks.9.0.out_layers.0.bias", "output_blocks.9.0.out_layers.3.weight", "output_blocks.9.0.out_layers.3.bias", "output_blocks.9.0.skip_connection.weight", "output_blocks.9.0.skip_connection.bias", "output_blocks.10.0.in_layers.0.weight", "output_blocks.10.0.in_layers.0.bias", "output_blocks.10.0.in_layers.2.weight", "output_blocks.10.0.in_layers.2.bias", "output_blocks.10.0.emb_layers.1.weight", "output_blocks.10.0.emb_layers.1.bias", "output_blocks.10.0.out_layers.0.weight", "output_blocks.10.0.out_layers.0.bias", "output_blocks.10.0.out_layers.3.weight", "output_blocks.10.0.out_layers.3.bias", "output_blocks.10.0.skip_connection.weight", "output_blocks.10.0.skip_connection.bias", "output_blocks.11.0.in_layers.0.weight", "output_blocks.11.0.in_layers.0.bias", "output_blocks.11.0.in_layers.2.weight", "output_blocks.11.0.in_layers.2.bias", "output_blocks.11.0.emb_layers.1.weight", "output_blocks.11.0.emb_layers.1.bias", "output_blocks.11.0.out_layers.0.weight", "output_blocks.11.0.out_layers.0.bias", "output_blocks.11.0.out_layers.3.weight", "output_blocks.11.0.out_layers.3.bias", "output_blocks.11.0.skip_connection.weight", "output_blocks.11.0.skip_connection.bias", "output_blocks.11.1.in_layers.0.weight", "output_blocks.11.1.in_layers.0.bias", "output_blocks.11.1.in_layers.2.weight", "output_blocks.11.1.in_layers.2.bias", "output_blocks.11.1.emb_layers.1.weight", "output_blocks.11.1.emb_layers.1.bias", "output_blocks.11.1.out_layers.0.weight", "output_blocks.11.1.out_layers.0.bias", "output_blocks.11.1.out_layers.3.weight", "output_blocks.11.1.out_layers.3.bias", "output_blocks.12.0.in_layers.0.weight", "output_blocks.12.0.in_layers.0.bias", "output_blocks.12.0.in_layers.2.weight", "output_blocks.12.0.in_layers.2.bias", "output_blocks.12.0.emb_layers.1.weight", "output_blocks.12.0.emb_layers.1.bias", "output_blocks.12.0.out_layers.0.weight", "output_blocks.12.0.out_layers.0.bias", "output_blocks.12.0.out_layers.3.weight", "output_blocks.12.0.out_layers.3.bias", "output_blocks.12.0.skip_connection.weight", "output_blocks.12.0.skip_connection.bias", "output_blocks.13.0.in_layers.0.weight", "output_blocks.13.0.in_layers.0.bias", "output_blocks.13.0.in_layers.2.weight", "output_blocks.13.0.in_layers.2.bias", "output_blocks.13.0.emb_layers.1.weight", "output_blocks.13.0.emb_layers.1.bias", "output_blocks.13.0.out_layers.0.weight", "output_blocks.13.0.out_layers.0.bias", "output_blocks.13.0.out_layers.3.weight", "output_blocks.13.0.out_layers.3.bias", "output_blocks.13.0.skip_connection.weight", "output_blocks.13.0.skip_connection.bias", "output_blocks.14.0.in_layers.0.weight", "output_blocks.14.0.in_layers.0.bias", "output_blocks.14.0.in_layers.2.weight", "output_blocks.14.0.in_layers.2.bias", "output_blocks.14.0.emb_layers.1.weight", "output_blocks.14.0.emb_layers.1.bias", "output_blocks.14.0.out_layers.0.weight", "output_blocks.14.0.out_layers.0.bias", "output_blocks.14.0.out_layers.3.weight", "output_blocks.14.0.out_layers.3.bias", "output_blocks.14.0.skip_connection.weight", "output_blocks.14.0.skip_connection.bias", "output_blocks.14.1.in_layers.0.weight", "output_blocks.14.1.in_layers.0.bias", "output_blocks.14.1.in_layers.2.weight", "output_blocks.14.1.in_layers.2.bias", "output_blocks.14.1.emb_layers.1.weight", "output_blocks.14.1.emb_layers.1.bias", "output_blocks.14.1.out_layers.0.weight", "output_blocks.14.1.out_layers.0.bias", "output_blocks.14.1.out_layers.3.weight", "output_blocks.14.1.out_layers.3.bias", "output_blocks.15.0.in_layers.0.weight", "output_blocks.15.0.in_layers.0.bias", "output_blocks.15.0.in_layers.2.weight", "output_blocks.15.0.in_layers.2.bias", "output_blocks.15.0.emb_layers.1.weight", "output_blocks.15.0.emb_layers.1.bias", "output_blocks.15.0.out_layers.0.weight", "output_blocks.15.0.out_layers.0.bias", "output_blocks.15.0.out_layers.3.weight", "output_blocks.15.0.out_layers.3.bias", "output_blocks.15.0.skip_connection.weight", "output_blocks.15.0.skip_connection.bias", "output_blocks.16.0.in_layers.0.weight", "output_blocks.16.0.in_layers.0.bias", "output_blocks.16.0.in_layers.2.weight", "output_blocks.16.0.in_layers.2.bias", "output_blocks.16.0.emb_layers.1.weight", "output_blocks.16.0.emb_layers.1.bias", "output_blocks.16.0.out_layers.0.weight", "output_blocks.16.0.out_layers.0.bias", "output_blocks.16.0.out_layers.3.weight", "output_blocks.16.0.out_layers.3.bias", "output_blocks.16.0.skip_connection.weight", "output_blocks.16.0.skip_connection.bias", "output_blocks.17.0.in_layers.0.weight", "output_blocks.17.0.in_layers.0.bias", "output_blocks.17.0.in_layers.2.weight", "output_blocks.17.0.in_layers.2.bias", "output_blocks.17.0.emb_layers.1.weight", "output_blocks.17.0.emb_layers.1.bias", "output_blocks.17.0.out_layers.0.weight", "output_blocks.17.0.out_layers.0.bias", "output_blocks.17.0.out_layers.3.weight", "output_blocks.17.0.out_layers.3.bias", "output_blocks.17.0.skip_connection.weight", "output_blocks.17.0.skip_connection.bias", "out.0.weight", "out.0.bias", "out.2.weight", "out.2.bias".

can you help me?
Thanks!

@psen2022
Copy link

Have you solved this problem?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants