加载chatglm不成功 #12

tang576225574 · 2024-02-03T09:36:03Z

会报错：AttributeError: ‘ChatGLMTokenizer‘ object has no attribute ‘sp_tokenizer
请问如何解决

Fulin-Guo · 2024-04-02T09:07:15Z

把tokenization_chatglm.py中

self.sp_tokenizer = SPTokenizer(vocab_file, num_image_tokens=num_image_tokens)

这一行代码移动到super().__init__函数前面，如下：

class ChatGLMTokenizer(PreTrainedTokenizer):
    """
    Construct a ChatGLM tokenizer. Based on byte-level Byte-Pair-Encoding.

    Args:
        vocab_file (`str`):
            Path to the vocabulary file.
    """

    vocab_files_names = {"vocab_file": "ice_text.model"}
    max_model_input_sizes = PRETRAINED_POSITIONAL_EMBEDDINGS_SIZES
    model_input_names = ["input_ids", "attention_mask", "position_ids"]

    def __init__(
            self,
            vocab_file,
            do_lower_case=False,
            remove_space=False,
            bos_token='<sop>',
            eos_token='<eop>',
            end_token='</s>',
            mask_token='[MASK]',
            gmask_token='[gMASK]',
            padding_side="left",
            pad_token="<pad>",
            unk_token="<unk>",
            num_image_tokens=20000,
            **kwargs
    ) -> None:
        self.sp_tokenizer = SPTokenizer(vocab_file, num_image_tokens=num_image_tokens)
        
        super().__init__(
            do_lower_case=do_lower_case,
            remove_space=remove_space,
            padding_side=padding_side,
            bos_token=bos_token,
            eos_token=eos_token,
            end_token=end_token,
            mask_token=mask_token,
            gmask_token=gmask_token,
            pad_token=pad_token,
            unk_token=unk_token,
            num_image_tokens=num_image_tokens,
            **kwargs
        )

        """ Initialisation """

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

加载chatglm不成功 #12

加载chatglm不成功 #12

tang576225574 commented Feb 3, 2024

Fulin-Guo commented Apr 2, 2024

加载chatglm不成功 #12

加载chatglm不成功 #12

Comments

tang576225574 commented Feb 3, 2024

Fulin-Guo commented Apr 2, 2024