Skip to content

Commit

Permalink
refactor
Browse files Browse the repository at this point in the history
  • Loading branch information
Stanislas0 committed Feb 21, 2023
1 parent 14493cb commit d65d72a
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion codegeex/tokenizer/tokenizer.py
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@
from transformers.models.gpt2 import GPT2TokenizerFast


def encode_whitespaces(text, start_extra_id: int, max_len: int):
def encode_whitespaces(text: str, start_extra_id: int, max_len: int):
""" Encode whitespaces to extra tokens in GPT-J.
>>> encode_whitespaces('a\\n b\\n c', 10, 10)
Expand Down

0 comments on commit d65d72a

Please sign in to comment.