transformers-keras

Transformer-based models implemented in tensorflow 2.x(Keras).

Installation

pip install -U transformers-keras

Models

Transformer
- Attention Is All You Need.
- Here is a tutorial from tensorflow:Transformer model for language understanding
BERT
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
ALBERT
- ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Transformer

Train a new transformer:

from transformers_keras import TransformerTextFileDatasetBuilder
from transformers_keras import TransformerDefaultTokenizer
from transformers_keras import TransformerRunner


src_tokenizer = TransformerDefaultTokenizer(vocab_file='testdata/vocab_src.txt')
tgt_tokenizer = TransformerDefaultTokenizer(vocab_file='testdata/vocab_tgt.txt')
dataset_builder = TransformerTextFileDatasetBuilder(src_tokenizer, tgt_tokenizer)

model_config = {
    'num_encoder_layers': 2,
    'num_decoder_layers': 2,
    'src_vocab_size': src_tokenizer.vocab_size,
    'tgt_vocab_size': tgt_tokenizer.vocab_size,
}

runner = TransformerRunner(model_config, dataset_builder, model_dir='/tmp/transformer')

train_files = [('testdata/train.src.txt','testdata/train.tgt.txt')]
runner.train(train_files, epochs=10, callbacks=None)

BERT

You can use BERT models in two ways:

Train a new BERT model
Load a pretrained BERT model

Train a new BERT model

Use your own data to pretrain a BERT model.

from transformers_keras import BertForPretrainingModel

model_config = {
    'max_positions': 128,
    'num_layers': 6,
    'vocab_size': 21128,
}

model = BertForPretrainingModel(**model_config)

Load a pretrained BERT model

from transformers_keras import BertForPretrainingModel

# download the pretrained model and extract it to some path
PRETRAINED_BERT_MODEL = '/path/to/chinese_L-12_H-768_A-12'

model = BertForPretrainingModel.from_pretrained(PRETRAINED_BERT_MODEL)

After building the model, you can train the model with your own data.

Here is an example:

from transformers_keras import BertTFRecordDatasetBuilder

builder = BertTFRecordDatasetBuilder(max_sequence_length=128, record_option='GZIP')

loss = tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True)
metric = tf.keras.metrics.SparseCategoricalAccuracy(name='acc')
model.compile(optimizer='adam', loss=loss, metrics=[metric])
model(model.dummy_inputs())
model.summary()

train_files = ['testdata/bert_custom_pretrain.tfrecord']
train_dataset = builder.build_train_dataset(train_files, batch_size=32)
model.fit(train_dataset, epochs=2)

ALBERT

You can use ALBERT model in two ways:

Train a new ALBERT model
Load a pretrained ALBERT model

Train a new ALBERT model

You should process your data to tfrecord format. Modify this script transformers_keras/utils/bert_tfrecord_custom_generator.py as you need.

from transformers_keras import AlbertForPretrainingModel

# ALBERT has the same data format with BERT
dataset_builder = BertTFRecordDatasetBuilder(
    max_sequence_length=128, record_option='GZIP', train_repeat_count=100, eos_token='T')

model_config = {
    'max_positions': 128,
    'num_layers': 6,
    'num_groups': 1,
    'num_layers_each_group': 1,
    'vocab_size': 21128,
}

model = AlbertForPretrainingModel(**model_config)

Load a pretrained ALBERT model

from transformers_keras import AlbertForPretrainingModel

# download the pretrained model and extract it to some path
PRETRAINED_BERT_MODEL = '/path/to/zh_albert_large'

model = AlbertForPretrainingModel.from_pretrained(PRETRAINED_BERT_MODEL)

After building the model, you can train this model with your own data.

Here is an example:

from transformers_keras import BertTFRecordDatasetBuilder

builder = BertTFRecordDatasetBuilder(max_sequence_length=128, record_option='GZIP')

loss = tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True)
metric = tf.keras.metrics.SparseCategoricalAccuracy(name='acc')
model.compile(optimizer='adam', loss=loss, metrics=[metric])
model(model.dummy_inputs())
model.summary()

train_files = ['testdata/bert_custom_pretrain.tfrecord']
train_dataset = builder.build_train_dataset(train_files, batch_size=32)
model.fit(train_dataset, epochs=2)

Name		Name	Last commit message	Last commit date
Latest commit History 172 Commits
.github/workflows		.github/workflows
testdata		testdata
transformers_keras		transformers_keras
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_ZH.md		README_ZH.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

transformers-keras

Contents

Installation

Models

Transformer

BERT

Train a new BERT model

Load a pretrained BERT model

ALBERT

Train a new ALBERT model

Load a pretrained ALBERT model

About

Releases

Packages

Languages

License

Juzenn/transformers-keras

Folders and files

Latest commit

History

Repository files navigation

transformers-keras

Contents

Installation

Models

Transformer

BERT

Train a new BERT model

Load a pretrained BERT model

ALBERT

Train a new ALBERT model

Load a pretrained ALBERT model

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages