Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[examples/bert/build.py]: Load weights for BertModel and RobertaModel if --model_dir is provided #2187

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

tkhanipov
Copy link

Currently the example TensorRT LLM engine builder for Bert models simply ignores model weights if those are present in the model directory, it only reads the config.json file, making it essentially impossible to generate a working engine from a pretrained model. This change fixes that.

@tkhanipov
Copy link
Author

Created a bug: #2197

@lfr-0531
Copy link
Collaborator

lfr-0531 commented Sep 8, 2024

@symphonylyh Could you please take a look at this PR? Thanks~

@lfr-0531 lfr-0531 added the triaged Issue has been triaged by maintainers label Sep 8, 2024
@symphonylyh
Copy link
Collaborator

Hi @tkhanipov , thanks for the PR! We're currently doing a refinement of the BERT workflow, will address this problem and merge your PR 👍

@tkhanipov
Copy link
Author

Hi @symphonylyh! Thank you for the response.
Just curious: isn't this refinement to the BERT workflow you are talking about related to supporting BERT in Executor API? So far, AFAIU (please correct me if I am wrong), encoder only models are not supported there.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
triaged Issue has been triaged by maintainers
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants