Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Unable to run optimized BERT: model shape expected by framework [-1,-1] doesn't match model configuration shape [-1,-1]

See original GitHub issue

Description When deploying optimized model I get error:

INVALID_ARGUMENT: getPluginCreator could not find plugin CustomEmbLayerNormPluginDynamic version 1 safeDeserializationUtils.cpp (259) - Serialization Error in load: 0 (Cannot deserialize plugin since corresponding IPluginCreator not found in Plugin Registry)

I1024 17:29:43.767217 538 model_config_utils.cc:203] autofilled config: name: "bertoptimized_plan"
platform: "tensorrt_plan"
max_batch_size: 1
input {
  name: "input_ids"
  data_type: TYPE_INT32
  dims: -1
  dims: -1
}
input {
  name: "segment_ids"
  data_type: TYPE_INT32
  dims: -1
  dims: -1
}
input {
  name: "input_mask"
  data_type: TYPE_INT32
  dims: -1
  dims: -1
}
output {
  name: "cls_squad_logits"
  data_type: TYPE_FP32
  dims: -1
  dims: -1
  dims: 2
  dims: 1
  dims: 1
}
default_model_filename: "model.plan"

** TRTIS Information ** What version of TRTIS are you using? nvcr.io/nvidia/tensorrtserver:19.09-py3 Are you using the TRTIS container or did you build it yourself? Did not Build

To Reproduce Steps to reproduce the behavior: Follow: https://github.com/NVIDIA/TensorRT/tree/master/demo/BERT#example-workflow using container: nvcr.io/nvidia/tensorflow:19.09-py3 Save model like: build/sample_bert -d $BERT_PATH -d $OUTPUT_PATH --fp16 --nheads 12 --saveEngine model.plan Add plan to model registry on trtis

Expected behavior A clear and concise description of what you expected to happen. The optimized model to run and accept inference requests

Issue Analytics

State:
Created 4 years ago
Comments:5 (2 by maintainers)

Top GitHub Comments

2reactions

lapoloniocommented, Oct 24, 2019

the output of the build command is libcommon.so and in libbert_plugins.so TensorRT/demo/BERT/build/libcommon.so

I run the server: LD_PRELOAD=/opt/tensorrtserver/libbert_plugins.so:/opt/tensorrtserver/libcommon.so trtserver --model-repository=/models --strict-model-config=false

I just copied the files from TensorRT/demo/BERT/build

1reaction

tanmayv25commented, Oct 26, 2019

Invalid argument: unable to load model ‘bertoptimized_plan_0_gpu0’, binding ‘input_ids’ shape expected by framework [-1,-1] doesn’t match model configuration shape [-1,-1]

This is a known issue. Tensorrtserver 19.09 release does not support autofiller for dynamic shaped inputs. TensorRT 6 dynamic-shaped plans need explicit batch dimension in their bindings. TRTIS doesn’t need an explicit batch dimension in config.pbtxt as it is added implicitly.

So, provide your own config.pbtxt as:

name: “bertoptimized_plan” platform: “tensorrt_plan” max_batch_size: 1 input { name: “input_ids” data_type: TYPE_INT32 dims: -1 } input { name: “segment_ids” data_type: TYPE_INT32 dims: -1 } input { name: “input_mask” data_type: TYPE_INT32 dims: -1 } output { name: “cls_squad_logits” data_type: TYPE_FP32 dims: -1 dims: 2 dims: 1 dims: 1 } default_model_filename: “model.plan”

And run server as

LD_PRELOAD=/opt/tensorrtserver/libbert_plugins.so:/opt/tensorrtserver/libcommon.so trtserver --model-repository=/models

You can also wait for 19.10 release which has the the autofiller support for dynamic shaped plans or build your own server from r19.10 branch.

Top Results From Across the Web

Unable to run optimized BERT: model shape expected by ...

This is a known issue. Tensorrtserver 19.09 release does not support autofiller for dynamic shaped inputs. TensorRT 6 dynamic-shaped plans need ...

Models - Hugging Face

Instantiate a pretrained pytorch model from a pre-trained model configuration. The model is set in evaluation mode by default ...

Model Optimizer Frequently Asked Questions

Model Optimizer cannot infer shapes or values for the specified node. It can happen because of a bug in the custom shape infer...

Unable to fit BERT model - Stack Overflow

I am trying to train a BERT model based on an online tutorial ... However, I get the following error when running the...

(optional) Exporting a Model from PyTorch to ONNX and ...

In this tutorial, we describe how to convert a model defined in PyTorch into the ONNX format and then run it with ONNX...

Troubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.

Start Free

Top Related Reddit Thread

No results found

Top Related Tweet

No results found

Top Related Dev.to Post

No results found

Unable to run optimized BERT: model shape expected by framework [-1,-1] doesn't match model configuration shape [-1,-1]

Issue Analytics

Top GitHub Comments

Top Results From Across the Web

Top Related Medium Post

Top Related StackOverflow Question

Troubleshoot Live Code

Top Related Reddit Thread

Top Related Hackernoon Post

Top Related Tweet

Top Related Dev.to Post

Top Related Hashnode Post

RNN in ONNX model does not give correct output with batch_size > 1

failed to load Tensor shape expected by framework [] doesn't match model configuration shape [-1]