[Bugfix] GptJ & GptNeoX batch inference error by YZP17121579 · Pull Request #742 · NVIDIA/FasterTransformer

YZP17121579 · 2023-08-11T09:18:53Z

GptJ & GptNeoX may generate random outputs when using batch inference mode and no prefix prompt.
The problem is caused by the nullptr check
in

Line 1064 in f8e42aa

if (tiled_prefix_prompt_lengths != nullptr) {

…inference

BasicCoder · 2023-08-12T01:48:54Z

I think this is a duplicate solution of #716 which is more elegant and efficient.

[Bugfix] GptJ & GptNeoX may generate random outputs when using batch …

43e0313

…inference

RobotGF mentioned this pull request Sep 8, 2023

LLaMA support #506

Open

Provide feedback