Hello, friends! Please tell me what I’m doing wrong. I am trying to repeat the results of quantization of any of the models, but the result is 4 times slower.
I spend all the same actions as specified in the repository - quantization.
It was not possible to find instructions for launching a quantized model, I launch it as usual.
There are no errors when quantizing and launching the model.
Help me, please:)