pytorch/serve

Enabling HTTP caching of inference results

Open

Aperta il 23 mar 2022

Vedi su GitHub
 (2 commenti) (4 reazioni) (1 assegnatario)Java (3844 star) (790 fork)batch import
enhancementhelp wanted

Descrizione

Is your feature request related to a problem? Please describe.

Currently, TorchServe adds headers that prevent from caching the inference results: https://github.com/pytorch/serve/blob/30f83500b0850e26ec55581f48a9307b1986f9f9/frontend/server/src/main/java/org/pytorch/serve/util/NettyUtils.java#L187-L190

This prevents some reverse-proxies like nginx from caching the results

Describe the solution

It would be great to have a way to override this behavior, either from the model handler, or through some configuration key.

Describe alternatives solution

N/A

Guida contributor