pytorch/serve

Enabling HTTP caching of inference results

Open

#1.527 aberto em 23 de mar. de 2022

Ver no GitHub
 (2 comments) (4 reactions) (1 assignee)Java (3.844 stars) (790 forks)batch import
enhancementhelp wanted

Description

Is your feature request related to a problem? Please describe.

Currently, TorchServe adds headers that prevent from caching the inference results: https://github.com/pytorch/serve/blob/30f83500b0850e26ec55581f48a9307b1986f9f9/frontend/server/src/main/java/org/pytorch/serve/util/NettyUtils.java#L187-L190

This prevents some reverse-proxies like nginx from caching the results

Describe the solution

It would be great to have a way to override this behavior, either from the model handler, or through some configuration key.

Describe alternatives solution

N/A

Guia do colaborador