pytorch/serve

Enabling HTTP caching of inference results

Open

#1,527 opened on 2022年3月23日

GitHub で見る
 (2 comments) (4 reactions) (1 assignee)Java (3,844 stars) (790 forks)batch import
enhancementhelp wanted

説明

Is your feature request related to a problem? Please describe.

Currently, TorchServe adds headers that prevent from caching the inference results: https://github.com/pytorch/serve/blob/30f83500b0850e26ec55581f48a9307b1986f9f9/frontend/server/src/main/java/org/pytorch/serve/util/NettyUtils.java#L187-L190

This prevents some reverse-proxies like nginx from caching the results

Describe the solution

It would be great to have a way to override this behavior, either from the model handler, or through some configuration key.

Describe alternatives solution

N/A

コントリビューターガイド