pytorch/serve

Enabling HTTP caching of inference results

Open

#1,527 建立於 2022年3月23日

在 GitHub 查看
 (2 留言) (4 反應) (1 負責人)Java (3,844 star) (790 fork)batch import
enhancementhelp wanted

描述

Is your feature request related to a problem? Please describe.

Currently, TorchServe adds headers that prevent from caching the inference results: https://github.com/pytorch/serve/blob/30f83500b0850e26ec55581f48a9307b1986f9f9/frontend/server/src/main/java/org/pytorch/serve/util/NettyUtils.java#L187-L190

This prevents some reverse-proxies like nginx from caching the results

Describe the solution

It would be great to have a way to override this behavior, either from the model handler, or through some configuration key.

Describe alternatives solution

N/A

貢獻者指南