Request an memory-eating endpoint many times before cache reproduced

I have an endpoint which consume lots of memory, so I use caching to reduce the memory overhead, but the problem is that, if user request it manytimes before the cache produced, it will consume lots of memory and cause oom killer, how can I handle this ?

I tried use some limiter like flask-limiter, it just stop request again. the request doesn't use cached results. any help would be appreciated

