Replies: 4 comments 8 replies
-
We see similar logs when a memcached cluster becomes overwhelmed and generally respond by scaling up the cluster. Sizing and operating memcached on large clusters is still something we're working out so I'd love to compare notes. One thing that can help is separating your cache configs. I notice that your logs indicate that you're using one cache cluster and all roles are configured together (
becomes
I'll also point out that we set We also set connections on our memcached servers: @electron0zero has a bit more experience here and my have insights. |
Beta Was this translation helpful? Give feedback.
-
@edgarkz as @joe-elliott said, we are also seeing some of these errors and figuring out the sizing things. here are few general heuristics I use to know when to scale a cache:
if you try these, let me know how it goes :) |
Beta Was this translation helpful? Give feedback.
-
Hey @electron0zero @joe-elliott, All the connection related issues are gone away and we have in place monitoring for scaling now. "caches available to storage backend" footer=true bloom=true offset_idx=false column_idx=false trace_id_idx=false page=false |
Beta Was this translation helpful? Give feedback.
-
if anyone looks for redis config, apparently tempo does have ttl configuration which is by default disabled. Thanks everyone |
Beta Was this translation helpful? Give feedback.
-
Hi,
1.Can you please assist with tempo queries logging the following timeouts from memcached?
Looking into memcached metrics I don't see anything hitting max connections or so.
2.should we scale up more instances of memcached? the chart supports it but will be any benefit from multiple instances?
single memcached instance deployed from distributed tempo helm chart
eks 1.26
tempo 2.4.1 distributed via helm chart
level=error ts=2024-03-28T21:20:22.175685654Z caller=memcached.go:153 msg="Failed to get keys from memcached" err="memcache: connect timeout to 10.100.213.119:11211" level=error ts=2024-03-28T21:20:22.175931644Z caller=memcached.go:153 msg="Failed to get keys from memcached" err="memcache: connect timeout to 10.100.213.119:11211" level=error ts=2024-03-28T21:20:22.434598693Z caller=memcached.go:236 msg="failed to put to memcached" name=parquet-footer|bloom|frontend-search err="server=10.100.213.119:11211: memcache: connect timeout to 10.100.213.119:11211" level=error ts=2024-03-28T21:20:23.133903161Z caller=memcached.go:236 msg="failed to put to memcached" name=parquet-footer|bloom|frontend-search err="server=10.100.213.119:11211: memcache: connect timeout to 10.100.213.119:11211" level=error ts=2024-03-28T21:20:24.174768798Z caller=memcached.go:236 msg="failed to put to memcached" name=parquet-footer|bloom|frontend-search err="server=10.100.213.119:11211: memcache: connect timeout to 10.100.213.119:11211" level=error ts=2024-03-28T21:20:24.175104043Z caller=memcached.go:236 msg="failed to put to memcached" name=parquet-footer|bloom|frontend-search err="server=10.100.213.119:11211: memcache: connect timeout to 10.100.213.119:11211" level=error ts=2024-03-28T21:20:24.195413573Z caller=memcached.go:236 msg="failed to put to memcached" name=parquet-footer|bloom|frontend-search err="server=10.100.213.119:11211: memcache: connect timeout to 10.100.213.119:11211" level=error ts=2024-03-28T21:20:24.195483153Z caller=memcached.go:236 msg="failed to put to memcached" name=parquet-footer|bloom|frontend-search err="server=10.100.213.119:11211: memcache: connect timeout to 10.100.213.119:11211" level=error ts=2024-03-28T21:20:24.195515305Z caller=memcached.go:236 msg="failed to put to memcached" name=parquet-footer|bloom|frontend-search err="server=10.100.213.119:11211: memcache: connect timeout to 10.100.213.119:1121
Beta Was this translation helpful? Give feedback.
All reactions