You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If we remove the aggregator and do the query count(up{label_key="label_value"}) I expect to have the value 918, but the truth is promxy are returning the max value from the 3 server groups we have, which is 404 and in this case comes from the sum from the data which resides on Grafana mimir dc1
I also did a test, I added a dedicated label to each server group, named __dc__, and when we do the query count(count(up{stack="persistence"}) without (__dc__)), we have the desired value which is 918.
However, let's go and do the expected query count(up{stack="persistence"}). We will have the value 980 since they are counting the values from google-us-dc1 and google-us-dc2 twice because when we add custom labels per server group, we are saying the data on each server group is unique, which is not the case.
Although we are using Mimir, in the end, is a Prometheus query API that we are using, so I don't feel it is related.
We are not overriding the prefer_max option and we are using the version v0.0.91.
I already tried to debug in Promxy code, but I ran without ideas and I decided to open this issue. I'm open to contribute either way if I find something 🙌
The text was updated successfully, but these errors were encountered:
I believe there may be a typo in this example; as described this configuration has some overlapping DCs (google-us-dc1 is in mimir dc1 and dc2). Given that the example below has eu-dc1..5 -- I'm assuming mimir dc2 was supposed to be eu? (since otherwise i don't see eu dc3,4,5).
but the truth is promxy are returning the max value from the 3 server groups we have,
This sounds like maybe the servergroup configuration isn't quite right -- as the NodeReplacer (that does the max/rewrite) is done at the top-level. All of the servergroup merging is done lower down. So this does sound like an issue with the servergroup configuration rather than the aggregation rewrite in NodeReplacer.
Although we are using Mimir, in the end, is a Prometheus query API that we are using, so I don't feel it is related.
This seems correct; this seems like an issue with the promxy servergroup config not quite matching your setup.
We are not overriding the prefer_max option and we are using the version v0.0.91.
If we are running into prefer_max we are definitely hitting a servergroup configuration issue. The prefer_max is intended to handle merging of data within a servergroup (defined as a set of API endpoints that "have the same data").
I ran without ideas and I decided to open this issue
I'd be happy to give a hand here! Could you provide your promxy config? Or at least the servergroup configuration. As well as re-iterating the downstreams, their data, and desired merging behavior. I think from there we'll be able to make some progress :)
I have the following setup with 3 server groups
When we do the following query
count(up{label_key="label_value"}) by (region)
we have the following results:If we remove the aggregator and do the query
count(up{label_key="label_value"})
I expect to have the value918
, but the truth is promxy are returning the max value from the 3 server groups we have, which is404
and in this case comes from the sum from the data which resides onGrafana mimir dc1
I also did a test, I added a dedicated label to each server group, named
__dc__
, and when we do the querycount(count(up{stack="persistence"}) without (__dc__))
, we have the desired value which is918
.However, let's go and do the expected query
count(up{stack="persistence"})
. We will have the value980
since they are counting the values fromgoogle-us-dc1
andgoogle-us-dc2
twice because when we add custom labels per server group, we are saying the data on each server group is unique, which is not the case.Although we are using Mimir, in the end, is a Prometheus query API that we are using, so I don't feel it is related.
We are not overriding the
prefer_max
option and we are using the versionv0.0.91
.I already tried to debug in Promxy code, but I ran without ideas and I decided to open this issue. I'm open to contribute either way if I find something 🙌
The text was updated successfully, but these errors were encountered: