Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

some metrics were missing when vm-storage crashes #6294

Open
jinlongwang opened this issue May 17, 2024 · 1 comment
Open

some metrics were missing when vm-storage crashes #6294

jinlongwang opened this issue May 17, 2024 · 1 comment
Assignees
Labels
bug Something isn't working need more info

Comments

@jinlongwang
Copy link

jinlongwang commented May 17, 2024

Describe the bug

we found vm-storage node crashed at 2024-05-16 12:32, and then we found that many metrics were missing.
We were pretty sure that these metrics existed before, and we found them in a different store(hive).

for example:
image

we found that all the missing metrics are hashed to the same node, which is the node where the panic occurred
image

vm-storage panic logs
image

thanks for helping

To Reproduce

This problem has occurred twice in our production environment

sorry we have no way to reproduce this problem so far

Version

vmstorage-20230313-063739-tags-v1.87.5-cluster-0-g4793665c4

we build this version from cluster-v1.87.5 (there is no code changed)

Logs

No response

Screenshots

No response

Used command-line flags

No response

Additional information

No response

@jinlongwang jinlongwang added the bug Something isn't working label May 17, 2024
@hagen1778
Copy link
Collaborator

This problem has occurred twice in our production environment

Hey @jinlongwang! Do you know why the node has crashed? Have you tried following these recommendations https://docs.victoriametrics.com/troubleshooting/#unexpected-query-results ?

vmstorage-20230313-063739-tags-v1.87.5-cluster-0-g4793665c4

This is pretty old version. A lot has changed in indexdb and in the way of storing data. I'd recommend updating to the usptream version. Otherwise, it is unlikely I can help besides pointing to troubleshooting docs. In general, runtime crashes shouldn't corrupt indexdb (data structure which helps to locate time series) and affects searchability of time series. It is unexpected that you lost the data.

@hagen1778 hagen1778 self-assigned this May 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working need more info
Projects
None yet
Development

No branches or pull requests

2 participants