CHANGELOG
The following tip changes can be tested by building VictoriaMetrics components from the latest commits according to the following docs:
- How to build single-node VictoriaMetrics
- How to build cluster version of VictoriaMetrics
- How to build vmagent
- How to build vmalert
- How to build vmauth
- How to build vmctl
tip
- FEATURE: vmbackup: add support for server-side copy of existing backups. See these docs for details.
- FEATURE: vmagent: properly handle
unexpected EOFerror when parsing metrics in Prometheus exposition format. See this issue. -
FEATURE: vmagent: retry failed write request on the closed connection immediately, without waiting for backoff. This should improve data delivery speed and reduce amount of error logs emitted by vmagent when using idle connections. See related issue.
- BUGFIX: do not allow starting VictoriaMetrics components with improperly set boolean command-line flags in the form
-boolFlagName value, since this leads to silent incomplete flags' parsing. This form should be replaced with-boolFlagName=value. See this issue. - BUGFIX: vmagent: properly set labels from
-remoteWrite.labelcommand-line flag just before sending samples to the configured-remoteWrite.urlaccording to these docs. Previously these labels were incorrectly set before the relabeling configured via-remoteWrite.urlRelabelConfigsand the stream aggregation configured via-remoteWrite.streamAggr.config, so these labels could be lost or incorrectly transformed before sending the samples to remote storage. The fix allows using-remoteWrite.labelfor identifyingvmagentinstances in cluster mode. See this issue and these docs for more details. - BUGFIX: remove
DEBUGlogging when parsingiffilters inside relabeling rules and when parsingmatchfilters inside stream aggregation rules. - BUGFIX: properly replace
:chars in label names with_when-usePromCompatibleNamingcommand-line flag is passed tovmagent,vminsertor single-node VictoriaMetrics. This addresses this comment. - BUGFIX: Official Grafana dashboards for VictoriaMetrics: fix display of ingested rows rate for
Samples ingested/sandSamples ratepanels for vmagent's dasbhoard. Previously, not all ingested protocols were accounted in these panels. An extra panelRows ratewas added toIngestionsection to display the split for rows ingested rate by protocol. - BUGFIX: vmbackup: correctly check if specified
-dstbelongs to specified-storageDataPath. See this issue. - BUGFIX: vmctl: don't interrupt the migration process if no metrics were found for a specific tenant. See this issue.
v1.93.0
Released at 2023-08-12
v1.93.x is a line of LTS releases (e.g. long-time support). It contains important up-to-date bugfixes. The v1.93.x line will be supported for at least 12 months since v1.93.0 release
Update note: starting from this release, vmagent ignores timestamps provided by scrape targets by default - it associates scraped metrics with local timestamps instead. Set honor_timestamps: true in scrape configs if timestamps provided by scrape targets must be used instead. This change helps removing gaps for metrics collected from cadvisor such as container_memory_usage_bytes. This also improves data compression and query performance over metrics collected from cadvisor. See more details here.
- SECURITY: upgrade Go builder from Go1.20.6 to Go1.21.0 in order to fix this issue.
-
SECURITY: upgrade base docker image (Alpine) from 3.18.2 to 3.18.3. See alpine 3.18.3 release notes.
- FEATURE: MetricsQL: add
share_eq_over_time(m[d], eq)function for calculating the share (in the range[0...1]) of raw samples on the given lookbehind windowd, which are equal toeq. See this feature request. Thanks to @Damon07 for the pull request. - FEATURE: vmauth: allow configuring deadline for a backend to be excluded from the rotation on errors via
-failTimeoutcmd-line flag. This feature could be useful when it is expected for backends to be not available for significant periods of time. See this issue for details. Thanks to @SunKyu for the pull request. - FEATURE: vmalert: remove deprecated in v1.61.0
-rule.configCheckIntervalcommand-line flag. Use-configCheckIntervalcommand-line flag instead. - FEATURE: vmalert: remove support of deprecated web links of
/api/v1/<groupID>/<alertID>/statusform in favour of/api/v1/alerts?group_id=<>&alert_id=<>links. Links of/api/v1/<groupID>/<alertID>/statusform were deprecated in v1.79.0. See this issue for details. - FEATURE: vmctl: allow disabling binary export API protocol via
-vm-native-disable-binary-protocolcmd-line flag when migrating data from VictoriaMetrics. Disabling binary protocol can be useful for deduplication of the exported data before ingestion. For this, deduplication need to be configured at-vm-native-src-addrside and-vm-native-disable-binary-protocolshould be set on vmctl side. - FEATURE: vmctl: add support of
weekstep for time-based chunking migration. See this issue. - FEATURE: vmctl: allow specifying custom full url at
--remote-read-src-addrcommand-line flag if--remote-read-disable-path-appendcommand-line flag is set. This allows importing data from urls, which do not end with/api/v1/read. For example, from Promscale. See this issue. - FEATURE: vmui: add warning in query field of vmui for partial data responses. See this issue.
- FEATURE: vmui: allow displaying the full error message on click for trimmed error messages in vmui. See this issue.
- FEATURE: Official Grafana dashboards for VictoriaMetrics: add
Concurrent insertspanel to vmagent's dasbhoard. The new panel supposed to show whether the number of concurrent inserts processed by vmagent isn't reaching the limit. - FEATURE: Official Grafana dashboards for VictoriaMetrics: add panels for absolute Mem and CPU usage by vmalert. See related issue here.
- FEATURE: Official Grafana dashboards for VictoriaMetrics: correctly calculate
Bytes per pointvalue for single-server and cluster VM dashboards. Before, the calculation mistakenly accounted for the number of entries in indexdb in denominator, which could have shown lower values than expected. -
FEATURE: Alerting rules for VictoriaMetrics:
ConcurrentFlushesHitTheLimitalerting rule was moved from single-server and cluster alerts to the list of "health" alerts as it could be related to many VictoriaMetrics components. - BUGFIX: vmagent: return human readable error if opentelemetry has json encoding. Follow-up after PR.
- BUGFIX: vmagent: properly validate scheme for
proxy_urlfield at the scrape config. See this issue for details. - BUGFIX: vmagent: properly apply
iffilters during relabeling. Previously theiffilter could improperly work. See this issue and this pull request. - BUGFIX: vmagent: use local scrape timestamps for the scraped metrics unless
honor_timestamps: trueoption is explicitly set at scrape_config. This fixes gaps for metrics collected from cadvisor or similar exporters, which export metrics with invalid timestamps. See this issue and this comment for details. The issue has been introduced in v1.68.0. - BUGFIX: vmagent: fixes runtime panic at OpenTelemetry parser. Opentelemetry format allows histograms without
sumfields. Such histogram converted as counter with_countsuffix. See this issue. - BUGFIX: vmagent: keep unmatched series at stream aggregation when
-remoteWrite.streamAggr.dropInputis set tofalseto match intended behaviour introduced at v1.92.0. See this issue. - BUGFIX: vmalert: properly set
vmalert_config_last_reload_successfulvalue on configuration updates or rollbacks. The bug was introduced in v1.92.0 in this PR. - BUGFIX: vmalert: fix
vmalert_remotewrite_send_duration_seconds_totalvalue, before it didn't count in the real time spending on remote write requests. See this pr for details. - BUGFIX: vmbackupmanager: fix panic when creating a backup to a local filesystem on Windows. See this issue.
- BUGFIX: vmui: properly handle client address with
X-Forwarded-Forpart at the Active queries page. See this comment. - BUGFIX: MetricsQL: prevent from panic when the lookbehind window in square brackets of rollup function is parsed into negative value. See this issue.
v1.92.1
Released at 2023-07-28
- BUGFIX: vmalert: revert unit test feature for alerting and recording rules introduced in this pull request. See the following change.
v1.92.0
Released at 2023-07-27
Update note: starting from this release, stream aggregation writes the following samples to the configured remote storage by default:
- aggregated samples;
- the original input samples, which match zero
matchoptions from the provided config.
Previously only aggregated samples were written to the storage by default. The previous behavior can be restored in the following ways:
- by passing
-streamAggr.dropInputcommand-line flag to single-node VictoriaMetrics; - by passing
-remoteWrite.streamAggr.dropInputcommand-line flag per each configured-remoteWrite.streamAggr.configatvmagent.
- SECURITY: upgrade base docker image (alpine) from 3.18.0 to 3.18.2. See alpine 3.18.2 release notes.
-
SECURITY: upgrade Go builder from Go1.20.5 to Go1.20.6. See the list of issues addressed in Go1.20.6.
- FEATURE: reduce memory usage by up to 5x for setups with high churn rate and long retention. See the description for this change and this issue for details.
- FEATURE: reduce spikes in CPU and disk IO usage during
indexdbrotation (aka inverted index), which is performed once per-retentionPeriod. The new algorithm gradually pre-populates newly createdindexdbduring the last hour before the rotation. The number of pre-populated series in the newly createdindexdbcan be monitored viavm_timeseries_precreated_totalmetric. This should resolve this issue. - FEATURE: MetricsQL: allow selecting time series matching at least one of multiple
orfilters. For example,{env="prod",job="a" or env="dev",job="b"}selects series with either{env="prod",job="a"}or{env="dev",job="b"}labels. This functionality allows passing the selected series to rollup functions without the need to use subqueries. See these docs. - FEATURE: MetricsQL: add ability to preserve metric names for binary operation results via
keep_metric_namesmodifier. For example,({__name__=~"foo|bar"} / 10) keep_metric_namesleavesfooandbarmetric names in division results. See these docs. This helps to address issues like this one. - FEATURE: MetricsQL: add ability to copy all the labels from
oneside of many-to-one operations by specifying*insidegroup_left()orgroup_right(). Also allow adding a prefix for copied label names viagroup_left(*) prefix "..."syntax. For example, the following query copies Kubernetes namespace labels tokube_pod_infoseries and addsns_prefix for the copied label names:kube_pod_info * on(namespace) group_left(*) prefix "ns_" kube_namespace_labels. The labels fromon()list aren't prefixed. This feature resolves this and that questions at StackOverflow. - FEATURE: MetricsQL: add ability to specify durations via
WITHtemplates. Examples:WITH (w = 5m) m[w]is automatically transformed tom[5m]WITH (f(window, step, off) = m[window:step] offset off) f(5m, 10s, 1h)is automatically transformed tom[5m:10s] offset 1hThanks to @lujiajing1126 for the initial idea and implementation. See this feature request.
- FEATURE: vmui: added a new page with the list of currently running queries. See this issue and these docs.
- FEATURE: vmagent: add support for data ingestion via OpenTelemetry protocol. See these docs, this feature request and this pull request.
- FEATURE: vmagent: allow sharding outgoing time series among the configured remote storage systems. This can be useful for building horizontally scalable stream aggregation, when samples for the same time series must be aggregated by the same
vmagentinstance at the second level. See these docs and this feature request for details. - FEATURE: vmagent: allow configuring staleness interval in stream aggregation config. See this issue for details.
- FEATURE: vmagent: allow specifying a list of series selectors inside
ifoption of relabeling rules. The corresponding relabeling rule is executed when at least a single series selector matches. See these docs. - FEATURE: stream aggregation: allow specifying a list of series selectors inside
matchoption of stream aggregation configs. The input sample is aggregated when at least a single series selector matches. See this feature request. - FEATURE: stream aggregation: preserve input samples, which match zero
matchoptions from the configured aggregations. Previously all the input samples were dropped by default, so only the aggregated samples are written to the output storage. The previous behavior can be restored by passing-streamAggr.dropInputcommand-line flag to single-node VictoriaMetrics or by passing-remoteWrite.streamAggr.dropInputcommand-line flag tovmagent. - FEATURE: vmctl: add verbose output for docker installations or when TTY isn't available. See this issue.
- FEATURE: vmctl: interrupt backoff retries when import process is cancelled. The change makes vmctl more responsive in case of errors during the import. See this pull request.
- FEATURE: vmctl: update backoff policy on retries to reduce probability of overloading for
sourceordestinationdatabases. See this issue. - FEATURE: vmstorage: suppress "broken pipe" and "connection reset by peer" errors for search queries on vmstorage side. See this and this commits.
- FEATURE: Official Grafana dashboards for VictoriaMetrics: add panel for tracking rate of syscalls while writing or reading from disk via
process_io_(read|write)_syscalls_totalmetrics. - FEATURE: accept timestamps in milliseconds at
start,endandtimequery args in Prometheus querying API. See these docs and this feature request. - FEATURE: vmalert: update retry policy for pushing data to
-remoteWrite.url. By default, vmalert will make multiple retry attempts with exponential delay. The total time spent during retry attempts shouldn't exceed-remoteWrite.retryMaxTime(default is 30s). When retry time is exceeded vmalert drops the data dedicated for-remoteWrite.url. Before, vmalert dropped data after 5 retry attempts with 1s delay between attempts (not configurable). See-remoteWrite.retryMinIntervaland-remoteWrite.retryMaxTimecmd-line flags. - FEATURE: vmalert: expose
vmalert_remotewrite_send_duration_seconds_totalcounter, which can be used for determining high saturation of every connection to remote storage with an alerting querysum(rate(vmalert_remotewrite_send_duration_seconds_total[5m])) by(job, instance) > 0.9 * max(vmalert_remotewrite_concurrency) by(job, instance). This query triggers when a connection is saturated by more than 90%. This usually means that-remoteWrite.concurrencycommand-line flag must be increased in order to increase the number of concurrent writings into remote endpoint. See this feature request. - FEATUTE: vmalert: display the error message received during unsuccessful config reload in vmalert's UI. See this issue for details.
- FEATUTE: vmalert: allow disabling of
stepparam attached to instant queries. This might be useful for using vmalert with datasources that to not support this param, unlike VictoriaMetrics. See this issue for details. - FEATUTE: vmalert: support option for "blackholing" alerting notifications if
-notifier.blackholecmd-line flag is set. Enable this flag if you want vmalert to evaluate alerting rules without sending any notifications to external receivers (eg. alertmanager). See this issue for details. Thanks to @venkatbvc for the pull request. - FEATURE: vmalert: add unit test for alerting and recording rules, see more details here. Thanks to @Haleygo for the pull request.
- FEATURE: vmalert: allow overriding default GET params for rules with
graphitedatasource type, in the same way as it happens forprometheustype. See this issue. - FEATUTE: vmalert: support
keep_firing_forfield for alerting rules. See docs updated here and this issue. Thanks to @Haleygo for the pull request. - FEATURE: vmauth: expose
vmauth_user_request_duration_secondsandvmauth_unauthorized_user_request_duration_secondssummary metrics for measuring requests latency per user. - FEATURE: vmbackup: show backup progress percentage in log during backup uploading. See this issue.
- FEATURE: vmrestore: show restoring progress percentage in log during backup downloading. See this issue.
-
FEATURE: add ability to fine-tune Graphite API limits via the following command-line flags:
-search.maxGraphiteTagKeysfor limiting the number of tag keys returned from Graphite API for tags-search.maxGraphiteTagValuesfor limiting the number of tag values returned from Graphite API for tag values-search.maxGraphiteSeriesfor limiting the number of series (aka paths) returned from Graphite API for series See this issue. - BUGFIX: properly return series from /api/v1/series if it finds more than the
limitseries (limitis an optional query arg passed to this API). Previously thelimit exceeded errorerror was returned in this case. See this issue. - BUGFIX: vmui: fix application routing issues and problems with manual URL changes. See this pull request and this issue.
- BUGFIX: add validation for invalid partial RFC3339 timestamp formats in query and export APIs.
- BUGFIX: vmctl: interrupt explore procedure in influx mode if vmctl found no numeric fields.
- BUGFIX: vmctl: fix panic in case
--remote-read-filter-time-startflag is not set for remote-read mode. This flag is now required to use remote-read mode. See this issue. - BUGFIX: vmctl: fix formatting issue, which could add superflouos
scharacters at the end ofsamples/soutput during data migration. For example, it could writesamples/ssssss. See this issue. - BUGFIX: vmalert: use RFC3339 time format in query args instead of unix timestamp for all issued queries to Prometheus-like datasources.
- BUGFIX: vmalert: correctly calculate evaluation time for rules. Before, there was a low probability for discrepancy between actual time and rules evaluation time if evaluation interval was lower than the execution time for rules within the group.
- BUGFIX: vmalert: reset evaluation timestamp after modifying group interval. Before, there could have latency on rule evaluation time.
- BUGFIX: vmselect: fix timestamp alignment for Prometheus querying API if time argument is less than 10m from the beginning of Unix epoch.
- BUGFIX: vmagent: close HTTP connections to service discovery servers when they are no longer needed. This should prevent from possible connection exhasution in some cases. See this issue.
- BUGFIX: vmagent: do not show relabel debug links at the
/targetspage whenvmagentruns with-promscrape.dropOriginalLabelscommand-line flag, since it has no the original labels needed for relabel debug. See this issue. - BUGFIX: vminsert: fixed decoding of label values with slash when accepting data via pushgateway protocol. This fixes Prometheus golang client compatibility. See this issue.
- BUGFIX: MetricsQL: properly parse binary operations with reserved words on the right side such as
foo + (on{bar="baz"}). Previously such queries could lead to panic. See this issue. - BUGFIX: Official Grafana dashboards for VictoriaMetrics: display cache usage for all components on panel
Cache usage % by typefor cluster dashboard. Before, only vmstorage caches were shown.
v1.91.3
Released at 2023-06-30
-
SECURITY: upgrade Go builder from Go1.20.4 to Go1.20.5. See the list of issues addressed in Go1.20.5.
- BUGFIX: vmagent: fix possible panic at shutdown when stream aggregation is enabled. See this pull request for details.
- BUGFIX: vmagent: fixed service name detection for consulagent service discovery in case of a difference in service name and service id. See this issue for details.
- BUGFIX: vmalert: retry all errors except 4XX status codes while pushing via remote-write to the remote storage. Previously, errors like broken connection could prevent vmalert from retrying the request.
- BUGFIX: vmalert: properly interrupt retry attempts on vmalert shutdown. Before, vmalert could have waited for all retries to finish for shutdown.
- BUGFIX: vmbackupmanager: fix an issue with
vmbackupmanagernot being able to restore data from a backup stored in GCS. See this issue for details. - BUGFIX: VictoriaMetrics cluster: properly return error from /api/v1/query and /api/v1/query_range at
vmselectwhen the-search.maxSamplesPerQueryor-search.maxSamplesPerSerieslimit is exceeded. Previously incomplete response could be returned without the error ifvmselectruns with-replicationFactorgreater than 1. See this pull request. - BUGFIX: storage: prevent from possible crashloop after the migration from versions below
v1.90.0to newer versions. See this issue for details. - BUGFIX: vmui: fix a memory leak issue associated with chart updates. See this pull request.
- BUGFIX: vmbackupmanager: fix removing storage data dir before restoring from backup.
- BUGFIX: vmselect: wait for all vmstorage nodes to respond when the
-replicationFactorflag is set bigger than > 1. Before, vmselect could have skip waiting for the slowest replicas to respond. This could have resulted in issues illustrated here. Now, this optimization is disabled by default and could be re-enabled by passing-search.skipSlowReplicascmd-line flag to vmselect. See more details here.
v1.91.2
Released at 2023-06-02
v1.91.1
Released at 2023-06-01
- FEATURE:vmagent: Adds
follow_redirectsat service discovery level of scrape configuration. See this issue. Thanks to @Haleygo for the pull request. -
FEATURE: vmselect: Decreases startup time for vmselect with a big number of vmstorage nodes. See this issue. Thanks to @Haleygo for the pull request.
- BUGFIX: vmalert: Properly form path to static assets in WEB UI if
http.pathPrefixset. See this issue. - BUGFIX: vmalert: Properly set datasource query params. See this issue. Thanks to @gsakun for the pull request.
- BUGFIX: vmalert: properly return empty slices instead of nil for
/api/v1/rulesfor groups with present name but absentrules. See this issue. - BUGFIX: vmauth: Properly handle LOCAL command for proxy protocol. See this issue.
- BUGFIX: vmbackupmanager: Fixes crash on startup. See this issue.
- BUGFIX: vmui: fix bug with custom URL in global settings not respecting tenantID change. See this issue.
v1.91.0
Released at 2023-05-18
- SECURITY: upgrade Go builder from Go1.20.3 to Go1.20.4. See the list of issues addressed in Go1.20.4.
-
SECURITY: serve
/robots.txtcontent to disallow indexing of the exposed instances by search engines. See this issue for details. - FEATURE: update docker compose environment to V2 in respect to V1 deprecation notice from June 2023. See Migrate to Compose V2.
- FEATURE: deprecate
-bigMergeConcurrencycommand-line flag, since improper configuration for this flag frequently led to uncontrolled growth of unmerged parts, which, in turn, could lead to queries slowdown and increased CPU usage. The concurrency for background merges can be controlled via-smallMergeConcurrencycommand-line flag, though it isn't recommended to change this flag in general case. - FEATURE: do not execute the incoming request if it has been canceled by the client before the execution start. See this pull request.
- FEATURE: support time formats with timezones. For example,
2024-01-02+02:00meansJanuary 2, 2024at+02:00time zone. See these docs. - FEATURE: expose
process_*metrics at/metricspage of all the VictoriaMetrics components under Windows OS. See this pull request. - FEATURE: reduce the amounts of unimportant
INFOlogging during VictoriaMetrics startup / shutdown. This should improve visibility for potentially important logs. - FEATURE: upgrade base docker image (alpine) from 3.17.3 to 3.18.0. See alpine 3.18.0 release notes.
- FEATURE: VictoriaMetrics cluster: do not pollute logs with
cannot read hello: cannot read message with size 11: EOFmessages atvmstorageduring TCP health checks performed by Consul or other services. See this issue. - FEATURE: vmagent: support the ability to filter consul_sd_configs targets in more optimal way via new
filteroption. See this feature request. - FEATURE: vmagent: add support for consulagent_sd_configs. See this feature request.
- FEATURE: vmagent: emit a warning if too small value is passed to
-remoteWrite.maxDiskUsagePerURLcommand-line flag. See this issue. - FEATURE: vmalert: add support of recursive globs for
-ruleand-rule.templatescommand-line flags by using**in the glob pattern. See this issue. - FEATURE: vmalert: add ability to specify custom per-group HTTP headers sent to the configured notifiers. See this issue. Thanks to @Haleygo for the pull request.
- FEATURE: vmalert: detect alerting rules which don't match any series. See these docs and this feature request.
- FEATURE: vmalert: support loading rules via HTTP URL. See this issue. Thanks to @Haleygo for the pull request.
- FEATURE: vmalert: add buttons for filtering groups/rules with errors or with no-match warning in web UI for page
/groups. See this issue. - FEATURE: vmalert: do not retry remote-write requests for responses with 4XX status codes. This aligns with Prometheus remote write specification. Thanks to @MichaHoffmann for the pull request.
- FEATURE: vmauth: add ability to filter incoming requests by IP. See these docs and this feature request.
- FEATURE: vmauth: add ability to proxy requests to the specified backends for unauthorized users. See this feature request.
- FEATURE: vmauth: add ability to specify default route for unmatched requests. See this feature request.
- FEATURE: vmauth: retry
POSTrequests on the remaining backends if the currently selected backend isn't reachable. See this issue. - FEATURE: vmui: add ability to compare the data for the previous day with the data for the current day at Cardinality Explorer. See this feature request.
- FEATURE: vmui: display histograms as heatmaps in Metrics explorer. See this feature request.
- FEATURE: vmui: add
WITH templateplayground. See this feature request. - FEATURE: vmui: add ability to debug relabeling. See this feature request.
- FEATURE: vmui: add an ability to copy and execute queries listed at top queries page. Also make more human readable the query duration column. See this feature request and this pull request.
- FEATURE: vmui: increase default font size for better readability.
- FEATURE: vmui: cardinality explorer: return back a table with labels containing the highest number of unique label values. See issue.
- FEATURE: vmui: add notification icon for queries that do not match any time series. A warning icon appears next to the query field when the executed query does not match any time series. See this feature request.
- FEATURE: vmbackup: add
-s3StorageClasscommand-line flag for setting the storage class for AWS S3 backups. See this issue. Thanks to @justcompile for the pull request. - FEATURE: vmbackup: store backup creation and completion time in
backup_complete.ignorefile of backup contents. This allows determining the exact timestamp when the backup was created and completed. - FEATURE: vmbackupmanager: add
created_atfield to the output of/api/v1/backupsAPI andvmbackupmanager backup listcommand. See this doc for data format details. - FEATURE: vmbackupmanager: add commands for locking/unlocking backups against deletion by retention policy. See this doc for data format details.
- FEATURE: vmctl: add support for different time formats for
--vm-native-filter-time-startand--vm-native-filter-time-endcommand-line flags. See this issue. -
FEATURE: vmctl: set default value for
--vm-native-step-intervalcommand-line flag tomonth. This enables time-based chunking of data based on monthly step value when using native migration mode. See this issue. - BUGFIX: reduce the probability of sudden increase in the number of small parts on systems with small number of CPU cores.
- BUGFIX: reduce the possibility of increased CPU usage when data with timestamps older than one hour is ingested into VictoriaMetrics. This reduces spikes for the graph
sum(rate(vm_slow_per_day_index_inserts_total)). See this pull request. - BUGFIX: fix possible infinite loop during
indexdbrotation when-retentionTimezoneOffsetcommand-line flag is set and the local timezone is not UTC. See this issue. Thanks to @faceair for the fix. - BUGFIX: do not panic at Windows during snapshot deletion. Instead, delete the snapshot on the next restart. See this comment for details.
- BUGFIX: change the max allowed value for
-memory.allowedPercentfrom 100 to 200. See this issue. - BUGFIX: properly limit the number of OpenTSDB HTTP concurrent requests specified via
-maxConcurrentInsertscommand-line flag. See this issue. Thanks to @zouxiang1993 for the fix. - BUGFIX: do not ignore trailing empty field in CSV lines when importing data in CSV format. See this issue.
- BUGFIX: disallow
"chars when parsing Prometheus label names, since they aren't allowed by Prometheus text exposition format. Previously this could result in silent incorrect parsing of incorrect Prometheus labels such asfoo{"bar"="baz"}or{foo:"bar",baz="aaa"}. See this issue. - BUGFIX: VictoriaMetrics cluster: prevent from possible panic when the number of vmstorage nodes increases when automatic vmstorage discovery is enabled.
- BUGFIX: MetricsQL: fix a panic when the duration in the query contains uppercase
Msuffix. Such a suffix isn't allowed to use in durations, since it clashes witha millionsuffix, e.g. it isn't clear whetherrate(metric[5M])means rate over 5 minutes, 5 months or 5 million seconds. See this and this issues. - BUGFIX: vmagent: properly handle the
vm_promscrape_config_last_reload_successfulmetric after config reload. See this issue. - BUGFIX: vmagent: add
__meta_kubernetes_endpoints_namelabel for all ports discovered from endpoint. Previously, ports not matched byServicedid not have this label. See this issue for details. Thanks to @thunderbird86 for discovering and fixing the issue. - BUGFIX: vmalert: retry failed read request on the closed connection one more time. This improves rules execution reliability when connection between vmalert and datasource closes unexpectedly.
- BUGFIX: vmalert: properly display an error when using
queryfunction for templating value of-external.alert.sourceflag. See this issue. - BUGFIX: vmalert: properly return empty slices instead of nil for
/api/v1/rulesand/api/v1/alertsAPI handlers. See this issue. - BUGFIX: vmauth: do not return invalid auth credentials in http response by default, since it may be logged by client. See this issue.
- BUGFIX: vmui: fix the display of the tenant selector. See this issue.
- BUGFIX: vmui: fix UI freeze when the query returns non-histogram series alongside histogram series.
- BUGFIX: vmui: fix the text display on buttons in Safari 16.4.
- BUGFIX: alerts-health: update threshold for
TooHighMemoryUsagealert from 90% to 80%, since 90% is too high for production environments. - BUGFIX: vmbackup: fix compatibility with Windows OS. See this issue.
- BUGFIX: vmctl: fix performance issue when migrating data from VictoriaMetrics according to these docs. Add the ability to speed up the data migration via
--vm-native-disable-retriescommand-line flag. See this issue. - BUGFIX: stream aggregation: fix bug with duplicated labels during stream aggregation via single-node VictoriaMetrics. See this issue.
v1.90.0
Released at 2023-04-06
Update note: this release contains backwards-incompatible change in storage data format, so the previous versions of VictoriaMetrics will exit with the unexpected number of substrings in the part name error when trying to run them on the data created by v1.90.0 or newer versions. The solution is to upgrade to v1.90.0 or newer releases
- SECURITY: upgrade base docker image (alpine) from 3.17.2 to 3.17.3. See alpine 3.17.3 release notes.
-
SECURITY: upgrade Go builder from Go1.20.2 to Go1.20.3. See the list of issues addressed in Go1.20.3.
- FEATURE: open source Graphite Render API. This API allows using VictoriaMetrics as a drop-in replacement for Graphite at both data ingestion and querying sides and reducing infrastructure costs by up to 10x comparing to Graphite. See this case study as an example.
- FEATURE: release Windows binaries for single-node VictoriaMetrics, VictoriaMetrics cluster, vmbackup and vmrestore. See this, this and this issues. This release of VictoriaMetrics for Windows cannot delete snapshots due to Windows constraints. See this comment for details. This issue should be resolved in future releases.
- FEATURE: log metrics with truncated labels if the length of label value in the ingested metric exceeds
-maxLabelValueLen. This should simplify debugging for this case. - FEATURE: vmagent: show target URL when debugging target relabeling. This should simplify target relabel debugging a bit. See this pull request.
- FEATURE: vmagent: add support for VictoriaMetrics remote write protocol when sending / receiving data to / from Kafka. This protocol allows saving egress network bandwidth costs when sending data from
vmagenttoKafkalocated in another datacenter or availability zone. See this feature request. - FEATURE: vmagent: add
-kafka.consumer.topic.concurrencycommand-line flag. It controls the number of Kafka consumer workers to use byvmagent. It should eliminate the need to start multiplevmagentinstances to improve data transfer rate. See this feature request. - FEATURE: vmagent: add support for Kafka producer and consumer on
arm64machines. See this issue. - FEATURE: vmagent: delete unused buffered data at
-remoteWrite.tmpDataPathdirectory when there is no matching-remoteWrite.urlto send this data to. See this feature request. - FEATURE: vmagent: add the ability for hot reloading of stream aggregation configs. See these docs and this feature request.
- FEATURE: check the contents of
-relabelConfigand-streamAggr.configfiles additionally to-promscrape.configwhen single-node VictoriaMetrics runs with-dryRuncommand-line flag. This aligns the behaviour of single-node VictoriaMetrics with vmagent behaviour for-dryRuncommand-line flag. - FEATURE: vmui: automatically draw a heatmap graph when the query selects a single histogram. This simplifies analyzing histograms. See this feature request.
- FEATURE: vmui: add support for drag'n'drop and paste from clipboard in the "Trace analyzer" page. See this pull request.
- FEATURE: vmui: hide messages longer than 3 lines in the trace. You can view the full message by clicking on the
show morebutton. See this pull request. - FEATURE: vmui: add the ability to manually input date and time when selecting a time range. See this pull request.
- FEATURE: vmui: updated usability and the search process in cardinality explorer. Made this process straightforward for user. See this pull request.
- FEATURE: vmui: add the ability to collapse/expand the legend. See this pull request.
- FEATURE: vmui: add tips for working with the graph and legend. See this pull request.
- FEATURE: vmui: add
applyandcancelbuttons to settings popup. See this issue. - FEATURE: vmctl: automatically disable progress bar when TTY isn't available. See this issue.
-
FEATURE: vmauth: add
-configCheckIntervalcommand-line flag, which can be used for automatic re-reading the-auth.configfile. See this feature request. - BUGFIX: prevent from slow snapshot creating under high data ingestion rate. See this issue.
- BUGFIX: vmauth: suppress proxy protocol parsing errors in case of
EOF. Usually, the error is caused by health checks and is not a sign of an actual error. - BUGFIX: vmui: fix displaying errors for each query. See this issue.
- BUGFIX: vmbackup: fix snapshot not being deleted in case of error during backup. See this issue.
- BUGFIX: stream aggregation: suppress
series after deduperror message in logs when-remoteWrite.streamAggr.dedupIntervalcommand-line flag is set at vmagent or when-streamAggr.dedupIntervalcommand-line flag is set at single-node VictoriaMetrics. - BUGFIX: allow using dashes and dots in environment variables names referred in config files via
%{ENV-VAR.SYNTAX}. See these docs and this issue. - BUGFIX: return back query performance scalability on hosts with big number of CPU cores. The scalability has been reduced in v1.86.0. See this issue.
- BUGFIX: MetricsQL: properly convert VictoriaMetrics historgram buckets to Prometheus histogram buckets when VictoriaMetrics histogram contain zero buckets. Previously these buckets were ignored, and this could lead to missing Prometheus histogram buckets after the conversion. Thanks to @zklapow for the fix.
- BUGFIX: vmagent: fix CPU and memory usage spikes when files pointed by file_sd_config cannot be re-read. See this_issue.
- BUGFIX: prevent unexpected merges on start-up when
-storage.minFreeDiskSpaceBytesis set. See the issue. - BUGFIX: properly support comma-separated filters inside retention filters. See this issue.
- BUGFIX: verify response code when fetching configuration files via HTTP. See this issue.
- BUGFIX: vmalert: replace empty labels with
""instead of"<no value>"during templating, as Prometheus does. See this issue. - BUGFIX: vmctl: properly pass multiple filters from
--vm-native-filter-matchcommand-line flag to the data source. Previously filters from--vm-native-filter-matchwere only used to discover the metric names, and the metric names like__name__="metric_name"has been taken into account, while the remaining filters were ignored. For example--vm-native-src-addr={foo="bar",baz="abc"}may foundmetric_name{foo="bar",baz="abc"}and filter was treated as--vm-native-src-addr={__name__="metrics_name"}, e.g.foo="bar",baz="abc"filter was ignored. See this issue.
v1.89.1
Released at 2023-03-12
- BUGFIX: prevent from possible
cannot unmarshal timeseries from rollupResultCachepanic after the upgrade to v1.89.0.
v1.89.0
Released at 2023-03-12
Update note: this release can crash with cannot unmarshal timeseries from rollupResultCache panic after the upgrade from the previous releases. This issue can be fixed by removing caches stored on disk according to these docs. Another option is to upgrade to v1.89.1.
-
SECURITY: upgrade Go builder from Go1.20.1 to Go1.20.2. See the list of issues addressed in Go1.20.2.
- FEATURE: vmctl: increase the default value for
--remote-read-http-timeoutcommand-line option from 30s (30 seconds) to 5m (5 minutes). This reduces the probability of timeout errors when migrating big number of time series. See this pull request. - FEATURE: vmctl: migrate series one-by-one in vm-native mode. This allows better tracking the migration progress and resuming the migration process from the last migrated time series. See this pull request and this feature request.
- FEATURE: vmctl: add
--vm-native-src-headersand--vm-native-dst-headerscommand-line flags, which can be used for setting custom HTTP headers during vm-native migration mode. Thanks to @baconmania for the pull request. - FEATURE: vmctl: add
--vm-native-src-bearer-tokenand--vm-native-dst-bearer-tokencommand-line flags, which can be used for setting Bearer token headers for the source and the destination storage during vm-native migration mode. See this feature request. - FEATURE: vmctl: add
--vm-native-disable-http-keep-alivecommand-line flag to allowvmctlto use non-persistent HTTP connections in vm-native migration mode. Thanks to @baconmania for the pull request. - FEATURE: vmalert: log number of configration files found for each specified
-rulecommand-line flag. -
FEATURE: vmalert enterprise: concurrently read config files from S3, GCS or S3-compatible object storage. This significantly improves config load speed for cases when there are thousands of files to read from the object storage.
- BUGFIX: vmstorage: fix a bug, which could lead to incomplete or empty results for heavy queries selecting tens of thousands of time series. See this pull request.
- BUGFIX: vmselect: reduce memory usage and CPU usage when performing heavy queries. See this issue.
- BUGFIX: prevent from possible
invalid memory address or nil pointer dereferencepanic during background merge. The issue has been introduced at v1.85.0. See this issue. - BUGFIX: prevent from possible
SIGBUScrash on ARM architectures (Raspberry Pi), which deny unaligned access to 8-byte words. Thanks to @oliverpool for narrowing down the issue and for the initial attempt to fix it. - BUGFIX: VictoriaMetrics cluster: always return
is_partial: truein partial responses. Previously partial responses could be returned as non-partial in some cases. - BUGFIX: VictoriaMetrics cluster: properly take into account
-rpc.disableCompressioncommand-line flag atvmstorage. It was ignored since v1.78.0. See this pull request. - BUGFIX: vmagent: fix panic when writing data to Kafka. The panic has been introduced in v1.88.0.
- BUGFIX: vmui: stop showing
Please enter a valid Query and execute iterror message on the first load of vmui. - BUGFIX: vmui: properly process
Run in VMUIbutton click in VictoriaMetrics datasource plugin for Grafana. - BUGFIX: vmui: fix the display of the selected value for dropdowns on
Explorepage. - BUGFIX: vmui: do not send
stepparam for instant queries. See this issue. - BUGFIX: vmauth: fix
cannot serve httppanic when plain HTTP request is sent tovmauthconfigured to accept requests over proxy protocol-encoded request (e.g. whenvmauthruns with-httpListenAddr.useProxyProtocolcommand-line flag). The issue has been introduced at v1.87.0 when implementing this feature. - BUGFIX: vmgateway: properly parse RSA public key discovered via JWK endpoint.
v1.88.1
Released at 2023-02-27
- FEATURE: add
-snapshotCreateTimeoutflag to allow configuring timeout for snapshot process. See this issue. - FEATURE: expose
vm_http_requests_totalandvm_http_request_errors_totalmetrics forsnapshot/*paths at VictoriaMetrics clustervmstorageand VictoriaMetrics Single. See this issue. - FEATURE: vmgateway: add the ability to discover keys for JWT verification via OpenID discovery endpoint. See these docs.
- FEATURE: add
-internStringDisableCachecommand-line flag for disabling the cache for interned strings. This flag may be useful in some cases for reducing memory usage at the cost of higher CPU usage. -
FEATURE: add
-internStringCacheExpireDurationcommand-line flag for controlling the lifetime of cached interned strings. - BUGFIX: MetricsQL: fix panic when executing the query
aggr_func(rollup*(some_value)). The panic has been introduced in v1.88.0. - BUGFIX: vmagent: use the provided
-remoteWrite.*auth options when determining whether the remote storage supports VictoriaMetrics remote write protocol. Previously the auth options were ignored. This was preventing from automatic switch to VictoriaMetrics remote write protocol. - BUGFIX: vmagent: do not register
vm_promscrape_config_*metrics if-promscrape.configflag is not used. Previously those metrics were registered and never updated, which was confusing and could trigger false-positive alerts. - BUGFIX: vmctl: skip measurements with no fields when migrating data from influxdb. See this issue.
- BUGFIX: delete failed snapshot contents from disk on failed attempt to create snapshot. Previously failed snapshot contents could remain on disk in incomplete state. See this issue
v1.88.0
Released at 2023-02-24
- SECURITY: upgrade base docker image (alpine) from 3.17.1 to 3.17.2. See alpine 3.17.2 release notes.
-
SECURITY: upgrade Go builder from Go1.20.0 to Go1.20.1. See the list of issues addressed in Go1.20.1.
- FEATURE: vmagent: add support for VictoriaMetrics remote write protocol. This protocol allows saving egress network bandwidth costs when sending data from
vmagentto VictoriaMetrics located in another datacenter or availability zone. This also allows reducing disk IO under high load whenvmagentstarts queuing the collected data to disk when the remote storage is temporarily unavailable or cannot keep up with the data ingestion rate. See this feature request. - FEATURE: vmagent: add support for Kuma Control Plane targets discovery aka kuma_sd_configs. See this issue.
- FEATURE: vmgateway: add the ability to verify JWT signature via JWKS endpoint. See these docs.
- FEATURE: vmauth: add the ability to limit the number of concurrent requests on a per-user basis via
-maxConcurrentPerUserRequestscommand-line flag and viamax_concurrent_requestsconfig option. See this feature request and these docs. - FEATURE: vmauth: automatically retry failing
GETrequests on all the configured backends. Previously the backend error has been immediately returned to the client without retrying the request on the remaining backends. - FEATURE: vmauth: choose the backend with the minimum number of concurrently executed requests among the configured backends in a round-robin manner for serving the incoming requests. This allows spreading the load among backends more evenly, while improving the response time.
- FEATURE: vmalert enterprise: add ability to read alerting and recording rules from S3, GCS or S3-compatible object storage. See these docs.
- FEATURE: vmctl: automatically retry requests to remote storage if up to 5 errors occur during the data migration process. This should help continuing the data migration process on temporary errors. Previously
vmctlwas stopping after the first error. See this feature request. - FEATURE: MetricsQL: support optional 2nd argument
min,maxoravgfor rollup, rollup_delta, rollup_deriv, rollup_increase, rollup_rate and rollup_scrape_interval function. If the second argument is passed, then the function returns only the selected aggregation type. This change can be useful for situations where only one type of rollup calculation is needed. For example,rollup_rate(requests_total[1i], "max")would return only the max increase rates forrequests_totalmetric per each interval between adjacent points on the graph. See this article for details. - FEATURE: MetricsQL: support optional 2nd argument
open,low,high,closefor rollup_candlestick function. If the second argument is passed, then the function returns only the selected aggregation type. - FEATURE: MetricsQL: add share(q) aggregate function.
- FEATURE: MetricsQL: add
mad_over_time(m[d])function for calculating the median absolute deviation over raw samples on the lookbehind windowd. See this feature request. - FEATURE: MetricsQL: add
range_mad(q)function for calculating the median absolute deviation over points per each time series returned byq. - FEATURE: MetricsQL: add
range_zscore(q)function for calculating z-score over points per each time series returned fromq. - FEATURE: MetricsQL: add
range_trim_outliers(k, q)function for dropping outliers located farther thank*range_mad(q)from therange_median(q). This should help removing outliers during query time at this issue. - FEATURE: MetricsQL: add
range_trim_zscore(z, q)function for dropping outliers located farther thanz*range_stddev(q)fromrange_avg(q). This should help removing outliers during query time at this issue. - FEATURE: vmui: show
medianinstead ofavgin graph tooltip and line legend, sincemedianis more tolerant against spikes. See this issue. - FEATURE: add
-search.maxSeriesPerAggrFunccommand-line flag, which can be used for limiting the number of time series MetricsQL aggregate functions can return in a single query. This flag can be useful for preventing OOMs when count_values function is improperly used. - FEATURE: vmui: small UX improvements for mobile view. See this feature request and this pull request.
- FEATURE: add
-search.logQueryMemoryUsagecommand-line flag for logging queries, which need more memory than specified by this command-line flag. See this feature request. Thanks to @michal-kralik for the idea and the intial implementation. -
FEATURE: allow setting zero value for
-search.latencyOffsetcommand-line flag. This may be needed in some cases. Previously the minimum supported value for-search.latencyOffsetcommand-line flag was1s. - BUGFIX: vmagent: immediately cancel in-flight scrape requests during configuration reload when stream parsing mode is disabled. Previously
vmagentcould wait for long time until all the in-flight requests are completed before reloading the configuration. This could significantly slow down configuration reload. See this issue. - BUGFIX: vmagent: do not wait for 2 seconds after the first unsuccessful attempt to scrape the target before performing the next attempt. This should improve scrape speed when the target closes http keep-alive connection between scrapes. See this and this issues.
- BUGFIX: vmagent: fix Azure service discovery inside Azure Container App. See this issue. Thanks to @MattiasAng for the fix!
- BUGFIX: do not put auxiliary directories scheduled for removal into snapshots. This should prevent from
cannot create hard links from ...must-remove...errors when making snapshots / backups. See this issue. - BUGFIX: prevent from possible data ingestion slowdown and query performance slowdown during background merges of big parts on systems with small number of CPU cores (1 or 2 CPU cores). The issue has been introduced in v1.85.0 when implementing this feature. See also this issue.
- BUGFIX: properly parse timestamps in milliseconds when ingesting data via OpenTSDB telnet put protocol. Previously timestamps in milliseconds were mistakenly multiplied by 1000. Thanks to @Droxenator for the pull request.
- BUGFIX: MetricsQL: do not add extrapolated points outside the real points when using interpolate() function. See this issue.
v1.87.7
Released at 2023-08-12
v1.87.x is a line of LTS releases (e.g. long-time support). It contains important up-to-date bugfixes. The v1.87.x line will be supported for at least 12 months since v1.87.0 release
- SECURITY: upgrade Go builder from Go1.20.4 to Go1.21.0.
-
SECURITY: upgrade base docker image (Alpine) from 3.18.2 to 3.18.3. See alpine 3.18.3 release notes.
- BUGFIX: vmselect: fix timestamp alignment for Prometheus querying API if time argument is less than 10m from the beginning of Unix epoch.
- BUGFIX: vminsert: fixed decoding of label values with slash when accepting data via pushgateway protocol. This fixes Prometheus golang client compatibility. See this issue.
- BUGFIX: vmagent: properly validate scheme for
proxy_urlfield at the scrape config. See this issue for details. - BUGFIX: vmagent: close HTTP connections to service discovery servers when they are no longer needed. This should prevent from possible connection exhasution in some cases. See this issue.
- BUGFIX: vmagent: properly apply
iffilters during relabeling. Previously theiffilter could improperly work. See this issue and this pull request. - BUGFIX: vmagent: fix possible panic at shutdown when stream aggregation is enabled. See this pull request for details.
- BUGFIX: vmagent: use local scrape timestamps for the scraped metrics unless
honor_timestamps: trueoption is explicitly set at scrape_config. This fixes gaps for metrics collected from cadvisor or similar exporters, which export metrics with invalid timestamps. See this issue and this comment for details. - BUGFIX: vmauth: Properly handle LOCAL command for proxy protocol. See this issue.
- BUGFIX: VictoriaMetrics cluster: properly return error from /api/v1/query and /api/v1/query_range at
vmselectwhen the-search.maxSamplesPerQueryor-search.maxSamplesPerSerieslimit is exceeded. Previously incomplete response could be returned without the error ifvmselectruns with-replicationFactorgreater than 1. See this pull request. - BUGFIX: vmalert: correctly calculate evaluation time for rules. Before, there was a low probability for discrepancy between actual time and rules evaluation time if evaluation interval was lower than the execution time for rules within the group.
- BUGFIX: vmalert: reset evaluation timestamp after modifying group interval. Before, there could have latency on rule evaluation time.
- BUGFIX: vmalert: Properly set datasource query params. See this issue. Thanks to @gsakun for the pull request.
- BUGFIX: vmalert: Properly form path to static assets in WEB UI if
http.pathPrefixset. See this issue. - BUGFIX: vmalert: properly return empty slices instead of nil for
/api/v1/rulesfor groups with present name but absentrules. See this issue. - BUGFIX: vmctl: interrupt explore procedure in influx mode if vmctl found no numeric fields.
- BUGFIX: vmctl: fix panic in case
--remote-read-filter-time-startflag is not set for remote-read mode. This flag is now required to use remote-read mode. See this issue.
v1.87.6
Released at 2023-05-18
v1.87.x is a line of LTS releases (e.g. long-time support). It contains important up-to-date bugfixes. The v1.87.x line will be supported for at least 12 months since v1.87.0 release
- SECURITY: upgrade Go builder from Go1.20.3 to Go1.20.4. See the list of issues addressed in Go1.20.4.
- SECURITY: upgrade base docker image (alpine) from 3.17.3 to 3.18.0. See alpine 3.18.0 release notes.
-
SECURITY: serve
/robots.txtcontent to disallow indexing of the exposed instances by search engines. See this issue for details. - BUGFIX: reduce the probability of sudden increase in the number of small parts on systems with small number of CPU cores.
- BUGFIX: reduce the possibility of increased CPU usage when data with timestamps older than one hour is ingested into VictoriaMetrics. This reduces spikes for the graph
sum(rate(vm_slow_per_day_index_inserts_total)). See this pull request. - BUGFIX: do not ignore trailing empty field in CSV lines when importing data in CSV format. See this issue.
- BUGFIX: disallow
"chars when parsing Prometheus label names, since they aren't allowed by Prometheus text exposition format. Previously this could result in silent incorrect parsing of incorrect Prometheus labels such asfoo{"bar"="baz"}or{foo:"bar",baz="aaa"}. See this issue. - BUGFIX: MetricsQL: fix a panic when the duration in the query contains uppercase
Msuffix. Such a suffix isn't allowed to use in durations, since it clashes witha millionsuffix, e.g. it isn't clear whetherrate(metric[5M])means rate over 5 minutes, 5 months or 5 million seconds. See this and this issues. - BUGFIX: VictoriaMetrics cluster: prevent from possible panic when the number of vmstorage nodes increases when automatic vmstorage discovery is enabled.
- BUGFIX: properly limit the number of OpenTSDB HTTP concurrent requests specified via
-maxConcurrentInsertscommand-line flag. See this issue. Thanks to @zouxiang1993 for the fix. - BUGFIX: vmalert: properly return empty slices instead of nil for
/api/v1/rulesand/api/v1/alertsAPI handlers. See this issue. - BUGFIX: vmagent: add
__meta_kubernetes_endpoints_namelabel for all ports discovered from endpoint. Previously, ports not matched byServicedid not have this label. See this issue for details. Thanks to @thunderbird86 for discovering and fixing the issue. - BUGFIX: fix possible infinite loop during
indexdbrotation when-retentionTimezoneOffsetcommand-line flag is set and the local timezone is not UTC. See this issue. Thanks to @faceair for the fix. - BUGFIX: vmauth: do not return invalid auth credentials in http response by default, since it may be logged by client. See this issue.
- BUGFIX: alerts-health: update threshold for
TooHighMemoryUsagealert from 90% to 80%, since 90% is too high for production environments. - BUGFIX: vmagent: properly handle the
vm_promscrape_config_last_reload_successfulmetric after config reload. See this issue. - BUGFIX: stream aggregation: fix bug with duplicated labels during stream aggregation via single-node VictoriaMetrics. See this issue.
- BUGFIX: stream aggregation: suppress
series after deduperror message in logs when-remoteWrite.streamAggr.dedupIntervalcommand-line flag is set at vmagent or when-streamAggr.dedupIntervalcommand-line flag is set at single-node VictoriaMetrics.
v1.87.5
Released at 2023-04-06
v1.87.x is a line of LTS releases (e.g. long-time support). It contains important up-to-date bugfixes. The v1.87.x line will be supported for at least 12 months since v1.87.0 release
- SECURITY: upgrade base docker image (alpine) from 3.17.2 to 3.17.3. See alpine 3.17.3 release notes.
-
SECURITY: upgrade Go builder from Go1.20.2 to Go1.20.3. See the list of issues addressed in Go1.20.3.
- BUGFIX: MetricsQL: properly convert VictoriaMetrics historgram buckets to Prometheus histogram buckets when VictoriaMetrics histogram contain zero buckets. Previously these buckets were ignored, and this could lead to missing Prometheus histogram buckets after the conversion. Thanks to @zklapow for the fix.
- BUGFIX: vmagent: fix CPU and memory usage spikes when files pointed by file_sd_config cannot be re-read. See this_issue.
- BUGFIX: prevent unexpected merges on start-up when
-storage.minFreeDiskSpaceBytesis set. See the issue. - BUGFIX: properly support comma-separated filters inside retention filters. See this issue.
- BUGFIX: verify response code when fetching configuration files via HTTP. See this issue.
v1.87.4
Released at 2023-03-25
v1.87.x is a line of LTS releases (e.g. long-time support). It contains important up-to-date bugfixes. The v1.87.x line will be supported for at least 12 months since v1.87.0 release
- BUGFIX: prevent from slow snapshot creating under high data ingestion rate. See this issue.
- BUGFIX: vmauth: suppress proxy protocol parsing errors in case of
EOF. Usually, the error is caused by health checks and is not a sign of an actual error. - BUGFIX: vmbackup: fix snapshot not being deleted in case of error during backup. See this issue.
- BUGFIX: allow using dashes and dots in environment variables names referred in config files via
%{ENV-VAR.SYNTAX}. See these docs and this issue. - BUGFIX: return back query performance scalability on hosts with big number of CPU cores. The scalability has been reduced in v1.86.0. See this issue.
v1.87.3
Released at 2023-03-12
v1.87.x is a line of LTS releases (e.g. long-time support). It contains important up-to-date bugfixes. The v1.87.x line will be supported for at least 12 months since v1.87.0 release
-
SECURITY: upgrade Go builder from Go1.20.1 to Go1.20.2. See the list of issues addressed in Go1.20.2.
- BUGFIX: vmstorage: fix a bug, which could lead to incomplete or empty results for heavy queries selecting tens of thousands of time series. See this pull request.
- BUGFIX: vmselect: reduce memory usage and CPU usage when performing heavy queries. See this issue.
- BUGFIX: prevent from possible
invalid memory address or nil pointer dereferencepanic during background merge. The issue has been introduced at v1.85.0. See this issue. - BUGFIX: prevent from possible
SIGBUScrash on ARM architectures (Raspberry Pi), which deny unaligned access to 8-byte words. Thanks to @oliverpool for narrowing down the issue and for the initial attempt to fix it. - BUGFIX: VictoriaMetrics cluster: always return
is_partial: truein partial responses. Previously partial responses could be returned as non-partial in some cases. - BUGFIX: VictoriaMetrics cluster: properly take into account
-rpc.disableCompressioncommand-line flag atvmstorage. It was ignored since v1.78.0. See this pull request. - BUGFIX: vmagent: do not register
vm_promscrape_config_*metrics if-promscrape.configflag is not used. Previously those metrics were registered and never updated, which was confusing and could trigger false-positive alerts. - BUGFIX: vmctl: skip measurements with no fields when migrating data from influxdb. See this issue.
- BUGFIX: vmauth: fix
cannot serve httppanic when plain HTTP request is sent tovmauthconfigured to accept requests over proxy protocol-encoded request (e.g. whenvmauthruns with-httpListenAddr.useProxyProtocolcommand-line flag). The issue has been introduced at v1.87.0 when implementing this feature.
v1.87.2
Released at 2023-02-24
v1.87.x is a line of LTS releases (e.g. long-time support). It contains important up-to-date bugfixes. The v1.87.x line will be supported for at least 12 months since v1.87.0 release
- SECURITY: upgrade base docker image (alpine) from 3.17.1 to 3.17.2. See alpine 3.17.2 release notes.
-
SECURITY: upgrade Go builder from Go1.20.0 to Go1.20.1. See the list of issues addressed in Go1.20.1.
- BUGFIX: vmagent: immediately cancel in-flight scrape requests during configuration reload when stream parsing mode is disabled. Previously
vmagentcould wait for long time until all the in-flight requests are completed before reloading the configuration. This could significantly slow down configuration reload. See this issue. - BUGFIX: vmagent: do not wait for 2 seconds after the first unsuccessful attempt to scrape the target before performing the next attempt. This should improve scrape speed when the target closes http keep-alive connection between scrapes. See this and this issues.
- BUGFIX: vmagent: fix Azure service discovery inside Azure Container App. See this issue. Thanks to @MattiasAng for the fix!
- BUGFIX: do not put auxiliary directories scheduled for removal into snapshots. This should prevent from
cannot create hard links from ...must-remove...errors when making snapshots / backups. See this issue. - BUGFIX: prevent from possible data ingestion slowdown and query performance slowdown during background merges of big parts on systems with small number of CPU cores (1 or 2 CPU cores). The issue has been introduced in v1.85.0 when implementing this feature. See also this issue.
- BUGFIX: properly parse timestamps in milliseconds when ingesting data via OpenTSDB telnet put protocol. Previously timestamps in milliseconds were mistakenly multiplied by 1000. Thanks to @Droxenator for the pull request.
- BUGFIX: MetricsQL: do not add extrapolated points outside the real points when using interpolate() function. See this issue.
v1.87.1
Released at 2023-02-09
v1.87.x is a line of LTS releases (e.g. long-time support). It contains important up-to-date bugfixes. The v1.87.x line will be supported for at least 12 months since v1.87.0 release
- FEATURE: vmalert: alerts state restore procedure was changed to become asynchronous. It doesn't block groups start anymore which significantly improves vmalert's startup time. This also means that
-remoteRead.ignoreRestoreErrorscommand-line flag becomes deprecated now and will have no effect if configured. While previously state restore attempt was made for all the loaded alerting rules, now it is called only for alerts which became active after the first evaluation. See this issue. - FEATURE: vmui: optimize VMUI for use from smartphones and tablets. See this feature request.
- FEATURE: vmui: add ability to search tenants in the drop-down list for the tenant selector. See this feature request.
- FEATURE: vmui: add avg/min/max/last values to line legends and tooltips for graphs. See this feature request.
-
FEATURE: vmui: hide the default
per-job resource usagedashboard if there is a custom dashboard exists at the directory specified via-vmui.customDashboardsPathcommand-line flag. See this feature request. - BUGFIX: vmagent: fix panic in HashiCorp Nomad service discovery. Thanks to @mr-karan for the pull request.
- BUGFIX: vmalert: fix display of rules number per-group for groups with identical names in UI.
- BUGFIX: vmalert: prevent disabling state updates tracking per rule via setting values < 1. The minimum number of update states to track is now set to 1.
- BUGFIX: vmalert: properly update
debugandupdate_entries_limitrule's params on config's hot-reload. - BUGFIX: properly initialize the
vm_concurrent_insert_currentmetric before exposing it. Previously this metric could be left uninitialized in some cases, e.g. its value was zero. This could lead to false alerts for the queryavg_over_time(vm_concurrent_insert_current[1m]) >= vm_concurrent_insert_capacity. See this issue. - BUGFIX: vmagent: immediately cancel in-flight scrape requests during configuration reload when using stream parsing mode. Previously
vmagentcould wait for long time until all the in-flight requests are completed before reloading the configuration. This could significantly slow down configuration reload. See this issue. - BUGFIX: vmgateway: do not validate JWT signature if no public keys are provided. Previously this could result in the
error setting up jwt verificationerror.
v1.87.0
Released at 2023-02-01
v1.87.x is a line of LTS releases (e.g. long-time support). It contains important up-to-date bugfixes. The v1.87.x line will be supported for at least 12 months since v1.87.0 release
- FEATURE: stream aggregation: add the ability to de-duplicate input samples before aggregation via
-streamAggr.dedupIntervaland-remoteWrite.streamAggr.dedupIntervalcommand-line options. - FEATURE: vmui: add dark mode - it can be selected via
settingsmenu in the top right corner. See this pull request. - FEATURE: vmui: improve visual appearance of the top menu. See this feature request.
- FEATURE: vmui: embed fonts into binary instead of loading them from external sources. This allows using
vmuiin full from isolated networks without access to Internet. Thanks to @ScottKevill for the pull request. - FEATURE: vmui: add ability to switch between tenants by selecting the needed tenant in the drop-down list at the top right corner of the UI. See this pull request.
- FEATURE: vmagent: reduce memory usage when sending stale markers for targets, which expose big number of metrics. See this and this issues.
- FEATURE: vmagent: add
__meta_kubernetes_pod_container_idmeta-label to the targets discovered via kubernetes_sd_configs. This label has been added in Prometheus starting fromv2.42.0. See this feature request. - FEATURE: vmagent: add
__meta_azure_machine_sizemeta-label to the targets discovered via azure_sd_configs. This label has been added in Prometheus starting fromv2.42.0. See this pull request. - FEATURE: vmauth: allow limiting the number of concurrent requests sent to
vmauthvia-maxConcurrentRequestscommand-line flag. This allows controlling memory usage ofvmauthand the resource usage of backends behindvmauth. See this feature request. Thanks to @dmitryk-dk for the initial implementation. - FEATURE: allow using VictoriaMetrics components behind proxies, which communicate with the backend via proxy protocol. See this feature request. For example, vmauth accepts proxy protocol connections when it starts with
-httpListenAddr.useProxyProtocolcommand-line flag. - FEATURE: add
-internStringMaxLencommand-line flag, which can be used for fine-tuning RAM vs CPU usage in certain workloads. For example, if the stored time series contain long labels, then it may be useful reducing the-internStringMaxLenin order to reduce memory usage at the cost of increased CPU usage. See this issue. -
FEATURE: provide GOARCH=386 binaries for single-node VictoriaMetrics, vmagent, vmalert, vmauth, vmbackup and vmrestore components at releases page. See this feature request. Thanks to @denisgolius for the pull request.
- BUGFIX: fix a bug, which could prevent background merges for the previous partitions until restart if the storage didn't have enough disk space for final deduplication and down-sampling.
- BUGFIX: fix a bug, which could lead to increased CPU usage and disk IO usage when adding data to previous months and when the deduplication or downsampling is enabled. See this pull request.
- BUGFIX: VictoriaMetrics cluster: propagate all the timeout-related errors from
vmstoragetovmselect. Previously some timeout errors weren't returned fromvmselecttovmstorage. Instead,vmstoragecould log the error and close the connection tovmselect, sovmselectwas logging cryptic errors such ascannot execute funcName="..." on vmstorage "...": EOF. - BUGFIX: vmui: add support for time zone selection for older versions of browsers. See this pull request.
- BUGFIX: vmagent: update API version for ec2_sd_configs to fix the issue with missing
__meta_ec2_availability_zone_idattribute. - BUGFIX: vmagent: properly return
200 OKHTTP status code when importing data via Pushgateway protocol. See this issue. - BUGFIX: vmagent: do not add
exported_prefix to scraped metric names, which clash with the automatically generated metric names ifhonor_labels: trueoption is set in the scrape_config. See the this and this issues. - BUGFIX: vmauth: allow re-entering authorization info in the web browser if the entered info was incorrect. Previously it was non-trivial to do via the web browser, since
vmauthwas returning400 Bad Requestinstead of401 Unauthorizedhttp response code. - BUGFIX: vmauth: always log the client address and the requested URL on proxying errors. Previously some errors could miss this information.
- BUGFIX: vmbackup: fix snapshot not being deleted after backup completion. This issue could result in unnecessary snapshots being stored, it is required to delete unnecessary snapshots manually. See the this issue.
- BUGFIX: VictoriaMetrics cluster: fix panic on top-level vmselect nodes of multi-level setup when the
-replicationFactorflag is set and request containstracequery parameter. See this issue.
v1.86.2
Released at 2023-01-18
-
SECURITY: vmbackup: do not expose basic auth passwords from
-snapshot.createURLand-snapshot.deleteURLcommand-line flags in logs. Thanks to @toanju for the pull request. - FEATURE: vmui: add ability to show custom dashboards at vmui by specifying a path to a directory with dashboard config files via
-vmui.customDashboardsPathcommand-line flag. See this feature request and these docs. - FEATURE: vmui: apply the
stepglobally to all the displayed graphs. See this feature request. -
FEATURE: vmui: improve the appearance of graph lines by using more visually distinct colors. See this feature request.
- BUGFIX: do not slow down concurrently executed queries during assisted merges, since assisted merges already prioritize data ingestion over queries. The probability of assisted merges has been increased starting from v1.85.0 because of internal refactoring. This could result in slowed down queries when there is a plenty of free CPU resources. See this and this issues.
- BUGFIX: reduce the increased CPU usage at
vmselectto v1.85.3 level when processing heavy queries. See this issue. - BUGFIX: retention filters: fix
FATAL: cannot locate metric name for metricID=...: EOFpanic, which could occur when retention filters are enabled. - BUGFIX: vmagent: properly cancel in-flight service discovery requests for consul_sd_configs and nomad_sd_configs when the service list changes. See this issue.
- BUGFIX: vmagent: dockerswarm_sd_configs: apply
filtersonly to objects of the specifiedrole. Previously filters were applied to all the objects, which could cause errors when different types of objects were used with filters that were not compatible with them. See this issue. - BUGFIX: vmagent: suppress all the scrape errors when
-promscrape.suppressScrapeErrorsis enabled. Previously some scrape errors were logged even if-promscrape.suppressScrapeErrorsflag was set. - BUGFIX: vmagent: consistently put the scrape url with scrape target labels to all error logs for failed scrapes. Previously some failed scrapes were logged without this information.
- BUGFIX: vmagent: do not send stale markers to remote storage for series exceeding the configured series limit. See this issue.
- BUGFIX: vmagent: properly apply series limit when staleness tracking is disabled.
- BUGFIX: vmagent: reduce memory usage spikes when big number of scrape targets disappear at once. See this issue. Thanks to @lzfhust for the initial fix.
- BUGFIX: Pushgateway import: properly return
200 OKHTTP response code. See this issue. - BUGFIX: MetricsQL: properly parse
MandMisuffixes as1e6multipliers in1Mand1Minumeric constants. See this issue. The issue has been introduced in v1.86.0. - BUGFIX: vmui: properly display range query results at
Tableview. For example,up[5m]query now shows all the raw samples for the last 5 minutes for theupmetric at theTableview. See this issue.
v1.86.1
Released at 2023-01-10
- BUGFIX: return correct query results over time series with gaps. The issue has been introduced in v1.86.0.
- BUGFIX: properly take into account the timeout passed by
vmselecttovmstorageduring query execution. This issue could result in the following error logs atvmstorageunder load:cannot process vmselect request: cannot execute "search_v7": couldn't start executing the request in 0.000 seconds, since -search.maxConcurrentRequests=... concurrent requests are already executed. The issue has been introduced in v1.86.0.
v1.86.0
Released at 2023-01-10
It is recommended upgrading to VictoriaMetrics v1.86.1 because v1.86.0 contains a bug, which could lead to incorrect query results over time series with gaps.
Update note 1: This release changes the logic behind -maxConcurrentInserts command-line flag. Previously this flag was limiting the number of concurrent connections established from clients, which send data to VictoriaMetrics. Some of these connections could be temporarily idle. Such connections do not take significant CPU and memory resources, so there is no need in limiting their count. The new logic takes into account only those connections, which actively ingest new data to VictoriaMetrics and to vmagent. This means that the default -maxConcurrentInserts value should handle cases, which could require increasing the value in the previous releases. So it is recommended trying to remove the explicitly set -maxConcurrentInserts command-line flag after upgrading to this release and verifying whether this reduces CPU and memory usage.
Update note 2: The vm_concurrent_addrows_current and vm_concurrent_addrows_capacity metrics exported by vmstorage are replaced with vm_concurrent_insert_current and vm_concurrent_insert_capacity metrics in order to be consistent with the corresponding metrics exported by vminsert. Please update queries in dahsboards and alerting rules with new metric names if old metric names are used there.
- FEATURE: vmagent: add support for aggregation of incoming samples by time and by labels. See these docs and this feature request.
- FEATURE: vmagent: reduce memory usage when scraping big number of targets without the need to enable stream parsing mode.
- FEATURE: vmagent: add support for Prometheus-compatible target discovery for HashiCorp Nomad services via nomad_sd_configs. See this feature request. Thanks to @mr-karan for the implementation.
- FEATURE: vmagent: automatically pre-fetch
metric_relabel_configsand the target labels when clicking on thedebug metrics relabelinglink at thehttp://vmagent:8429/targetspage at the particular target. See these docs. - FEATURE: vmui: add ability to explore metrics exported by a particular
job/instance. See these docs and this feature request. - FEATURE: allow passing partial
RFC3339date/time totime,startandendquery args at querying APIs and export APIs. For example,2022is equivalent to2022-01-01T00:00:00Z, while2022-01-30T14is equivalent to2022-01-30T14:00:00Z. See these docs. - FEATURE: MetricsQL: allow using unicode letters in identifiers. For example,
температура{город="Киев"}is a valid MetricsQL expression now. Previously every non-ascii letters should be escaped with\char when used inside MetricsQL expression:\т\е\м\п\е\р\а\т\у\р\а{\г\о\р\о\д="Киев"}. Now both expressions are equivalent. Thanks to @hzwwww for the pull request. - FEATURE: relabeling: add support for
keepequalanddropequalrelabeling actions, which are supported by Prometheus starting from v2.41.0. These relabeling actions are almost identical tokeep_if_equalanddrop_if_equalrelabeling actions supported by VictoriaMetrics sincev1.38.0- see these docs - so it is recommended sticking tokeep_if_equalanddrop_if_equalactions instead of switching tokeepequalanddropequal. - FEATURE: csvimport: support empty values for imported metrics. See this issue.
- FEATURE: vmalert: allow configuring the default number of stored rule's update states in memory via global
-rule.updateEntriesLimitcommand-line flag or per-rule via rule'supdate_entries_limitconfiguration param. See these docs and this pull request. - FEATURE: improve the logic benhind
-maxConcurrentInsertscommand-line flag. Previously this flag was limiting the number of concurrent connections from clients, which write data to VictoriaMetrics or vmagent. Some of these connections could be idle for some time. These connections do not need significant amounts of CPU and memory, so there is no sense in limiting their count. The updated logic behind-maxConcurrentInsertslimits the number of active insert requests, not counting idle connections. - FEATURE: protect all the http endpoints with
-httpAuth.*command-line flag. Previously endpoints protected by-*AuthKeycommand-line flags weren't protected by-httpAuth.*. This could complicate the proper security setup. See this issue. - FEATURE: VictoriaMetrics cluster: add
-maxConcurrentInsertsand-insert.maxQueueDurationcommand-line flags tovmstorage, so they could be tuned if needed in the same way as atvminsertnodes. - FEATURE: VictoriaMetrics cluster: limit the number of concurrently executed requests at
vmstorageproportionally to the number of available CPU cores, since every request can saturate a single CPU core atvmstorage. Previously a singlevmstoragecould accept and start processing arbitrary number of concurrent requests received from big number ofvmselectnodes. This could result in increased RAM, CPU and disk IO usage or event to out of memory crash atvmstorageside under high load. The limit can be fine-tuned if needed via-search.maxConcurrentRequestscommand-line flag atvmstorageaccording to these docs.vmstoragenow exposes the following additional metrics athttp://vmstorage:8482/metricspage:vm_vmselect_concurrent_requests_capacity- the maximum number of requests allowed to execute concurrentlyvm_vmselect_concurrent_requests_current- the current number of concurrently executed requestsvm_vmselect_concurrent_requests_limit_reached_total- the total number of requests, which were put in the wait queue when-search.maxConcurrentRequestsconcurrent requests are being executedvm_vmselect_concurrent_requests_limit_timeout_total- the total number of canceled requests because they were sitting in the wait queue for more than-search.maxQueueDuration
- BUGFIX: vmui: properly update the
stepvalue in url after thestepinput field has been manually changed. This allows preserving the properstepwhen copy-n-pasting the url to another instance of web browser. See this issue. - BUGFIX: vmui: properly update tooltip when quickly hovering multiple lines on the graph. See this issue.
- BUGFIX: properly parse floating-point numbers without integer or fractional parts such as
.123and20.during data import. See this issue. - BUGFIX: MetricsQL: properly parse durations with uppercase suffixes such as
10S,5MS,1W, etc. See this issue. - BUGFIX: vmagent: fix a panic during target discovery when
vmagentruns with-promscrape.dropOriginalLabelscommand-line flag. See this issue. The bug has been introduced in v1.85.0. - BUGFIX: vmagent: dockerswarm_sd_configs: properly encode
filtersfield. See this issue. - BUGFIX: vmagent: fix possible resource leak after hot reload of the updated consul_sd_configs. See this issue.
- BUGFIX: vmagent: fix a panic in gce_sd_configs when the discovered instance has zero labels. See this issue. The issue has been introduced in v1.85.0.
- BUGFIX: properly return label names starting from uppercase such as
CamelCaseLabelfrom /api/v1/labels. See this issue. - BUGFIX: fix
opentsdbHTTP endpoint not respecting-httpAuth.*flags. See this issue - BUGFIX: consistently select the sample with the biggest value out of samples with identical timestamps during querying when the deduplication is enabled according to this feature request. Previously random samples could be selected during querying.
v1.85.3
See changes here
v1.85.2
See changes here
v1.85.1
See changes here
v1.85.0
See changes here
v1.84.0
See changes here
v1.83.1
See changes here
v1.83.0
See changes here
v1.82.1
See changes here
v1.82.0
See changes here
v1.81.2
See changes here
v1.81.1
See changes here
v1.81.0
See changes here
v1.80.0
See changes here
v1.79.14
See changes here
v1.79.13
See changes here
v1.79.12
See changes here
v1.79.11
See changes here
v1.79.10
See changes here
v1.79.9
See changes here
v1.79.8
See changes here
v1.79.7
See changes here
v1.79.6
See changes here
v1.79.5
See changes here
v1.79.4
See changes here
v1.79.3
See changes here
v1.79.2
See changes here
v1.79.1
See changes here
v1.79.0
See changes here
v1.78.1
See changes here
v1.78.0
See changes here
v1.77.2
See changes here
v1.77.1
See changes here
v1.77.0
See changes here
v1.76.1
See changes here
v1.76.0
See changes here
v1.75.1
See changes here
v1.75.0
See changes here
v1.74.0
See changes here
v1.73.1
See changes here
v1.73.0
See changes here
v1.72.0
See changes here
v1.71.0
See changes here
v1.70.0
See changes here
v1.69.0
See changes here
v1.68.0
See changes here
v1.67.0
See changes here
v1.66.2
See changes here
v1.66.1
See changes here
v1.66.0
See changes here
v1.65.0
See changes here
v1.64.1
See changes here
v1.64.0
See changes here
v1.63.0
See changes here
v1.62.0
See changes here
v1.61.1
See changes here
v1.61.0
See changes here
v1.60.0
See changes here
v1.59.0
See changes here
v1.58.0
See changes here
v1.57.1
See changes here
v1.57.0
See changes here
v1.56.0
See changes here
v1.55.1
See changes here
v1.55.0
See changes here
v1.54.1
See changes here
v1.54.0
See changes here
v1.53.1
See changes here
v1.53.0
See changes here
v1.52.0
See changes here
v1.51.0
See changes here
v1.50.2
See changes here
v1.50.1
See changes here
v1.50.0
See changes here
v1.49.0
See changes here
v1.48.0
See changes here
v1.47.0
See changes here
v1.46.0
See changes here
v1.45.0
See changes here
v1.44.0
See changes here
v1.43.0
See changes here
v1.42.0
See changes here
Previous releases
See releases page.