Published by bwplotka over 5 years ago
Published by bwplotka over 5 years ago
β οΈ IMPORTANT β οΈ This is the last release that supports gossip. From Thanos v0.5.0, gossip will be completely removed.
Major improvements:
--store.unhealthy-timeout=5m
flag).β οΈ WARNING β οΈ #798 adds a new default limit to Thanos Store: --store.grpc.series-max-concurrency
. Most likely you will want to make it the same as --query.max-concurrent
on Thanos Query.
New options:
New Store flags:
* `--store.grpc.series-sample-limit` limits the amount of samples that might be retrieved on a single Series() call. By default it is 0. Consider enabling it by setting it to more than 0 if you are running on limited resources.
* `--store.grpc.series-max-concurrency` limits the number of concurrent Series() calls in Thanos Store. By default it is 20. Considering making it lower or bigger depending on the scale of your deployment.
New Store metrics:
* `thanos_bucket_store_queries_dropped_total` shows how many queries were dropped due to the samples limit;
* `thanos_bucket_store_queries_concurrent_max` is a constant metric which shows how many Series() calls can concurrently be executed by Thanos Store;
* `thanos_bucket_store_queries_in_flight` shows how many queries are currently "in flight" i.e. they are being executed;
* `thanos_bucket_store_gate_duration_seconds` shows how many seconds it took for queries to pass through the gate in both cases - when that fails and when it does not.
New Store tracing span:
* store_query_gate_ismyturn
shows how long it took for a query to pass (or not) through the gate.
#1016 Added option for another DNS resolver (miekg/dns client).
Note that this is required to have SRV resolution working on Golang 1.11+ with KubeDNS below v1.14
New Querier and Ruler flag: -- store.sd-dns-resolver
which allows to specify resolver to use. Either golang
or miekgdns
#986 Allow to save some startup & sync time in store gateway as it is no longer needed to compute index-cache from block index on its own for larger blocks.
The store Gateway still can do it, but it first checks bucket if there is index-cached uploaded already.
In the same time, compactor precomputes the index cache file on every compaction.
New Compactor flag: --index.generate-missing-cache-file
was added to allow quicker addition of index cache files. If enabled it precomputes missing files on compactor startup. Note that it will take time and it's only one-off step per bucket.
#887 Compact: Added new --block-sync-concurrency
flag, which allows you to configure number of goroutines to use when syncing block metadata from object storage.
#928 Query: Added --store.response-timeout
flag. If a Store doesn't send any data in this specified duration then a Store will be ignored and partial data will be returned if it's enabled. 0 disables timeout.
#893 S3 storage backend has graduated to stable
maturity level.
#936 Azure storage backend has graduated to stable
maturity level.
#937 S3: added trace functionality. You can add trace.enable: true
to enable the minio client's verbose logging.
#953 Compact: now has a hidden flag --debug.accept-malformed-index
. Compaction index verification will ignore out of order label names.
#963 GCS: added possibility to inline ServiceAccount into GCS config.
#1010 Compact: added new flag --compact.concurrency
. Number of goroutines to use when compacting groups.
#1028 Query: added --query.default-evaluation-interval
, which sets default evaluation interval for sub queries.
#980 Ability to override Azure storage endpoint for other regions (China)
#1021 Query API series
now supports POST method.
#939 Query API query_range
now supports POST method.
#970 Deprecated partial_response_disabled
proto field. Added partial_response_strategy
instead. Both in gRPC and Query API.
No PartialResponseStrategy
field for RuleGroups
by default means abort
strategy (old PartialResponse disabled) as this is recommended option for Rules and alerts.
Metrics:
thanos_rule_evaluation_with_warnings_total
to Ruler.thanos_ruler_query_apis*
are now thanos_ruler_query_apis_*
for consistency.thanos_querier_store_apis*
are now thanos_querier_store_apis__*
for consistency.thanos_bucket_store_series*
are now thanos_bucket_store_series_*
for consistency.strategy
label.Ruler tracing spans:
/rule_instant_query HTTP[client]
is now /rule_instant_query_part_resp_abort HTTP[client]"
if request is for abort strategy.#1009: Upgraded Prometheus (~v2.7.0-rc.0 to v2.8.1) and TSDB (v0.4.0
to v0.6.1
) deps.
Changes that affects Thanos:
--sync-delay
to --consistency-delay
#1053
For ruler essentially whole TSDB CHANGELOG applies beween v0.4.0-v0.6.1: https://github.com/prometheus/tsdb/blob/master/CHANGELOG.md
Note that this was added on TSDB and Prometheus: [FEATURE] Time-ovelapping blocks are now allowed. #370
Whoever due to nature of Thanos compaction (distributed systems), for safety reason this is disabled for Thanos compactor for now.
#868 Go has been updated to 1.12.
#1055 Gossip flags are now disabled by default and deprecated.
#964 repair: Repair process now sorts the series and labels within block.
#1073 Store: index cache for requests. It now calculates the size properly (includes slice header), has anti-deadlock safeguard and reports more metrics.
thanos_objstore_bucket_last_successful_upload_time
now does not appear when no blocks have been uploaded so far.0s
See the full changelog here
Published by bwplotka over 5 years ago
Published by bwplotka over 5 years ago
Published by improbable-ludwik over 5 years ago
Published by domgreen over 5 years ago
πππ
bucket inspect
command for better insights on blocks in object storage.--web.route-prefix
, --web.external-prefix
, --web.prefix-header
. Details here
count_values
PromQL function. #4585block-sync-concurrency
with 20
default, so no change by default)put_user_metadata
option to config.insecure_skip_verify
option to config.Published by bwplotka almost 6 years ago
Xmas patch to release 2 critical fixes (Azure, DNS SD) and awesome, new store UI page.
This also includes first mitigation for https://github.com/improbable-eng/thanos/issues/335
Changelog also available here.
/stores
.Published by bwplotka almost 6 years ago
Next Thanos release adding support to new discovery method, gRPC mTLS and two new object store providers (Swift and Azure).
Note lots of necessary breaking changes in flags that relates to bucket configuration.
Changelog also available here.
thanos_objstore_gcs_bucket_operations_total
in favor of of generic bucket operation metrics.thanos_
prefix to memberlist (gossip) metrics. Make sure to update your dashboards and rules."X-Amz-Acl": "bucket-owner-full-control"
metadata for s3 upload operation.--objstore.config-file
to reference to the bucket configuration file in yaml format. Detailed information can be found in document storage.thanos rule
, static configuration of query nodes via --query
thanos rule
, file based discovery of query nodes using --query.file-sd-config.files
thanos query
, file based discovery of store nodes using --store.file-sd-config.files
/-/healthy
endpoint to Querier.dns+
and dnssrv+
prefixes for the respective lookup. Details here
--cluster.disable
flag to disable gossip functionality completely.thanos_rule_loaded_rules
metric.thanos_compactor_retries_total
metric not being registered.Published by bwplotka about 6 years ago
First Thanos minor release.
This is still not major version, so backward compatibility is NOT guarnteed. See: https://semver.org/#spec-item-4.
See changelog for changes.
The major changes in comparison to v0.1.0-rc.2:
tsdb.path
flag to data-dir
model.Duration
). Example change: 1m0s
won't work, while 1m
will work.Published by bwplotka over 6 years ago
See changelog for changes.
The major changes in comparison to v0.1.0-rc.1:
Published by bwplotka over 6 years ago
A release candidate for v0.1.0
. See CHANGELOG.md for details.
Published by bwplotka over 6 years ago
A release candidate for v0.1.0
. See CHANGELOG.md for details.