Skip to content

feat: PartitionExtrema + SortExec observer + BoundedWindowAggExec passthrough#23090

Open
avantgardnerio wants to merge 3 commits into
apache:mainfrom
coralogix:brent/partition-extrema
Open

feat: PartitionExtrema + SortExec observer + BoundedWindowAggExec passthrough#23090
avantgardnerio wants to merge 3 commits into
apache:mainfrom
coralogix:brent/partition-extrema

Conversation

@avantgardnerio

@avantgardnerio avantgardnerio commented Jun 22, 2026

Copy link
Copy Markdown
Contributor

Which issue does this PR close?

Implements the proposal in #23089.

Rationale for this change

See #23089 for the full design rationale, including the dual-semantics motivation, alternatives considered, and coexistence with Partitioning::Range. Short version: range-aware operators (parallel window functions, future dynamic-range repartitioning, range-elimination optimizations) want to ask an ExecutionPlan for the lex-min / lex-max of a partition's output along its declared ordering. Today there's no way to ask.

What changes are included in this PR?

Pure addition; zero behavior change in any path that doesn't call the new method.

  • ExecutionPlan::runtime_partition_extrema(&self, partition) -> Result<Option<PartitionExtrema>> — default Ok(None).
  • PartitionExtrema { kind, min, max, row_count } and enum ExtremaKind { Observed, Expanded }. Observed (the only kind any operator in this PR returns) means the reported range literally bounds the partition's data. Expanded is reserved for future operators that deliberately route rows outside the reported range as a "halo" for a downstream filter to strip. The dual semantics live on the enum so passthroughs that don't care don't have to match.
  • SortExec override: a per-partition slot is populated inside the sort code path (each sort_batch_chunked call folds first/last sorted rows into the slot, zero-copy via RecordBatch::slice). Once execution has consumed the input, the slot holds the lex-min / lex-max along the declared ordering.
  • BoundedWindowAggExec override: passthrough. BWAG extends its input's equivalence properties and appends new window-result columns on the right of the schema, so the leading sort exprs remain stable in the output along the same column indices.

Are these changes tested?

7 unit tests in datafusion/physical-plan/src/sorts/sort.rs::tests:

  • test_runtime_partition_extrema_before_execute_is_none — caller contract: reading without a poll returns Ok(None).
  • test_runtime_partition_extrema_after_full_sort — two batches, in-memory merge path; extrema match expected lex-min / lex-max with kind = Observed.
  • test_runtime_partition_extrema_descending_swaps_min_max — DESC sort: min is the largest value, max is the smallest.
  • test_runtime_partition_extrema_per_partition — two input partitions with preserve_partitioning=true: each output partition's extrema track its own range.
  • test_runtime_partition_extrema_default_is_none — default trait impl returns Ok(None) on a non-overriding operator (EmptyExec).
  • Plus two more under earlier commits, covering the chunk-fold path.

Are there any user-facing changes?

  • New public types in datafusion::physical_plan: PartitionExtrema, ExtremaKind.
  • New trait method ExecutionPlan::runtime_partition_extrema with a default Ok(None). Existing custom ExecutionPlan implementations are not required to change.
  • No SQL surface changes.

…ded"

- `runtime_partition_extrema`: replace the "observed-vs-intended" pointer
  with a one-sentence handoff to `ExtremaKind::Observed` /
  `ExtremaKind::Expanded`. The discriminant is now the source of truth
  for which interpretation applies.
- `ExtremaKind::Expanded`: rename "intended primary range" → "primary
  range". The earlier doc carried "intended" as a leftover from the
  observed-vs-intended framing the enum replaces; with the variant
  named `Expanded`, "intended" only adds an extra synonym to map.
BWAG processes its input in order and appends new window-result
columns on the right of the existing schema. Its equivalence
properties are built by extending the input's, leaving the input's
leading sort expressions in place along the same column indices in
the output. So along the output's declared ordering, the partition
range observed (or expanded) by the upstream is the same range we
emit — a clean passthrough.

Skipped: CoalesceBatchesExec. The operator is deprecated since 52.0.0
("we now use BatchCoalescer from arrow-rs instead of a dedicated
operator"); coalescing now happens inside other operators' streams,
so there's no dedicated plan node to override.
@alamb

alamb commented Jun 22, 2026

Copy link
Copy Markdown
Contributor

I left comments in the linked tickets like #23093 / #23094

@Dandandan

Copy link
Copy Markdown
Contributor

run benchmarks

@Dandandan

Copy link
Copy Markdown
Contributor

run benchmark sort_tpch10 tpch10

(just making sure to run some benchmarks(

@adriangbot

Copy link
Copy Markdown

Hi @Dandandan, your benchmark configuration could not be parsed (#23090 (comment)).

Error: invalid configuration: invalid type: string "(just making sure to run some benchmarks(", expected struct CommentConfig at line 2 column 1

Usage:

run benchmark <name>           # run specific benchmark(s)
run benchmarks                 # run default suite
run benchmarks <name1> <name2> # run specific benchmarks

Any benchmark name is accepted: bench.sh suite names (e.g. tpch, clickbench_partitioned, wide_schema) and Criterion bench targets (e.g. sql_planner) are resolved automatically. A name that matches neither fails on the runner.

Per-side configuration (run benchmark tpch followed by):

env:
# shared env is inherited by BOTH the build and the run, so build
# flags go here. Builds default to no debuginfo for speed; opt back
# in for hung-job gdb dumps and cap jobs to stay within memory:
CARGO_PROFILE_RELEASE_DEBUG: "1"
CARGO_BUILD_JOBS: "1"
baseline:
ref: v45.0.0
env:
# per-side env only reaches the benchmark run, not the build
DATAFUSION_RUNTIME_MEMORY_LIMIT: 1G
changed:
ref: v46.0.0
env:
DATAFUSION_RUNTIME_MEMORY_LIMIT: 2G

File an issue against this benchmark runner

@Dandandan

Copy link
Copy Markdown
Contributor

run benchmark sort_tpch10 tpch10

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4855406731-773-hc6c6 6.12.85+ #1 SMP Mon May 11 08:17:35 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing brent/partition-extrema (2cc5bd1) to f9c1e9e (merge-base) diff using: tpch10
Results will be posted here when complete


File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4855401518-771-2689j 6.12.85+ #1 SMP Mon May 11 08:17:35 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing brent/partition-extrema (2cc5bd1) to f9c1e9e (merge-base) diff using: tpch
Results will be posted here when complete


File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4855406731-772-khm8t 6.12.85+ #1 SMP Mon May 11 08:17:35 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing brent/partition-extrema (2cc5bd1) to f9c1e9e (merge-base) diff using: sort_tpch10
Results will be posted here when complete


File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4855401518-769-pdcmv 6.12.85+ #1 SMP Mon May 11 08:17:35 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing brent/partition-extrema (2cc5bd1) to f9c1e9e (merge-base) diff using: clickbench_partitioned
Results will be posted here when complete


File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark running (GKE) | trigger
Instance: c4a-highmem-16 (12 vCPU / 65 GiB) | Linux bench-c4855401518-770-5rwhx 6.12.85+ #1 SMP Mon May 11 08:17:35 UTC 2026 aarch64 GNU/Linux

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected

Comparing brent/partition-extrema (2cc5bd1) to f9c1e9e (merge-base) diff using: tpcds
Results will be posted here when complete


File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected
Details

Comparing HEAD and brent_partition-extrema
--------------------
Benchmark tpch_sf1.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━┓
┃ Query     ┃                           HEAD ┃        brent_partition-extrema ┃       Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━┩
│ QQuery 1  │ 38.24 / 40.05 ±2.14 / 43.74 ms │ 37.96 / 39.27 ±1.39 / 41.24 ms │    no change │
│ QQuery 2  │ 19.15 / 19.67 ±0.60 / 20.81 ms │ 19.03 / 19.20 ±0.15 / 19.40 ms │    no change │
│ QQuery 3  │ 32.90 / 33.50 ±0.49 / 34.26 ms │ 32.86 / 34.06 ±1.36 / 36.71 ms │    no change │
│ QQuery 4  │ 17.31 / 17.74 ±0.66 / 19.06 ms │ 17.20 / 17.33 ±0.14 / 17.59 ms │    no change │
│ QQuery 5  │ 37.13 / 38.00 ±0.88 / 39.64 ms │ 38.33 / 40.37 ±1.75 / 43.00 ms │ 1.06x slower │
│ QQuery 6  │ 16.03 / 16.28 ±0.25 / 16.74 ms │ 16.27 / 16.72 ±0.77 / 18.26 ms │    no change │
│ QQuery 7  │ 44.10 / 46.05 ±1.20 / 47.83 ms │ 44.60 / 46.03 ±1.28 / 48.26 ms │    no change │
│ QQuery 8  │ 42.99 / 43.08 ±0.07 / 43.18 ms │ 42.80 / 43.06 ±0.16 / 43.25 ms │    no change │
│ QQuery 9  │ 49.25 / 50.03 ±0.59 / 50.74 ms │ 49.93 / 51.16 ±0.96 / 52.47 ms │    no change │
│ QQuery 10 │ 41.99 / 42.33 ±0.23 / 42.61 ms │ 42.16 / 42.51 ±0.44 / 43.37 ms │    no change │
│ QQuery 11 │ 13.21 / 13.34 ±0.20 / 13.74 ms │ 13.03 / 13.33 ±0.32 / 13.89 ms │    no change │
│ QQuery 12 │ 23.75 / 24.23 ±0.31 / 24.70 ms │ 23.48 / 24.35 ±1.10 / 26.47 ms │    no change │
│ QQuery 13 │ 32.52 / 34.15 ±1.06 / 35.42 ms │ 32.73 / 33.94 ±1.01 / 35.16 ms │    no change │
│ QQuery 14 │ 23.66 / 23.83 ±0.10 / 23.96 ms │ 23.58 / 23.78 ±0.11 / 23.88 ms │    no change │
│ QQuery 15 │ 31.10 / 32.85 ±1.63 / 35.67 ms │ 31.31 / 31.66 ±0.51 / 32.66 ms │    no change │
│ QQuery 16 │ 13.88 / 14.02 ±0.18 / 14.36 ms │ 14.00 / 14.17 ±0.10 / 14.31 ms │    no change │
│ QQuery 17 │ 73.08 / 74.18 ±0.87 / 75.59 ms │ 71.85 / 72.40 ±0.44 / 73.09 ms │    no change │
│ QQuery 18 │ 59.00 / 60.47 ±1.64 / 63.62 ms │ 58.16 / 60.55 ±2.31 / 64.79 ms │    no change │
│ QQuery 19 │ 33.31 / 34.12 ±0.78 / 35.36 ms │ 33.26 / 33.60 ±0.39 / 34.32 ms │    no change │
│ QQuery 20 │ 31.90 / 32.41 ±0.36 / 32.89 ms │ 32.03 / 32.40 ±0.39 / 33.12 ms │    no change │
│ QQuery 21 │ 53.08 / 54.81 ±1.02 / 56.08 ms │ 54.63 / 56.46 ±1.51 / 58.58 ms │    no change │
│ QQuery 22 │ 13.92 / 14.08 ±0.14 / 14.25 ms │ 13.78 / 14.01 ±0.20 / 14.35 ms │    no change │
└───────────┴────────────────────────────────┴────────────────────────────────┴──────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━┓
┃ Benchmark Summary                      ┃          ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━┩
│ Total Time (HEAD)                      │ 759.24ms │
│ Total Time (brent_partition-extrema)   │ 760.36ms │
│ Average Time (HEAD)                    │  34.51ms │
│ Average Time (brent_partition-extrema) │  34.56ms │
│ Queries Faster                         │        0 │
│ Queries Slower                         │        1 │
│ Queries with No Change                 │       21 │
│ Queries with Failure                   │        0 │
└────────────────────────────────────────┴──────────┘

Resource Usage

tpch — base (merge-base)

Metric Value
Wall time 5.0s
Peak memory 1.2 GiB
Avg memory 513.1 MiB
CPU user 21.7s
CPU sys 1.7s
Peak spill 0 B

tpch — branch

Metric Value
Wall time 5.0s
Peak memory 1.2 GiB
Avg memory 509.6 MiB
CPU user 21.9s
CPU sys 1.7s
Peak spill 0 B

File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected
Details

Comparing HEAD and brent_partition-extrema
--------------------
Benchmark tpch_sf10.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━┓
┃ Query     ┃                               HEAD ┃            brent_partition-extrema ┃       Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━┩
│ QQuery 1  │  313.37 / 315.52 ±2.18 / 319.29 ms │  310.70 / 314.95 ±2.98 / 319.88 ms │    no change │
│ QQuery 2  │  104.44 / 105.80 ±1.13 / 107.63 ms │  103.13 / 106.46 ±2.12 / 108.65 ms │    no change │
│ QQuery 3  │  235.52 / 240.09 ±3.00 / 244.55 ms │  233.31 / 238.81 ±3.98 / 243.50 ms │    no change │
│ QQuery 4  │  114.00 / 117.02 ±2.60 / 121.51 ms │  114.07 / 116.15 ±1.43 / 118.41 ms │    no change │
│ QQuery 5  │  349.58 / 358.24 ±6.04 / 365.21 ms │  353.59 / 362.41 ±6.89 / 371.71 ms │    no change │
│ QQuery 6  │  124.53 / 126.94 ±3.44 / 133.73 ms │  125.71 / 128.22 ±3.76 / 135.67 ms │    no change │
│ QQuery 7  │  465.24 / 473.14 ±5.35 / 480.77 ms │  471.33 / 477.87 ±4.06 / 482.24 ms │    no change │
│ QQuery 8  │  381.76 / 390.96 ±7.65 / 404.81 ms │  383.49 / 388.17 ±2.84 / 391.21 ms │    no change │
│ QQuery 9  │  555.16 / 562.37 ±3.93 / 566.32 ms │ 566.23 / 582.98 ±13.21 / 603.62 ms │    no change │
│ QQuery 10 │ 298.19 / 312.59 ±15.57 / 342.54 ms │  300.94 / 308.50 ±5.60 / 316.89 ms │    no change │
│ QQuery 11 │     88.95 / 92.83 ±4.05 / 99.64 ms │    85.84 / 90.49 ±7.07 / 104.52 ms │    no change │
│ QQuery 12 │  177.89 / 178.84 ±0.93 / 180.32 ms │  181.13 / 189.52 ±8.72 / 204.43 ms │ 1.06x slower │
│ QQuery 13 │  286.97 / 297.28 ±7.31 / 308.07 ms │ 291.05 / 302.96 ±10.19 / 318.12 ms │    no change │
│ QQuery 14 │  174.46 / 177.32 ±3.57 / 183.72 ms │  173.59 / 177.25 ±3.34 / 182.90 ms │    no change │
│ QQuery 15 │  305.75 / 308.85 ±2.58 / 312.32 ms │  306.49 / 312.66 ±3.88 / 317.35 ms │    no change │
│ QQuery 16 │     64.89 / 67.94 ±1.82 / 69.47 ms │     65.35 / 66.67 ±1.49 / 69.23 ms │    no change │
│ QQuery 17 │ 632.45 / 642.25 ±11.26 / 657.45 ms │  646.59 / 651.05 ±5.56 / 661.94 ms │    no change │
│ QQuery 18 │  693.20 / 708.83 ±9.56 / 719.66 ms │ 679.70 / 707.39 ±19.34 / 737.56 ms │    no change │
│ QQuery 19 │ 250.06 / 262.26 ±12.80 / 279.13 ms │ 248.27 / 265.85 ±13.59 / 282.73 ms │    no change │
│ QQuery 20 │ 279.26 / 294.62 ±12.89 / 318.24 ms │  279.16 / 290.99 ±8.52 / 301.71 ms │    no change │
│ QQuery 21 │  657.59 / 668.46 ±5.94 / 673.86 ms │  664.91 / 667.64 ±3.16 / 673.52 ms │    no change │
│ QQuery 22 │     60.55 / 63.51 ±2.47 / 67.04 ms │     60.85 / 64.37 ±4.35 / 72.72 ms │    no change │
└───────────┴────────────────────────────────────┴────────────────────────────────────┴──────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃ Benchmark Summary                      ┃           ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ Total Time (HEAD)                      │ 6765.63ms │
│ Total Time (brent_partition-extrema)   │ 6811.35ms │
│ Average Time (HEAD)                    │  307.53ms │
│ Average Time (brent_partition-extrema) │  309.61ms │
│ Queries Faster                         │         0 │
│ Queries Slower                         │         1 │
│ Queries with No Change                 │        21 │
│ Queries with Failure                   │         0 │
└────────────────────────────────────────┴───────────┘

Resource Usage

tpch10 — base (merge-base)

Metric Value
Wall time 35.0s
Peak memory 4.6 GiB
Avg memory 1.4 GiB
CPU user 344.4s
CPU sys 19.8s
Peak spill 0 B

tpch10 — branch

Metric Value
Wall time 35.0s
Peak memory 5.0 GiB
Avg memory 1.5 GiB
CPU user 344.5s
CPU sys 20.3s
Peak spill 0 B

File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected
Details

Comparing HEAD and brent_partition-extrema
--------------------
Benchmark tpcds_sf1.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃               brent_partition-extrema ┃       Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━┩
│ QQuery 1  │           5.86 / 6.34 ±0.83 / 8.01 ms │           5.63 / 6.25 ±0.89 / 7.99 ms │    no change │
│ QQuery 2  │        81.34 / 82.30 ±0.61 / 82.95 ms │        82.11 / 82.41 ±0.34 / 83.03 ms │    no change │
│ QQuery 3  │        29.96 / 30.26 ±0.18 / 30.50 ms │        29.88 / 30.03 ±0.10 / 30.19 ms │    no change │
│ QQuery 4  │     511.55 / 514.85 ±3.49 / 521.16 ms │     508.29 / 511.46 ±3.13 / 517.43 ms │    no change │
│ QQuery 5  │        52.52 / 53.23 ±0.42 / 53.66 ms │        52.55 / 52.71 ±0.15 / 52.97 ms │    no change │
│ QQuery 6  │        37.37 / 37.94 ±0.50 / 38.59 ms │        37.72 / 38.07 ±0.29 / 38.36 ms │    no change │
│ QQuery 7  │       95.54 / 97.93 ±3.50 / 104.89 ms │       94.78 / 96.99 ±2.24 / 100.98 ms │    no change │
│ QQuery 8  │        38.23 / 38.64 ±0.34 / 39.03 ms │        37.59 / 38.31 ±0.49 / 38.76 ms │    no change │
│ QQuery 9  │        52.23 / 56.79 ±2.29 / 58.27 ms │        55.34 / 56.70 ±1.18 / 58.31 ms │    no change │
│ QQuery 10 │        64.79 / 65.07 ±0.26 / 65.49 ms │        63.80 / 64.17 ±0.21 / 64.42 ms │    no change │
│ QQuery 11 │     319.36 / 324.79 ±3.51 / 330.35 ms │     313.62 / 317.56 ±2.22 / 320.53 ms │    no change │
│ QQuery 12 │        29.22 / 29.41 ±0.16 / 29.68 ms │        28.88 / 29.14 ±0.24 / 29.44 ms │    no change │
│ QQuery 13 │     120.74 / 123.11 ±1.97 / 126.41 ms │     120.18 / 121.40 ±1.00 / 122.92 ms │    no change │
│ QQuery 14 │     421.60 / 428.00 ±4.73 / 433.52 ms │     420.98 / 424.68 ±2.51 / 428.24 ms │    no change │
│ QQuery 15 │        60.59 / 62.61 ±1.31 / 64.25 ms │        61.35 / 61.68 ±0.29 / 62.16 ms │    no change │
│ QQuery 16 │           6.94 / 7.08 ±0.21 / 7.50 ms │           6.73 / 6.85 ±0.15 / 7.13 ms │    no change │
│ QQuery 17 │        81.60 / 84.11 ±2.89 / 89.66 ms │        81.50 / 83.77 ±2.56 / 87.77 ms │    no change │
│ QQuery 18 │     125.77 / 127.50 ±1.33 / 129.70 ms │     126.59 / 127.25 ±0.76 / 128.56 ms │    no change │
│ QQuery 19 │        42.68 / 43.81 ±1.41 / 46.59 ms │        42.61 / 43.64 ±1.67 / 46.96 ms │    no change │
│ QQuery 20 │        37.11 / 37.48 ±0.25 / 37.78 ms │        36.84 / 37.42 ±0.40 / 38.08 ms │    no change │
│ QQuery 21 │        18.29 / 18.50 ±0.19 / 18.78 ms │        18.15 / 18.54 ±0.36 / 19.20 ms │    no change │
│ QQuery 22 │        63.47 / 63.83 ±0.38 / 64.56 ms │        65.18 / 67.18 ±2.77 / 72.67 ms │ 1.05x slower │
│ QQuery 23 │     358.20 / 361.73 ±3.69 / 368.07 ms │     355.17 / 361.57 ±5.48 / 371.60 ms │    no change │
│ QQuery 24 │     230.95 / 234.78 ±5.38 / 245.44 ms │     229.31 / 233.81 ±7.21 / 248.09 ms │    no change │
│ QQuery 25 │     113.77 / 115.32 ±1.92 / 118.96 ms │     111.83 / 114.96 ±2.01 / 118.12 ms │    no change │
│ QQuery 26 │        59.63 / 59.93 ±0.16 / 60.09 ms │        58.79 / 59.39 ±0.40 / 59.91 ms │    no change │
│ QQuery 27 │           6.43 / 6.55 ±0.18 / 6.91 ms │           6.49 / 6.63 ±0.11 / 6.80 ms │    no change │
│ QQuery 28 │        61.55 / 62.21 ±0.59 / 63.31 ms │        58.10 / 60.58 ±2.18 / 63.76 ms │    no change │
│ QQuery 29 │      99.03 / 103.78 ±4.44 / 112.12 ms │     100.24 / 103.10 ±4.28 / 111.41 ms │    no change │
│ QQuery 30 │        32.99 / 33.42 ±0.35 / 34.06 ms │        32.97 / 33.44 ±0.25 / 33.61 ms │    no change │
│ QQuery 31 │     112.52 / 116.07 ±4.99 / 125.96 ms │     114.35 / 116.31 ±2.46 / 121.08 ms │    no change │
│ QQuery 32 │        21.50 / 21.92 ±0.23 / 22.12 ms │        22.11 / 22.59 ±0.45 / 23.26 ms │    no change │
│ QQuery 33 │        38.60 / 39.05 ±0.42 / 39.80 ms │        38.84 / 39.85 ±1.10 / 41.50 ms │    no change │
│ QQuery 34 │        10.34 / 10.70 ±0.35 / 11.29 ms │        10.26 / 10.49 ±0.13 / 10.63 ms │    no change │
│ QQuery 35 │        73.25 / 74.76 ±1.21 / 76.18 ms │        74.32 / 75.21 ±0.62 / 75.95 ms │    no change │
│ QQuery 36 │           6.08 / 6.19 ±0.15 / 6.49 ms │           5.96 / 6.18 ±0.18 / 6.43 ms │    no change │
│ QQuery 37 │           7.36 / 7.53 ±0.13 / 7.70 ms │           7.33 / 7.39 ±0.06 / 7.48 ms │    no change │
│ QQuery 38 │        63.71 / 65.99 ±2.52 / 70.70 ms │        64.78 / 65.90 ±1.36 / 68.29 ms │    no change │
│ QQuery 39 │     450.71 / 458.78 ±4.30 / 462.62 ms │     450.68 / 460.55 ±9.07 / 477.16 ms │    no change │
│ QQuery 40 │        23.90 / 24.46 ±1.02 / 26.49 ms │        23.56 / 23.88 ±0.20 / 24.10 ms │    no change │
│ QQuery 41 │        11.97 / 12.08 ±0.07 / 12.17 ms │        12.13 / 12.18 ±0.06 / 12.27 ms │    no change │
│ QQuery 42 │        24.47 / 24.60 ±0.09 / 24.69 ms │        24.60 / 24.82 ±0.12 / 24.94 ms │    no change │
│ QQuery 43 │           4.91 / 5.05 ±0.16 / 5.34 ms │           4.95 / 5.07 ±0.17 / 5.41 ms │    no change │
│ QQuery 44 │           9.52 / 9.66 ±0.11 / 9.82 ms │           9.47 / 9.63 ±0.09 / 9.73 ms │    no change │
│ QQuery 45 │        41.82 / 44.67 ±4.48 / 53.60 ms │        39.36 / 43.91 ±5.76 / 55.29 ms │    no change │
│ QQuery 46 │        12.42 / 12.79 ±0.29 / 13.23 ms │        12.59 / 12.91 ±0.33 / 13.47 ms │    no change │
│ QQuery 47 │     236.13 / 242.01 ±5.50 / 252.07 ms │     231.14 / 239.22 ±4.65 / 244.17 ms │    no change │
│ QQuery 48 │       98.15 / 99.72 ±1.55 / 102.16 ms │        97.84 / 98.26 ±0.31 / 98.75 ms │    no change │
│ QQuery 49 │        78.31 / 79.62 ±1.14 / 81.14 ms │        77.81 / 79.23 ±1.66 / 82.31 ms │    no change │
│ QQuery 50 │        59.79 / 60.51 ±0.56 / 61.28 ms │        59.42 / 60.23 ±0.68 / 61.20 ms │    no change │
│ QQuery 51 │       93.00 / 95.81 ±2.98 / 101.44 ms │        92.07 / 93.15 ±0.81 / 94.21 ms │    no change │
│ QQuery 52 │        24.85 / 24.97 ±0.08 / 25.08 ms │        24.88 / 27.55 ±3.31 / 33.92 ms │ 1.10x slower │
│ QQuery 53 │        30.63 / 30.93 ±0.31 / 31.51 ms │        30.31 / 30.95 ±0.54 / 31.89 ms │    no change │
│ QQuery 54 │        56.01 / 56.79 ±0.46 / 57.47 ms │        56.62 / 57.55 ±0.68 / 58.51 ms │    no change │
│ QQuery 55 │        23.97 / 24.18 ±0.17 / 24.43 ms │        24.13 / 24.58 ±0.36 / 25.17 ms │    no change │
│ QQuery 56 │        40.19 / 40.34 ±0.13 / 40.58 ms │        40.94 / 41.17 ±0.36 / 41.88 ms │    no change │
│ QQuery 57 │     179.62 / 182.36 ±4.63 / 191.61 ms │     176.54 / 178.95 ±1.84 / 182.06 ms │    no change │
│ QQuery 58 │     118.46 / 121.09 ±3.27 / 127.36 ms │     116.72 / 120.17 ±3.52 / 126.75 ms │    no change │
│ QQuery 59 │     119.38 / 120.48 ±1.82 / 124.10 ms │     119.29 / 119.57 ±0.25 / 119.88 ms │    no change │
│ QQuery 60 │        39.88 / 41.14 ±1.11 / 43.14 ms │        41.13 / 43.04 ±3.27 / 49.58 ms │    no change │
│ QQuery 61 │        12.62 / 12.87 ±0.24 / 13.28 ms │        12.94 / 13.04 ±0.10 / 13.22 ms │    no change │
│ QQuery 62 │        47.28 / 47.96 ±0.52 / 48.88 ms │        46.94 / 47.29 ±0.18 / 47.42 ms │    no change │
│ QQuery 63 │        30.53 / 30.80 ±0.26 / 31.30 ms │        30.84 / 31.08 ±0.21 / 31.33 ms │    no change │
│ QQuery 64 │     413.56 / 422.29 ±8.22 / 436.00 ms │     416.26 / 420.92 ±4.17 / 427.39 ms │    no change │
│ QQuery 65 │     146.38 / 150.72 ±2.44 / 152.97 ms │     149.78 / 153.39 ±2.66 / 156.75 ms │    no change │
│ QQuery 66 │        80.73 / 81.28 ±0.39 / 81.90 ms │        80.06 / 80.89 ±0.66 / 81.64 ms │    no change │
│ QQuery 67 │     246.65 / 256.44 ±7.06 / 267.38 ms │     253.38 / 257.96 ±4.96 / 267.00 ms │    no change │
│ QQuery 68 │        12.20 / 12.44 ±0.14 / 12.60 ms │        12.31 / 12.55 ±0.19 / 12.77 ms │    no change │
│ QQuery 69 │        58.83 / 59.25 ±0.44 / 59.92 ms │        58.81 / 59.31 ±0.54 / 60.22 ms │    no change │
│ QQuery 70 │     106.29 / 111.90 ±6.52 / 123.53 ms │     107.06 / 111.60 ±7.08 / 125.45 ms │    no change │
│ QQuery 71 │        36.14 / 36.86 ±0.36 / 37.12 ms │        36.23 / 37.24 ±0.85 / 38.66 ms │    no change │
│ QQuery 72 │ 2117.15 / 2228.83 ±83.21 / 2335.60 ms │ 2172.15 / 2207.75 ±21.15 / 2233.04 ms │    no change │
│ QQuery 73 │         9.90 / 10.07 ±0.13 / 10.28 ms │        10.30 / 10.49 ±0.10 / 10.59 ms │    no change │
│ QQuery 74 │     181.11 / 186.43 ±8.29 / 202.95 ms │     178.55 / 185.25 ±8.17 / 201.11 ms │    no change │
│ QQuery 75 │     152.64 / 154.55 ±1.71 / 157.70 ms │     151.93 / 154.25 ±2.48 / 158.85 ms │    no change │
│ QQuery 76 │        36.07 / 36.55 ±0.26 / 36.78 ms │        36.76 / 36.95 ±0.19 / 37.20 ms │    no change │
│ QQuery 77 │        61.56 / 62.37 ±0.58 / 63.07 ms │        62.49 / 63.15 ±0.68 / 64.44 ms │    no change │
│ QQuery 78 │     189.97 / 194.33 ±5.15 / 204.13 ms │     188.96 / 192.44 ±3.25 / 198.37 ms │    no change │
│ QQuery 79 │        68.20 / 69.63 ±2.35 / 74.30 ms │        68.27 / 69.01 ±0.56 / 69.94 ms │    no change │
│ QQuery 80 │     101.56 / 103.67 ±2.62 / 107.84 ms │      99.32 / 100.97 ±1.26 / 102.49 ms │    no change │
│ QQuery 81 │        26.73 / 27.02 ±0.27 / 27.41 ms │        25.91 / 26.46 ±0.30 / 26.75 ms │    no change │
│ QQuery 82 │        17.24 / 17.41 ±0.11 / 17.53 ms │        17.00 / 17.07 ±0.04 / 17.11 ms │    no change │
│ QQuery 83 │        41.41 / 41.96 ±0.29 / 42.19 ms │        40.92 / 41.63 ±1.18 / 43.97 ms │    no change │
│ QQuery 84 │        31.18 / 32.55 ±1.50 / 35.41 ms │        30.82 / 31.20 ±0.28 / 31.59 ms │    no change │
│ QQuery 85 │     110.74 / 112.25 ±1.67 / 115.28 ms │     109.56 / 113.22 ±4.82 / 122.24 ms │    no change │
│ QQuery 86 │        25.99 / 26.21 ±0.22 / 26.60 ms │        26.05 / 26.19 ±0.10 / 26.32 ms │    no change │
│ QQuery 87 │        64.38 / 66.12 ±1.41 / 67.97 ms │        65.91 / 66.80 ±0.91 / 68.55 ms │    no change │
│ QQuery 88 │        64.00 / 64.84 ±0.48 / 65.31 ms │        65.01 / 67.23 ±3.91 / 75.04 ms │    no change │
│ QQuery 89 │        36.79 / 37.15 ±0.23 / 37.38 ms │        36.53 / 37.43 ±0.56 / 38.18 ms │    no change │
│ QQuery 90 │        17.53 / 17.78 ±0.16 / 17.96 ms │        17.43 / 17.64 ±0.31 / 18.27 ms │    no change │
│ QQuery 91 │        47.15 / 48.02 ±0.52 / 48.62 ms │        47.08 / 47.87 ±0.69 / 48.94 ms │    no change │
│ QQuery 92 │        30.53 / 32.03 ±1.10 / 33.95 ms │        30.94 / 31.31 ±0.42 / 32.10 ms │    no change │
│ QQuery 93 │        51.04 / 52.10 ±0.64 / 52.98 ms │        51.50 / 52.73 ±1.16 / 54.66 ms │    no change │
│ QQuery 94 │        38.64 / 39.14 ±0.47 / 39.88 ms │        39.76 / 40.01 ±0.28 / 40.52 ms │    no change │
│ QQuery 95 │        82.87 / 84.13 ±0.68 / 84.93 ms │        82.25 / 83.58 ±0.74 / 84.45 ms │    no change │
│ QQuery 96 │        25.17 / 25.97 ±1.35 / 28.66 ms │        24.89 / 25.03 ±0.21 / 25.44 ms │    no change │
│ QQuery 97 │        47.38 / 48.30 ±1.14 / 50.54 ms │        47.67 / 50.27 ±1.80 / 53.11 ms │    no change │
│ QQuery 98 │        44.08 / 44.45 ±0.23 / 44.78 ms │        43.37 / 44.59 ±1.16 / 46.02 ms │    no change │
│ QQuery 99 │        71.50 / 72.23 ±0.70 / 73.39 ms │        70.88 / 71.11 ±0.16 / 71.29 ms │    no change │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴──────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                      ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                      │ 10624.46ms │
│ Total Time (brent_partition-extrema)   │ 10577.06ms │
│ Average Time (HEAD)                    │   107.32ms │
│ Average Time (brent_partition-extrema) │   106.84ms │
│ Queries Faster                         │          0 │
│ Queries Slower                         │          2 │
│ Queries with No Change                 │         97 │
│ Queries with Failure                   │          0 │
└────────────────────────────────────────┴────────────┘

Resource Usage

tpcds — base (merge-base)

Metric Value
Wall time 55.0s
Peak memory 2.3 GiB
Avg memory 1.6 GiB
CPU user 241.1s
CPU sys 6.5s
Peak spill 0 B

tpcds — branch

Metric Value
Wall time 55.0s
Peak memory 2.2 GiB
Avg memory 1.6 GiB
CPU user 238.0s
CPU sys 6.3s
Peak spill 0 B

File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected
Details

Comparing HEAD and brent_partition-extrema
--------------------
Benchmark clickbench_partitioned.json
--------------------
┏━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━┓
┃ Query     ┃                                  HEAD ┃               brent_partition-extrema ┃        Change ┃
┡━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━┩
│ QQuery 0  │          1.22 / 4.00 ±5.45 / 14.91 ms │          1.25 / 4.09 ±5.56 / 15.21 ms │     no change │
│ QQuery 1  │        13.10 / 13.28 ±0.16 / 13.53 ms │        13.14 / 13.38 ±0.16 / 13.64 ms │     no change │
│ QQuery 2  │        36.12 / 36.40 ±0.25 / 36.75 ms │        36.99 / 37.29 ±0.31 / 37.85 ms │     no change │
│ QQuery 3  │        30.57 / 31.31 ±0.81 / 32.64 ms │        30.91 / 31.23 ±0.29 / 31.73 ms │     no change │
│ QQuery 4  │     227.20 / 229.88 ±1.99 / 232.81 ms │     227.23 / 231.05 ±4.70 / 240.01 ms │     no change │
│ QQuery 5  │     275.30 / 277.47 ±1.50 / 279.88 ms │     269.20 / 275.24 ±4.81 / 283.82 ms │     no change │
│ QQuery 6  │           1.26 / 1.42 ±0.23 / 1.86 ms │           1.28 / 1.41 ±0.22 / 1.85 ms │     no change │
│ QQuery 7  │        13.75 / 14.05 ±0.18 / 14.30 ms │        14.34 / 14.69 ±0.26 / 15.05 ms │     no change │
│ QQuery 8  │     323.09 / 326.94 ±2.94 / 330.76 ms │     319.70 / 325.79 ±5.86 / 334.44 ms │     no change │
│ QQuery 9  │     458.11 / 468.94 ±9.65 / 481.39 ms │     453.84 / 461.12 ±6.30 / 468.88 ms │     no change │
│ QQuery 10 │        69.78 / 73.05 ±3.98 / 80.88 ms │        69.27 / 70.69 ±1.14 / 72.09 ms │     no change │
│ QQuery 11 │        82.38 / 83.82 ±1.14 / 85.59 ms │        80.53 / 82.41 ±1.67 / 85.37 ms │     no change │
│ QQuery 12 │     269.11 / 274.28 ±3.94 / 279.81 ms │     268.62 / 273.99 ±5.07 / 282.41 ms │     no change │
│ QQuery 13 │     363.42 / 374.72 ±8.26 / 388.38 ms │    367.92 / 382.35 ±10.88 / 394.86 ms │     no change │
│ QQuery 14 │     280.71 / 283.35 ±2.68 / 288.37 ms │    284.22 / 293.62 ±14.90 / 323.35 ms │     no change │
│ QQuery 15 │     270.38 / 274.65 ±6.14 / 286.78 ms │    270.11 / 289.21 ±20.97 / 326.36 ms │  1.05x slower │
│ QQuery 16 │     611.93 / 618.36 ±4.48 / 622.65 ms │     616.92 / 627.98 ±5.81 / 633.18 ms │     no change │
│ QQuery 17 │    614.53 / 628.89 ±13.25 / 652.41 ms │     630.00 / 635.96 ±6.65 / 646.03 ms │     no change │
│ QQuery 18 │ 1247.16 / 1269.57 ±14.73 / 1288.91 ms │ 1260.97 / 1294.94 ±23.94 / 1325.51 ms │     no change │
│ QQuery 19 │        27.91 / 28.80 ±0.92 / 30.31 ms │        28.00 / 28.35 ±0.30 / 28.75 ms │     no change │
│ QQuery 20 │    517.37 / 525.60 ±10.87 / 546.07 ms │     517.34 / 523.96 ±3.37 / 526.53 ms │     no change │
│ QQuery 21 │     512.21 / 518.04 ±3.57 / 522.21 ms │     517.78 / 528.26 ±5.91 / 533.74 ms │     no change │
│ QQuery 22 │   978.64 / 991.78 ±12.52 / 1014.34 ms │  985.04 / 1001.73 ±14.12 / 1019.10 ms │     no change │
│ QQuery 23 │ 3050.59 / 3088.89 ±20.91 / 3112.85 ms │ 3054.11 / 3067.57 ±10.24 / 3085.07 ms │     no change │
│ QQuery 24 │       41.91 / 48.74 ±11.50 / 71.63 ms │        41.65 / 47.23 ±7.48 / 62.05 ms │     no change │
│ QQuery 25 │     110.78 / 113.91 ±3.73 / 121.23 ms │     110.96 / 111.86 ±0.98 / 113.07 ms │     no change │
│ QQuery 26 │        41.65 / 42.38 ±0.73 / 43.68 ms │        41.56 / 47.10 ±9.82 / 66.70 ms │  1.11x slower │
│ QQuery 27 │     670.86 / 678.92 ±6.01 / 688.46 ms │    670.02 / 678.62 ±11.44 / 701.27 ms │     no change │
│ QQuery 28 │ 3034.37 / 3057.45 ±23.36 / 3100.10 ms │ 3015.90 / 3047.33 ±17.30 / 3064.91 ms │     no change │
│ QQuery 29 │        40.87 / 47.80 ±8.43 / 60.68 ms │        40.99 / 41.65 ±0.67 / 42.78 ms │ +1.15x faster │
│ QQuery 30 │    300.08 / 314.27 ±13.80 / 336.36 ms │    297.69 / 311.52 ±11.31 / 326.20 ms │     no change │
│ QQuery 31 │    278.47 / 292.57 ±12.02 / 308.62 ms │     293.75 / 299.17 ±4.19 / 304.67 ms │     no change │
│ QQuery 32 │   967.55 / 999.65 ±48.86 / 1096.89 ms │    922.04 / 940.77 ±12.95 / 958.88 ms │ +1.06x faster │
│ QQuery 33 │ 1463.28 / 1501.51 ±31.99 / 1538.94 ms │ 1441.49 / 1478.67 ±30.39 / 1518.84 ms │     no change │
│ QQuery 34 │ 1482.61 / 1517.83 ±27.69 / 1567.66 ms │ 1446.91 / 1490.67 ±29.94 / 1525.69 ms │     no change │
│ QQuery 35 │    285.80 / 318.23 ±42.16 / 400.24 ms │    276.72 / 312.71 ±56.29 / 424.38 ms │     no change │
│ QQuery 36 │        64.42 / 68.25 ±2.77 / 71.85 ms │        66.44 / 70.82 ±3.02 / 74.41 ms │     no change │
│ QQuery 37 │        36.05 / 39.89 ±3.15 / 44.27 ms │       36.63 / 42.62 ±11.34 / 65.29 ms │  1.07x slower │
│ QQuery 38 │        42.76 / 48.72 ±6.26 / 60.75 ms │        40.10 / 43.22 ±3.56 / 49.55 ms │ +1.13x faster │
│ QQuery 39 │     139.14 / 150.73 ±7.23 / 156.84 ms │     136.88 / 152.06 ±8.71 / 161.54 ms │     no change │
│ QQuery 40 │        14.41 / 15.21 ±0.79 / 16.70 ms │        13.90 / 14.63 ±0.47 / 15.23 ms │     no change │
│ QQuery 41 │        14.51 / 20.84 ±7.41 / 34.61 ms │        13.90 / 14.03 ±0.08 / 14.14 ms │ +1.49x faster │
│ QQuery 42 │        13.64 / 13.86 ±0.13 / 14.01 ms │        13.16 / 13.70 ±0.89 / 15.48 ms │     no change │
└───────────┴───────────────────────────────────────┴───────────────────────────────────────┴───────────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                      ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                      │ 19728.26ms │
│ Total Time (brent_partition-extrema)   │ 19654.69ms │
│ Average Time (HEAD)                    │   458.80ms │
│ Average Time (brent_partition-extrema) │   457.09ms │
│ Queries Faster                         │          4 │
│ Queries Slower                         │          3 │
│ Queries with No Change                 │         36 │
│ Queries with Failure                   │          0 │
└────────────────────────────────────────┴────────────┘

Resource Usage

clickbench_partitioned — base (merge-base)

Metric Value
Wall time 100.0s
Peak memory 10.8 GiB
Avg memory 4.5 GiB
CPU user 1010.5s
CPU sys 70.8s
Peak spill 0 B

clickbench_partitioned — branch

Metric Value
Wall time 100.0s
Peak memory 10.5 GiB
Avg memory 4.4 GiB
CPU user 1009.4s
CPU sys 69.6s
Peak spill 0 B

File an issue against this benchmark runner

@adriangbot

Copy link
Copy Markdown

🤖 Benchmark completed (GKE) | trigger

Instance: c4a-highmem-16 (12 vCPU / 65 GiB)

CPU Details (lscpu)
Architecture:                            aarch64
CPU op-mode(s):                          64-bit
Byte Order:                              Little Endian
CPU(s):                                  16
On-line CPU(s) list:                     0-15
Vendor ID:                               ARM
Model name:                              Neoverse-V2
Model:                                   1
Thread(s) per core:                      1
Core(s) per cluster:                     16
Socket(s):                               -
Cluster(s):                              1
Stepping:                                r0p1
BogoMIPS:                                2000.00
Flags:                                   fp asimd evtstrm aes pmull sha1 sha2 crc32 atomics fphp asimdhp cpuid asimdrdm jscvt fcma lrcpc dcpop sha3 sm3 sm4 asimddp sha512 sve asimdfhm dit uscat ilrcpc flagm sb paca pacg dcpodp sve2 sveaes svepmull svebitperm svesha3 svesm4 flagm2 frint svei8mm svebf16 i8mm bf16 dgh rng bti
L1d cache:                               1 MiB (16 instances)
L1i cache:                               1 MiB (16 instances)
L2 cache:                                32 MiB (16 instances)
L3 cache:                                80 MiB (1 instance)
NUMA node(s):                            1
NUMA node0 CPU(s):                       0-15
Vulnerability Gather data sampling:      Not affected
Vulnerability Indirect target selection: Not affected
Vulnerability Itlb multihit:             Not affected
Vulnerability L1tf:                      Not affected
Vulnerability Mds:                       Not affected
Vulnerability Meltdown:                  Not affected
Vulnerability Mmio stale data:           Not affected
Vulnerability Reg file data sampling:    Not affected
Vulnerability Retbleed:                  Not affected
Vulnerability Spec rstack overflow:      Not affected
Vulnerability Spec store bypass:         Mitigation; Speculative Store Bypass disabled via prctl
Vulnerability Spectre v1:                Mitigation; __user pointer sanitization
Vulnerability Spectre v2:                Mitigation; CSV2, BHB
Vulnerability Srbds:                     Not affected
Vulnerability Tsa:                       Not affected
Vulnerability Tsx async abort:           Not affected
Vulnerability Vmscape:                   Not affected
Details

Comparing HEAD and brent_partition-extrema
--------------------
Benchmark sort_tpch10.json
--------------------
┏━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━┓
┃ Query ┃                                   HEAD ┃                brent_partition-extrema ┃    Change ┃
┡━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━┩
│ Q1    │   2466.32 / 2480.41 ±7.99 / 2490.07 ms │   2474.94 / 2482.84 ±6.30 / 2489.29 ms │ no change │
│ Q2    │  2015.56 / 2052.62 ±21.92 / 2077.40 ms │  1997.65 / 2069.42 ±37.48 / 2102.33 ms │ no change │
│ Q3    │  9119.96 / 9182.45 ±34.41 / 9221.44 ms │  9035.51 / 9110.48 ±60.32 / 9174.17 ms │ no change │
│ Q4    │  3120.45 / 3132.53 ±13.22 / 3157.46 ms │  3104.85 / 3116.10 ±13.78 / 3142.54 ms │ no change │
│ Q5    │  3043.17 / 3088.50 ±45.97 / 3176.41 ms │  3049.57 / 3064.54 ±10.41 / 3076.86 ms │ no change │
│ Q6    │   3186.28 / 3202.78 ±9.30 / 3213.55 ms │  3178.86 / 3204.08 ±17.47 / 3227.09 ms │ no change │
│ Q7    │  5986.60 / 6014.56 ±21.27 / 6036.83 ms │  5986.97 / 6026.19 ±37.24 / 6089.71 ms │ no change │
│ Q8    │  4264.36 / 4412.58 ±94.55 / 4523.19 ms │ 4207.31 / 4424.13 ±126.28 / 4555.75 ms │ no change │
│ Q9    │  4765.96 / 4788.43 ±14.84 / 4810.84 ms │  4733.21 / 4782.78 ±38.79 / 4831.28 ms │ no change │
│ Q10   │ 7029.39 / 7407.92 ±214.26 / 7650.94 ms │ 7156.47 / 7371.02 ±138.12 / 7537.72 ms │ no change │
│ Q11   │  3698.12 / 3804.92 ±97.60 / 3984.70 ms │  3717.02 / 3767.69 ±42.71 / 3846.35 ms │ no change │
└───────┴────────────────────────────────────────┴────────────────────────────────────────┴───────────┘
┏━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━┓
┃ Benchmark Summary                      ┃            ┃
┡━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━┩
│ Total Time (HEAD)                      │ 49567.71ms │
│ Total Time (brent_partition-extrema)   │ 49419.26ms │
│ Average Time (HEAD)                    │  4506.16ms │
│ Average Time (brent_partition-extrema) │  4492.66ms │
│ Queries Faster                         │          0 │
│ Queries Slower                         │          0 │
│ Queries with No Change                 │         11 │
│ Queries with Failure                   │          0 │
└────────────────────────────────────────┴────────────┘

Resource Usage

sort_tpch10 — base (merge-base)

Metric Value
Wall time 250.1s
Peak memory 19.0 GiB
Avg memory 7.2 GiB
CPU user 938.2s
CPU sys 73.2s
Peak spill 0 B

sort_tpch10 — branch

Metric Value
Wall time 250.1s
Peak memory 19.4 GiB
Avg memory 7.3 GiB
CPU user 933.4s
CPU sys 71.8s
Peak spill 0 B

File an issue against this benchmark runner

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

physical-plan Changes to the physical-plan crate

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants