Skip to content

fix: apply recursive CTE column-list aliases to the static term#23098

Open
tomsanbear wants to merge 1 commit into
apache:mainfrom
tomsanbear:recursive-cte-column-list-alias
Open

fix: apply recursive CTE column-list aliases to the static term#23098
tomsanbear wants to merge 1 commit into
apache:mainfrom
tomsanbear:recursive-cte-column-list-alias

Conversation

@tomsanbear

Copy link
Copy Markdown

Which issue does this PR close?

Rationale for this change

WITH RECURSIVE t(n) AS (...) failed to plan because the CTE's declared column-list names (the t(n) part) were never applied to the recursive working relation. They were applied (via apply_table_alias) only after the whole CTE plan was built, but the working table is derived from the static term's schema before that — so the self-reference couldn't resolve the declared names and planning failed with Schema error: No field named n. Valid fields are t."Int64(1)".. PostgreSQL and DuckDB accept the query; aliasing inside the static SELECT (SELECT 1 AS n) was the only workaround.

What changes are included in this PR?

Apply the column-list aliases to the static term inside recursive_cte(), before the work table is created, so the working relation and the self-reference carry the declared names. The caller now applies only the relation-name alias on the recursive path (the columns are already applied), avoiding a redundant projection on top of the RecursiveQuery node. The non-UNION fallback applies the aliases directly; non-recursive CTEs are unchanged. A column/alias-count mismatch is now reported at the static term — a clearer error than the previous "No field named …".

Are these changes tested?

Yes, added cte.slt cases for single- and multi-column column-list recursive CTEs (asserting the recursion produces the expected rows), UNION (DISTINCT), the arity-mismatch error, and an EXPLAIN locking the plan shape (no extra projection over RecursiveQuery).

Are there any user-facing changes?

WITH RECURSIVE t(n) AS (...) and multi-column column lists now plan and execute correctly, matching PostgreSQL/DuckDB. No API changes.

Note: this pull request was created together with AI tools (claude code), the full diff was reviewed by myself in full prior to submission

`WITH RECURSIVE t(n) AS (SELECT 1 UNION ALL SELECT n + 1 FROM t WHERE n < 10)`
failed to plan with `Schema error: No field named n. Valid fields are
t."Int64(1)".`. The CTE's declared column-list names were applied (via
`apply_table_alias`) only after the whole CTE plan was built, but the recursive
working relation is derived from the schema of the static term before that, so
the self-reference could never resolve the declared names.

Apply the column-list aliases to the static term inside `recursive_cte`, before
the work table is created, so the working relation and the self-reference expose
the declared names. The relation-name alias is still added by the caller; on the
recursive path the column re-alias is skipped to avoid a redundant projection on
top of the `RecursiveQuery` node. The non-UNION fallback applies the aliases
directly. Non-recursive CTEs are unchanged.

Adds sqllogictest coverage for single- and multi-column column-list recursive
CTEs, UNION (DISTINCT), and the column/alias-count mismatch error.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

sql SQL Planner sqllogictest SQL Logic Tests (.slt)

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Recursive CTE column-list alias t(n) is ignored, fails to plan with "No field named n"

1 participant