fix(introspection): preserve data validation attributes, enum ordering, and comments during db pull by svetch · Pull Request #672 · zenstackhq/zenstack-v3

svetch · 2026-02-08T15:01:05Z

Retain data validation attributes (e.g., @email) on fields after
introspection (#670)
Preserve original declaration order of enums instead of moving them
to the end of the schema file (#669)
Preserve triple-slash comments above enum declarations (#669)
Fixes Json defaults not resolves to string literal value.
Fixes self-refencing models duplicates filed name

Fixes zenstackhq/zenstack#2342, fixes zenstackhq/zenstack#2341

Summary by CodeRabbit

New Features
- Consolidates duplicate per-column enums during DB pulls to restore shared enums and clean schemas.
Improvements
- Better detection of database-managed attributes to reduce noisy diffs.
- Preserves original declaration ordering and comments in generated schemas.
- Improved handling of Json/Bytes default values across DB introspection.
- Safer relation-name generation to avoid naming clashes during pulls.
- Refined code-generation defaults for quote and indent; CLI indent parsing adjusted.
Tests
- Expanded tests covering enum consolidation, naming, ordering, and schema preservation.

- Retain data validation attributes (e.g., @email) on fields after introspection (#670) - Preserve original declaration order of enums instead of moving them to the end of the schema file (#669) - Preserve triple-slash comments above enum declarations (#669) Fixes #669, fixes #670

coderabbitai · 2026-02-08T15:01:33Z

📝 Walkthrough

Walkthrough

Adds enum consolidation to pull flow, replaces hardcoded attribute exclusions with a new helper, preserves original declaration/comments in the code generator, extends Json/Bytes default-value handling across providers, adjusts CLI indent parsing, and updates tests and generator defaults.

Changes

Cohort / File(s)	Summary
Enum consolidation & pull logic `packages/cli/src/actions/pull/index.ts`	Added exported `consolidateEnums({ newModel, oldModel })` to map per-column enums to shared enums, group and merge identical enums, rename kept enums, update attributes and field references, remove duplicates, and add clash-avoidance when deriving opposite field names in `syncRelation`.
DB action integration `packages/cli/src/actions/db.ts`	Invoke `consolidateEnums({ newModel, oldModel })` after syncing relations; swapped hardcoded attribute exclusion checks for `isDatabaseManagedAttribute(...)`; defaulted generator options with `quote: options.quote ?? 'single'` and `indent: options.indent ?? 4`.
Pull utilities `packages/cli/src/actions/pull/utils.ts`	Added exported `isDatabaseManagedAttribute(name: string)` to detect DB-managed attribute names (`@relation`, `@id`, `@unique`) and attributes starting with `@db`.
Providers: default-value handling `packages/cli/src/actions/pull/provider/mysql.ts`, `packages/cli/src/actions/pull/provider/postgresql.ts`, `packages/cli/src/actions/pull/provider/sqlite.ts`	Extended `getDefaultValue` handling to include `Json` and `Bytes` field types; treat defaults as string literals or route through type-cast conversion when value includes a cast.
CLI option parsing `packages/cli/src/index.ts`	Removed `parseInt` coercion from the `--indent` option; option still defaults to `4` but is no longer coerced by Commander (runtime value may be a string).
Tests: pull behavior & fixtures `packages/cli/test/db/pull.test.ts`	Updated fixtures and expectations (e.g., `UsersRole`→`Role`, `UserStatus`→`Status`); added tests for self-referencing models with multiple FKs, enum consolidation and ordering, comment preservation, validation/transform attributes, and provider-specific scenarios; adjusted many assertions accordingly.
Code generation / ordering & comments `packages/language/src/zmodel-code-generator.ts`	Stopped reordering declarations (preserve original source order); emit triple-slash comment blocks before enum declarations and enum fields; enum-field emission now places comment block before the indented field line.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Poem

🐇 I found lone enums scattered in rows,
I nudged them home where their old names still glow,
Kept your /// whispers, each comment in place,
Tamed Json and Bytes so defaults show their face,
A little hop — your schema’s back in its flow.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 14.29% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title directly and clearly summarizes the main changes: preserving data validation attributes, enum ordering, and comments during introspection.
Linked Issues check	✅ Passed	The PR addresses all coding requirements from both linked issues: enum ordering preserved via code generator changes [`#669`], triple-slash comments preserved [`#669`], data validation attributes retained via isDatabaseManagedAttribute logic [`#670`], and JSON/Bytes defaults fixed.
Out of Scope Changes check	✅ Passed	All changes are directly related to the linked issues: enum handling, comment preservation, attribute management, and default value fixes for JSON/Bytes types are all in scope.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

No actionable comments were generated in the recent review. 🎉

🧹 Recent nitpick comments

packages/cli/src/actions/pull/index.ts (2)
536-548: LGTM — clash avoidance for opposite field names in self-referencing models.

Clean approach to avoid duplicate field names. The re-derivation pattern at line 546 mirrors the existing disambiguation at line 469.

One minor note: relation.references.columns[0] is typed as string | null (line 107). In practice it's always non-null here because targetFields resolution (lines 436–440) would have bailed out earlier otherwise, but a non-null assertion (!) would make the intent explicit and match the style at line 469.
Optional: add non-null assertion for consistency
-            `${lowerCaseFirst(sourceModel.name)}_${firstColumn}To${relation.references.table}_${relation.references.columns[0]}`,
+            `${lowerCaseFirst(sourceModel.name)}_${firstColumn}To${relation.references.table}_${relation.references.columns[0]!}`,
658-670: as any cast on field.type — consider a narrower type assertion.

The (field.type as any).reference cast works but is broad. If the AST type for field.type exposes a writable reference property (or can be cast to a more specific mutable type), that would be safer against future refactors.

Not blocking since this pattern appears common for AST manipulation in this codebase. Based on learnings, production code should prefer proper type annotations or targeted type assertions over broad casts.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Copilot

Pull request overview

This PR improves db pull introspection/merge behavior to better preserve original schema authoring intent (validation attributes, enum positioning, and enum comments) while also addressing MySQL enum-type fragmentation.

Changes:

Stop re-sorting declarations during codegen so enums keep their original placement in the schema file.
Preserve triple-slash comments for enum declarations (and enum fields) during generation.
Add enum consolidation logic during pull (notably for MySQL per-column synthetic enums) and expand CLI/test coverage for these behaviors.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
packages/language/src/zmodel-code-generator.ts	Removes enum-last sorting; emits enum-level and enum-field comments so order/comments are preserved on generation.
packages/cli/test/db/pull.test.ts	Adds regression tests for enum ordering, enum comments, and validation attributes; adjusts some existing enum expectations.
packages/cli/src/index.ts	Changes `--indent` option handling (arg parsing removed).
packages/cli/src/actions/pull/index.ts	Adds `consolidateEnums` to map MySQL synthetic per-column enums back to shared enums using old schema references.
packages/cli/src/actions/db.ts	Invokes enum consolidation during pull; changes attribute add/delete syncing and generator option defaults.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

packages/cli/test/db/pull.test.ts

packages/cli/src/index.ts

packages/cli/src/actions/db.ts

coderabbitai

Actionable comments posted: 2

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

packages/cli/src/actions/db.ts (1)
465-494: ⚠️ Potential issue | 🟠 Major

Tightened attribute filter may silently drop DB-side attribute changes like @unique.

The filter now only considers @relation and @db.* for add/remove diffing. This correctly preserves user-defined validation attributes (@email, @length, etc.) that the DB doesn't know about.

However, DB-backed attributes like @unique and @updatedAt also fall outside this filter. If a column gains or loses a unique constraint in the DB, the change won't be reflected in the schema during a merge pull — the old @unique won't be removed, and a new one won't be added.

Please verify this is the intended trade-off, or consider also including @unique, @id, and @updatedAt in the filter.
#!/bin/bash
# Check how `@unique`, `@id`, `@updatedAt` are handled during the merge phase
# to see if there's separate handling outside the attribute diff
rg -n '@unique|@id|@updatedAt' packages/cli/src/actions/db.ts -C 5

🤖 Fix all issues with AI agents

In `@packages/cli/src/index.ts`:
- Line 173: Commander option for '--indent' currently lacks an arg parser so the
value becomes a string and violates PullOptions.indent: number; update the
Option creation for '--indent <number>' to provide an argParser that converts
the incoming string to a number (e.g., parseInt/Number) and validate it (ensure
it's an integer >= 0) before applying the .default(4) so PullOptions.indent is
always a proper number at runtime.

In `@packages/cli/test/db/pull.test.ts`:
- Around line 1265-1275: The test's expected substring spacing doesn't match the
formatter output; update the contains assertions against restoredSchema so the
model block uses four spaces before the id field's directive and one space
before the email field's directive (i.e., change the expected lines in the User
model to "id     Int    `@id` `@default`(autoincrement())" and "email  String
`@unique`") so the restoredSchema.contains checks match the zmodel-formatter.ts
output.

packages/cli/src/index.ts

coderabbitai

Actionable comments posted: 2

🤖 Fix all issues with AI agents

In `@packages/cli/src/actions/pull/utils.ts`:
- Around line 31-33: The isDatabaseManagedAttribute function currently returns
['@relation','@id','@unique'] and checks name.startsWith('@db.'), but misses
'@updatedAt' so db pull/introspection-generated `@updatedAt` attributes are
filtered out; update the whitelist inside isDatabaseManagedAttribute to include
'@updatedAt' (i.e., ensure the returned array contains '@updatedAt') so
attribute diffing in db.ts will treat `@updatedAt` as database-managed and sync
correctly.

In `@packages/cli/test/db/pull.test.ts`:
- Around line 1265-1275: The zero-restoration test assumes enum consolidation
from an old schema but for MySQL the synthetic enum name (e.g., UserStatus) is
produced when there is no old schema; update the test in pull.test.ts (the
assertions that inspect restoredSchema) to handle MySQL: either seed a non-empty
old schema so index.ts's enum consolidation logic can map synthetic enums back
to shared names, or change the MySQL branch of the assertions to expect the
synthetic enum name (e.g., expect(restoredSchema).contains("enum UserStatus {"))
while keeping the existing expectation for PostgreSQL/native enums; detect the
current connector (MySQL vs Postgres) in the test and assert the appropriate
enum name accordingly.

packages/cli/src/actions/pull/utils.ts

packages/cli/test/db/pull.test.ts

Prevents field name collisions during introspection by refining the naming strategy for self-referencing relations with multiple foreign keys. Extends support for JSON and Bytes default values across MySQL, PostgreSQL, and SQLite providers to ensure consistent schema restoration. Adds test cases for self-referencing models to verify the avoidance of duplicate fields.

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@packages/cli/src/actions/pull/index.ts`:
- Around line 638-647: consolidateEnums currently renames keepEnum to oldEnum
but leaves the synthetic @@map attribute added by syncEnums, so
getDbName(keepEnum) still returns the old mapped name and prevents matching;
after the line keepEnum.name = oldEnum.name (inside consolidateEnums) remove any
@@map attribute from keepEnum (or alternatively copy attributes from oldEnum
onto keepEnum) so that getDbName(keepEnum) reflects the renamed enum; reference
the functions/symbols consolidateEnums, keepEnum, oldEnum, syncEnums and
getDbName when making the change.

🧹 Nitpick comments (2)

packages/cli/src/actions/pull/index.ts (2)

650-661: Casting field.type to any to assign reference — consider a narrower assertion.

The (field.type as any).reference = { ... } pattern works but bypasses type safety on the AST node. A narrower cast such as (field.type as { reference: ... }).reference = ... would preserve partial type checking.

This is a minor concern given the AST mutation context, but worth noting per repo conventions around avoiding untyped casts in production code. Based on learnings, production code in packages/* should prefer explicit type annotations or targeted type assertions over broad any casts.

576-611: Enum mapping relies on the first field match — may miss mappings if the first matched field's old counterpart changed type.

The inner loop breaks after finding the first field that references newEnum (line 607). If that field's old counterpart no longer references an enum (e.g., it was changed to String), the mapping is lost even though other fields may still reference the shared enum.

In practice this is unlikely for the consolidation use case (MySQL synthetic enums), but a continue instead of break at line 607 (falling through to check more fields) would make it more robust.

packages/cli/src/actions/pull/index.ts

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@packages/cli/src/actions/pull/index.ts`:
- Around line 567-674: The code assigns oldEnum.attributes directly to
keepEnum.attributes which leaves attribute AST nodes with their $container still
pointing at oldEnum; in consolidateEnums, after preparing keepEnum and before
updating references, replace keepEnum.attributes with a shallow copy of
oldEnum.attributes and re-parent each attribute node by setting its $container
(and $parent if used elsewhere) to keepEnum so the attributes' parent pointers
are correct; operate on the symbols keepEnum and oldEnum within consolidateEnums
and ensure you mutate the attribute objects' container references rather than
leaving them pointing to oldEnum.

🧹 Nitpick comments (1)

packages/cli/src/actions/pull/index.ts (1)

654-666: as any cast to mutate reference — consider a narrower type assertion.

Line 660 uses (field.type as any).reference to rewrite the enum reference. While functional, this bypasses type safety entirely. A narrower assertion (e.g., casting to a type with a writable reference property) would be preferable to avoid accidentally setting the wrong shape.

packages/cli/src/actions/pull/index.ts

svetch · 2026-02-09T20:02:37Z

@coderabbitai resume

coderabbitai · 2026-02-09T20:02:47Z

✅ Actions performed

Reviews resumed.

Copilot AI review requested due to automatic review settings February 8, 2026 15:01

Copilot started reviewing on behalf of svetch February 8, 2026 15:01 View session

Copilot AI reviewed Feb 8, 2026

View reviewed changes

packages/cli/test/db/pull.test.ts Outdated Show resolved Hide resolved

packages/cli/src/index.ts Show resolved Hide resolved

packages/cli/src/actions/db.ts Show resolved Hide resolved

coderabbitai bot reviewed Feb 8, 2026

View reviewed changes

packages/cli/src/index.ts Show resolved Hide resolved

coderabbitai bot reviewed Feb 8, 2026

View reviewed changes

packages/cli/src/actions/pull/utils.ts Show resolved Hide resolved

packages/cli/test/db/pull.test.ts Outdated Show resolved Hide resolved

fix: address PR comments

5c932f2

svetch force-pushed the main branch from d616524 to 5c932f2 Compare February 8, 2026 15:32

coderabbitai bot reviewed Feb 8, 2026

View reviewed changes

packages/cli/src/actions/pull/index.ts Show resolved Hide resolved

coderabbitai bot reviewed Feb 9, 2026

View reviewed changes

packages/cli/src/actions/pull/index.ts Show resolved Hide resolved

svetch force-pushed the main branch from eea21e5 to 69e23b2 Compare February 9, 2026 19:57

fix: address PR comments

b1e473f

svetch force-pushed the main branch from 69e23b2 to b1e473f Compare February 9, 2026 20:57

ymc9 approved these changes Feb 10, 2026

View reviewed changes

ymc9 merged commit d9cdd9c into zenstackhq:dev Feb 10, 2026
6 of 7 checks passed

Uh oh!

Comments

Conversation

svetch commented Feb 8, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Feb 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

svetch commented Feb 9, 2026

Uh oh!

coderabbitai bot commented Feb 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

svetch commented Feb 8, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 8, 2026 •

edited

Loading