fix(writer): spark 38811 insert alter table add columns by Shekharrajak · Pull Request #3479 · apache/datafusion-comet

Shekharrajak · 2026-02-10T20:05:36Z

Which issue does this PR close?

Rationale for this change

Comet bypasses Spark's logicalPlanOutputWithNames() entirely. It must explicitly use cmd.outputColumnNames (the table's actual column names) to achieve the same result.

What changes are included in this PR?

renames attributes in the logical plan (outputColumns) before passing to FileFormatWriter

How are these changes tested?

unit tests

Shekharrajak · 2026-02-10T20:07:04Z

spark/src/main/scala/org/apache/spark/sql/comet/CometNativeWriteExec.scala

    }

+    // Refresh the catalog table cache so subsequent reads see the new data
+    catalogTable.foreach { ct =>


this was different issue - while running the test, realised table needs to be refreshed to get the new data.

Shekharrajak · 2026-02-10T20:07:26Z

spark/src/main/scala/org/apache/comet/serde/operator/CometDataWritingCommand.scala

        .setOutputPath(outputPath)
        .setCompression(codec)
-        .addAllColumnNames(cmd.query.output.map(_.name).asJava)
+        .addAllColumnNames(cmd.outputColumnNames.asJava)


Operator { plan_id: 42 parquet_writer: ParquetWriter { output_path: "file:/.../spark-warehouse/t" compression: SNAPPY column_names: ["i", "s"] } ......

spark/src/test/scala/org/apache/comet/parquet/CometParquetWriterSuite.scala

andygrove

Thanks @Shekharrajak this is looking good overall. Would it be possible to add assertions to the new tests to assert that the plan (or the key part of the plan) is actually using Comet operators?

Shekharrajak · 2026-02-15T06:38:27Z

Thanks @Shekharrajak this is looking good overall. Would it be possible to add assertions to the new tests to assert that the plan (or the key part of the plan) is actually using Comet operators?

788bcab assertCometNativeWrite is helping in validation of expected execution plan. Please have a look.

Shekharrajak · 2026-02-15T07:14:11Z

spark/src/test/scala/org/apache/comet/parquet/CometParquetWriterSuite.scala

+
+    val listener = new org.apache.spark.sql.util.QueryExecutionListener {
+      override def onSuccess(funcName: String, qe: QueryExecution, durationNs: Long): Unit = {
+        if (funcName == "command") {


we can also use this directly ?

if (qe.executedPlan.exists(_.isInstanceOf[DataWritingCommandExec])) { capturedPlan = Some(qe) }

This did not work since we require stripAQEPlan

Shekharrajak · 2026-02-15T07:16:39Z

spark/src/test/scala/org/apache/comet/parquet/CometParquetWriterSuite.scala

+      val maxWaitTimeMs = 5000
+      val checkIntervalMs = 50
+      var iterations = 0
+      while (capturedPlan.isEmpty && iterations < maxWaitTimeMs / checkIntervalMs) {


wait for sometime to make sure query plan is completed.

Shekharrajak added 2 commits February 11, 2026 01:26

test: add SPARK-38811 INSERT INTO ALTER TABLE ADD COLUMNS tests

ae0b3a2

fix: use outputColumnNames for native writer column mapping

bd5e713

Shekharrajak changed the title ~~Fix/spark 38811 insert alter table add columns~~ fix: spark 38811 insert alter table add columns Feb 10, 2026

Shekharrajak commented Feb 10, 2026

View reviewed changes

andygrove reviewed Feb 13, 2026

View reviewed changes

spark/src/test/scala/org/apache/comet/parquet/CometParquetWriterSuite.scala Outdated Show resolved Hide resolved

andygrove reviewed Feb 13, 2026

View reviewed changes

Merge branch 'main' into fix/spark-38811-insert-alter-table-add-columns

790c6df

mbutrovich changed the title ~~fix: spark 38811 insert alter table add columns~~ fix(writer): spark 38811 insert alter table add columns Feb 13, 2026

Shekharrajak and others added 2 commits February 14, 2026 17:22

Merge branch 'main' into fix/spark-38811-insert-alter-table-add-columns

97ce443

test: add CometNativeWriteExec plan assertion to SPARK-38811 tests

788bcab

Shekharrajak force-pushed the fix/spark-38811-insert-alter-table-add-columns branch from 1354593 to 788bcab Compare February 15, 2026 06:37

Shekharrajak commented Feb 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(writer): spark 38811 insert alter table add columns#3479

fix(writer): spark 38811 insert alter table add columns#3479
Shekharrajak wants to merge 5 commits intoapache:mainfrom
Shekharrajak:fix/spark-38811-insert-alter-table-add-columns

Shekharrajak commented Feb 10, 2026

Uh oh!

Shekharrajak Feb 10, 2026

Uh oh!

Shekharrajak Feb 10, 2026

Uh oh!

Uh oh!

andygrove left a comment

Uh oh!

Shekharrajak commented Feb 15, 2026

Uh oh!

Shekharrajak Feb 15, 2026 •

edited

Loading

Uh oh!

Shekharrajak Feb 15, 2026

Uh oh!

Shekharrajak Feb 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Shekharrajak commented Feb 10, 2026

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

How are these changes tested?

Uh oh!

Shekharrajak Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

Shekharrajak Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

andygrove left a comment

Choose a reason for hiding this comment

Uh oh!

Shekharrajak commented Feb 15, 2026

Uh oh!

Shekharrajak Feb 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Shekharrajak Feb 15, 2026

Choose a reason for hiding this comment

Uh oh!

Shekharrajak Feb 15, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Shekharrajak Feb 15, 2026 •

edited

Loading