Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[SPARK-46084][PS] Refactor data type casting operation for Categorica…
…l type ### What changes were proposed in this pull request? The PR proposes to refactor data type casting operation - especially `DataTypeOps.astype` - for Categorical type. ### Why are the changes needed? To optimize performance/debuggability/readability by using official API. We can leverage the PySpark API `coalesce` and `create_map `, instead of implementing Python code from scratch. ### Does this PR introduce _any_ user-facing change? No, it's internal optimization. ### How was this patch tested? The existing CI should pass. ### Was this patch authored or co-authored using generative AI tooling? No. Closes apache#43993 from itholic/refactor_cat. Authored-by: Haejoon Lee <haejoon.lee@databricks.com> Signed-off-by: Dongjoon Hyun <dhyun@apple.com>
- Loading branch information