Idea Options
- Subscribe to RSS Feed
- Mark as New
- Mark as Read
- Bookmark
- Subscribe
- Printer Friendly Page
- Notify Moderator
Databricks IN-DB support for create or replace table when overwriting an existing table
Status:
Accepting Votes
Submitted by
AStasi
on
‎08-30-2022
06:19 AM
- Subscribe to RSS Feed
- Mark as New
- Mark as Read
- Bookmark
- Subscribe
- Printer Friendly Page
- Notify Moderator
Currently the Databricks in-database connector allows for the following when writing to the database
- Append Existing
- Overwrite Table (Drop)
- Create New Table
- Create Temporary Table
This request is to add a 5th option that would execute
- Create or Replace Table
Why is this important?
- Create or Replace is similar to the Overwrite Table (Drop) in that it fully replaces the existing table however, the key differences are
- Drop table completely removes the table and it's data from Databricks
- Any users or processes connected to that table live will fail during the writing process
- No history is maintained on the table, a key feature of the Databricks Delta Lake
- Create or Replace does not remove the table
- Any users or processes connected to that table live will not fail as the table is not dropped
- History is maintained for table versions which is a key feature of Databricks Delta Lake
- Drop table completely removes the table and it's data from Databricks
While this request was specific to testing on Azure Databricks the documentation for Azure and AWS for Databricks both recommend using "Replace" instead of "Drop" and "Create" for Delta tables in Databricks.
See more ideas labeled with:
4 Comments
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.