Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
  1. What is a Delta Lake ?

    It’s an open-source storage layer for Spark that enables relational database capabilities for batch and streaming data.

  2. What’s the advantage of using Delta Lake in a LakeHouse Architecture ?

    Delta Lake offers the benefits of a relational database system with the flexibility of storing data in a data lake.

  3. Do you need to work directly with Delta Lake APIs to use tables in a Fabric LakeHouse ?

    No, we can also use SQL-based data manipulation.

  4. What are the benefits of using Delta tables ?

    1. Querying and data modification

    2. ACID Transactions

    3. Versioning and time travel

    4. Batch and streaming data

    5. Standard formats and interoperability

  5. What’s the difference between managed and external tables ?

    Managed tables have both table definition and data files managed by the Spark Runtime for the Fabric LakeHouse while external tables have the table definition mapped to an alternative file storage location.

  6. What’s the advantage of saving data in delta format without creating a table definition ?

    It allows to persist the results of data transformations and later overlay a table definition or process directly using the Delta Lake API.