Building a management layer to your data lake for structured/unstructured data

แชร์
ฝัง
  • เผยแพร่เมื่อ 16 ก.ย. 2024
  • Intro: The challenges in managing a data lake for structured and unstructured data.
    Achieving manageability:
    1. The components of the architecture and their role
    Opentable formats
    Catalogs
    Data Version control systems
    2. How it all fits together
    Example using Databricks technologies
    Example using Apache Iceberg
    Example using AWS technologies
    3. Discussion
    Language: English
    About the lecturer: Einat Orr is the CEO and Co-founder of Treeverse, the company behind lakeFS, an open source platform that delivers a git-like experience to object-storage based data lakes. She received her PhD. in Mathematics from Tel Aviv University, in the field of optimization in graph theory. Einat previously led several engineering organizations, most recently as CTO at SimilarWeb.
    big-data-demys...

ความคิดเห็น •