Your learning journey starts here – select a chapter group

Part 8 explores modern data storage and management technologies for the construction industry. It analyzes efficient formats for handling large amounts of information - from simple CSV and XLSX to the more productive Apache Parquet and ORC with a detailed comparison of their capabilities and limita-tions. The concepts of data warehouses (DWH), data lakes (Data Lakes) and their hybrid solutions (Data Lakehouse), as well as the principles of data gov-ernance (Data Governance) and data minimalism (Data Minimalism) are dis-cussed. The problems of Data Swamp) and strategies to prevent chaos in information systems are covered in detail. New approaches to working with data are presented, including vector databases and their application in con-struction through the concept of Bounding Box. This part also touches upon the DataOps and VectorOps methodologies as new standards for organizing data workflows.

filtercontent
  • ALL THE CHAPTERS IN THIS PART
  • DATA INFRASTRUCTURE: FROM STORAGE FORMATS TO DIGITAL REPOSITORIES (8)
  • DATA WAREHOUSE MANAGEMENT AND CHAOS PREVENTION (4)