Dive into the Code


Every week, fresh solutions are released through our social channels
Don't miss the new solutions
Facing Issues? Reach Out!
Offer support for any troubleshooting or advanced use-cases



Offer support for any troubleshooting or advanced use-cases
Change language
Post's Highlights
Stay updated: news and insights
We’re Here to Help
Fresh solutions are released through our social channels
Dive into the world of data-driven construction with this accessible guide, perfect for professionals and novices alike.
From the basics of data management to cutting-edge trends in digital transformation, this book
will be your comprehensive guide to using data in the construction industry.
No construction project in the world has ever started in a CAD program. Before a drawing or model takes shape in CAD, it passes through the conceptualization stage (Fig. 6.4-1, stages 1-2), where the focus...
In addition to the development of CAD database access tools and open and simplified CAD -formats, the emergence of LLM -tools (Large Language Models) is revolutionizing the processing of design data. Whereas previously the access...
The process of data processing from DWG -files due to the unstructured nature of the information – has always been a complex task, requiring specialized software and often manual analysis. However, with the development of...
When working with the design data of the future, it is unlikely that anyone really needs to understand the geometric kernels of proprietary tools or learn hundreds of incompatible formats containing the same information. However,...
After the steps of collecting, structuring, cleaning and verifying the information, a coherent and analyzable data set has been formed. The previous parts of the book covered the systematization and structuring of heterogeneous sources –...
In today’s construction industry, where project data is characterized by complexity and multi-level structure, visualization plays a key role. Visualization of data allows project managers and engineers to visualize complex patterns and trends hidden in...
In today’s construction industry, the management of performance indicators (KPI and ROI) and their visualization through reports and dashboards play a key role in improving productivity and project management efficiency. As in any business, in...
A variety of charts and graphs are used to visualize indicators and metrics, which are typically combined into data showcases and dashboards. These dashboards provide a centralized view of the status of a project or...
Data interpretation is the final stage of analysis, where information makes sense and begins to “speak”. This is where the answers to the key questions are formulated: “what to do?” and “how to do?” (Fig....
When key performance indicators (KPI) stop growing, despite the increase in data volumes and team size, company management inevitably comes to the realization of the need to automate processes. Sooner or later this realization becomes...
The first stage of the ETL process – Extract) – starts with writing code to collect data sets to be further checked and processed. To do this, we scan all the folders of the production...
The Transform step is where the data is processed and transformed. This process may include correctness checking, normalization, filling in missing values and validation using automated tools According to the PwC study “Data-Driven. What Students...
After completion of the Transform stage, when the data have been brought to a structured form and verified, the final stage – Load, where the data can be both loaded into target system and visualized...
At the data loading stage it is possible not only to visualize data, upload them to tables or databases, but also to automatically generate reports, including the necessary graphs, charts and key analytical indicators that...
Automating reporting at the ETL stage Load is an important step in data processing, especially when the results of the analysis need to be presented in a format that is easy to communicate and understand....
At the Load stage, the results were generated in the form of tables, graphs and final PDF reports prepared in accordance with the established requirements. Further it is possible to export this data into machine-readable...
It’s time to move on to building a full-fledged ETL process that covers all key stages of data handling in a single scenario – extraction, transformation and loading. Let’s build an automated ETL-Pipeline that allows...
In the previous chapters on creating data requirements and automating ETL, we step-by-step broke down the process of data preparation, transformation, validation, and visualization. These activities were implemented as separate code blocks (Fig. 7.2-18 –...
Data from CAD systems and databases (BIM) are some of the most sophisticated and dynamically updated data sources in the business of construction companies. These applications not only describe the project using geometry, but also...
Apache Airflow is a free and open source platform, designed to automate, orchestrate and monitor workflows (ETL -conveyors). Working with large amounts of data is required every day: Download files from different sources – Extract...
Apache Airflow is widely used to organize complex data processing processes, allowing to build flexible ETL -conveyors. Apache Airflow can be run either through a web interface or programmatically through Python code (Fig. 7.4-2). In...
Apache NiFi is a powerful open source platform,designed to automate data flows between different systems. Originally developed in 2006 by the US National Security Agency (NSA) under the name “Niagara Files” for internal use. In...
n8n is an Open Source Low-Code / No-Code platform for building automated workflows, characterized by ease of use, flexibility and the ability to quickly integrate with a wide range of external services. No-Code is a...
Today’s construction companies operate in an environment of high uncertainty: changing material prices, delayed deliveries, labor shortages and tight project deadlines. The use of analytical dashboards, ETL -conveyors and BI systems helps companies quickly find...
Everything in the Universe consists of the smallest building blocks – atoms and molecules, and over time all living and non-living things inevitably return to this initial state. In nature, this process occurs with astonishing...
Data warehouses allow companies to collect and combine information from different systems, creating a single center for subsequent analytics. Collected historical data enables not only deeper analysis of processes, but also the identification of patterns...
Storage formats play a key role in the scalability, reliability, and performance of analytics infrastructure. For data analysis and processing – such as filtering, grouping, and aggregation – our examples used Pandas DataFrame – a...
One of the popular formats for storing and processing big data is Apache Parquet. This format is designed specifically for columnar storage (similar to Pandas), which allows you to significantly reduce memory footprint and increase...
Just as the Parquet format is optimized for efficient storage of large amounts of information, the Data Warehouse is optimized for integrating and structuring data to support analytics, forecasting and management decision making. In today’s...
Classic DWH – data warehouses, designed to store structured data in a format optimized for analytical queries, have faced limitations in handling unstructured data and scalability. In response to these challenges, Data Lakes) have emerged,...
To combine the best features of DWH (structured, manageable, high performance analytics) and Data Lake (scalability, handling heterogeneous data), the Data Lakehouse approach was developed. This architecture combines the flexibility of data lakes with the...
Some construction and engineering companies are already using the concept of Common Data Environment (CDE) according to ISO 19650. In essence, the CDE performs the same functions as a data warehouse (DWH) in other industries:...
Vector databases are a new class of repositories that do not just store data, but allow searching by meaning, comparing objects by semantic proximity, and creating intelligent systems: from recommendations to automatic analysis and context...
Understanding and implementing the concepts of Data Governance, Data Minimalism, and preventing Data Swamp are key to successfully managing data warehouses and delivering business value (Fig. 8.2-3). According to a study by Gartner (2017), 85%...
While Data Governance is responsible for controlling and organizing data, DataOps helps ensure its accuracy, consistency and smooth flow within the company. This is especially critical for a number of business cases in construction, where...
Traditional approaches to building data warehousing often result in the creation of disparate “silos of information” where important insights are inaccessible for analysis and decision making. Modern storage concepts, such as Data Warehouse, Data Lake...
The term “big data” does not have a strict definition. The concept originally appeared when the volume of information began to exceed the capabilities of traditional methods of its processing. Today, the volume and complexity...
Traditionally, construction was based on subjective hypotheses and personal experience. Engineers assumed – with a certain degree of probability – how the material would behave, what loads the structure would withstand and how long the...
Working with open datasets provides a unique opportunity to put into practice the principles discussed in previous chapters: judicious feature selection, representative sampling, visualization, and critical analysis. In this chapter, we will explore how complex...
In the following example we will analyze a large dataset using data from different CAD tools (BIM). To collect and create the large dataset, a specialized automated web crawler (script) was used, configured to automatically...
This website uses cookies
We use cookies to personalise content and ads, to provide social media features and to analyse our traffic. We also share information about your use of our site with our social media, advertising and analytics partners who may combine it with other information that you’ve provided to them or that they’ve collected from your use of their services.
Cookies are small text files that can be used by websites to make a user's experience more efficient.
The law states that we can store cookies on your device if they are strictly necessary for the operation of this site. For all other types of cookies we need your permission. This means that cookies which are categorized as necessary, are processed based on GDPR Art. 6 (1) (f). All other cookies, meaning those from the categories preferences and marketing, are processed based on GDPR Art. 6 (1) (a) GDPR.
This site uses different types of cookies. Some cookies are placed by third party services that appear on our pages.
You can at any time change or withdraw your consent from the Cookie Declaration on our website.
Learn more about who we are, how you can contact us and how we process personal data in our Privacy Policy.
Please state your consent ID and date when you contact us regarding your consent.

Linux

macOS
🚀 Welcome to the future of data in construction!
You're taking your first step into the world of open data, working with normalized, structured data—the foundation of data analytics and modern automation tools.

By downloading, you agree to the DataDrivenConstruction terms of use
Stay ahead with the latest updates on converters, tools, AI, LLM
and data analytics in construction — Subscribe now!
🚀 Welcome to the future of data in construction!
You're taking your first step into the world of open data, working with normalized, structured data—the foundation of data analytics and modern automation tools.

By downloading, you agree to the DataDrivenConstruction terms of use
Stay ahead with the latest updates on converters, tools, AI, LLM
and data analytics in construction — Subscribe now!
🚀 Welcome to the future of data in construction!
You're taking your first step into the world of open data, working with normalized, structured data—the foundation of data analytics and modern automation tools.

By downloading, you agree to the DataDrivenConstruction terms of use
Stay ahead with the latest updates on converters, tools, AI, LLM
and data analytics in construction — Subscribe now!
🚀 Welcome to the future of data in construction!
You're taking your first step into the world of open data, working with normalized, structured data—the foundation of data analytics and modern automation tools.

By downloading, you agree to the DataDrivenConstruction terms of use
Stay ahead with the latest updates on converters, tools, AI, LLM
and data analytics in construction — Subscribe now!
🚀 Welcome to the future of data in construction!
You're taking your first step into the world of open data, working with normalized, structured data—the foundation of data analytics and modern automation tools.

By downloading, you agree to the DDC terms of use
🚀 Welcome to the future of data in construction!
You're taking your first step into the world of open data, working with normalized, structured data—the foundation of data analytics and modern automation tools.

By downloading, you agree to the DataDrivenConstruction terms of use
Stay ahead with the latest updates on converters, tools, AI, LLM
and data analytics in construction — Subscribe now!
🚀 Welcome to the future of data in construction!

By downloading, you agree to the DataDrivenConstruction terms of use
Stay ahead with the latest updates on converters, tools, AI, LLM
and data analytics in construction — Subscribe now!
CHAPTER 016: HiPPO or the Danger of Opinions in Decision Making
Discusses the importance of structured data over subjective opinions for decision-making.
CHAPTER 035: Data Transformation: The Critical Foundation of Modern Business Analysis
Covers data cleaning, validation, and structuring, including handling "dirty" data.
CHAPTER 025: Structured Data
Explores working with structured formats like XLSX, CSV, and SQL.
CHAPTER 051: Learning how to turn documents, PDF, pictures and texts into structured formats
Practical methods for converting different Documents abd geometric data into structured tables.
CHAPTER 007: Corporate Mycelium: How Data Connects to Business Processes
Explains data integration across systems via "mycelium" networks of managers and tools.
CHAPTER 014: Data Silos and Their Impact on Company Performance
Challenges of siloed systems and the need for workflow automation.
CHAPTER 132: n8n & Workflow Automation
Examples of integrating data using ETL tools (Apache NiFi, Airflow, n8n).
CHAPTER 046: Choosing an IDE: From LLM Experiments to Business Solutions
Setting up automated pipelines in Jupyter Notebook, VS Code, and Google Collab.
CHAPTER 041: LLM Chatbots: ChatGPT, LlaMa, Mistral, Claude, DeepSeek, QWEN, Grok for Automating Data Processing
The role of LLMs in data analysis and code generation.
CHAPTER 045: RAG: Intelligent LLM Assistants with Access to Corporate Data
Retrieval-Augmented Generation (RAG) for document and database queries.
CHAPTER 044: AI in the Company and How to Deploy Your Own LLM
implementing local LLMs to process sensitive construction data
CHAPTER 029: Text Data: Between Unstructured Chaos and Structure
details methods to transform raw text (contracts, reports) into structured formats
