In the construction industry, information comes from many sources – drawings, specifications, schedules and reports. To effectively manage this flow of information, it needs to be structured. Structured data allows you to organize information in a convenient, readable and accessible form.
According to JB Knowledge’s 5th Annual Construction Technology Report (Deloitte, “Data-Driven Management in Digital Capital Projects,” December 16, 2016), 67% of construction project management professionals track and evaluate job performance manually or using spreadsheets.
Some of the most common structured data formats are XLSX and CSV. They are widely used for storing, processing and analyzing information in spreadsheets. In such spreadsheets, data is presented in the form of rows and columns, which makes them easy to read, edit, and analyze
XLSX, a format created by Microsoft, is based on the use of XML -structures and is archived using the ZIP algorithm. The main features of the format:
- Support for complex formulas, charts and macros.
- Ability to store data in different sheets as well as format information.
- Optimized for Microsoft Excel, but compatible with other office suites as well.
CSV format is a plain text file in which values are separated by commas, semicolons, or other delimiter characters. Key benefits:
- Universal compatibility with various programs and operating systems.
- Easy import/export to databases and analytical systems.
- Easy processing even in text editors.
However, CSV does not support formulas and formatting, so its main application is data exchange between systems and mass updates of information. Due to its versatility and platform independence, CSV has become a popular tool for data transfer in heterogeneous IT environments.
The two formats XLSX and CSV act as a link between different systems that work with structured data (Fig. 3.1-4). They are particularly useful in tasks where readability, manual editing and basic compatibility are important

Platform independence makes CSV the most popular format for data transfer in heterogeneous IT environments and systems.
However, XLSX and CSV are not designed for high-performance computing or long-term storage of large amounts of data. More modern structured formats such as Apache Parquet, Apache ORC, Feather, HDF5 are used for such purposes. These formats will be discussed in more detail in the chapter “Storing Big Data: Analyzing Popular Formats and Their Effectiveness” in Part 9 of this book
In practice, Excel with XLSX format is more often used for small tasks and automation of routine processes. More complex scenarios require the use of data management systems such as ERP, PMIS CAFM, CPM, SCM and others (Fig. 3.2-1). These systems store structured data on which the organization and management of the company’s information flows are based.
Modern data management information systems used in the construction industry rely on structured data organized as tables. For reliable, scalable and holistic management of large volumes of information, application and system developers are turning to relational database management systems (RDBMS).