What approach defines how data is handled in a data lake regarding schemas?

Answer

Schema-on-Read

The defining characteristic of a data lake architecture concerning schema definition is the 'schema-on-read' approach. This methodology dictates that data is stored in its native, raw format without any requirement for predefined structures or models upon ingestion. The structure or schema is applied only at the time the data is requested or queried for a specific analytical purpose. This contrasts sharply with traditional data warehouses, which mandate 'schema-on-write,' meaning data must conform to a rigorous model before it is even stored in the repository.

What approach defines how data is handled in a data lake regarding schemas?
inventionmedicinetechnologydatadata lake