How large can data output be from a single whole-genome sequencing run?

Answer

Hundreds of gigabytes

The explosive growth of data volume, particularly in areas like genomics, was a key driver forcing healthcare institutions towards data lake architecture. The complexity and sheer size of the data generated by modern sequencing methods are substantial. Specifically, a single run of whole-genome sequencing can produce hundreds of gigabytes of data. Attempting to structure this massive volume and complexity into a traditional, highly structured data warehouse schema before specific research questions are formulated is described as being both inefficient and costly, highlighting the inherent advantage of the data lake for handling such large binary objects natively.

How large can data output be from a single whole-genome sequencing run?
inventionmedicinetechnologydatadata lake