A data lake is a centralized repository that allows for the storage of all structured, semi-structured, and unstructured data, in its original format and at any scale. In its most simple form, data is stored without conversion or adherence to a predefined scheme. This allows highly skilled data analysts or data scientists to explore their data closer to the original context.