// ROBOTICS AND SMART FACTORIES TERM

Data Lake

A large, centralized repository that stores vast amounts of raw data in its native format, without a predefined structure. It can hold structured, semi-structured, and unstructured data.

Data Lake — illustration from Wikipedia
Image via Wikipedia

TECHNICAL DEFINITION

A data lake is a centralized repository designed to store vast quantities of raw, unstructured, semi-structured, and structured data at scale, enabling flexible data exploration, advanced analytics, and machine learning applications without prior schema definition.

BACKGROUND

The Fourth Industrial Revolution, also known as 4IR, Industry 4.0 or the Intelligence Age, is a neologism describing rapid technological advancement in the 21st century. It follows the Third Industrial Revolution. The term was popularized in 2016 by Klaus Schwab, the World Economic Forum founder and former executive chairman, who asserts that these developments represent a significant shift in industrial capitalism.

READ MORE ON WIKIPEDIA

SYNONYMS & ALIASES

  • Big data repository
  • Enterprise data lake

USAGE NOTE

Data lakes are used in manufacturing to store diverse operational data, from sensor readings to production logs, for future analysis and AI model training.

DEVELOPERS

Organizations developing technology related to Data Lake.

  • Amazon Web Services (AWS)

    Provides a suite of cloud computing services, including AWS Lake Formation and Amazon S3, which are foundational for building data lakes. AWS offers specific solutions for manufacturers to ingest, store, and analyze data from operational technology (OT) and information technology (IT) systems.

  • Microsoft Azure

    A cloud computing platform offering Azure Data Lake Storage and Azure Synapse Analytics to create large-scale data repositories for manufacturing. These services enable manufacturers to consolidate data from IoT devices, production lines, and supply chains for predictive maintenance and operational intelligence.

  • Databricks

    Offers a unified data analytics platform based on the 'Lakehouse' paradigm, combining data lakes' flexibility with data warehouses' performance. For manufacturing, it enables large-scale data engineering and AI on sensor, machine, and production data to improve operational efficiency.

  • Snowflake

    Provides the Manufacturing Data Cloud, a platform that enables manufacturers to build a central data lake for seamless data access and governance. It helps unify IT and OT data, facilitating applications like supply chain visibility, product quality analysis, and production optimization.

  • Siemens

    Through its Siemens Xcelerator portfolio and industrial IoT solutions, Siemens helps manufacturers create data lakes for operational technology (OT) data. This enables the analysis of data from machines and plants for performance optimization, predictive maintenance, and digital twin creation.

  • Cloudera

    Provides the Cloudera Data Platform (CDP), an enterprise data cloud that enables manufacturers to build and manage multi-functional data lakes. It supports the ingestion and processing of vast amounts of industrial data for use cases like predictive maintenance and yield optimization.

  • Rockwell Automation

    Offers the FactoryTalk platform, which helps manufacturers collect and contextualize data from industrial operations. It provides the foundation for an operational data lake, enabling analytics and insights by unifying information from disparate control systems and enterprise applications.

  • PTC

    Develops the ThingWorx Industrial IoT (IIoT) platform, which enables companies to connect to industrial assets and build a data lake of operational information. The platform facilitates the storage, contextualization, and analysis of machine data for applications in smart manufacturing.

  • Google Cloud

    Offers a comprehensive suite of services, including Cloud Storage and BigQuery, to build scalable data lakes for the manufacturing industry. Their Manufacturing Data Engine helps unify data from factory floors and business systems to enable AI-driven insights and process optimization.

RELATED TERMS IN SMART FACTORY