About 82 results
Open links in new tab
  1. What is Apache Hadoop and MapReduce - Azure HDInsight

    Feb 28, 2025 · The Hadoop ecosystem includes related software and utilities, including Apache Hive, Apache HBase, Spark, Kafka, and many others. Azure HDInsight is a fully managed, full-spectrum, …

  2. What is Azure HDInsight | Microsoft Learn

    Feb 24, 2025 · What is HDInsight and the Hadoop technology stack? Azure HDInsight is a managed cluster platform that makes it easy to run big data frameworks like Apache Spark, Apache Hive, …

  3. Azure Data Lake Storage Introduction - Azure Storage | Microsoft Learn

    May 15, 2026 · Azure Data Lake Storage is primarily designed to work with Hadoop and all frameworks that use the Apache Hadoop Distributed File System (HDFS) as their data access layer. Hadoop …

  4. Manage Apache Hadoop clusters in HDInsight by using the Azure portal

    Feb 20, 2025 · Most Hadoop jobs are batch jobs that run only occasionally. For most Hadoop clusters, there are large periods of time when the cluster isn't used for processing. With HDInsight, your data …

  5. Azure Data Lake Storage Gen2 overview in HDInsight

    Jun 13, 2024 · Azure Data Lake Storage Gen2 takes core features from Azure Data Lake Storage Gen1 and integrates them into Azure Blob storage. These features include a file system that is compatible …

  6. Use SSH with Hadoop - Azure HDInsight | Microsoft Learn

    May 9, 2024 · Learn how to use Secure Shell (SSH) to securely connect to Apache Hadoop on Azure HDInsight. For information on connecting through a virtual network, see Azure HDInsight virtual …

  7. Choose a data storage technology - Azure Architecture Center

    Oct 4, 2024 · Compare big data storage technology options in Azure, including key selection criteria and a capability matrix.

  8. Transform data using Hadoop MapReduce activity - Azure Data …

    Mar 25, 2026 · Learn how to process data by running Hadoop MapReduce programs on an Azure HDInsight cluster with Azure Data Factory or Synapse Analytics.

  9. Using the HDFS CLI with Azure Data Lake Storage - Azure Storage

    Nov 18, 2024 · Use the Hadoop Distributed File System (HDFS) CLI for Azure Data Lake Storage. Create a container, get a list of files or directories, and more.

  10. Introduction to Azure Data Factory - Azure Data Factory

    6 days ago · Learn about Azure Data Factory, a cloud data integration service that orchestrates and automates movement and transformation of data.

  11. Apache Hadoop & secure transfer storage - Azure HDInsight

    Mar 24, 2024 · The use of Azure Storage (WASB) instead of Apache Hadoop HDFS as the default data store For information on how HDInsight uses Azure Storage, see Use Azure Storage with HDInsight.