数据科学家培训:Hadoop大数据管理 由于Hadoop已经成为业界的大数据标准平台,许多公司都推出了各自版本的Hadoop,也有一些公司则围绕Hadoop开发产品。在Hadoop生态系统中,规模最大、知名度最高的公司则是Cloudera。本课程基于Cloudera公司的CDH hadoop发行版。学习配置、部署、维护以及保护Apache Hadoop集群以及包括Hive,Impala,Yarn等在内的生态系统项目时所必需的技术知识、技能与能力。
培训时间:4天 Hadoop 大数据培训学习内容
Apache Hadoop的应用案例 Hadoop分布式文件系统 Hadoop数据载入 MapReduce 规划Hadoop机群 Hadoop安装和基本配置 安装配置Hive,Impala和Pig Hadoop客户端 高级配置 Hadoop安全 管理和调度作业 集群维护 集群监测和排错 教学大纲 The Case for Apache Hadoop Why Hadoop? Core Hadoop Components Fundamental Concepts HDFS HDFS Features Writing and Reading Files NameNode Memory Considerations Overview of HDFS Security Using the Namenode Web UI Using the Hadoop File Shell
Getting Data into HDFS Ingesting Data from External Sources with Flume Ingesting Data from Relational Databases with Sqoop REST Interfaces Best Practices for Importing Data
YARN and MapReduce What Is MapReduce? Basic MapReduce Concepts YARN Cluster Architecture Resource Allocation Failure Recovery Using the YARN Web UI MapReduce Version 1
Planning Your Hadoop Cluster General Planning Considerations Choosing the Right Hardware Network Considerations Configuring Nodes Planning for Cluster Management
Hadoop Installation and Initial Configuration Deployment Types Installing Hadoop Specifying the Hadoop Configuration Performing Initial HDFS Configuration Performing Initial YARN and MapReduce Configuration Hadoop Logging
Installing and Configuring Hive, Impala, and Pig Hive Impala Pig
Hadoop Clients What is a Hadoop Client? Installing and Configuring Hadoop Clients Installing and Configuring Hue Hue Authentication and Authorization
Cloudera Manager The Motivation for Cloudera Manager |
<点击:上海涛德Oracle OCM认证及BI商业智能课程>|人工智能培训-上海涛德 ( 沪ICP备14006824号 )|网站地图
GMT+8, 2018-4-27 03:09 , Processed in 0.159158 second(s), 14 queries , Gzip On.