马马虎虎学了30多天的大数据课程,从最开始的不明所以到现在略知一二,准备将所学的内容进行一下梳理。
与CDH同级的概念是 HDP,Apache Hadoop.
本文讲讲CDH相关的概念.
CDH是Cloudera这个公司发布的产品,官网地址https://www.cloudera.com/
到官方文档地址https://www.cloudera.com/documentation.html 可知CDH为Cloudera Enterprise产品中的一员。
查看Cloudera Enterprise文档的Introduction(当前5.12为最高版本) https://www.cloudera.com/documentation/enterprise/latest/topics/introduction.html
Cloudera provides a scalable, flexible, integrated platform that makes it easy to manage rapidly increasing volumes and varieties of data in your enterprise. Cloudera products and solutions enable you to deploy and manage Apache Hadoop and related projects, manipulate and analyze your data, and keep that data secure and protected.
Cloudera provides the following products and tools:
看完说明后,大体了解到Cloudera提供如下产品和工具:CDH,Apache Impala,Cloudera Search,Cloudera Manager,Cloudera Navigator . 其中CDH包含Apache Impala和Cloudera Search. 总结起来,Cloudera提供CDH,Cloudera Manager,Cloudera Navigator三大件.
文档后面章节对这三大件各做了简介
CDH Overview
CDH delivers the core elements of Hadoop
Introduction文档中有提到,关于CDH各组件的信息,超出了Cloudera文档的范围。各组件的使用我以后会在使用中编写相应的文档。
Cloudera Manager 5 Overview
With Cloudera Manager, you can easily deploy and centrally operate the complete CDH stack and other managed services.
说白了CM可以使CDH的安装和管理简化.
Terminology
Architecture
Cloudera Navigator Data Management Overview
Cloudera Navigator Data Management is a complete solution for data governance, auditing, and related data management tasks that is fully integrated with the Hadoop platform.
这个解释有些抽象,后面FAQ中有一个问题回复比较简明
Is Cloudera Navigator a module of Cloudera Manager?
Not exactly. Cloudera Navigator is installed separately, after Cloudera Manager is installed, and it interacts behind the scenes with Cloudera Manager to deliver some of its core functionality. Cloudera Manager is used by cluster administrators to manage the cluster and all its services. Cloudera Navigator is used by administrators but also by security and governance teams, data stewards, and others to audit, trace data lineage from source raw data through final form, and perform other comprehensive data governance and stewardship tasks.
如果不涉及到数据安全审计等方面,Cloudera Navigator可以不用安装。
了解了CDH相关的概念后,开始准备安装。安装会单独写个文档,网上可参考的安装文档也很多。我准备参考官方文档,依照官方文档中的步骤内容进行。
https://www.cloudera.com/documentation/enterprise/latest/topics/introduction.html