数十亿条记录数以百万计的半结构化的、 未格式化的数据字段是现实的今天，我们储存的数据类型。传统的数据库都受严格的数据布局要求和约束，不幸的是，不能进行扩展以满足大数据要求。HBase 园丁如何将数据存储在分布式系统中。本课程中，开始使用 HBase: Hadoop 数据库，教你如何从一开始使用 HBase 来完成。首先，您将学习如何设计和布局格式数据的柱状优化磁盘寻求减少读取延迟时间。接下来，您将学习如何操作和访问此数据使用命令行 HBase 壳以及 HBase Java API。最后，您将学习来处理此数据执行复杂的聚合和分组操作与 HBase 使用 MapReduce 编程模型。通过这门课程结束时，你会准备好开始制作您的数据更易于管理使用 HBase。
Getting Started with HBase: The Hadoop Database
MP4 | Video: AVC 1280x720 | Audio: AAC 44KHz 2ch | Duration: 2.5 Hours | 412 MB
Genre: eLearning | Language: English
As the data you store expands in size, traditional relational databases may no longer work. HBase has the ability to deal with billions of rows of data and each record can contains millions of fields. This course will help you get started with HBase.
Billions of records with millions of fields of semi-structured, unformatted data is the reality of the kind of data we are storing today. Traditional databases are bound by strict data layout requirements and constraints that, unfortunately, do not scale to meet big data requirements. HBase reimagines how data can be stored in a distributed system. This course, Getting Started with HBase: The Hadoop Database, teaches you how to use HBase from the start to finish. First, you'll learn how to design and layout data in a columnar format in order to optimize disk seeks and reduce read latency. Next, you'll learn how to manipulate and access this data using the command line HBase shell as well as the HBase Java API. Finally, you'll learn to process this data by performing complex aggregation and grouping operations using the MapReduce programming model with HBase. By the end of this course, you'll be ready to start making your data much more manageable using HBase.