Details
-
Type:
Task
-
Status: Resolved
-
Priority:
Major
-
Resolution: Fixed
-
Affects Version/s: 0.17.0
-
Fix Version/s: 0.18.0
-
Component/s: Documentation
-
Labels:None
Description
Update the front page to reflect reorganization along modular lines.
Kite SDK
Kite is a high-level data layer for Hadoop. Kite is an API and a set of tools that help you speed up development. You configure how Kite stores your data in Hadoop, rather than building and maintaining an infrastructure yourself.
High-level Tools
Kite's API and tools are built around the dataset. Dataset is a consistent interface for working with your Hadoop data. You have control of implementation details, such as whether to use Avro or Parquet format, HDFS or HBase storage, but you only have to tell Kite what to do; Kite handles the implementation for you.
Kite's command-line interface helps you manage datasets with pre-built tasks like creating datasets, migrating schemas, and loading data. It also helps you configure Kite and other Hadoop projects.
The Kite Data API provides programmatic access to datasets. Using the API, you can build applications that directly interact with Kite Datasets.
Low-level Control
When you create a dataset, you control your data layout, record schema, and other options with straightforward configuration. Then you can focus on building your application, while Kite handles data storage for you. Kite automatically partitions records when writing, and prunes partitions when reading.
Configuration-based Transformation
Kite Morphlines is a flexible way to express data transformations as configuration.
Contents
__Overview
Kite Solutions
__Sandbox
____Creating a Dataset
____Viewing with Impala
__Sessionization
__Network Analytics
__Recommendation Engine
Command Line Interface
__Install the CLI
__CLI Reference
Kite API
__Kite Data Module API
__Kite Javadocs
__Using Kite with Apache Maven
Datasets
__Data Module Overview
__Dataset Lifecycle
__Schema Evolution
__Partitioned Datasets
__Parquet vs. Avro Format
__Column Mapping
__Dataset URIs
__Restricted Views
__HBase Storage Cells