[KITE-254] Automatically set flume path from the partition strategy - Cloudera Open Source

Details

Type: Improvement
Status: Resolved
Priority: Major
Resolution: Duplicate
Affects Version/s: 0.9.0, 0.10.0
Fix Version/s: None
Component/s: None
Labels:
None

Description

Currently, sending records to partitioned kite datasets via flume requires the user to configure the HDFS file path by hand and match the partition strategy. From the examples:

tier1.sinks.sink-1.hdfs.path = /tmp/data/events/year=%{cdk.partition.year}/month=%{cdk.partition.month}/day=%{cdk.partition.day}/hour=%{cdk.partition.hour}/minute=%{cdk.partition.minute}

This leaves a lot up to the user and isn't clearly documented. Ideally, users would set a kite dataset sink, which handles partitioning.

Attachments

Issue Links

relates to

KITE-255 Remove Kite's log4j appender

Open

Activity

People

Assignee:

Unassigned

Reporter:

Ryan Blue

Votes:

0 Vote for this issue

Watchers:

0 Start watching this issue

Dates

Created:

06/Dec/13 10:25 PM

Updated:

09/Dec/13 11:37 AM

Resolved:

09/Dec/13 11:37 AM