Wednesday Oct 16, 2013

Loading from Multiple Data Sources with Oracle Loader for Hadoop

Oracle Loader for Hadoop can be used to load data from multiple data sources (for example Hive, HBase), and data in multiple formats (for example Apache weblogs, JSON files).   There are two ways to do this:

(1) Use an input format implementation.  Oracle Loader for Hadoop includes several input format implementations.  In addition, a user can develop their own input format implementation for proprietary data sources and formats.

(2) Leverage the capabilities of Hive, and use Oracle Loader for Hadoop to load from Hive.

These approaches are discussed in our Oracle Open World 2013 presentation
About

Oracle Loader for Hadoop and Oracle SQL Connector for HDFS

Search

Categories
Archives
« April 2014
SunMonTueWedThuFriSat
  
1
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
   
       
Today