読書会(Hadoop 第2版)第7回議事録
[ 戻る ]
=============================================
Java読書会BOF 「Hadoop 第2版」を読む会 第7回
=============================================
.. csv-table:: 開催概要
"日時", "2012年4月14日(土) 10:00 - 17:00"
"場所", "川崎市産業振興会館 第2会議室"
"出席者(敬称略)", "高橋(徹)、村山、岩室、今井、門脇、吉本、小棚木(書記)"
"会場費", "300円/ひとり"
議事
====
11章 Pig
-------------------------
11.7 実践Pig
------------
11.7.1 並列性
```````````````````````
11.7.2 パラメータ置換
```````````````````````
11.7.2.1 動的パラメータ
```````````````````````
* バッククォートで置換がWIndowsではどうなるのか気になる。
11.7.2.2 パラメータ置換の処理
`````````````````````````````
12章 Hive
----------
12.1 Hiveのインストール
-----------------------
12.1.1 Hiveシェル
``````````````````
12.2 例
---------------
* where句に in が使えるといいな。
例:
WHERE temperature != 9999 AND (quality = 0 OR quality = 1 OR quality = 4 OR quality = 5 OR quality =9)
->
WHERE temperature != 9999 AND quality IN (0, 1, 4, 5, 9)
12.3 Hiveの実行
----------------
12.3.1 Hiveの設定
``````````````````
12.3.1.1 ロギング
``````````````````
12.3.2 Hiveのサービス
`````````````````````
* 【誤記】 囲み記事のANT_LIBのパス名: /opt/and/lib -> /opt/ant/lib
12.3.2.1 Hiveのクライアント
```````````````````````````
12.3.3 メタストア
`````````````````
12.4 伝統的なデータベースとの比較
---------------------------------
12.4.1 読み込み時のスキーマ適用と書き込み時のスキーマ適用の比較
```````````````````````````````````````````````````````````````
12.4.2 更新、トランザクション、インデックス
```````````````````````````````````````````
12.5 HiveQL
-------------
* 表12-2のデータ型の「一時」とは何?
-> 「日時」の間違いではないか。
* 【誤記】表示12-2 SQLのデータ型: 一時 -> 日時
12.5.1 データ型
````````````````
12.5.1.1 プリミティブ型
```````````````````````
12.5.1.2 変換
```````````````
12.5.1.3 複合型
````````````````
12.5.2 演算子と関数
```````````````````
12.6 テーブル
--------------
12.6.1 管理対象テーブルと外部テーブル
`````````````````````````````````````
12.6.2 パーティションとバケット
```````````````````````````````
12.6.2.1 パーティション
````````````````````````
12.6.2.2 バケット
```````````````````
12.6.3 ストレージフォーマット
`````````````````````````````
12.6.3.1 でフォルトのストレージフォーマット:デリミタで区切られたテキスト
`````````````````````````````````````````````````````````````````````````
12.6.3.2 バイナリストレージフォーマット:SequenceFileとRCFile
``````````````````````````````````````````````````````````````
12.6.3.3 例:RegexSerDe
`````````````````````````
12.6.4 データのインポート
`````````````````````````
12.6.4.1 INSERT OVERWRITE TABLE
``````````````````````````````````
12.6.4.2 複数テーブルへのインサート
```````````````````````````````````
12.6.4.3 CREATE TABLE ... AS SELECT
``````````````````````````````````````
12.6.5 テーブルの修正
`````````````````````
12.6.6 テーブルのドロップ
`````````````````````````
12.7. データのクエリ
---------------------
12.7.1 ソートと集計
````````````````````
12.7.2 MapReduceのスクリプト
`````````````````````````````
12.7.3 結合
```````````
12.7.3.1 内部結合
``````````````````
12.7.3.2 外部結合
`````````````````
12.7.3.3 セミ結合
`````````````````
12.7.3.4 map結合
````````````````
12.7.4 サブクエリ
``````````````````
12.7.5 ビュー
`````````````
12.8 ユーザー定義関数
---------------------
* 【誤記】 12.8.1の直前の3行目と2行目: LITERAL VIEW -> LATERAL VIEW
12.8.1 UDFの書き方
```````````````````
12.8.2 UDAFの書き方
```````````````````
12.8.2.1 より複雑なUDAF
```````````````````````
13章 HBase
----------
13.1 HBaseの基礎
-----------------
13.1.1 背景
```````````
13.2 概念
---------
13.2.1 データモデルに関する弾丸ツアー
`````````````````````````````````````
13.2.1.1 領域
`````````````
* 領域は原文では regions。
13.2.1.2 ロック
```````````````
13.2.2 実装
```````````
13.2.2.1 HBaseの操作
`````````````````````
13.3 インストール
-----------------
13.3.1 試運転
`````````````
13.4 クライアント
-----------------
13.4.1 Java
````````````
13.4.1.1 MapReduce
``````````````````
13.4.2 Avro, REST, Thrift
`````````````````````````
13.4.2.1 REST
`````````````
13.4.2.2 Thrift
```````````````
13.4.2.3 Avro
`````````````
13.5 例
-------
13.5.1 スキーマ
```````````````
13.5.2 データのロード
`````````````````````
.. note:: 次回は、p.456 13.5.2.1 最適化のための注意事項の数行前(直前)より
[ 戻る ]