課程簡介
介紹
分散式計算原理
-
Apache Spark(阿帕奇斯帕克酒店)
Hadoop
Data Serialization的原理
-
數據物件如何通過網路傳遞
物件序列化
序列化方法
節儉
協定緩衝區
阿帕奇 Avro
數據結構
尺寸、速度、格式特性
持久數據存儲
與動態語言集成
動態類型化
模式
未標記的數據
變更管理
Data Serialization 和分散式計算
-
Avro 作為 Hadoop 的子專案
Java 序列化
Hadoop 序列化
Avro 序列化
將 Avro 與
-
Hive (阿夫羅塞爾德)
清管 (AvroStorage)
移植現有 RPC 框架
總結和結論
最低要求
- 大致熟悉分散式計算。
客戶評論 (5)
Trainer's preparation & organization, and quality of materials provided on github.
Mateusz Rek - MicroStrategy Poland Sp. z o.o.
Course - Impala for Business Intelligence
The VM I liked very much The Teacher was very knowledgeable regarding the topic as well as other topics, he was very nice and friendly I liked the facility in Dubai.
Safar Alqahtani - Elm Information Security
Course - Big Data Analytics in Health
I thought he did a great job of tailoring the experience to the audience. This class is mostly designed to cover data analysis with HIVE, but me and my co-worker are doing HIVE administration with no real data analytics responsibilities.
ian reif - Franchise Tax Board
Course - Data Analysis with Hive/HiveQL
I genuinely enjoyed the many hands-on sessions.
Jacek Pieczątka
Course - Administrator Training for Apache Hadoop
The fact that all the data and software was ready to use on an already prepared VM, provided by the trainer in external disks.