- ·上一篇文章:Hadoop Shuffle
- ·下一篇文章:Hadoop IO
Hadoop 编程
由 kamiya 创建,Alma 最后一次修改 2016-08-12 MapReduce - 编程在线练习http://cloudcomputing.ruc.edu.cn处理select:直接分析输入数据,取出需要的字段数据即可where: 也是对输入数据处理的过程中进行处理,判断是否需要该数据aggregation:min, max, sumgroup by: 通过Reducer实现sortjoin: map join, reduce joinThird-Party Librariesexport LIBJARS=$MYLIB/commons-lang-2.3.jar, hadoop jar prohadoop-0.0.1-SNAPSHOT.jar org.aspress.prohadoop.c3. WordCountUsingToolRunner -libjars $LIBJARShadoop jar prohadoop-0.0.1-SNAPSHOT-jar-with-dependencies.jar org.aspress.prohadoop.c3. WordCountUsingToolRunner The dependent libraries are now included inside the application JAR file一般还是上面的好,指定依赖可以利用Public Cache,如果是包含依赖,则每次都需要拷贝参考书籍MapReduce Design Patterns
Hadoop 编程