Category: hadoop
-

Hadoop Tutorial Series, Issue #4: To Use Or Not To Use A Combiner
Explains when Hadoop Combiners help (or hurt) performance and correctness, with code‑level guidance.
-

Hadoop Tutorial Series, Issue #3: Counters In Action
Shows how to instrument MapReduce jobs with Hadoop Counters to track custom metrics during large‑scale processing.
-

Hadoop Tutorial Series, Issue #2: Getting Started With (Customized) Partitioning
Teaches key partitioning patterns (e.g., partial sorts to specific reducers) to control data flow in MapReduce jobs.
-

Hadoop Tutorial Series, Issue #1: Setting Up Your MapReduce Learning Playground
Step‑by‑step setup of a Cloudera VM + Maven project so you can quickly experiment with Hadoop wordcount and beyond.
