This is a Work in Progress…
Pre Processing of Data (Un-Structured)
- Map Reduce
Pre Processing of Data (Structured or Semi-Structured)
- Hadoop (see below)
Statistical Analysis (After Pre-Processing)
- R is used for statistical analysis which happens after processing of data . However there is some limitation on size of data which can be used.
- covers both data storage and data processing at massive scale.
- PIG and HIVE are tools which belong to Hadoop.