The BigData Landscape

This is a Work in Progress…

Pre Processing of Data  (Un-Structured)

  • Map Reduce

Pre Processing of Data  (Structured or Semi-Structured)

  • PIG
  • Hive
  • Hadoop (see below)

Statistical Analysis (After Pre-Processing)

  • R is used for statistical analysis which happens after processing of data . However there is some limitation on size of data which can be used.


  • covers both data storage and data processing at massive scale.
  • PIG and HIVE are tools which belong to Hadoop.

Cloud Advisory Services | Security Advisory Services | Data Science Advisory and Research

Specializing in high volume web and cloud application architecture, Anuj Varma’s customer base includes Fortune 100 companies (, British Petroleum, Schlumberger).

All content on this site is original and owned by AdverSite Web Holdings, Inc. – the parent company of No part of it may be reproduced without EXPLICIT consent from the owner of the content.

Anuj Varma – who has written posts on Anuj Varma, Technology Architect.

Leave a Reply

Your email address will not be published. Required fields are marked *