This is a Work in Progress…

Pre Processing of Data  (Un-Structured)

  • Map Reduce

Pre Processing of Data  (Structured or Semi-Structured)

  • PIG
  • Hive
  • Hadoop (see below)

Statistical Analysis (After Pre-Processing)

  • R is used for statistical analysis which happens after processing of data . However there is some limitation on size of data which can be used.

Hadoop

  • covers both data storage and data processing at massive scale.
  • PIG and HIVE are tools which belong to Hadoop.

Anuj holds professional certifications in Google Cloud, AWS as well as certifications in Docker and App Performance Tools such as New Relic. He specializes in Cloud Security, Data Encryption and Container Technologies.

Initial Consultation

Anuj Varma – who has written posts on Anuj Varma, Hands-On Technology Architect, Clean Air Activist.