• Introduction to Hadoop and big data
• Design and anatomy of distributed file system
• Apache PIG and SQOOP
• MapReduce
• Virtual Machine
• HIVE and HBASE
Hadoop is an open source framework that collects, process, analyses and stores large amount of data running on clustered systems. Storing in clustered or distributed systems can enable the user to process it parallel ways.
• Hadoop developer
• Hadoop trainer
If you have knowledge in java or you are interested in learning java, then this will be a better option for your technical knowledge advancement.
4 months (192 hours)