Star Big Data Programming

Big Data Hadoop training will make you an expert in HDFS, MapReduce, Hbase, Hive, Pig Yarn, Oazie, flume and sqoop using real time use cases on Retail, Social Media, Aviation, Tourism, Finance Domain. It equips you with in depth knowledge of writing codes using MapReduce framework and managing Large Data Sets with HBase. The Topics Covered in this course mainly includes-Hive, Pig and setup of Hadoop Cluster.


Star Certified - Big Data Developer assumes that the Learner is a Working Software Professional, or someone with Programming experience, wanting to understand the complexities of Big Data Development.

Big Data Programming Course Objectives

  • About Hadoop and its eco-system.
  • How to create program for Hadoop.
  • How to create Map Reduce application.
  • How to interact with related technologies - Spark, Sqoop. Pig. Hive. Ozie etc.
  • The flexibility and applications of Big Data to suit various industries
  • About the complexities of commercial distribution systems of Hadoop.
  • About the various practical aspects of installing and working with Hadoop.

Course Outcome

Understand the finer nuances of Big Data technology, and have answers to a lot of related questions. You will also be comfortable in dealing with Big Data related tools, platforms, and their architecture to store, program, process, and manage thedata.

Table Of Contents Outline

Part 1: Exploring Big Data and Hadoop

1. Introducing Data and Big Data.

2. Identifying the Business Applications of Big Data.

3. Big Data and Hadoop.

Part 2: Discussing Hadoop Eco-system

4. Introduction to MapReduce.

5. Exploring HDFS: Storing Data.

6. Yarn and Map Reduce

Part 3: Programming for Map Reduce.

7. Developing a first Application for Map Reduce.

8. Exploring the working of Map Reduce Process.

9. Exploring Additional Features of Map Reduce.

Part 4: Exploring Related Technologies.

10. Exploring Avro - data serialization system.

11. Exploring Parquet - columnar storage format.

12. Exploring Flume - service for streaming event data.

13. Exploring Sqoop - transferring bulk data.

14. Exploring Pig – analyzing large data.

15.Exploring Hive - data warehouse.

16.Exploring Oozie - workflow scheduler.

17.Exploring Crunch -joining and data aggregation.

18.Exploring Spark and Scala.

19.Exploring Base — big data store.

20.Exploring Zoo Keeper - coordination service for distributed applications.

21. Exploring Storm - real time computation

22. Machine Learning with Mahout.

Part 5: Databases and Hadoop:

23. Interacting with NG-SQL Databases

Part 6: Exploring Advanced Topics

24. Hadoop and Security.

25. Exploring Apache Drill and Google Big Query.

26. Exploring Cloudera27. Exploring Horton works.

28. Exploring Pivotal HD.

29. Exploring Holn sight.

30. Exploring IBM info Sphere(Hadoop-) Relational Databases).

31. Exploring Hadoop and AWS.

Part 7: Case Studies and Lab Exercises

Exam Details

Exam Codes Big Data Programming S07-116 (Academy customers use the same codes)
Launch Date Jul 01 2017
Number of Questions 75
Type of Questions MULTIPLE CHOICE
Length of Test 150 Minutes
Passing Score 75%
Recommended Experience Any Graduate professionals with knowledge in Java programming background are eligible for learning Big Data Hadoop Training. A basic knowledge of any programming language like Java, C or Python and Linux is always an added advantage and also strong knowledge on Concepts of OOPs.
Languages English


Experienced Linux® system administrators responsible for managing OpenStack environments who want to learn:

Do you have questions? Get in touch.
Contact us
Refer & Earn