Bigdata/Hadoop

The Big Data Hadoop course has been designed to impart an in-depth knowledge of Big Data processing using Hadoop. The course is packed with real-life projects and case studies to be executed in the Cloud Lab.

Big Data Development

The Big Data Hadoop course has been designed to impart an in-depth knowledge of Big Data processing using Hadoop. The course is packed with real-life projects and case studies to be executed in the Cloud Lab. This program covers Big Data Analytics process involved in storing, processing and managing Big Data – both structured and unstructured data, as well as the data analytics layer on top of Big Data systems, using both more traditional predictive models by connecting an analytics tool like R to Big Data Systems.

Content:

Linux overview and directory structure

  1. Installation of ubuntu
  2. Linux commands
  3. Java and python Installation
  4. Big Data Overview
  5. Hadoop Architecture & Components
  6. Hadoop Configuration
  7. Hadoop Processing – Map Reduce & HDFS
  8. Python
  9. Map Reduce with python
  10. Pig
    • Pig’s Data Model
    • Pig Functions
    • Input and Output formats to MR program
    • Case Study
  11. Overview of R, R data types and objects, reading and writing data
  12. Control structures, functions
  13. Loop functions, Simulation
  14. Database Connectivity
  15. Introduction to Scala
  16. Creating a Scala Project
  17. Classes, Objects and Methods
  18. Scala GUI and Connectivity
  19. Spark Overview
  20. RDD(Resilient Distributed Datasets) Fundamentals
  21. Cluster Architectures for Spark
  22. Spark Job Execution
  23. Introduction to MongoDB
  24. MongoDB API
  25. Indexing and Data Modeling
  26. Connection with Python
  27. Rest API
  28. Introduction to Cassandra
  29. Architecture of Cassandra and Configuration
  30. Cassandra Data Model
  31. CQL
  32. Connection with python
  33. Kafka
  34. Kafka Streaming