Big Data Training by Experts
Our Training Process
Big Data - Syllabus, Fees & Duration
MODULE 1: Introduction to Big Data and Hadoop
- Types of Digital Data
- Introduction to Big Data
- Big Data Analytics
- History of Hadoop
- Apache Hadoop
- Analysing Data with Unix tools
- Analysing Data with Hadoop
- Hadoop Streaming
- Hadoop Echo System
- IBM Big Data Strategy
- Introduction to Infosphere BigInsights
- Big Sheets
MODULE 2: HDFS (Hadoop Distributed File System)
- The Design of HDFS
- HDFS Concepts
- Command Line Interface
- Hadoop file system interfaces
- Data flow
- Data Ingest with Flume and Scoop and Hadoop archives
- Hadoop I/O: Compression
- Serialization
- Avro and File-Based Data structures
MODULE 3: Map Reduce
- Anatomy of a Map Reduce Job Run
- Failures, Job Scheduling
- Shuffle and Sort
- Task Execution
- Map Reduce Types and Formats
- Map Reduce Features
MODULE 4: Hadoop Eco System
- Pig : Introduction to PIG
- Execution Modes of Pig
- Comparison of Pig with Databases
- Grunt, Pig Latin
- User Defined Functions
- Data Processing operators
- Hive : Hive Shell
- Hive Services
- Hive Metastore
- Comparison with Traditional Databases
- HiveQL
- Tables
- Querying Data and User Defined Functions
- Hbase: HBasics
- Concepts
- Clients
- Hbase Versus RDBMS
- Big SQL: Introduction
MODULE 5: Data Analytics with R
- Machine Learning: Introduction
- Supervised Learning
- Unsupervised Learning
- Collaborative Filtering
- Big Data Analytics with BigR.
This syllabus is not final and can be customized as per needs/updates