Course Description
The demand for Big Data Hadoop professionals is increasing across the globe and it’s a great opportunity for the IT professionals to move into the most sought technology in the present day world. ExcelR offers Big Data and Hadoop course in Bangalore and instructor-led live online session delivered by industry experts who are considered to be the best trainers in the industry. The training is studded with loads of practical assignments, case studies and project work, which ensures the hands-on experience for the participants. The training program is meticulously designed to become a professional of Big data Hadoop developer and crack the job in the space of Big Data.
Course Curriculum
- What is Big Data
- Need and significance of innovative technologies
- 3 Vs (Characteristics)
- Forms of Data & Sources
- Various Hadoop Distributions
- Significance of HDFS in Hadoop
- HDFS Features
- Daemons of Hadoop and functionalities
- Data Storage in HDFS
- Accessing HDFS
- Data Flow
- Hadoop Archives
- Introduction to MapReduce
- MapReduce Architecture
- MapReduce Programming Model
- MapReduce Algorithm and Phases
- Data Types
- Input Splits and Records
- Basic MapReduce Program
- Introduction to Apache Pig
- MapReduce Vs. Apache Pig
- SQL Vs. Apache Pig
- Different Data types in Apache Pig
- Modes of Execution in Apache Pig
- Execution Mechanism
- Data Processing Operators
- How to write a simple PIG Script
- UDFs in PIG
- The Metastore
- Comparison with Traditional Databases
- HiveQL
- Tables
- Querying Data
- User-Defined Functions
- Introduction to HBase
- HBase Vs HDFS
- Use Cases
- Basics Concepts
- HBase Architecture
- Zookeeper
- Clients
- MapReduce integration
- MapReduce over HBase
- Schema definition
- Basics of MySQL database
- Install and Configuration
- Load/Update/Delete – DML transactions on database
- Import and Export data
- Other MySQL functions
- Introduction to Sqoop
- Sqoop Architecture and Internals
- MySQL client and server installation
- How to connect relational database using Sqoop
- Sqoop Commands
- overview
- Installation
- The basic syntax
- Data types
- Programming practice
- Basics of Python
- Variables, expressions and statements
- Functions, Structures, Strings
- Strings and Files
- Basic visualizations
- Basic Statistics
- Spark Architecture (Eco System)
- SparkR setup
- Pyspark and Spark-Shell (scala) interfaces
- Spark SQL
- Spark MLLib
- Spark Streaming
Contact Our Team of Experts