+91-20-65330101(Pune)
02164 - 225500(Karad)

Big Data - Hadoop Developer

Big Data - Hadoop Developer


Course Name:Big Data - Hadoop Developer
Duration: 6 Weekends
Fees: Rs.12,000/-
Faculty: Nilesh Ghule
Batch Schedule: Weekend batch - Sat & Sun (10:00 am to 5:00 pm)
  • Core Java programming skills
  • Any RDBMS (like Oracle or MySQL)
  • XML awareness, Linux commands familiarity
  • Students and Freshers.
  • Professionals willing to switch to Big Data / Hadoop developer stream
  • Work in clustered Hadoop environment.
  • Get pre-installed VM with Hadoop & Eco system setup.
  • Get in-depth understanding of Hadoop internals/core.
  • Learn with plenty of demo codes/programs.
  • Focus on core features of Hadoop and eco-system, than touching various features.
  • Quality is having higher weightage than the quantity.
  • JVM, JRE, JDK, Packaging jar, Execution
  • Wrapper classes, inheritance, abstract class
  • Interfaces, Method overriding, Reflection
  • Java generics, Collection framework
  • IO framework, Garbage collection, Member class
  • RDBMS vs NoSQL databases, NoSQL db types
  • ACID vs BASE, CAP theorm, MongoDb Intro
  • Applications, installation, documents, collections
  • Mongo client & tools, JSON & BSON, Data type
  • CRUD operations, Schema design/modeling
  • Aggregation or Indexing, Scaling
  • Big Data intro – What & Why?
  • BigData vs. RDBMS
  • Hadoop2 architecture, daemons
  • Hadoop2 installation in modes
  • File System, Distributed FS & HDFS
  • HDFS: namenode, sec namenode, data node
  • Replica mechanism, FS shell cmds, Web UI
  • MapReduce (MR) & alternatives, YARN
  • Hadoop eco-systems, applications/use cases
  • Hadoop FS apis, dealing with local/HDFS files
  • Hadoop data types, Parsing MR job args
  • MR programming: Mapper & Reducer impl
  • Input splits, Input/Output formats
  • Job execution on YARN, Understanding logs
  • AppMaster, ResourceManager & NodeManager
  • Partitioner & Combiners, Custom Writables
  • Counters, Testing, Analyzing execution
  • Inner/Outer Joins in MR, Cartesian product,
  • Hadoop streaming, SequenceFiles
  • HBase introduction, Column oriented Db
  • HBase vs RDBMS, HBase installation
  • HBase architecture, Master & Region servers
  • HBase column schema, Internal storage
  • Hbase Shell, commands, Java client APIs
  • Distribution system coordination, Zookeeper
  • MapReduce & HBase integration
  • Hive introduction, architecture, installation
  • Hive CLI, Security, Beeline, Metastore & Derby
  • Hive managed & external tables,
  • Hive QL: Loading, Filtering, Grouping, Joins
  • Hive simple & complex types, DDL, DML, DQL
  • Hive indexes, views, query optimizations
  • Hive serialization / deserialization, Loading data
  • Partitioning: static & dynamic – use cases
  • Bucketing, use cases of Partitions & Buckets
  • Hive functions, operators and Hive UDF impl.
  • Thrift server, Java connectivity, Hive vs Impala
  • Sqoop intro, use cases, sqoop execution
  • Import data into Hadoop, Export from Hadoop
  • Sqoop for importing in Hive or HBase
  • Flume intro, flume architecture, applications
  • Events, source, channel and sink
  • Capturing logs data, IoT data

For Registration Click Here

Back to Top