Three Days Extensive Hands-on Workshop on Big Data using Hadoop, Spark, NoSQL/Cassandra


















Learner's Place Professional Academy is offering 3 days extensive workshop on Big Data. Big Data workshop is targeted towards technical people who want to get a jumpstart on Big Data, with a specific focus on Hadoop, Spark, NoSQL and Cassandra. The workshop is targeted for a small focused group of attendees with more than 70% hands on lab. You will not only learn the technology but also get familiarized with industry specific applications. Workshop instructors Mark Kerzner and Sujee Maniyam are industry practitioners and veterans of Big Data.


Date: March 25 - 27, 2016


Time: 9AM - 6PM


Location: Biltmore Hotel, 2151 Laurelwood Rd, Santa Clara, CA 95054


Day 1 : NoSQL with Cassandra




Learn NoSQL data modeling with the popular Cassandra data base


9:00AM – 10:00AM - NoSQL landscape       


10:00AM – 12:00PM - Cassandra architecture and concepts   


12:00PM - 1:00PM - Lunch Break


1:00PM - 2:00PM - CQL


2:00PM - 3:00PM - Data modeling in Cassandra using CQL


3:00PM - 3:15PM - Break


3:15PM - 4:15PM - queries


4:15PM - 5:15PM - indexes


5:15PM - 5:45PM - composite keys


5:45PM - 6:00PM - Wrap –up Day 1



Day 2 : Hadoop




Learn to use Hadoop - the Big Data platform


9:00AM - 11:30AM - Hadoop intro


11:30 AM - 12:00PM - HDFS


12:00 PM -- 1:00PM - Lunch Break


1:00 PM - 3:00PM - Map Reduce primer


3:00PM - 3:15PM - Break


3:15PM - 4:45PM - Hive


4:45PM - 5:45PM - Querying data in Hadoop


5:45PM - 6:00PM - Wrap –up Day 2


Day 3 Spark




Continue learning Big Data analytics with emerging technology - Apache Spark


9:00 AM - 10:00 AM - Scala primer (quick introduction)


10:00 AM – 12:00  PM - Spark architecture / design


12:00 PM - 1:00PM - Lunch Break


1:00PM - 2:00PM - Spark Shell


2:00PM - 3:00PM - RDDs


3:00PM - 3:15PM - Break


3:15PM - 5.00PM - Spark SQL / Dataframes


5:00PM - 5:45PM - Spark streaming


5:45PM - 6:00PM - Wrap-up and Q&A


NOTE: Agenda subject to change without notice


Lab requirement:


A reasonably modern laptop (Need to be able to connect to clusters running on cloud services… corporate laptops with overly restrictive firewalls are not recommended)


  • SSH client (For Windows use Putty / SecureCRT ; Mac and Linux come with ssh clients)

  • Chrome browser with Markdown Preview Plus plugin

  • Nice to have : a programmer’s editor

    • Windows : Sublime, NotePad++, Programmer’s NotePad, TextPad

    • Mac : Sublime, TextWrangler

    • Linux : Sublime, GEdit, vim, Emacs


Who should attend?


This course is appropriate for any Big Data enthusiast including Software Programmer, Project Manager, Product Manager, Architect, DBA or Quality Analyst. Prior experience with programming is not necessary.


Instructor : Mark Kerzner












Mark is an experienced/hands-on BigData architect. He has been developing software for over 20 years in a variety of technologies (enterprise, web, HPC) and for a variety of verticals (healthcare, O&G, legal, financial).


He currently focuses on Hadoop, BigData,NOSQL and Amazon Cloud Services. Mark has been doing Hadoop training for individuals and corporations; his classes are hands-on and draw heavily on his industry experience.


Mark stays active in the Hadoop / Startup communities. He runs Houston Hadoop Meetup. Mark contributes to a number of Hadoop-based projects.

Instructor : Sujee Maniyam












Sujee has been developing software for 15 years. In the last few years he has been consulting and teaching Hadoop, NOSQL and Cloud technologies.

Sujee stays active in Hadoop / Open Source community. He runs a developer focused meetup and Hadoop hackathons called ‘Big Data Gurus’. He has presented at variety of meetups.


Sujee contributes to Hadoop project and other open source projects. He writes about Hadoop and other technologies on his website.

Book written by Mark & Sujee