Cherhan.net - Introduction to Big Data and Hadoop Ecosystem
The rise of social media has become ubiquitous in modern society. Social media generates large amount of data, which challenges state-of-the-art data computation and analysis methods. This workshop covers the basics of Big Data, Big Data analytics concepts, and help you to be familiar with Hadoop ecosystem.
We will take a look into the following topics of Hadoop, the most popular Big Data platform:
Hadoop Components: MapReduce
- Hadoop programming: Hive, Pig, Spark.
After completing the course, participants will bring home a Virtual Machine (VM) with ready-to-use Big Data solutions, pre-installed with Hadoop packages and datasets that can be used with their datasets. Participants will:
Understand what is big data
Able to describe Hadoop ecosystem, and
Apply Big Data processing using Hadoop
Dr. Lau Cher Han, a data scientist with CTO background and graduated from Queensland University of Technology (QUT) with Philosophy Doctorate. Dr. Lau’s research interest has been in the area of text mining and big data analysis. During his PhD, he researched a machine learning model that detects news topics from Twitter, using text mining, sentiment analysis and organic metrics. This model is used by Australian media companies to build systems that present topics to support news editor decision making process.
7:00 PM - 10:00 PM MYT
- @CAT Penang
Standard RM89.00 Early Bird RM69.00
- Venue Address
- ACAT Penang, 16, Gat Lebuh China, George Town, 10200 George Town, Pulau Pinang, Malaysia Malaysia