Data Engineering: Big Data Technologies
Training Provider: DIGIPEN INSTITUTE OF TECHNOLOGY SINGAPORE PTE. LTD.
Course Reference: TGS-2023022301
S$1,750
Original: S$3,500
Save S$1,750
About This Course
Upon successful completion of the module, the trainee will be able to perform the following specific tasks:
- Demonstrate aptitude in working with Linux commands
- Apply HDFS commands to manage data and map-reduce to process data
- Manage and process streaming data with Apache Kafka
- Train scalable machine learning models with Apache Spark
- Design Hive queries to process data in a distributed manner
What You'll Learn
This module presents the various big data technologies concepts that include Apache Kafka, Apache Hadoop, Apache Spark, Apache Sqoop and Apache Hive. Trainees will acquire skillsets related to the Linux environment, stream processing services, HDFS commands, training scalable machine learning models with Spark, migrating data from relational databases to HDFS with Sqoop and leveraging on the capabilities of distributed computing with Hive.
Entry Requirements
Diploma with 2 years of working experience.
Course Details
Back to All Courses
Note: To apply for this course, visit the SkillsFuture website or contact the training provider directly.
More Courses from DIGIPEN INSTITUTE OF TECHNOLOGY SINGAPORE PTE. LTD.
Upon successful completion of the module, the trainee will be able to perform the following specific...
Upon successful completion of the module, the trainee will be able to perform the following specific...
Upon successful completion of the course, the trainee will be able to perform the following specific...