Hadoop is a big data framework , which stores andprocesses the huge data (like Tbs, Pbs)
Components- Storage: Hdfs, Processing: MR1. MR2.
SQL
SQL is the language to access/process the data in Database
Components- Basics, Databases, Tables, Views, Joins.
Hive
Hive is a data warehouse system which is used to analyze structured data.
Components- Basics, Datatypes, Tables, Views, Joins.
Scala
Scala is programming language, similar to Java/Python
Components - Data types, Classes, Objects, Functions,Loops, Singleton Object, OOPS, etc.
Spark
Spark is a bigdata framework, design to handle/processhuge data volumes in a faster/optimised way.
Spark is available in Scala, Java, Python (called Pyspark)
Components- Spark Core, Spark SQL
About Course
Data Engineering
A data engineering is developing, testing, and maintaining data pipelines and architectures, which the data scientist uses for analysis.
What Will You Learn?
Learn Data Engineering - Level 1 required to crack a job
Material Includes
Presentations
Temporary Recordings
Datasets
Project work
Requirements
Basic programming knowledge in Java, Basic SQL and Basic LINUX
Laptop or PC is required for training and practice.
Audience
Engineering and computer science student and professionals