A bank is launching a new credit card and wants to identify
prospects it can target in its marketing campaign.
It has received prospect data from various internal and 3rd party sources. The data has various issues such as missing or unknown values in certain fields.The data needs to be cleansed before any kind of analysis can be done.
Since the data is in huge volume with billions of records, the bank has asked you to use Big Data Hadoop and Spark technology to cleanse, transform and analyze this data.
- Big Data Hadoop concepts
- Hadoop - Hands-On
- Spark concepts and hands-on
- Project - Bank prospects marketing data cleansing using Spark
- Bonus - Bank prospects data transformation using AWS S3, Glue and Athena
- Bonus project - Build your first Machine Learning Model