Micro project on big data technologies via spark
-
Colab-Spark setup
-
Data loading
-
EDA & Preprocessing
-
Pipelines & Experiments
-
Text preprocessing
-
Text classification
- BoW models + LogReg
- Transfer Learning (at least an attempt 😀)
-
Entity Recgnition & Entity Linking
...and much more 🤘