In this repository, you can find my course work for the subject Large scale data analysis taken at IT University of Copenhagen. Here is a quick overview:
- Introduction to spark (exercise)
- Time series prediction of power production
- Yelp review analysis at scale
- MLflow model tracking
- Final exam report
The course has been managed by Maria Sinziiana Astefanoaei to whom goes full credit for the course content which might be included partially in my course work.