projects
Simple app to extract text from pictures using Tesseract
Episode 223 - Build a Screen Recorder with Electron
🔍 Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
A very simple command line tool for downloading YouTube videos.
Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DocumentDB as the database
DEPRECATED - Raspberry Pi Kubernetes cluster that runs HA/HP Drupal 8
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Building a fake news detector from initial ideation to model deployment
Price Crawler - Tracking Price Inflation
Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for Apache Airflow (MWAA) on AWS.
My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on lambda architecture, that aggregates Twitter and US stock market data for user sentiment anal…