Projects

Practical projects in data engineering, cloud and AI-assisted development.

Irish Hospital Healthcare Analytics

Complete

Analytics platform ingesting Irish hospital patient reviews from Google Places and SerpAPI, combined with government waitlist data (NTPF), enriched with NLP sentiment analysis, and visualised through interactive dashboards.

Apache AirflowAWS S3PostgreSQLPythonApache Spark+5

Airbnb Listings Pipeline

Complete

Batch data pipeline that automates the collection of Airbnb listing snapshots from the Inside Airbnb dataset, stores them in MinIO, and loads them into PostgreSQL for historical trend analysis of the London short-term rental market.

Apache AirflowMinIOPostgreSQLApache SparkMetabase+3

Batch Energy Analytics Pipeline (AWS)

Complete

Serverless batch pipeline that ingests monthly electricity generation data from the Ember API for 8 European countries, transforms nested JSON into Parquet format via AWS Glue, and visualises energy production trends through an Amazon QuickSight dashboard.

AWS S3AWS Glue ETLAWS Glue CrawlerAmazon AthenaAmazon QuickSight+2

dbt Analytics Engineering (AWS Redshift)

In Progress

dbt project for building analytics data models on AWS Redshift using staging, intermediate, and mart layers. Based on the Olist Brazilian e-commerce dataset (2016–2019), designed for self-service analytics with dimensional modelling.

dbtAWS RedshiftSQLAWS GluePython

Real-Time Maritime Analytics Pipeline (AWS)

Complete

Fully serverless real-time pipeline ingesting live AIS vessel position data via WebSocket, processing it through an event-driven Lambda architecture, archiving raw events in S3, and visualising vessel activity in real time through OpenSearch Dashboards.

Amazon KinesisAWS LambdaAmazon SQSAmazon Kinesis FirehoseAmazon S3+3

Credit Card Fraud Detection (AWS SageMaker)

Complete

End-to-end ML pipeline for credit card fraud detection on AWS SageMaker, with a focus on handling class imbalance, threshold optimisation for business decisions, and the development of reusable, domain-agnostic ML frameworks.

AWS SageMakerAWS S3AWS AthenaPythonXGBoost+4

Versicherungswerk — Consulting Engagement

Ongoing

Consulting engagement for a German digital insurance platform (versicherungswerk.de). Focus: defining project structure and governance, introducing Claude Code as an audit and analysis tool, establishing working methods, and setting technical direction.

Claude CodeGitHubKanbanMiroN8N+2

Role: Consultant