Clicky

Joel, Pinto da Mata - CV

joelpintomata.com | LinkedIn | joelmatacv@runbox.com

Work Experience

Data Engineer at KPN - DSH - Real-Time Exchange Platform, December 2022 - Present

  • Development of the DSH/KPN DataCatalogue, a DataMesh/DataMarket platform.
  • Implementation of Python (+ pySpark) ETL pipelines for data ingestion into DataHub.
  • Pipeline orchestration with Airflow, incorporating custom instrumentation for traceability, recoverability and blacklisting.
  • Development of a confluent-compliant, multi-tenant Schema Registry using Spring Boot.
  • Led an open-source contribution for DataHu b Azure Blob ingestion.

Tech Stack: Kafka, Kafka Streams, Airflow, Python, pySpark, Java, DataHub, Apicurio, Kubernetes, Flux

Senior Data Engineer at DPG Media Nederland, April 2022 – December 2022

  • Full SDSL of a real-time delivery disruption detection and notification system.
  • Implementationg using a serverless architecture using event-driven data pipelines.
  • Managed the project lifecycle with AWS tooling and infrastructure provisioning using Terraform.
  • Conducted Stakeholder management and equirement and technical analysis activities.

Tech Stack: AWS (Lambda, API Gateway, CodeBuild, SQS, SNS), Node.js, Python, Terraform

Senior Data Engineer at Scival, April 2021–April 2022

  • Developed a large-scale analytics platform for research data insights.
  • Created data processing and enrichment pipelines with Java, Python, Scala, and Spark.
  • Conducted data analysis and modeling using Apache Hive (S3).
  • Performed ad hoc data analysis with Jupyter notebooks.

Tech Stack: Java, Scala, Python, Spark, Kafka, Hive, AWS (EMR, S3, Redshift, Notebooks)

Delegate Architect/Senior Software Engineer at Mendeley, March 2020–April 2021

Software Engineer at Mendeley, March 2020 – April 2021

  • Developed a fully customizable solution for research data management, significantly increasing customer engagement and reducing onboarding time.
  • Led cross-team technical initiatives, established architectural principles, and maintained technical artifacts.
  • Migration of existing solutions to microservices and server-less architectures.

Tech Stack: Java, Spring Boot (+ Cloud Functions), MariaDB, PostgreSQL, AWS, SQS, ActiveMQ, Jenkins, Kubernetes

Delegate Architect/Senior Software Engineer at Research Data Search, September 2018–March 2020

  • Developed Mendeley’s Research Data search engine.
  • Led cross-team technical initiatives, ensuring adherence to architectural standards.
  • Developed data pipelines for metadata extraction, data classification, and enrichment.

Tech Stack: Java, Python, Spring Boot (+ Cloud Functions), Solr, Spark, Apache NiFi, AWS, EMR, GitLab, Kubernetes, Docker

Senior Big Data Engineer at PublicSonar, March 2018 – September 2018

  • Developed a social media real-time analysis platform for early warning and incident management.
  • Built data processing and analysis pipelines with Scala, Spark, Java, and Kafka Streams.
  • Developed data ingestion pipelines using Apache NiFi.

Tech Stack: Golang, Kafka, RabbitMQ, Mongo, GitLab, Docker

Senior Backend Developer at iFlavours

October 2015 – February 2018

  • Developed a greenfield e-commerce web shop and price comparator.

Tech Stack: Java, Scala, Spark, Spring Boot, Play Framework, Mongo, MariaDB, AWS, GitLab, Terraform, Docker

IT Engineer at European Space Agency (ESTEC), February 2011 – September 2015

Tech Stack: Java, Hibernate, Oracle, Apex

Various Development Roles between 2007 and 2011

Education

  • Masters in Computer Engineering - Universidade Nova de Lisboa (FCT), Portugal

Certifications

  • The Open Group Certified: TOGAF®
  • Oracle Certified Master Java SE 6 Developer
  • Oracle Certified Professional Java SE 5 Programmer
  • Spring Core V3
  • ITIL V3 Foundations
  • Green Belt Foundational Security Training

Projects

  • Project Thrive: Mentoring junior developers through the OfferZen Foundation.

Skills

  • Languages: (+)Java, Python, (-)Scala, (-)Node.js, (-)Golang
  • Data Processing: Kafka, Spark, Airflow, Apache NiFi
  • Cloud Services: AWS (Lambda, API Gateway, EMR, S3, Redshift, SQS, SNS)
  • DevOps: Terraform, Docker, GitLab, Kubernetes