Joel, Pinto da Mata - CV
joelpintomata.com | LinkedIn | joelmatacv@runbox.com
Work Experience
- Development of the DSH/KPN DataCatalogue, a DataMesh/DataMarket platform.
- Implementation of Python (+ pySpark) ETL pipelines for data ingestion into DataHub.
- Pipeline orchestration with Airflow, incorporating custom instrumentation for traceability, recoverability and blacklisting.
- Development of a confluent-compliant, multi-tenant Schema Registry using Spring Boot.
- Led an open-source contribution for DataHu b Azure Blob ingestion.
Tech Stack: Kafka, Kafka Streams, Airflow, Python, pySpark, Java, DataHub, Apicurio, Kubernetes, Flux
- Full SDSL of a real-time delivery disruption detection and notification system.
- Implementationg using a serverless architecture using event-driven data pipelines.
- Managed the project lifecycle with AWS tooling and infrastructure provisioning using Terraform.
- Conducted Stakeholder management and equirement and technical analysis activities.
Tech Stack: AWS (Lambda, API Gateway, CodeBuild, SQS, SNS), Node.js, Python, Terraform
- Developed a large-scale analytics platform for research data insights.
- Created data processing and enrichment pipelines with Java, Python, Scala, and Spark.
- Conducted data analysis and modeling using Apache Hive (S3).
- Performed ad hoc data analysis with Jupyter notebooks.
Tech Stack: Java, Scala, Python, Spark, Kafka, Hive, AWS (EMR, S3, Redshift, Notebooks)
Software Engineer at Mendeley, March 2020 – April 2021
- Developed a fully customizable solution for research data management, significantly increasing customer engagement and reducing onboarding time.
- Led cross-team technical initiatives, established architectural principles, and maintained technical artifacts.
- Migration of existing solutions to microservices and server-less architectures.
Tech Stack: Java, Spring Boot (+ Cloud Functions), MariaDB, PostgreSQL, AWS, SQS, ActiveMQ, Jenkins, Kubernetes
Delegate Architect/Senior Software Engineer at Research Data Search, September 2018–March 2020
- Developed Mendeley’s Research Data search engine.
- Led cross-team technical initiatives, ensuring adherence to architectural standards.
- Developed data pipelines for metadata extraction, data classification, and enrichment.
Tech Stack: Java, Python, Spring Boot (+ Cloud Functions), Solr, Spark, Apache NiFi, AWS, EMR, GitLab, Kubernetes, Docker
Senior Big Data Engineer at PublicSonar, March 2018 – September 2018
- Developed a social media real-time analysis platform for early warning and incident management.
- Built data processing and analysis pipelines with Scala, Spark, Java, and Kafka Streams.
- Developed data ingestion pipelines using Apache NiFi.
Tech Stack: Golang, Kafka, RabbitMQ, Mongo, GitLab, Docker
Senior Backend Developer at iFlavours
October 2015 – February 2018
- Developed a greenfield e-commerce web shop and price comparator.
Tech Stack: Java, Scala, Spark, Spring Boot, Play Framework, Mongo, MariaDB, AWS, GitLab, Terraform, Docker
IT Engineer at European Space Agency (ESTEC), February 2011 – September 2015
Tech Stack: Java, Hibernate, Oracle, Apex
Various Development Roles between 2007 and 2011
Education
- Masters in Computer Engineering - Universidade Nova de Lisboa (FCT), Portugal
Certifications
- The Open Group Certified: TOGAF®
- Oracle Certified Master Java SE 6 Developer
- Oracle Certified Professional Java SE 5 Programmer
- Spring Core V3
- ITIL V3 Foundations
- Green Belt Foundational Security Training
Projects
- Project Thrive: Mentoring junior developers through the OfferZen Foundation.
Skills
- Languages: (+)Java, Python, (-)Scala, (-)Node.js, (-)Golang
- Data Processing: Kafka, Spark, Airflow, Apache NiFi
- Cloud Services: AWS (Lambda, API Gateway, EMR, S3, Redshift, SQS, SNS)
- DevOps: Terraform, Docker, GitLab, Kubernetes