ML Engineer • Data Scientist

Javier Guzmán Figueira Domínguez

I'm an AI enthusiast, currently focused on applying Machine Learning and Deep Learning at scale. I love working as part of a team: designing, helping each other, discussing which are the right questions, and building systems that serve a purpose. I love learning and challenges!

Experiences

Sr Machine Learning Engineer

Februrary 2021 - Present

Additionally to the maintenance and improvement of previously developed services and tools, focused on building a solution that oversees the quality of speech data delivered to clients and eases the delivery process.

Among other details, this system includes the development of:

  • a micro-service and a REST API (FastAPI),
  • a binary classification model for audio (deployed in Kubernetes using MLflow),
  • an on-demand metrics computation using Druid.

As well, cooperated with stakeholders to understand the impact of our solutions on their needs and guide them on its usage. And, helped new team members in onboarding processes and first steps inside the team.

Languages & Frameworks:

Python

FastAPI

MLFlow

Platforms and Databases:

RabbitMQ

Apache Kafka

SQLServer

Druid

Data and ML Libraries:

PyTorch

Pandas

Other:

Kubernetes

Kibana

Grafana

Machine Learning Engineer

February 2020 - Februrary 2021

In the context of a data crowdsourcing company, worked in the squad responsible for ensuring the quality of the data submitted by the crowd.

To achieve this goal, worked towards the productization of metrics and models which allowed taking dynamic actions as part of the data flow and building analytical tools for monitoring purposes.

Other tasks performed:

  • Improve CI/CD and packaging of artifacts from data scientists and internal tools.
  • Development and maintenance of end-to-end tests.
  • Helping data scientists in their day-to-day routine.
  • Maintenance of micro-services and throughput enhancement.
  • Monitoring and alerting of infrastructure.
Languages & Frameworks:

Python (90%)

Flask

.NET (C#)

Platforms:

Apache Spark

Superset

RabbitMQ

Apache Kafka

Data libraries:

Pandas

Numpy

Matplotlib

Seaborn

Databases:

SQLServer

PosgreSQL

CosmosDB

Druid

Other:

Apache Jupyter

Docker

Kubernetes

Machine Learning Engineer

October 2016 - February 2020

As part of the e-learning team, I help to solve the emerging challenges through making tasks as:

  • Data visualization and exploratory analysis using code notebooks.
  • Development of proofs of concept applying state-of-the-art methods.
  • Support to design architectures to deploy the chosen solutions.
  • Development of machine learning and deep learning models using both classical tools, big data and streaming platforms.

Some of the ML/DL techniques I have applied to projects are:

  • Pattern mining and frequent itemsets mining.
  • Hidden Markov Models and LSTM Networks.
  • Recommender systems using ALS, autoencoders and embeddings.
  • Anomaly detection using clustering techniques (kMeans, DBSCAN).
Languages:

Python

R

Scala

Typescript

Platforms:

Apache Spark

Metabase

RabbitMQ

Apache Kafka

Data libraries:

Pandas

Numpy

Matplotlib

ggplot2

Machine Learning/Deep Learning libraries:

Scikit-Learn

SparkML

Keras

TensorFlow

Caret

Databases:

MySQL

PostgreSQL

MongoDB

Other:

Apache Jupyter

Apache Zeppelin

Docker

Jenkins

Logstash

Node.js

React

Co-founder - Architect - FullStack Developer

September 2015 - May 2017

Co-foundated, designed and developed of a collaborative platform for students oriented to the early creation of a professional profile. For this, designed and early developed a microservice-based architecture, by using the Netflix OSS stack over the Spring framework: Zuul, Hystrix, Ribbon, Eureka, Spring Stream...

Technologies & languages:

Java

Spring

JavaScript

AngularJS

Docker

PosgreSQL

MongoDB

Neo4J

Apache Kafka

Full Stack Developer Intern

March 2016 - May 2016

Maintained and developed a management application broadly used on call centers.

Technologies used:

.NET

C#

SQL Server

Android Developer

September 2015 - April 2016

First contact with a multidisciplinary development team through a collaboration in this startup as an Android developer implementing UX/UI designs.

Technologies used:

Android

Python

Git

Skills & Proficiency

Sample of technologies and languages that I know. They are subjectively classified according to my own knowledge of them. It seeks to choose technologies representative of different fields, not all of them.

Python & R

Machine Learning

Deep Learning

Apache Spark & Scala

Big Data Tools

SQL & NoSQL

Architecture Design

Java & Node.js

C/C++

Agile methodologies

DevOps

Other Skills

Docker Jenkins Git Bash Scrum TDD Clean Code

Projects

These are some projects in those I have collaborated, both academically or professionally.

Feature selection in big image sets - Research of feature selection methods over Spark using the Imagenet dataset. Features are extracted both using transfer learning (Keras) and classical artificial vision methods (LBP, BoW with SURF, KAZE...).
MetaDW - (BSc CSE Project) Metadata management and presentation system in a data warehouse environment for Business Intelligence purposes.
Gradox - Platform of collaboration between students, mainly focused on the University and oriented to the early creation of a professional profile.
Chattyhive - An social chat service that allows you to interact with other users through the topics you like and to create communities to share information in real time.
ChuckBot - Sample of a bot for ChatOps, built with the Hubot framework.

Certifications

Mongo University

Machine Learning

June 2017
Coursera, Stanford University

English B2

September 2009