PhD-educated Data Scientist specializing in applying advanced machine learning and statistical analysis to complex time-series and physiological data. Possesses over five years of hands-on experience building robust data pipelines in Python (Pandas, NumPy, Scikit-learn, PyTorch) and SQL. Expert in the full data science workflow, including data curation, advanced feature engineering, dimensionality reduction (PCA, UMAP), and developing hybrid models. A key project involved using unsupervised clustering (HDBSCAN) to discover novel features (“syllables of movement”) from raw data, which then powered a highly accurate Random Forest classifier to predict complex human behaviours.
Doctor of Philosophy (Engineering), 2021
The University of Sheffield
Master of Engineering, 2011
Universidad La Salle Bajio
BEng in Mechanical and Electrical Engineering, 2009
Universidad La Salle Bajio
Python (Expert: Pandas, NumPy, Scikit-learn, Matplotlib), SQL, MATLAB, Version Control (Git, GitHub, GitLab, Bitbucket)
Algorithms: Unsupervised Clustering (K-Means, Hierarchical, Density-based), Supervised Classification (Decision Trees, Random Forest), Regression, Bayesian Methods. Techniques: Feature Engineering, Dimensionality Reduction (e.g., PCA, T-SNE, UMAP), Model Training & Evaluation, Statistical Analysis, Time Series Analysis.
Scikit-learn, Pandas, NumPy, Matplotlib, Pytorch
Marker-based (e.g., Vicon Motion Systems, CODA motion), and marker-less (Openpose, DeepLabCut, OpenCV), Action Classification, Motion Modelling
Unity, HTC Vive, Oculus Quest
Spanish (Native), English, German (Basic-Intermediate)
May you wish to contact me, please fill out the form!