John Simonsen

Data Science & MLOps Specialist

John Simonsen Headshot

Professional Summary

Conifer, CO

Master's Graduate in Data Science specializing in MLOps and the end-to-end machine learning lifecycle. My expertise is demonstrated through developing a complete RAG-based Q&A application, from data ingestion and containerization to deployment on Kubernetes. I have authored deployment blueprints for AWS using Terraform and created CI/CD pipelines with GitHub Actions to ensure robust, automated workflows. Combining these advanced MLOps skills with a decade of experience in systems administration, I am well-equipped to build, scale, and maintain reliable AI/ML solutions.

Technical Skills

AI / Machine Learning

PyTorch Scikit-learn NLTK LangChain LLMs (Ollama) RAG Neural Networks (CNN, LSTM) NLP Time Series Regression Classification Clustering

MLOps, Cloud & Infrastructure

AWS (EKS, ECR, S3) Kubernetes Docker Terraform (IaC) CI/CD (GitHub Actions, GitLab) MLflow DVC FastAPI Linux (Ubuntu)

Languages & Databases

Python SQL ChromaDB (Vector DB) PostgreSQL MySQL Bash Java C++ HTML/CSS/JS

Projects & Portfolio

MLOps Stack Q&A Bot | End-to-end RAG Application

  • Developed a Retrieval-Augmented Generation (RAG) application designed to answer questions about MLOps tools (ZenML, MLflow, Docker, etc.) by utilizing official documentation.
  • Engineered a data ingestion pipeline involving web scraping, embedding generation (using Ollama), and storage in a ChromaDB vector database.
  • Built and containerized a RESTful API using FastAPI and Docker, and integrated it with a front-end web interface.
  • Deployed the containerized application on a local Kubernetes cluster for development and testing.
  • Authored a complete blueprint for cloud deployment, including Terraform configuration for AWS (EKS, ECR, S3) and a CI/CD pipeline using GitHub Actions.

Flight Delay Predictor | MLOps, CI/CD & Model Deployment

  • Developed and deployed a machine learning model for predicting flight delays using Python, FastAPI, Docker, and MLflow.
  • Utilized MLflow for model tracking, versioning, and deployment, showcasing experience with a leading MLOps platform.
  • Implemented a CI/CD pipeline using GitLab to automate the testing (Pytest) and deployment process.

Customer Sentiment Analysis | NLP & LSTM Networks

  • Developed and trained an LSTM neural network model using PyTorch to classify customer sentiment from product and service reviews.
  • Implemented text data preparation pipeline including cleaning, tokenization (NLTK), lemmatization, and padding.
  • Employed techniques like embedding layers, dropout regularization, and early stopping to mitigate overfitting.

Professional Experience

AI Trainer

2024 - Present

Alignerr (Remote) & DataAnnotation (Remote)

  • Contribute to specialized AI model evaluation projects, assessing capabilities in code generation, technical problem-solving, and multi-language instruction adherence to refine model performance.
  • Perform comparative analysis on AI model variations by evaluating side-by-side outputs to guide development and enhance user interaction quality.
  • Provide detailed feedback on AI-generated code and technical outputs, identifying errors and logical flaws to improve model reasoning, accuracy, and robustness.

Systems Administrator & Technical Instructor

2016 - 2023

Northern Utah Academy for Math, Engineering, and Science (Layton, UT)

  • System Administration: Administered and maintained a 28-workstation Ubuntu lab, managing updates, user provisioning, and troubleshooting.
  • Automation & Tool Development: Developed automation scripts (Bash, Python) and CI/CD pipelines (GitHub Actions) to streamline classroom operations, including automated grading and system imaging.
  • Mentorship: Led student teams in the CyberPatriot competition for five years, consistently achieving top rankings (1st-3rd) in state competitions.

Education & Awards

Master of Science in Data Analytics - Data Science

APRIL 2025

Western Governors University

View Graduate Coursework Highlights

Code available in the MSDA Graduate Coursework Repository.

  • Advanced Analytics: Neural networks, deep learning (PyTorch), NLP, sentiment analysis.
  • Machine Learning: Supervised (SVMs, k-NN), unsupervised (k-means, t-SNE), time series forecasting.
  • Deployment: Operationalization, scalable data pipelines, deployment at scale.
  • Optimization: Linear optimization (Python), gradient/non-gradient algorithms.

Bachelor of Science in Technology and Engineering Education

MAY 2015

Utah State University

Awards

  • Eagle Scout Award with Bronze Palm
  • Winner, Northrup Grumman Coding Challenge, Hill Air Force Base