~/trevordalton
$ whoami

Trevor Dalton

// Senior Data Engineer

Building agentic AI pipelines and data infrastructure at M Science. Passionate about LLMs and the engineering challenges of putting them into production. When I'm not coding I'm lifting weights, playing pickleball, or reading. Always happy to talk books or anything else.

#

experience

Senior Data Engineer Jan 2026–Present · Data Engineer Mar 2022–Dec 2025
2022 – Present
  • Designed an agentic eReceipt tagging pipeline using LangGraph, BM25 retrieval, and OpenAI/Anthropic LLMs to automatically classify thousands of daily transactions against a video game product taxonomy
  • Fine-tuned multiple BERT-based classification models using PyTorch and Hugging Face to automate large-scale data labeling and improve data accuracy with F1-scores above 98%. Deployed models with MLflow, streamlining operations and cutting manual effort by 50%
  • Derived actionable insights from large-scale video game sales data using SQL and Python, uncovering user behavior trends to inform client product strategy and marketing decisions
  • Construct PySpark/SQL ETL pipelines for full raw-to-deliverable processing of large-scale data
  • Optimize AWS EC2 configurations to cut pipeline costs and runtime by as much as 60%
  • Developed DataOps-controlled pipeline orchestration systems, reducing runtimes by over 20%
python databricks pyspark pytorch hugging face langgraph mlflow snowflake aws airflow
Data Engineer May 2021–Mar 2022 · Data Engineering Intern May 2020–May 2021
2020 – 2022
  • Engineered a JavaScript multi-source ETL pipeline to connect, aggregate and analyze data
  • Designed frontend interface for a master data pipeline to derive insights from enterprise application data
  • Developed an Ontology-driven ETL Orchestration tool used by 50+ developers using React
  • Built scalable desktop applications using React, Node.js, TypeScript, and Electron
  • Developed machine learning and graph database models for supply chain and maintenance forecasting to mitigate risk and reduce lifetime costs of advanced weapons systems
  • Delivered valuable insights on large datasets using statistical analysis and MatPlotLib visualizations
python javascript typescript react node.js neo4j graphql aws
#

education

M.S. Information and Data Science
2024

The MIDS program at the Berkeley School of Information is recognized as one of the nation's top-tier data science programs. The program's focus on collaborative problem-solving has taught me how to form effective teams from a diverse set of individuals.

  • Natural Language Processing
  • Time Series and Panel Data Analysis
  • Computer Vision
  • Experiments and Causal Inference
  • Machine Learning
  • Data Engineering
  • Statistics
  • Research Design and Analysis
B.S. Computer Science
2021

Utah's preeminent research institution and where I first cut my teeth at software development. Despite the challenging curriculum I was able to thrive thanks to the help of my peers and professors who offered ample support.

  • Algorithms
  • Artificial Intelligence
  • Data Visualization
  • Operating Systems
  • Database Systems
  • Machine Learning
  • NLP
  • Information Systems
  • Computer Systems
A.S. Computer Science
2018

Taking full advantage of the Success Academy Program I engaged in concurrent enrollment classes which allowed me to graduate high school with enough credits to attain my associate's degree at the age of 18. The supportive community and passionate professors ignited my love for computing.

  • Data Structures
  • Algorithms
#

projects

Datadrip

Datadrip: AI for Financial Analysts

Automates earnings analysis for publicly traded companies. Extracts and summarizes financial data from earnings presentations using computer vision, generative AI, and visual Q/A. Built in 16 weeks; awarded 2nd place in Berkeley MIDS capstone presentations.

You can view the demo here.

↗ Website
Bank Document Verifier

Bank Document Verifier

A collaborative workspace geared to enable financial institutions to automate their application processes. Created using Angular, TypeScript, and SCSS. Here, banks can automatically verify important documents such as W2s, Schedule C's, etc. and approve applicants. BDV allows for real-time communication to ensure loans are processed accurately and timely.

The Bank Document Verifier team was awarded 2nd place in the University of Utah's School of Computing's 2021 Spring capstone presentations

↗ Demo
Aye-Aye

Aye-Aye: Semantic Lexicon Induction

A semi-supervised lexicon induction algorithm! Given just a few seed words this the Aye-Aye can learn any semantic category to a high degree of precision

↗ GitHub
Personal Website

Personal Website

Passion project of mine I have maintained since 2018 to keep with my web development roots and have an online portfolio for people to stay updated on my journey

↗ GitHub
ASCII Converter

ASCII Image Converter

An application that allows users to submit images and convert them to their ASCII equivalents. Multiprocessing is used to speed up conversion on larger images. The image above is an egg I tested this on.

↗ GitHub