I do a mix of data engineering and machine learning. On the data engineering side I help build out the organization's data platform. I develop Apache Airflows DAGs to automate...more manual ETL tasks, design custom Airflow operators to integrate with AWS and convert legacy SQL scripts to SparkSQL to run on the platform. On the machine learning side I use SparkMLlib and Tensorflow, to design models to forecast retail demand, determine product groupings, and recommend items to customers.
I generate dashboards using Python and Bokeh for doctors, staff, and hospital management, build complex ETL pipelines to consume data from hospital's decentralized data stores,...more and develop machine learning models to improve quality of care and assist in both clinical and business decisions.
2015 - 2017 (about 2 years)
I founded PaddleSoft in order to help paddlers plan their whitewater adventures. I did a variety of tasks ranging from building a river search engine with ElasticSearch, to...more creating a paddling social networking site with Neo4j and Rails, to training a machine learning algorithm to predict river flows, to scraping paddling websites with Python and using data text to train Word2Vec and NLP algorithms, to using NodeJS and Kafka to create a realtime river flow map.
Worked with the hospital data analytics/PMO team to pull (both patient and financial) data from SQL databases, transform data for consumption by APIs, and create interactive...more visualizations.Additionally automated a bunch of manual processes such as weekly reporting to the hospital VPs.
Developing a Facebook scraping and analysis engine.
Apache Spark, Jupyter Notebook, Kafka · The goal of this project is develop a Facebook scraping engine that easily integrates with existing data pipelines.
What I Do
I specialize in developing end-to-end machine learning applications. This includes everything from initial data collection, to exploratory data analysis, to training/fine-tuning models, to deploying the models into production.
Created a Ruby on Rails application to collect stream flow and weather data in a PostgreSQL database, developed a time series Neural Network in MATLAB to predict the flow of the stream, and showcased predictions in a graph using Chart.js and jQuery.
I'm looking for a job involving machine learning. I'm particularly interested in using my skills to adapt cutting edge machine learning research to business problems. However, I'm also capable of doing data engineering type tasks such as ETL and designing pipelines.