Hello, I'm

Jonathan Musni

Senior Data Engineer & Analytics Expert

I build scalable data solutions that drive business impact. Specialized in cloud architectures, performance optimization, and turning complex data challenges into strategic advantages.

63% Cost Reduction
96% Performance Boost
5+ Years Experience
Jonathan Musni running marathon in New York
Marathon Finisher

About Me

From healthcare tech to data engineering excellence

My Journey

My path to data engineering began in an unexpected place - the operating rooms of St. Luke's Medical Center in the Philippines. As a Biomedical Equipment Technician, I learned the critical importance of precision and reliability when technology directly impacts lives.

This foundation led me through operations support at Sky Cable Corporation, where I optimized network performance while self-studying data science. DXC Technology sponsored my Master's in Data Science at Drexel University, opening doors to the fascinating world of AI and machine learning.

Today, I focus on building robust, scalable data solutions that drive real business value while maintaining the precision and attention to detail that started my journey in healthcare technology.

Education

MS Data Science (Drexel, 3.92 GPA)
Currently: MS Information Systems (Part-time)

Location

Philadelphia, PA
Originally from Philippines

Beyond Code

Spartan racer & marathoner
Philadelphia tech community

Professional Journey

Building data solutions that drive impact

MS Information Systems (Part-time) & Continuous Learning

Harrisburg University & Self-directed 2024 - Present

Advanced studies and continuous learning in cutting-edge technologies:

  • Pursuing MS in Information Systems and Engineering Management (Part-time)
  • DataExpert Combined Track (Data and Analytics Engineering) - 2025
  • Exploring Model Context Protocol, DSA, and LangGraph
  • Sharing knowledge through technical blogs and GitHub projects
Data Engineering AI Engineering System Architecture Research

Data Engineer

Comcast 2023 - 2024

Contracted data engineering services for major telecommunications company:

  • Built scalable data pipelines for analytics platforms
  • Enhanced data observability and monitoring systems
  • Implemented modern data engineering best practices
  • Earned multiple certifications including AWS Data Engineer Associate
Apache Spark AWS Databricks Data Pipelines

Data Engineer

DXC Technology Platform X 2021 - 2023

Transitioned to full-time data engineering with exceptional results:

  • 96% latency improvement (2 minutes to 2 seconds) through dataset migration
  • 63% AWS S3 storage cost reduction while maintaining data integrity
  • Made Platform X Data Lake GDPR compliant
  • Graduated with MS Data Science from Drexel University (2021)
Apache Spark AWS S3 Data Lakes GDPR Compliance Python

Data Scientist

DXC Technology AI Studio 2019 - 2021

Pioneered ML and AI applications while pursuing Master's degree:

  • Built Machine Learning powered racing simulator
  • Developed backend for AI-powered Virtual Reality application
  • Pursued MS in Data Science at Drexel University (sponsored by DXC)
  • Moved to Philadelphia, USA in 2019
Machine Learning Python VR/AR Data Science

Electronics Engineer - Area Operations

Sky Cable Operations 2016 - 2019

Provided technical operations support for telecommunications infrastructure:

  • Managed area operations for cable television and internet services
  • Optimized network performance and system reliability
  • Self-studied data science and Python during free time
  • Brief training as Civil Aviation Authority officer candidate (2016)
Network Operations Electronics Engineering System Optimization Self-study

Biomedical Equipment Technician

St. Luke's Medical Center 2015 - 2016

Critical healthcare technology support in operating room theaters:

  • Ensured reliability of medical equipment during surgeries
  • Maintained precision and attention to detail in life-critical environments
  • Passed Philippines Electronics Engineering Licensure Exam (2015)
  • Graduated BS Electronics Engineering from Polytechnic University (2014)
Medical Equipment Quality Control Electronics Engineering Critical Systems

Featured Projects

Real-world solutions with measurable impact

Retail Customer Analytics Pipeline Screenshot
Databricks Delta Live Tables K-means Clustering GitHub Actions Delta Lake

Retail Customer Analytics Pipeline

End-to-end data engineering pipeline with medallion architecture (Bronze/Silver/Gold layers). Features automated K-means customer segmentation, RFM analysis, and comprehensive data quality monitoring.

Multi-Agent Job Application Helper Screenshot
CrewAI GPT-4 Multi-Agent Systems LangChain

Multi-Agent Job Application Helper

AI system using CrewAI multi-agent framework to ethically optimize job applications. Features coordinated agents for resume analysis, job matching, and application enhancement while maintaining authenticity.

Stateful Conversational AI Chatbot Screenshot
LangGraph PostgreSQL ChromaDB OpenAI

Stateful Conversational AI Chatbot

Advanced chatbot with hierarchical context retrieval using LangGraph. Features 3-tier memory architecture: last 5 conversations (PostgreSQL), vector search (ChromaDB), and LLM knowledge base.

Model Context Protocol Implementation Screenshot
FastMCP AWS Glue Apache Iceberg Model Context Protocol

Model Context Protocol Implementation

MCP client and server implementations featuring specialized Iceberg server for AWS Glue integration. Supports database management, schema inspection, data querying, and snapshot history tracking.

Apache Iceberg Handbook Screenshot
Docker Apache Spark MinIO GDPR Compliance

Apache Iceberg Handbook

Comprehensive technical collection covering Copy-on-Write vs Merge-on-Read strategies, GDPR compliance implementation, and interactive Jupyter notebooks with complete Docker environment setup.

Technical Expertise

Modern data stack and cloud technologies

Cloud Platforms

AWS
Azure
Databricks

Programming Languages

Python
SQL
PySpark

Data Processing

Apache Spark
Snowflake
Kafka
Flink
Trino

Tools & Frameworks

Airflow
DBT

Data Warehouse

Snowflake
PostgreSQL
Redshift
DynamoDB

Data Lakehouse

Delta Lake
Apache Iceberg
S3 Tables

AI Engineering

Model Context Protocol
RAG Systems
Vector Databases
Agentic AI

Professional Certifications & Education

Education: MS Data Science (Drexel University, 2021) • MS Information Systems (Harrisburg University, In Progress - Part-time) • BS Electronics Engineering (Polytechnic University of the Philippines, 2014)

Running Journey

I started running during the pandemic and it has helped me mentally through some tough times. I love joining races and running with friends, but most of the time, running alone is where I get clarity on some things that I am working on, whether in my personal or professional life.

Spartan Stadion 5K

2021 Washington DC

First Spartan race - stadium obstacles and endurance challenge

Spartan Beast 21K

2023 Cincinnati, Ohio

Ultra-distance obstacle race with 30+ obstacles over challenging terrain

Philadelphia Marathon

2023 Philadelphia, PA

26.2 miles through the City of Brotherly Love - home marathon achievement

Delaware Marathon

2024 Wilmington, Delaware

Second marathon completion - building endurance and consistency

5

Major Races

2

Marathons

3

Spartan Races

4+

Years Running

Let's Connect

Ready to discuss data engineering opportunities

Location

Philadelphia, PA

Schedule a Meeting

Book 30-min call

Resume

Download PDF