Tony DiRubbo

Data Scientist & AI Engineer

Building intelligent systems at the intersection of artificial intelligence and data infrastructure. Currently exploring the applications of large language models in real-world applications while maintaining a strong foundation in data engineering.

Tony DiRubbo

About Me

I am a graduate of Syracuse University with a master's in Applied Data Science, with pathways in Artificial Intelligence and Data Pipelines & Platforms. My journey in data science began during my undergraduate studies at St. John Fisher University, where I developed a strong foundation in statistical reasoning and computational thinking.

My academic work has been complemented by professional experiences at CooperVision and Tessy Corporation, where I've applied data science concepts to production systems. I'm currently working as a Graduate Student AI Researcher, developing adaptive AI resources for education.

What drives me is the opportunity to translate complex data into actionable insights and build systems that are both technically sound and user-focused. Whether it's designing data warehouses, developing machine learning models, or creating AI-powered applications, I'm passionate about creating value through data.

Software Engineering
Machine Learning
DevOps
Cloud Platforms
Web Development
Quant Statistics
Data Warehousing
Data Visualization

All Projects

Machine Learning to Predict Diabetes

Source Code

  • Built ML pipelines to predict Type II diabetes using non-clinical data from the CDC BRFSS survey
  • Engineered and encoded 28+ features; applied SMOTE and PCA to handle high dimensionality
  • Compared classification models (Logistic Regression, Random Forest, XGBoost, Neural Networks)
  • Applied cross-validation, threshold tuning, focal loss, and synthetic sampling to improve performance

Deep Learning to Predict Nutritional Content of Food from Images

Read Report Source Code

  • Curated MM-Food-100K by parsing nutrient fields and selecting the top 50 classes for balanced training
  • Built a ResNet-50 transfer learning model with augmentation, frozen layers, and epoch optimization
  • Created a multithreaded pipeline to download, validate, and preprocess 100K+ food images
  • Mapped model predictions to mean nutrient vectors to infer calories and macros from a single image

API Evaluation and Analysis

Read Report Source Code

  • Worked with a PhD graduate to evaluate their API which is used to rank cross country races
  • Used Python libraries including Pandas, Matplotlib, and MongoDB to read and evaluate JSON data
  • Developed a report which evaluates the performance of the API based on key metrics
  • Created a program which allows for quick analysis of future races

Machine Learning with Heart Disease

View Project

  • Developed models which input various recorded health metrics to predict heart disease development
  • Project was completed alone with mentorship from a professor
  • R, Excel, and Python were utilized to create various models which were evaluated against one another
  • Findings were presented at an end of semester conference for senior thesis projects

Award Winning Data Journalism Article

Read Article

  • Used data to prove if the NYC fully remote school district was a viable option
  • Data was independently collected and sorted with MS Excel, then Tableau was used for visualizations
  • Techniques from Tableau were developed on from a marketing visualization class
  • The article won awards and scholarships from the Washington Media Scholars Foundations

Work Experience

Tessy Corportation

March 2026 - Present

Enterprise Application Architect

  • Built a React Application to allow for automatic tracking and document management of quote requests
  • Utilized TypeScript and SharePoint Framework to build custom components for SharePoint pages
  • Implemented Microsoft Fabric to provide a new data warehouse pipeline for project analysis

Tessy Corportation

December 2025 - March 2026

Digital Workflow / System Automation Co-op

  • Assisted in maintaining and extending capabilities for a .NET microservice environment
  • Designed a JSON-driven workflow engine with dependency-aware task states and overrides
  • Created and optimized SQL stored procedures for frontend integration using Azure Web Services
  • Implemented a CI/CD DevOps pipeline for quick microservice upgrades and delivery

Syracuse University

January 2025 - December 2025

Graduate Student AI Researcher

  • Developing adaptive AI resources to enhance student learning and engagement
  • Collaborating with a team of five graduate students, meeting bi-weekly to coordinate research efforts
  • Leading semester-based projects focused on consistent AI themes across multiple terms
  • Presenting research progress and outcomes at end-of-semester poster symposiums

CooperCompanies

March 2025 - August 2025

Global Master Data Co-op

  • Utilized Java, Visual Basic, and Oracle to develop applications to assist with the data pipeline
  • Constructed Python scripts to perform large-scale data changes and updates
  • Reviewed and signed-off on data entry/changes to ensure data governance between regional systems
  • Pulled data for inquiry and analysis for finance, sustainability, and customer service teams using SQL

St John Fisher University

August 2021 - May 2024

Peer Math and Computer Science Tutor

  • Aided students in their development of various math and computer science principals
  • Students requested appointments on a short-term basis and could also visit during assigned hours
  • Technical support for Microsoft Excel, R, and Java was also given
  • Students found themselves able to properly explain concepts and not need to request further support

Team Connection, NC

Summer 2023

Web Development Intern

  • Maintained and developed the storefront and website for the Team Connection brand
  • Was the sole developer and worked closely with the president and marketing director
  • Utilized WordPress, JavaScript, and SQL to develop webpages and maintain storefront data
  • Internship concluded with a presentation with recommended next steps for the brand's webpage

Connect With Me