Senior Data Engineer

Anthony Sottile

Building enterprise data platforms, AI-powered analytics, and interactive experiences that turn complex data into decisive action.

About Me

Anthony Sottile

I'm a data engineer who loves building things end-to-end — from designing lakehouse architectures that wrangle millions of rows, to crafting the React dashboards executives actually enjoy using. My sweet spot is turning messy, siloed data into governed platforms that drive real business decisions.

Before enterprise data, I built a baseball analytics SaaS and led analytics for USC Baseball. I still bring that same competitive, scrappy energy to every pipeline and dashboard I ship.

0+

Data Sources Integrated

0+

Executive Dashboards

$0M+

Opportunities Surfaced

0×

Pipeline Performance Gain

University of Southern California

B.S. Business Administration / Finance · Minor in Applied Analytics · GPA: 3.53

Dean's List · Cum Laude · USC Men's Basketball · USC Baseball

Technical Skills

Click a skill to see where it was used

Languages

Data Engineering

Cloud & DevOps

AI & ML

Visualization

PythonSQLTypeScriptPySparkMicrosoft FabricDatabricksDelta LakeReactNode.jsDockerKubernetesPower BIOpenAILangChainGraphQLAzure DevOpsPostgreSQLKafkaPythonSQLTypeScriptPySparkMicrosoft FabricDatabricksDelta LakeReactNode.jsDockerKubernetesPower BIOpenAILangChainGraphQLAzure DevOpsPostgreSQLKafka

Experience

TIDI Products – The Jordan Company

Chicago, IL / Remote
Senior Data EngineerAugust 2025 – Present
  • Led enterprise BI strategy, architecting Microsoft Fabric as the company's analytics platform with a medallion data architecture integrating 15+ sources into a governed foundation supporting 30+ executive dashboards.
  • Architected the company's first custom BI portal (Node.js, React, GraphQL) and MCP server with OpenAI integration, enabling executives to access dashboards, KPIs, and natural-language lakehouse queries through a unified interface.
Data Engineer (DP 600) / Data ScientistApril 2024 – July 2025
  • Built the company's first enterprise lakehouse in Microsoft Fabric from scratch, establishing standardized ingestion and transformation patterns across 15+ legacy systems.
  • Developed PySpark/Spark SQL ELT pipelines that reduced legacy data refresh from 6+ hours to under 45 minutes.

KPMG LLP

Chicago, IL
Deal Advisory & Strategy Analytics AssociateJune 2021 – August 2021 / July 2022 – April 2024
  • Led development of a digital command center for a $20B utility company; integrated SAP business warehouses into PowerBI delivering C-suite dashboards.
  • Built Core Schedules, an automated M&A due diligence product using PowerBI REST APIs and Databricks—cutting report generation from hours to minutes.

CloudHack.AI

Chicago, IL
Founder & Solutions ArchitectJuly 2022 – April 2024
  • Founded a baseball analytics SaaS for 2 D1 college teams; delivered AI-powered scouting reports via Azure, Databricks, and PowerBI that contributed to a 34-23-1 record.
  • Architected end-to-end data infrastructure: automated PySpark ETL from FTP/NCAA sources to Azure Blob Storage, integrated OpenAI API for personalized coaching insights.

USC Baseball

Los Angeles, CA
Director of AnalyticsJanuary 2019 – June 2022
  • Led student analytics team delivering PowerBI scouting reports from Trackman data for 55 games across 4 months.
  • Developed ad-hoc insights for coaches while supporting on-field operations including batting practice and pitcher sessions.

Data Pipeline Architecture

Interactive medallion lakehouse architecture built at TIDI Products

Baseball Analytics Dashboard

AI-powered scouting reports built for D1 college teams at CloudHack.AI

Pitch Velocity Distribution

Batting Average Trend

Player Skill Profile

Strike Zone Heatmap

Strike Zone
12
28
15
22
45
35
18
38
20
Low High

Star Schema Explorer

Conformed dimensional models for financial and commercial analytics

Ask My AI Assistant

Powered by OpenAI — ask anything about my experience, skills, or projects

ai-assistant — gpt-4o-mini

Try one of these to get started:

Let's Connect

I'm always open to discussing data engineering, analytics, or new opportunities.

Download Resume