I don't just move data.
I engineer intelligence.
Data Engineer & AI/ML Specialist with 5+ years building enterprise-scale pipelines, real-time test validation systems, and AI solutions that have saved $1.6M+ annually.
// ABOUT
The Engineer Behind the Systems
about_tanish.log
I started my journey at UC Riverside, where I earned both my BS and MS in Computer Engineering. What began as curiosity about how systems process information evolved into a passion for building intelligent data architectures at scale.
At American Express, I spent nearly 5 years on the Arena Testing Team, where I didn't just build pipelines — I engineered solutions that fundamentally changed how the team operated. My AI-based data scrubbing system eliminated $1.6M in annual vendor costs. My ETL optimizations cut processing time by 60%. My real-time streaming pipelines validated 130K+ daily test transactions at sub-second speed.
Now at Jorie AI, I'm applying that same systems-thinking to healthcare data, building RCM dashboards that turn complex billing data into actionable intelligence. My trajectory is clear: from moving data to engineering intelligence.
5+
Enterprise Experience
MS
UC Riverside
3
Domains
// EXPERIENCE
Systems I've Built & Scaled
Jorie AI
Associate Data Engineer — AI
Feb 2026 — Present • Oak Brook, IL
Designing interactive RCM dashboards tracking AR aging, denial rates, and reimbursement trends
Building secure SQL Server pipelines for healthcare billing analytics
Implementing automated data validation for billing and claims accuracy
Translating complex RCM requirements into scalable reporting solutions
American Express
Software Developer / Data Engineer
Apr 2021 — Feb 2026 • Phoenix, AZ
$1.6M saved annually — AI-based data scrubbing for banking personas
60% faster — ETL optimization with parallel processing using PySpark & AWS Glue
99.7% data accuracy — Automated 15+ manual workflows with real-time validation
130K+ daily test transactions — Real-time streaming test transaction validation via AWS Lambda & Kinesis
45% improvement AWS → GCP — Query performance boost through multi-cloud migration
67% fewer incidents — Comprehensive CI/CD implementation for data pipelines
CE-CERT, UC Riverside
Software Engineer Intern / Data Analyst
Jun 2020 — Jan 2021 • Riverside, CA
Built automated data collection for Eco-Vissim driving simulation
500GB+ — Vehicle telemetry data processing for fuel efficiency research
Intelligence Systems Portfolio
Real-world systems engineered to detect, predict, and optimize at enterprise scale.
AI Data Scrubbing Engine
Developed an AI-powered data scrubbing solution for banking and credit card personas, completely eliminating dependency on expensive external vendor contracts for test data generation.
Real-time Streaming Pipeline
Designed and deployed real-time streaming data pipelines using AWS Lambda and Kinesis for test environment validation, enabling sub-second processing of 130K+ daily test transactions at scale.
Enterprise Data Lake
Architected enterprise-scale test data lake on AWS, consolidating data from Oracle, Teradata, mainframes, and RDBMS sources into HDFS, supporting massive QA operations.
ETL Performance Optimizer
Optimized ETL pipeline performance through parallel processing and incremental loading strategies, cutting processing time from 8 hours to 3.2 hours.
Multi-Cloud Migration
Led migration from AWS to GCP for test data infrastructure, achieving significant cost reduction while improving query performance through BigQuery optimization.
// SKILLS
Technology Arsenal
AI / ML & Data Science
Cloud & Big Data
Engineering & Tools
Tools & Platforms
// INITIATE CONNECTION
Let's Build Something Intelligent Together
Whether you're looking for a Data Engineer who thinks in systems, an AI/ML specialist who delivers measurable impact, or a collaborator who turns complex data into strategic advantage — let's connect.
> tanish.arora
Engineered with precision. Powered by data.
© 2026 Tanish Arora