Shubham Goyal

Profile photo

Senior Data Scientist with 5+ years of experience building production ML systems, risk models, and LLM-powered agentic workflows in fintech and enterprise settings.

Currently:
Senior Data Scientist @ BILL, San Francisco Bay Area, CA

Previously:
Senior Data Scientist @ American Express · Data Science Intern @ Verato

Education:
MS Information Systems @ Northeastern University, Boston, MA
BSc/MSc Geophysics @ IIT Kharagpur, India

View My LinkedIn Profile

View My GitHub Profile

Portfolio

About Me

I’m a Senior Data Scientist based in the Bay Area, building production ML systems at the intersection of risk intelligence and applied AI. I’m hacker/developer at heart, love to get my hands dirty at interesting problems.

At BILL, I work on credit risk models, agentic LLM workflows, and semantic search systems that drive real business outcomes — from $5M+ in revenue uplift to automating complex underwriting processes end-to-end.

Before BILL, I spent 3 years at American Express working across credit modeling and risk strategy for Global Collections and US Commercial teams — where I learned to build models that matter at scale.

I’m currently exploring new opportunities in Data Science (Product, Finance, Strategic Intelligence), MLE, and Applied AI roles in the Bay Area. Open to connecting and collaborating.

Download Resume

Latest Work

CineSphere - Movie Recommending Chatbot Using Knowledge Graphs

View on GitHub

LES Weather Forecasting - Multi-Modal Model Approach

View on GitHub


TravelBud - AI Tool for Travel planning

View on GitHub


A/B Test — Understanding causal relationships in experiments

Medium


DIME(Database on Ideology, Money in Politics, and Elections) Network Analysis

View on GitHub


Founder - GrocerEase: Smart Digitization of Groceries

Google Drive


VaniVerse

View on GitHub


Hindi Seq2Seq Model - Generating Hindi poems with RNN

View on GitHub


Runner’s High: Reflecting on Runs in ‘22

In 2022, I took on running as a hobby and set a target of running 500km over the calendar year. Reflecting on the runs, I used Python to visualize the geospatial data collected over the year on Strava, sharing some thoughts and insights!

View on GitHub Medium


Feel Free to email - Gmail