Resume

Work

  • 2022.08 - 2024.08
    Data Engineer
    IBM - Chief Information Office (CIO)
    Led data engineering initiatives as Product Owner for the GHD business unit, optimizing and scaling data pipelines to drive predictive analytics and data mining capabilities.
    • Migrated IBM's transactional data warehouse to IBM Cloud using Apache Spark and Scala
    • Reduced costs by 81.3% through legacy system migration and decommissioning
    • Optimized SQL scripts from 600+ to 22 ETLs through new modeling and architecture
    • Improved processing speed by 63.6% with data lakes and data marts implementation
    • Reduced downtime by 37.5% through Jenkins CI/CD and Apache Airflow automation
    • Participated in CDC PoC using Apache Kafka and Debezium for streaming data pipelines
  • 2022.01 - 2022.07
    Software Developer Intern
    IBM - Chief Information Office (CIO)
    Assisted with 6+ cross-functional teams to streamline data workflows, integrating data pipelines that supported efficient ML algorithm development for optimal bid pricing.
    • Validated and refined 40+ SQL scripts into 6 Fact models
    • Documented 17 key business logic for bid pricing modules
    • Automated Datamart schema generation with Python script

Education

  • 2024.09 - 2026.05

    New York City, NY

    Master of Science
    New York University, Courant Institute of Mathematical Sciences
    Computer Science, AI Concentration
    GPA: 3.8/4.0
  • 2018.08 - 2022.06

    Coimbatore, IN

    Bachelor of Technology
    Amrita School of Engineering, Amrita Vishwa Vidyapeetham
    Computer Science
    GPA: 9.37/10.0

Volunteer

  • 2025.09 - 2025.12

    New York City, NY

    Recitation Leader
    New York University
    Recitation leader for Calculus-I course at CIMS Math department, held weekly teaching sessions for 60+ undergrad students.
    • CALC I, MATH-UA.121.019 & MATH-UA.121.020

Projects

  • 2025.09 - 2025.12
    From Baseline to DeepSeek - Single-GPU MoE Training Efficiency
    Conducted a systems-level study of MoE training efficiency on resource-constrained hardware using the FineWeb-10B dataset, comparing naive PyTorch, ScatterMoE, and MegaBlocks architectures.
    • Reproduced a DeepSeek-inspired MoE architecture with shared experts and top-k routing, achieving a validation loss of 3.93 (1.5% improvement over dense models)
    • Optimized training throughput using ScatterMoE fused kernels, reducing memory footprint by 18% and decreasing latency by 16% against a Naive MoE implementation
    • Benchmarked implementation variants including MegaBlocks and ScatterMoE to identify VRAM bottlenecks in single-GPU scaling
  • 2025.09 - 2025.12
    SmallGraphGCN - Accelerating GNN Training on Batched Small Graphs
    Engineered a specialized GCN framework optimized for training on large batches of small sized graphs by eliminating excessive kernel launches by kernel fusion and edge-centric parallelism.
    • Wrote custom CUDA kernels for computation aggregation, resulting in a 68% reduction in kernel launches and 4.9x lower memory transfer overhead compared to PyG baselines
    • Maintained high model accuracy on molecular datasets while achieving substantial gains in training throughput
  • 2025.04 - 2025.05
    Gaze-Guided Reinforcement Learning for Visual Search
    Implemented a novel RL framework integrating human gaze patterns with PPO algorithm in AI2-THOR using three integration methods and custom CNN architectures.
    • Achieved 26% better performance than random baselines
    • Improved sample efficiency in 3D visual search and object detection tasks
  • 2024.10 - 2024.12
    MTA Transit Time Prediction
    Designed robust regression models to predict NYC bus travel times using MTA BusTime and TomTom Traffic data.
    • Achieved MAE of 43.73 seconds using XGBoost with grid-based modeling
    • Evaluated LSTM architectures to optimize short-sequence temporal data predictions
  • 2021.03 - 2021.05
    Health Insurance Cross-Sell Prediction Case Study
    Engineered a predictive machine learning model to forecast customer propensity for purchasing additional insurance products.

Skills

Programming
Python
C/C++
Scala
SQL
DB2
Frameworks
CUDA
Apache Spark
Apache Airflow
TensorFlow
Keras
PyTorch
OpenAI Gym
FastAPI
dbt (Data Build Tool)
Cloud
AWS (EC2, S3)
GCP
Developer Tools
Git
Docker
Kubernetes
Bazel
Jenkins
LogDNA
VS Code
Jupyter Notebook
Anaconda
Libraries
Pandas
NumPy
Matplotlib
Scikit-learn
XGBoost
SpaCy