Portfolio

Rohan Isaac

Data Scientist · Atlanta, GA

Summary

I design and deploy data science solutions that transform complex data into actionable insights. Having worked across healthcare, manufacturing, CPG, and Hi-Tech, I focus on building robust, end-to-end pipelines from data curation to model deployment that support better decisions in real-world settings.

Work Experience

Emory University

Atlanta, GA

  • Medical Imaging AI
  • Data Analysis
  • Foundation Models
  • LLMs
  • Model Development
  • EHR
  • Data Pipeline

Data Scientist

Jul 2024 — Present

  • Built a data pipeline for a large-scale multi-modality breast imaging dataset of (2D and DBT mammograms, Ultrasound, MRI) for more than 1 million exams. Integrated multiple data sources such as patient workflow EHR data, risk scores, state registry data, clinical (encounters, labs, meds, notes, ICD, CPT), radiology and pathology reports extracted from EHRs.
  • Evaluated a commercially deployed 3D DBT breast-AI screening model, leveraging internal datasets to perform stratified analyses across outcomes, demographics, imaging characteristics and statistical evaluations to determine model optimization opportunities.
  • Leverage medical imaging foundation models (MammoCLIP, MedImageInsights, BiomedParse) to extract screening mammography embeddings and applied linear probing for downstream breast imaging and cancer outcome analysis.
  • Developed a module for structured data extraction from pathology reports by deploying open-source reasoning LLMs with a pydantic-based validation schema to extract over 30 labels spanning cancer outcomes, treatment details, and receptor status, achieving over 90% sensitivity. Applied prompt engineering techniques to improve parsing robustness and consistency.
  • Conducted multiple clinical analytics research for internal stakeholders specifically in breast imaging space, leveraging multiple data sources to generate data driven insights.

ClearObject

Remote

  • Computer vision
  • Edge deployment
  • PyTorch/TensorFlow
  • YOLO/U-net

Data Scientist Intern

Jan 2024 — Apr 2024

  • Integrated computer vision systems for manufacturing quality control, covering data collection, labeling, and model fine-tuning with PyTorch/TensorFlow (segmentation with U-net, object detection with YOLO). Built end-to-end workflows with Google Vertex AI.
  • Deployed models to a proprietary edge device for real-time inference using NVIDIA TensorRT and DeepStream to improve frame-rate efficiency.

Google Inc. – LibreHealth Radiology (Google Summer of Code)

Remote

  • Medical imaging AI
  • DICOM automation
  • Grad-CAM
  • Open source

Google Summer of Code Contributor

May 2023 — Aug 2023

  • Integrated computer vision models into LibreHealth Radiology AI for multilabel classification and object localization using Grad-CAM for chest and mammogram modalities.
  • Automated modality-specific model selection by reading DICOM properties (e.g., CheXpert for chest X-rays), reducing manual configuration.

Education

Master of Science, Applied Data Science

Indiana University-Indianapolis

Aug 2022 — May 2024

Post Graduate Program Data Science

Praxis Business School

Jul 2018 — Mar 2019

Bachelor of Engineering, Mechanical

Anna University

Oct 2012 — Jun 2016

Skills

Programming & AI

  • Python
  • R
  • SQL
  • PyTorch
  • TensorFlow

Cloud Tools

  • Google Cloud Platform (Cloud Run, Cloud Functions, BigQuery, VertexAI, Firebase)
  • Azure (Databricks, Synapse, DataFactory)

Domain Expertise

  • Medical imaging AI
  • Healthcare Analytics
  • Supply Chain Analytics
Publications
Filter

Subgroup Performance of a Commercial Digital Breast Tomosynthesis Model for Breast Cancer Detection

First Author

Nature Communications · 2025

B Brown-Mulry, RS Isaac, SH Lee, A Seth, KJ Min, T Dapamede, F Li, ...

Impact of label noise from large language models generated annotations on evaluation of diagnostic model performance

Radiology: Artificial Intelligence · 2025

M Chavoshi, H Trivedi, A Mansuri, J Newsome, CR Sanyika, ...

Evaluating Vision Language Models (VLMs) for Radiology: A Comprehensive Analysis

Preprint · 2025

F Li, H Trivedi, B Khosravi, T Dapamede, M Chavoshi, A Dere, RS Isaac, ...

Novel AI-Based Quantification of Breast Arterial Calcification to Predict Cardiovascular Risk

Preprint · 2025

T Dapamede, A Urooj, V Joshi, G Gershon, F Li, M Chavoshi, ...

Real-world performance evaluation of a commercial deep learning model for intracranial hemorrhage detection

npj Digital Medicine · 2025

M Chavoshi, A Mansuri, W Bala, B Brown-Mulry, T Dapamede, R Isaac, ...

Feature Quality and Adaptability of Medical Foundation Models: A Comparative Evaluation for Radiographic Classification and Segmentation

Preprint · 2025

F Li, T Dapamede, M Chavoshi, YS Jeon, B Khosravi, A Dere, ...

Measuring impact of radiologist-AI collaboration: Efficiency, accuracy, and clinical impact

First Author

ISBI · 2024

S Purkayastha, R Isaac, A Veldandi, R Saxena, P Singh, P Vaswani, ...

A general-purpose AI assistant embedded in an open-source radiology information system

First Author

AIME · 2023

S Purkayastha, R Isaac, S Anthony, S Shukla, EA Krupinski, JA Danish, ...

Abstracts
Filter

Comparing General Vision Transformer Embeddings versus Task-Specific Deep Learning Models for Knee Osteoarthritis Grading

SIIM · 2025

Leveraging Pre-trained Medical Image Embeddings for Knee Pain Score Prediction: A Comparative Analysis of Vision Transformer and CNN Approaches

SIIM · 2025

Systematic Detection and Correction of DICOM Label Discrepancies in Large-Scale Chest X-ray Datasets

SIIM · 2025

On the Feasibility of Chest X-ray Reconstruction from Foundation Model Vector Embeddings

SIIM · 2025

Report Interpretation Times for Radiology Residents and Attendings Before and After Deployment of a Triaging Model for Pulmonary Embolism and Intracranial Hemorrhage Detection in a Large Academic Center

SIIM · 2025

LLM Showdown: Benchmarking Performance, Cost, and Speed of Granular Ground Truth Extraction in Radiology Reports for Post-Deployment Monitoring of AI Models

SIIM · 2025

Chest Radiography and Mammography-Derived Imaging Biomarkers Enhance Cardiovascular Risk Prediction Beyond ASCVD Risk Score

SIIM · 2025

EdgeDeID: Advancing De-identification with Small LMs (SLMs) and Synthetic Data on Edge Devices

SIIM · 2025

Emory Breast Imaging Dataset (EMBED) v2 – A racially diverse, multi-modal dataset of 1.2M breast imaging exams and associated histopathology

First Author

SIIM · 2025

Score Change as a Marker of Elevated Risk in AI-Based Breast Cancer Screening

First Author

AIM-AHEAD · 2025

Association Between False Positive Mammograms and Return to Screening in a Racially Diverse Cohort

First Author

RSNA · 2025

Interests

  • AI and Gen AI
  • Health Informatics
  • Tech Trends
  • Travel
  • Cooking