Using Structured Query Language (SQL) to Stratify a Large-Scale De-Identified Patient Database: Addressing the Top-Down Versus Step-Up Treatment Debate in Inflammatory Bowel Disease (IBD)

Date of Award


Document Type

Undergraduate Capstone Project

Degree Name

Master of Professional Studies

Organizational Unit

University College, Healthcare Management


Healthcare Leadership

First Advisor

Steve Sorensen


Big data, Comparative effectiveness research, Inflammatory bowel disease, Step-up, Top-down, Structured query language


This Capstone Project addresses the debate between 'step-up' and 'top-down' treatment approaches common in Inflammatory Bowel Disease (IBD). In light of the ineffectiveness of conventional clinical research, the Project establishes another method of research using modern techniques in data science to build patient identification algorithms and separate the patient population into cohorts based on similar prognostic factors. Researchers can study the comparative effectiveness of alternative treatment approaches by querying de-identified disease-specific databases using a string of structured query language (SQL) commands. Patient cohorts are constructed according to initial treatment pathway, and a comparison is drawn between prescribing frequency of 'step-up' medications (5-ASAs and corticosteroids) and frequency of 'top-down' medications (immunomodulators and biologics) from a de-identified database of 26,000 patients.

Publication Statement

Copyright is held by the author. Permanently suppressed.

This document is currently not available here.