The application of complex data sources to pulmonary vascular diseases is an emerging and promising area of investigation. The use of -omics platforms, in silico modeling of gene networks, and linkage of large human cohorts with DNA biobanks are beginning to bear biologic insight into pulmonary hypertension. These approaches to high-throughput molecular phenotyping offer the possibility of discovering new therapeutic targets and identifying variability in response to therapy that can be leveraged to improve clinical care. Optimizing the methods for analyzing complex data sources and accruing large, well-phenotyped human cohorts linked to biologic data remain significant challenges. Here, we discuss two specific types of complex data sources—gene regulatory networks and DNA-linked electronic medical record cohorts—that illustrate the promise, challenges, and current limitations of these approaches to understanding and managing pulmonary vascular disease.