"NolinNPC: The Natural Products Collection Database" by Parisa Mershon

Off-campus University of Richmond users: To download campus access theses, please use the following link to log in to our proxy server with your university username and password.

Date of Award

5-2025

Document Type

Restricted Thesis: Campus only access

Degree Name

Bachelor of Science

Department

Chemistry

First Advisor

Dr. Kristine Nolin

Abstract

Natural products (NPs) are the cornerstone of traditional and modern pharmaceuticals, yet the process of discovering new bioactive compounds remains largely speculative. To address this, we developed the Nolin Natural Product Collection (NolinNPC), a unified database cataloging plant-derived NPs, their structural characteristics, biological activities, source organisms, extraction methods, and associated diseases or biochemical pathways. The initial stages of the NPC relied on manually curated Excel datasets, but the project has since transitioned to a modular relational database framework supported by R and Python. This structure allows for more rigorous data visualization, analysis, and expansion. Core external resources, including CrossRef, PubChem, and the Global Biodiversity Information Facility (GBIF), are accessed through API integration, streamlining metadata validation and enabling dynamic updates. Compounds are systematically classified according to chemical structure, supporting finer differentiation among novel NPs, while occurrence data linked to source organisms enhances the strategic selection of biodiversity targets. Maintaining many-to-many relational integrity through junction tables ensures flexibility in representing the complex interconnections between compounds, organisms, and biological effects. The NPC is currently undergoing migration to a PostgreSQL environment, with future plans to deploy a web-accessible platform using Django. Future development will also focus on implementing structured data entry pipelines using ChatGPT outputs, enabling standardized and semi-automated database expansion while minimizing human error. Additional goals include local storage of molecular files for compounds not yet cataloged in PubChem and the development of integrated 2D/3D structural visualization tools. By consolidating diverse streams of natural product data into a coherent, expandable platform, the NolinNPC seeks to transform NP discovery into a strategic, data-driven process, significantly accelerating the identification of candidates for therapeutic development.

Available for download on Monday, May 15, 2028

Share

COinS