I mapped the patent-vs-literature coverage for 24,746 plant compounds: the gap between commercial activity and published research is wider than expected [OC] Comparison

March 31, 2026
10 views
AC
By Alex Cartwright
I mapped the patent-vs-literature coverage for 24,746 plant compounds: the gap between commercial activity and published research is wider than expected [OC] Comparison
Click to enlarge

Data Analysis

What This Visualization Shows

This data visualization displays "I mapped the patent-vs-literature coverage for 24,746 plant compounds: the gap between commercial activity and published research is wider than expected [OC]" and provides a clear visual representation of the underlying data patterns and trends. The visualization focuses on A few weeks ago I posted a version of this chart with incorrect terminology. I called the gap "FTO whitespace" when it actually shows the opposite (high patent density, not freedom to operate). Thanks to the community for catching that. Here is the corrected version with proper axis labels and terminology.

The dataset is built from the USDA Dr. Duke's phytochemical database, denormalized from 16 CSV files and enriched against five public APIs: PubMed (citation counts), [ClinicalTrials.gov](http://ClinicalTrials.gov) (study registrations), ChEMBL (bioactivity measurements), USPTO PatentsView (patent grants since 2020), and PubChem (molecular identifiers).

What the chart shows:

Each dot is one of 24,746 unique compounds. The x-axis is PubMed mentions (log scale, original values shown), the y-axis is USPTO patent grants since 2020 (same scale). Red dots are compounds in the "Patent-Literature Gap": six or more patents but fewer than 50 published studies.

30 compounds fall into this zone. They are being commercially patented but have almost no peer-reviewed pharmacological characterization. For anyone doing target selection or competitive landscape analysis in botanical drug discovery, this is the signal that matters.

Source: Ethno-API v2.3, 76,907 compound-plant records. Full methodology and a 400-record sample are on GitHub.

[github.com/wirthal1990-tech/USDA-Phytochemical-Database-JSON](http://github.com/wirthal1990-tech/USDA-Phytochemical-Database-JSON)

[ethno-api.com](http://ethno-api.com), which allows us to understand complex relationships and insights within the data through visual storytelling.

Deep Dive into the Topic

Scientific data visualization is fundamental to modern research methodology, enabling researchers to explore complex datasets, identify patterns, and communicate findings effectively to both scientific and public audiences. This approach transforms abstract numerical data into comprehensible visual formats that reveal insights not apparent in raw data.

Research data often involves multiple variables, time series, and complex relationships that require sophisticated visualization techniques. Scientific visualizations can reveal correlations, outliers, and trends that guide hypothesis formation and experimental design. Medical research particularly benefits from data visualization for tracking treatment outcomes, understanding disease patterns, and presenting clinical trial results.

The impact of scientific data visualization extends beyond academic research. Public health officials use epidemiological visualizations to track disease outbreaks and plan interventions. Environmental scientists employ data visualization to communicate climate change impacts and conservation needs. Medical professionals use patient data visualizations to improve diagnosis accuracy and treatment planning. This broad application makes scientific data visualization a critical tool for evidence-based decision-making in research and practice.

Data Analysis and Insights

The patterns revealed in this visualization demonstrate the importance of systematic data analysis in understanding complex phenomena. By examining different data segments, time periods, and categorical breakdowns, we can identify trends that inform strategic planning and decision-making processes.

Statistical analysis of this data reveals variations across different dimensions that provide insights into underlying drivers and relationships. These patterns help identify areas of opportunity, potential risks, and key performance indicators that can guide future actions and resource allocation.

The analytical approach used in this visualization enables comparison across different categories, time periods, or geographic regions, revealing insights that support evidence-based decision-making. This type of analysis is essential for organizations seeking to optimize performance and understand complex market dynamics.

Significance and Applications

This data visualization has important implications for understanding trends and patterns that affect decision-making across multiple sectors. The insights derived from this analysis can inform policy development, business strategy, resource allocation, and operational improvements.

For analysts, researchers, and decision-makers, this type of data visualization provides essential insights for strategic planning and performance optimization. Whether addressing operational challenges, market analysis, or policy development, understanding data patterns helps create more effective strategies and solutions.

The broader significance lies in how this information contributes to our understanding of complex systems and relationships. This knowledge helps predict future trends, identify potential challenges, and develop more informed approaches to problem-solving and opportunity identification.

Comments

Loading comments...

Leave a Comment

0/500 characters

About the Author

Alex Cartwright

Alex Cartwright

Senior Data Visualization Expert

Alex Cartwright is a renowned data visualization specialist and infographic designer with over 15 years of experience in...

Infographic DesignData AnalysisVisual Communication
View Profile

Visualization Details

Published3/31/2026
CategoryData Analysis
TypeVisualization
Views10