Badapple

(Bioassay-Data Associative Promiscuity Pattern Learning Engine)

About

This web app analyzes each input query molecule by searching a database of bioactivity data experimentally produced by NIH screening centers (data from PubChem). For each scaffold in the query molecule, the Badapple promiscuity score (pScore) is computed according to the following scaffold scoring formula:

score = ((sActive) / (sTested + median(sTested)) *
        (aActive) / (aTested + median(aTested)) *
        (wActive) / (wTested + median(wTested)) *
        1e5)
        

where:

sTested = # tested substances containing this scaffold

sActive = # active substances containing this scaffold

aTested = # assays with tested compounds containing this scaffold

aActive = # assays with active compounds containing this scaffold

wTested = # wells (samples) containing this scaffold

wActive = # active wells (samples) containing this scaffold

The inDrug flag indicates whether the corresponding scaffold exists in any approved drug*. A high score for an inDrug scaffold thus represents conflicting evidence, but existence of an approved drug is normally much stronger evidence.

* This analysis applies only to drugs with 5 ring systems or less. Drugs with many ring systems (e.g., venetoclax) are excluded.

Note that benzene is not considered as a scaffold due to it being such a common substructure.

Interpreting Scores

The table below provides an overview of how to interpret different pScores provided by Badapple.

pScore rangeadvisory
~ unknown; no data
0-99 low pScore; no indication
100-299 moderate pScore; weak indication of promiscuity
≥300 high pScore; strong indication of promiscuity

Programmatic access

For programmatic access to Badapple please see the API Docs Page

For large use cases, or if one wishes to explore the Badapple data in more detail, please see the local installation guide.

Badapple Paper

For more information about Badapple please see the following paper:

Badapple: promiscuity patterns from noisy evidence

Although the original Badapple project was developed several years ago, we have re-created the original Badapple database and website using updated free and open-source software. See next section for more details.

Databases: badapple vs badapple_classic vs badapple2

Under the Badapple project there have been three unique databases developed so far. They can be summarized as follows: For more information on the differences between these databases please see the next two sections.

What's different between badapple and badapple_classic?

The only differences badapple_classic and badapple are:

We have performed several analyses to confirm that badapple_classic and badapple align closely. Details on the analyses we've done can be found here.

What's different between badapple_classic and badapple2?

The most significant differences between badapple_classic and badapple2 are:

In addition to the items above, badapple2 stores information not present in badapple_classic, including:

If you select badapple2 in the webapp, details on the associated biological targets and approved drugs are available for each scaffold.

Help

If you have any issues or questions please raise them here.

Code Availability

All of our code, including the DBs, API, and UI are publicly available. Please see the links below.

Authors and Acknowledgement

This project was developed within the UNM School of Medicine, Dept. of Internal Medicine, Translational Informatics Division.

We would like to thank Cristian Bologa for his guidance, as well as Oleg Ursu, Tudor Oprea, Christopher A. Lipinski, and Larry Sklar for their previous efforts on this project.

As well, we would like to acknowledge the developers of the many open-source software packages that have been vital to the success of this project. We'd like to especially acknowledge the following projects: