Abstract

Because undesirable pharmacokinetics and toxicity of candidate compounds are the main reasons for the failure of drug development, it has been widely recognized that absorption, distribution, metabolism, excretion and toxicity (ADMET) should be evaluated as early as possible. In silico ADMET evaluation models have been developed as an additional tool to assist medicinal chemists in the design and optimization of leads. Here, we announced the release of ADMETlab 2.0, a completely redesigned version of the widely used AMDETlab web server for the predictions of pharmacokinetics and toxicity properties of chemicals, of which the supported ADMET-related endpoints are approximately twice the number of the endpoints in the previous version, including 17 physicochemical properties, 13 medicinal chemistry properties, 23 ADME properties, 27 toxicity endpoints and 8 toxicophore rules (751 substructures). A multi-task graph attention framework was employed to develop the robust and accurate models in ADMETlab 2.0. The batch computation module was provided in response to numerous requests from users, and the representation of the results was further optimized. The ADMETlab 2.0 server is freely available, without registration, at https://admetmesh.scbdd.com/.

ADMETlab 2.0 assists medicinal chemists in the design and optimization of lead compounds.
Graphical Abstract

ADMETlab 2.0 assists medicinal chemists in the design and optimization of lead compounds.

INTRODUCTION

A successful drug should achieve a finely tuned combination of biochemical behavior, pharmacokinetics and safety. In addition to high potency and selectivity, desirable absorption, distribution, metabolism, excretion and toxicity (ADMET) profile is equally critical to the success of a drug candidate (1–3). More specifically, an ideal drug should be taken appropriately into the body, distributed reasonably to various tissues and organs, metabolized in a way that does not immediately remove its activity, eliminated in a suitable manner, and confirmed non-toxicity (4). These issues seem distinct but closely interrelated, covering the whole process from administration to elimination.

Traditionally, the measurement of the ADMET properties for a drug candidate was usually scheduled after its potency towards a specific target was determined (5). Unfortunately, undesirable adverse effects were often detected in this stage, and therefore a new round of molecular design and syntheses need to be conducted or even the project has to be terminated thoroughly. It was estimated that the contribution of ADMET deficiencies to attrition in drug development reached up to 50% in the 1990s (6,7), which reminded us of the importance of ADMET evaluation in the process from chemicals to drugs. Currently, the potency and ADMET profiles of molecules are usually tested at the same stage, and thus undesirable compounds can be excluded at an earlier stage of drug discovery and development (8,9). However, the testing capacity of in vitro and in vivo ADMET assays seems stretched thin when confronted with the sheer volume of biological screening data, although extensive efforts have been dedicated to ramping up the testing capacity (10). Additionally, the time and cost for experimental assays is another burden, especially for drug discovery pipelines with limited resources.

With the continuous accumulation of experimental ADMET data, a high number of in silico prediction models for many endpoints have been developed to assist ADMET evaluation efficiently. More concretely, they could assist medicinal chemists from two aspects: 1) excluding undesirable compounds at the drug design stage; 2) acquiring timely feedback of ADMET information for lead optimization. The past several years have witnessed many in silico studies concerning ADMET parameters. Meanwhile, a variety of web tools have been developed and used in drug discovery applications, such as ADMETlab (11), FAF-Drugs4 (12), admetSAR (13), SwissADME (14), ProTox-II (15), pkCSM (16), etc.

Thereinto, the webserver ADMETlab was released by our team in 2018. Armed with high-quality experimental data and tailored quantitative structure-property relationship (QSPR) models, it allows users to perform multiple drug-likeness analyses and the predictions of most ADMET-related properties. During the past three years, it has been widely used for ADMET assessment, serving >50 000 users around the world, with millions of entries computed. However, some shortcomings of ADMETlab still need to be improved, such as redundant modules, incomplete endpoints, unclear representation of results, and so on. As part of our continual efforts to provide the community with a comprehensive, accurate and efficient online platform for the evaluation of ADMET-related parameters for chemicals, we updated the server to version 2.0, overcoming all known shortcomings of the old version while maintaining its battle-tested advantages. ADMETlab 2.0 currently supports two computational modes: single-molecule evaluation and batch screening, allowing for the calculation of 88 ADMET-related parameters, including 17 physicochemical properties, 13 medicinal chemistry properties, 23 ADME properties, 27 toxicity endpoints and 8 toxicophore rules (751 substructures). It is backed up by the robust QSPR models trained by the multi-task graph attention (MGA) framework based on high-quality experimental ADMET data. To sum up, the upgraded version is believed to have greater capacity to assist medicinal chemists in accelerating the drug research and development (R&D) process. ADMETlab 2.0 is implemented as a publicly available web server with an intuitive interface and can be freely accessed at https://admetmesh.scbdd.com/.

PROGRAM DESCRIPTION AND METHODS

Software implementation

ADMETlab 2.0 was built using the Python web framework of Django and deployed on an elastic compute service from Aliyun running an Ubuntu Linux system. The web access was enabled via the Nginx web server and the interactions between Django and proxy server were supported by uwsgi. This application was developed based on the Model-View-Template (MVT) framework. The model layer maps the business objects to the database objects. The view layer is a business logic layer, responsible for performing the access to the deep learning models, delivering the data to be shown on the template layer, and handling the upload and download of files. The template layer provides the visualization of results, page rendering, integration of documentation, etc. The uploaded and downloaded files, pre-trained models and model predictions were stored in the server. The prediction models were built with the Python programming language. The deep learning packages, PyTorch and DGL, were used in model implementation. Additionally, the RDKit package was employed to provide various cheminformatics support. The server has been successfully tested on the recent version of Mozilla Firefox, Google Chrome and Apple Safari.

Input and output

ADMETlab 2.0 provides a convenient and easy-to-use interface for users. Two services, Evaluation and Screening, are designed to support single-molecule and batch evaluation, whose input parameters and output information will be elaborated respectively.

In the Evaluation pattern, two molecular submission approaches are provided by pasting the SMILES string or drawing the chemical structure with the help of JMSE molecule editor (17). Once a user submits the job, the webserver will automatically standardize the input SMILES strings and compute all the endpoints. The prediction results are mainly displayed in the tabular format in the browser, with the 2D molecular structure and a radar plot summarizing the physicochemical quality of the compound. For those endpoints predicted by the regression models, such as Caco-2 permeability, plasma protein binding, etc., concrete predictive values are provided. For the endpoints predicted by the classification models, such as Pgp-inhibitor, hERG Blocker, etc., the prediction probability values are transformed into six symbols: 0-0.1(−−−), 0.1-0.3(−−), 0.3-0.5(−), 0.5-0.7(+), 0.7-0.9(++), and 0.9-1.0(+++). Usually, the token ‘+++’ or ‘++’ represents the molecule is more likely to be toxic or defective, while ‘−−−’ or ‘−−’ represents nontoxic or appropriate. Here, we do not recommend trusting predictions symbolled by ‘+’ or ‘−' (probably values in 0.3-0.7), and corresponding molecules require further assessment. The substructural rules available in the webserver, such as PAINS, SureChEMBL Rule, etc., were implemented using the SMARTS recognition capability of RDKit function. And the calculation of physicochemical and medicinal chemistry endpoints was based on the python library Scopy (18), following the parameters reported in corresponding original papers strictly. If the number of alerts is not zero, users can click the DETAIL button to check the undesirable substructures in the molecule. Finally, the full result file can be downloaded from the website in CSV or PDF format.

In the Screening pattern, two molecular submission approaches are provided by entering a list of SMILES strings or uploading a SDF or TXT formatted file. It should be noted that the file should only contain molecules without column headers and molecular indexes, otherwise the server may declare invalid input type. After all the predictions are completed, the results for each input molecule will be presented on a separate row, containing the assigned index, SMILES string, 2D molecular structure, and a View button. The prediction details can be accessed by clicking the View button of the corresponding molecule that links to the single-molecule evaluation page. These results can also be downloaded as a CSV-formatted file to the user's computer, where concrete probably values of classification endpoints are provided to enable the users to define their own thresholds to filter out deficient compounds with different levels of reliability. A typical ADMETlab 2.0 task for 1000 molecules requires ∼84 s, but it may also depend on the complexity of molecules.

Data collection

To obtain as much data as possible for model training, we conducted a comprehensive data retrieval by using different ADMET-related keywords. The data sources included open-access bioactivity databases, such as ChEMBL (19), PubChem (20) and OCHEM (21), peer-reviewed literature, and freely accessible software Toxicity Estimation Software Tools (TEST) developed by the U.S. Environmental Protection Agency (22). In data curation, we filtered off organometallic compounds, isomeric mixtures and chemical mixtures, neutralized salts, eliminated counterions and transformed SMILES strings into canonical form. Subsequently, the molecules with more than 128 atoms (unsuitable for GNN model training) and duplicated entries were removed, leaving a high-quality dataset collection of 0.25M entries spanning 53 ADMET-related endpoints. The scaffold analysis indicated a high level of the structural diversity of the training sets, and the models developed with such datasets may have good prediction coverage for structurally diverse compounds. More details of the modeling data and scaffold analysis are provided in Supplementary Tables S1 and S2.

Model validation

In this update, a total of 53 prediction models were implemented, including 40 classification models and 13 regression models. For each endpoint, the dataset was split into the training, validation and test sets by a ratio of 8:1:1, and stratified sampling was used when partitioning the data for classification to keep the ratio of the positive and negative instances in the three subsets balanced. The larger part was used for training, and the validation and test sets were used to optimize the hyperparameters and test the predictive capacity of each model, respectively. As for the evaluation parameters, we selected accuracy (ACC), specificity (SP), sensitivity (SE), the area under the ROC curve (AUC), and the Matthews correlation coefficient (MCC) (23) for the classification models, and R-square (R2), mean absolute error (MAE), and root mean squared error (RMSE) for the regression models. To obtain robust and accurate prediction models, the model training process was repeated ten times with random data splitting. The best performing models were incorporated into the online platform, and different performance measures for the classification and regression models were separately summarized in Tables 1 and 2, respectively. For the classification models, most of them achieved an AUC of 0.85 or higher, except for some cytochrome enzyme endpoints such as CYP1A2 substrate and CYP2C9 substrate. Meanwhile, these models yielded satisfactory prediction accuracy, with 27 models reaching the ACC values above 0.8. The average MCC value of these models is 0.53 and 50% of models obtain a MCC of more than 0.5. The specificity and sensitivity of most models were relatively balanced, except for some endpoints in the Toxicology in the 21st Century (Tox21) dataset, which was probably due to the imbalanced labeled data. For the regression models, most of them achieved an R2 above 0.72. Nevertheless, some endpoints, such as Clearance and LC50DM, had relatively limited historical data for model training, rendering the underperformance of the corresponding models. The complete performance summary for the training and test sets are listed in Supplementary Tables S3 and S4. Generally, these models can give relatively accurate predictions for the ADMET-related properties of chemicals.

Table 1.

Performance of the classification models incorporated into the ADMETlab 2.0 platform

CategoryModelAUCACCMCCSpecificitySensitivity
AbsorptionPgp-inhibitor0.9220.8670.7230.8440.882
Pgp-substrate0.8400.7680.5380.7050.828
HIA0.8660.9240.6870.8000.942
F20%0.8330.7500.4140.6800.773
F30%0.8480.8020.5800.7940.806
DistributionBBB Penetration0.9080.8620.7180.8240.891
MetabolismCYP1A2 inhibitor0.9280.8520.7040.8480.857
CYP1A2 substrate0.7370.6490.2980.6320.667
CYP2C19 inhibitor0.9130.8390.6790.8130.869
CYP2C19 substrate0.7580.6540.3000.6670.636
CYP2C9 inhibitor0.9190.8410.6710.8230.878
CYP2C9 substrate0.7250.7070.3860.7760.606
CYP2D6 inhibitor0.8920.8240.5580.8230.828
CYP2D6 substrate0.8470.7750.5530.7330.818
CYP3A4 inhibitor0.9210.8320.6590.8250.841
CYP3A4 substrate0.7760.7130.4370.8200.608
ExcretionT1/20.8010.7270.4780.6580.827
ToxicityhERG Blockers0.9430.8890.7780.8690.909
H-HT0.8140.7200.4610.8140.650
DILI0.9240.8940.7930.8260.958
AMES Toxicity0.9020.8070.6060.7320.865
Rat Oral Acute Toxicity Toxicity0.8530.7780.5490.7690.793
FDAMDD0.8040.7360.4710.7340.737
Skin Sensitization0.7070.7750.4620.5390.889
Carcinogencity0.7880.7310.4760.6230.843
Eye Corrosion0.9830.9570.9080.9650.944
Eye Irritation0.9820.9520.8760.9180.964
Respiratory Toxicity0.8280.7640.5140.7320.786
NR-AR0.8860.8900.3480.8960.731
NR-AR-LBD0.9150.9360.4720.9420.783
NR-AhR0.9430.8620.5730.8580.896
NR-Aromatase0.8520.8490.2640.8590.615
NR-ER0.7710.8150.3200.8450.567
NR-ER-LBD0.8500.9030.3640.9180.618
NR-PPAR-gamma0.8930.8960.3440.9010.750
SR-ARE0.8630.8270.4690.8500.701
SR-ATAD50.8740.9190.3610.9290.640
SR-HSE0.9070.8680.3930.8750.750
SR-MMP0.9270.8970.6600.9080.835
SR-p530.8810.8410.3650.8490.723
CategoryModelAUCACCMCCSpecificitySensitivity
AbsorptionPgp-inhibitor0.9220.8670.7230.8440.882
Pgp-substrate0.8400.7680.5380.7050.828
HIA0.8660.9240.6870.8000.942
F20%0.8330.7500.4140.6800.773
F30%0.8480.8020.5800.7940.806
DistributionBBB Penetration0.9080.8620.7180.8240.891
MetabolismCYP1A2 inhibitor0.9280.8520.7040.8480.857
CYP1A2 substrate0.7370.6490.2980.6320.667
CYP2C19 inhibitor0.9130.8390.6790.8130.869
CYP2C19 substrate0.7580.6540.3000.6670.636
CYP2C9 inhibitor0.9190.8410.6710.8230.878
CYP2C9 substrate0.7250.7070.3860.7760.606
CYP2D6 inhibitor0.8920.8240.5580.8230.828
CYP2D6 substrate0.8470.7750.5530.7330.818
CYP3A4 inhibitor0.9210.8320.6590.8250.841
CYP3A4 substrate0.7760.7130.4370.8200.608
ExcretionT1/20.8010.7270.4780.6580.827
ToxicityhERG Blockers0.9430.8890.7780.8690.909
H-HT0.8140.7200.4610.8140.650
DILI0.9240.8940.7930.8260.958
AMES Toxicity0.9020.8070.6060.7320.865
Rat Oral Acute Toxicity Toxicity0.8530.7780.5490.7690.793
FDAMDD0.8040.7360.4710.7340.737
Skin Sensitization0.7070.7750.4620.5390.889
Carcinogencity0.7880.7310.4760.6230.843
Eye Corrosion0.9830.9570.9080.9650.944
Eye Irritation0.9820.9520.8760.9180.964
Respiratory Toxicity0.8280.7640.5140.7320.786
NR-AR0.8860.8900.3480.8960.731
NR-AR-LBD0.9150.9360.4720.9420.783
NR-AhR0.9430.8620.5730.8580.896
NR-Aromatase0.8520.8490.2640.8590.615
NR-ER0.7710.8150.3200.8450.567
NR-ER-LBD0.8500.9030.3640.9180.618
NR-PPAR-gamma0.8930.8960.3440.9010.750
SR-ARE0.8630.8270.4690.8500.701
SR-ATAD50.8740.9190.3610.9290.640
SR-HSE0.9070.8680.3930.8750.750
SR-MMP0.9270.8970.6600.9080.835
SR-p530.8810.8410.3650.8490.723
Table 1.

Performance of the classification models incorporated into the ADMETlab 2.0 platform

CategoryModelAUCACCMCCSpecificitySensitivity
AbsorptionPgp-inhibitor0.9220.8670.7230.8440.882
Pgp-substrate0.8400.7680.5380.7050.828
HIA0.8660.9240.6870.8000.942
F20%0.8330.7500.4140.6800.773
F30%0.8480.8020.5800.7940.806
DistributionBBB Penetration0.9080.8620.7180.8240.891
MetabolismCYP1A2 inhibitor0.9280.8520.7040.8480.857
CYP1A2 substrate0.7370.6490.2980.6320.667
CYP2C19 inhibitor0.9130.8390.6790.8130.869
CYP2C19 substrate0.7580.6540.3000.6670.636
CYP2C9 inhibitor0.9190.8410.6710.8230.878
CYP2C9 substrate0.7250.7070.3860.7760.606
CYP2D6 inhibitor0.8920.8240.5580.8230.828
CYP2D6 substrate0.8470.7750.5530.7330.818
CYP3A4 inhibitor0.9210.8320.6590.8250.841
CYP3A4 substrate0.7760.7130.4370.8200.608
ExcretionT1/20.8010.7270.4780.6580.827
ToxicityhERG Blockers0.9430.8890.7780.8690.909
H-HT0.8140.7200.4610.8140.650
DILI0.9240.8940.7930.8260.958
AMES Toxicity0.9020.8070.6060.7320.865
Rat Oral Acute Toxicity Toxicity0.8530.7780.5490.7690.793
FDAMDD0.8040.7360.4710.7340.737
Skin Sensitization0.7070.7750.4620.5390.889
Carcinogencity0.7880.7310.4760.6230.843
Eye Corrosion0.9830.9570.9080.9650.944
Eye Irritation0.9820.9520.8760.9180.964
Respiratory Toxicity0.8280.7640.5140.7320.786
NR-AR0.8860.8900.3480.8960.731
NR-AR-LBD0.9150.9360.4720.9420.783
NR-AhR0.9430.8620.5730.8580.896
NR-Aromatase0.8520.8490.2640.8590.615
NR-ER0.7710.8150.3200.8450.567
NR-ER-LBD0.8500.9030.3640.9180.618
NR-PPAR-gamma0.8930.8960.3440.9010.750
SR-ARE0.8630.8270.4690.8500.701
SR-ATAD50.8740.9190.3610.9290.640
SR-HSE0.9070.8680.3930.8750.750
SR-MMP0.9270.8970.6600.9080.835
SR-p530.8810.8410.3650.8490.723
CategoryModelAUCACCMCCSpecificitySensitivity
AbsorptionPgp-inhibitor0.9220.8670.7230.8440.882
Pgp-substrate0.8400.7680.5380.7050.828
HIA0.8660.9240.6870.8000.942
F20%0.8330.7500.4140.6800.773
F30%0.8480.8020.5800.7940.806
DistributionBBB Penetration0.9080.8620.7180.8240.891
MetabolismCYP1A2 inhibitor0.9280.8520.7040.8480.857
CYP1A2 substrate0.7370.6490.2980.6320.667
CYP2C19 inhibitor0.9130.8390.6790.8130.869
CYP2C19 substrate0.7580.6540.3000.6670.636
CYP2C9 inhibitor0.9190.8410.6710.8230.878
CYP2C9 substrate0.7250.7070.3860.7760.606
CYP2D6 inhibitor0.8920.8240.5580.8230.828
CYP2D6 substrate0.8470.7750.5530.7330.818
CYP3A4 inhibitor0.9210.8320.6590.8250.841
CYP3A4 substrate0.7760.7130.4370.8200.608
ExcretionT1/20.8010.7270.4780.6580.827
ToxicityhERG Blockers0.9430.8890.7780.8690.909
H-HT0.8140.7200.4610.8140.650
DILI0.9240.8940.7930.8260.958
AMES Toxicity0.9020.8070.6060.7320.865
Rat Oral Acute Toxicity Toxicity0.8530.7780.5490.7690.793
FDAMDD0.8040.7360.4710.7340.737
Skin Sensitization0.7070.7750.4620.5390.889
Carcinogencity0.7880.7310.4760.6230.843
Eye Corrosion0.9830.9570.9080.9650.944
Eye Irritation0.9820.9520.8760.9180.964
Respiratory Toxicity0.8280.7640.5140.7320.786
NR-AR0.8860.8900.3480.8960.731
NR-AR-LBD0.9150.9360.4720.9420.783
NR-AhR0.9430.8620.5730.8580.896
NR-Aromatase0.8520.8490.2640.8590.615
NR-ER0.7710.8150.3200.8450.567
NR-ER-LBD0.8500.9030.3640.9180.618
NR-PPAR-gamma0.8930.8960.3440.9010.750
SR-ARE0.8630.8270.4690.8500.701
SR-ATAD50.8740.9190.3610.9290.640
SR-HSE0.9070.8680.3930.8750.750
SR-MMP0.9270.8970.6600.9080.835
SR-p530.8810.8410.3650.8490.723
Table 2.

Performance of the regression models incorporated into the ADMETlab 2.0 platform

CategoryModelR2RMSEMAE
Physicochemical propertyLog S0.8540.8500.588
Log D7.40.8920.4620.347
Log P0.9570.3570.256
AbsorptionCaco-2 permeability0.7460.3070.222
MDCK permeability0.7310.2910.199
DistributionPPB0.7330.1350.834
VD0.7820.6700.457
Fu0.7630.3670.263
ExcretionCL0.6783.3752.240
ToxicityBioconcentration factor0.7860.6030.435
IGC500.7230.4960.335
LC50FM0.7450.8630.643
LC50DM0.5240.9940.692
CategoryModelR2RMSEMAE
Physicochemical propertyLog S0.8540.8500.588
Log D7.40.8920.4620.347
Log P0.9570.3570.256
AbsorptionCaco-2 permeability0.7460.3070.222
MDCK permeability0.7310.2910.199
DistributionPPB0.7330.1350.834
VD0.7820.6700.457
Fu0.7630.3670.263
ExcretionCL0.6783.3752.240
ToxicityBioconcentration factor0.7860.6030.435
IGC500.7230.4960.335
LC50FM0.7450.8630.643
LC50DM0.5240.9940.692
Table 2.

Performance of the regression models incorporated into the ADMETlab 2.0 platform

CategoryModelR2RMSEMAE
Physicochemical propertyLog S0.8540.8500.588
Log D7.40.8920.4620.347
Log P0.9570.3570.256
AbsorptionCaco-2 permeability0.7460.3070.222
MDCK permeability0.7310.2910.199
DistributionPPB0.7330.1350.834
VD0.7820.6700.457
Fu0.7630.3670.263
ExcretionCL0.6783.3752.240
ToxicityBioconcentration factor0.7860.6030.435
IGC500.7230.4960.335
LC50FM0.7450.8630.643
LC50DM0.5240.9940.692
CategoryModelR2RMSEMAE
Physicochemical propertyLog S0.8540.8500.588
Log D7.40.8920.4620.347
Log P0.9570.3570.256
AbsorptionCaco-2 permeability0.7460.3070.222
MDCK permeability0.7310.2910.199
DistributionPPB0.7330.1350.834
VD0.7820.6700.457
Fu0.7630.3670.263
ExcretionCL0.6783.3752.240
ToxicityBioconcentration factor0.7860.6030.435
IGC500.7230.4960.335
LC50FM0.7450.8630.643
LC50DM0.5240.9940.692

Besides, we also implemented more rigorous leave-cluster-out validation, where the molecules were grouped into clusters according to the Murcko scaffolds. The scaffolds of validation and test sets were excluded from the training set. The classification endpoints could obtain an average MCC of 0.47, and regression endpoints could obtain an average R2 of 0.65, suggesting that the knowledge learned from the training set has the potential to be transferred to other different chemical structures and the predictive ability of our webserver is reliable. More detailed results of leave-cluster-out validation are listed in Supplementary Tables S5 and S6.

Multi-task graph attention (MGA) framework

Traditional multitask graph neural network (GNN) methods usually handle homogeneous tasks, such as pure regression or classification tasks. However, in ADMET prediction, both regression tasks and classification tasks are needed. Therefore, a multi-task graph attention (MGA) framework was used to simultaneously learn the regression and classification tasks for ADMET predictions in this study. An overview of the Multi-task Graph Attention framework is shown in Supplementary Figure S1. MGA is composed of the input, relation graph convolution network (RGCN) layers (24), attention layer, and fully-connected (FC) layers. In the input, a node represents the information of an atom, and after passing the RGCN layers, the node represents the general features of the circular substructure centered on the atom. The attention layers can assign different attention weights to different substructures, and then generate the customized fingerprints from the general features for a specific task. It's worth noting that MGA has different attention layers for different tasks and can generate customized fingerprints for different tasks. The FC layers predict the corresponding tasks based on the customized fingerprints. The classification and regression tasks adopt different loss functions (loss_c and loss_r), and the loss function of MGA is a combination of loss_c and loss_r (25). To alleviate data-imbalance problem, we employed BCEWithLogitsLoss loss function and attached corresponding weights of positive examples (pos_weight) according to the proportion of positive and negative samples. We used the empirical hyperparameters of MGA to construct the ADMET prediction models and more information on the MGA method can be seen in Supplementary Data. The implementation and empirical hyperparameters of MGA can be found at https://github.com/wzxxxx/MGA.

NEW DEVELOPMENTS IN ADMETlab 2.0

To better facilitate the ADMET evaluation of chemicals, ADMETlab 2.0 is presented with significant updates to functional modules, predictive models, and explanations. Additionally, we also updated the user interface to improve the user experience. Figure 1 presents the workflow of ADMETlab 2.0.

Workflow scheme of ADMETlab 2.0.
Figure 1.

Workflow scheme of ADMETlab 2.0.

Comprehensively enhanced ADMET profiles

Originally, ADMETlab could support the predictions of 7 basic physicochemical properties, 28 ADMET-related properties and 6 drug-likeness rules, but some important endpoints were still not taken into consideration. Moreover, the drug-likeness rules supported by ADMElab had overlaps between their internal terms. These problems may lead to an inadequate understanding of the whole ADMET spectrum for a molecule. Therefore, ADMETlab needs to be updated to overcome the existing limitations.

The ADME-related endpoints could be roughly divided into seven different sections: (i) physicochemical properties, (ii) medicinal chemistry properties, (iii) absorption, (iv) distribution, (v) metabolism, (vi) excretion and (vii) toxicity. Most of the newly added endpoints in ADMETlab 2.0 belong to physicochemical properties, medicinal chemistry properties and toxicity. At first, we added a number of structural properties closely relevant to drug-likeness, such as formal charge, flexibility, and stereo centers (26–28). Then, the empirical formula of logP was replaced by a well-trained MGA model based on more than ten thousand molecules. For medicinal chemistry properties, we selected 4 complementary drug-likeness rules, including Lipinski's rule-of-five (29), Pfizer rule (30), GSK rule (31) and Golden Triangle (32). These rules were born within the long-term drug discovery practice of world-famous pharmaceutical companies. Some quantitative measures, including QED (33), Fsp3 (34), MCE-18 (35), SAscore (36), and NPscore (37), were also added to this section to evaluate molecular drug-likeness, medicinal chemistry evolution, synthetic feasibility, and natural product likeness, respectively. Thereinto, the Fsp3 measure is a simple and interpretable metric for molecular saturation, and it is defined as the fraction of sp3 carbon atoms (the number of sp3 hybridized carbons / total carbon count). This parameter is closely related to solubility and melting points. The QED measure is a publicly accepted drug-likeness parameter. It is derived from a multivariate nonlinear function, in which several properties were parameterized and combined, including those used by Lipinski's rule-of-five, PSA, Nrotb, counts of aromatic rings (NAr), and the presence of certain undesirable structural motifs. We also noticed the frequent hitters (38) revoked more and more attention, and therefore four kinds of substructural rules were added to facilitate the detection of frequent hitters, thereby avoiding false positive results in drug discovery. For the toxicity section, the involvement of Tox21 made the toxicity prediction of our platform extend to biological target-based pathways. The binding data for twelve different biological targets, belonging to two major groups (the nuclear receptor pathway and the stress response pathway), constitute the complete Tox21 dataset. The prediction results of these endpoints in Tox21 could give users information on how chemicals may affect human health. In addition to traditional QSPR models, eight different toxicophore rules were integrated into this section, including human toxicity, environmental toxicity and comprehensive toxicity. It was believed that the combined application of toxicity prediction models and toxicophore rules could greatly improve the predictivity and interpretability of models (18). Other sections were also expanded in different degrees, and the introduction of the supported endpoints was provided in Supplementary Data and the website.

Further, it should be noted that the enhancement of ADMElab was not only confined to the addition of ADMET-related endpoints. The quality and quantity of the experimental data for model building was significantly improved. For example, the number of the molecules in the logD 7.4 dataset increased roughly tenfold from 1031 to 10 370, and a dramatic data increase also occurred to the hERG dataset. Other datasets were also extended with different magnitudes, such as Clearance, T1/2, VD, Caco-2 permeability, etc. Meanwhile, the number of some datasets decreased due to more rigorous data curation, where loosely defined molecules were removed and irrelevant data consolidation was corrected. Collectively, our platform could support the calculation of 88 ADMET-related endpoints spanning seven different categories, which is currently the most integrated online platform of this kind. In terms of the data, compared with the initial version, the number of the entries for model training in the current release has almost tripled.

Re-engineered modules and batch evaluation support

In the previous release of ADMETlab, we designed three main modules: Drug-likenessAnalysis, ADMET Prediction, and Systematic Evaluation, whose meanings were self-explanatory. Drug-likeness analysis and ADMET evaluation were separated into different modules, and single-property prediction and all-sided ADMET prediction were both supported to meet the demands of users. Our original intention was to make the webserver well organized. However, according to the feedback from users, most people preferred to obtain the whole ADMET profile as the foundation of the decision-making that whether the chemicals were worth further exploration, rather than specific endpoints. The single-property evaluation module seemed redundant and embarrassed in most situations. Moreover, due to the absence of drug-likeness rules in Systematic Evaluation, the users had to switch back and forth between the two modules, leading to bad user experience. To solve these problems, we re-engineered and optimized the functional modules from three aspects. Firstly, the single-property prediction module was completely removed. Secondly, the independent drug-likeness module was incorporated into the Systematic Evaluation as the Medicinal Chemistry section. The combined module could provide an integral calculation and evaluation of ADMET profiles by entering molecules only once, and it was renamed to ADMET Evaluation to reflect its essence more intuitively. Further, ADMETlab also had a built-in database, which included >280 000 entries from multiple data sources, recording the basic information and ADMET profiles. In addition to accurate searching, it could implement similarity searching with the user-defined criterion and provide the most similar molecules to the query molecules. However, some public integrated databases have matured considerably in offering high-quality and up-to-date resources, such as ChEMBL, BindingDB, admetSAR, etc. Moreover, it is hard to measure to which extent similarity searching benefit the users in their structure optimization work. Considering all the factors together, the database will be no longer retained within the latest version. Nevertheless, the interested readers still could access this function in the previous version.

Another notable weakness of the previous version is the inability of batch evaluation. The application of in silico frameworks enables earlier ADMET evaluation, even before syntheses. It also means significantly higher computation demands, from dozens to thousands in number. Clearly, single-molecule evaluation was powerless to handle a large number of molecules unless drawing support from in-house scripts. During the past three years, there were numerous letters from different institutes asking for the possibility of adding additional batch evaluation function. As a response, we developed ADMET Screening as an independent module supporting batch uploading and downloading, which could effectively avoid laboring operations for fetching results one by one. By using this module, synthetic chemists and computational chemists could conveniently evaluate a series of empirically designed or virtually screened compounds before syntheses and biochemical assays.

In summary, ADMETlab 2.0 currently provides the users with concise, explicit, and efficient functional modules to facilitate the ADMET evaluation of chemicals.

Robust and accurate MGA models

As the core component of the web server, the accuracy of models determines the reliability of predictions. With fixed datasets, the performance of ADMET evaluation models depends heavily on the selection of algorithms and descriptors. Traditionally, researchers needed to select multiple types of descriptors as the proxy for molecular structures, each combined with several algorithms, to construct prediction models and evaluate prediction performance, thereby obtaining the optimum combination. Such orthogonal strategy was also employed in the development of ADMETlab, where a total of 11 molecular descriptors and 6 modeling algorithms were engaged in the comparison. Although the process was tedious, a series of high-quality models with satisfactory performance were successfully constructed and embedded into the webserver.

Nevertheless, we asked ourselves: can we do better? Currently, the use of deep neural networks (DNN) has become popular throughout computational chemistry. Supported by many publications (39,40), it could obtain great improvement over classical approaches using random forest and fingerprints for many endpoints, such as solubility, Caco-2 permeability, log D, etc. In this work, we employed the MGA framework to develop the classification and regression predictors simultaneously for further performance improvement. MGA operated on graph-structured data, where the input molecule was regarded as a graph, with atoms being the nodes and bonds the edges. The node features were learned by propagating the features from the neighboring nodes and learning affine transformations. By learning jointly over multiple endpoints, the parameters in the hidden layers were shared among all tasks to force the learning of useful representation of the input molecule, which could improve the model generalization ability and enable tasks with fewer measures to benefit from the chemical space coverage of the larger tasks. Finally, 40 classification endpoints and 13 regression endpoints were built and evaluated in a separate test set fashion. For the classification models, the AUC ranged from 0.707 to 0.983 with an average value of 0.863. For the regression models, the R2 mainly ranged from 0.678 to 0.957 with an average value of 0.783, except for LC50DM. We also employed the XGBoost algorithm and MOE2d descriptors to construct traditional single-task models using the same datasets. As expected, the MGA models could yield better performance relative to the corresponding XGBoost models, except for seven classification and two regression endpoints. However, the superiority of XGBoost is quite slender (about 0.012 on AUC and 0.018 on R2) and these models may also slow down the computation speed. We still provided the users with MGA models across 53 endpoints in the web server.

Also noteworthy, the benefit brought by new algorithms is also reflected in the faster computing speed. In the MGA framework, molecules are learned in the form of graphs universally. Therefore, the laborious descriptor calculation is no longer required. Additionally, compared with the independent models for individual endpoint, multitask learning enables one input with multiple outputs, thus greatly simplifying the calculation process. Currently, a typical ADMETlab 2.0 task for 1000 molecules requires about 84 seconds, while the time required to run 20 molecules in the initial version is about 1574 s.

Practical explanation and guidance

Unambiguous interpretation of predictions in ADMETlab had been extensively acclaimed by the scientific community. In this update, we maintained this battle-tested characteristic and made further improvement. We inherited the explicit classification of endpoints, which enabled the users to quickly find the section of interest. For example, researchers of central nervous system drugs would be particularly interested in the distribution parameters, especially blood-brain barrier penetration, which determined if the medications could have significant brain exposure for therapeutic effect. To avoid confusion, we designed different display manners. For the regression endpoints, the concrete predictive values are shown in the result page. For the classification models, the prediction probability values are represented with different symbols: 0-0.1(−−−), 0.1-0.3(−−), 0.3-0.5(−), 0.5-0.7(+), 0.7-0.9(++), and 0.9-1.0(+++). Usually, the token ‘+++’ or ‘++’ represents the molecule is more likely to be toxic or defective, while ‘−−’ or ‘−’ represents nontoxic or appropriate. Here, we do not recommend trusting predictive results symbolled by ‘+’ or ‘−' (probably values in 0.3-0.7), and corresponding molecules require further assessment. Along with the predictive results, the empirical decision state of each endpoint is visually represented with different colored dots, green for excellent, yellow for medium, and red for poor, whose definition criteria are summarized in Supplementary Data and the website. For substructural alerts, like PAINS and SureChEMBL, in addition to the alert counts, the users could check the substructures contained in the molecule through the DETAIL button. The information of each endpoint is folded in the information icon to facilitate the understanding of the predictive results, including endpoints explanation, empirical optimal range, and label definition of binary endpoints. Generally, ADMETlab 2.0 provides practical explanation and information to help the users to get a whole ADMET picture of the input molecule.

APPLICATION CASE

To further demonstrate the reliability of ADMETlab 2.0, we selected olaparib as the input molecule to predict its ADMET parameters and to discuss the results in detail. olaparib is a poly (ADP-ribose) polymerase inhibitor developed by AstraZeneca for the treatment of BRCA mutation-positive ovarian cancer (41). As a globally approved antitumor drug, some ADMET-related parameters have been reported in several studies. The results page of Olaparib is provided in Supplementary Figure S2.

It can be seen from the radar plot that all the physicochemical properties are in the proper scope except solubility (logS). Poor solubility also affects the Fsp3 score, barely 0.33 with a red empirical decision state. In reality, its poor solubility has troubled the pharmacist for a long time and many advanced drug research techniques have been proposed to ensure its bioavailability (42). In the absorption section, olaparib is predicted as P-glycoprotein (Pgp) substrate and inhibitor with high possibility, which is consistent with the publication demonstrating that increased Pgp drug efflux transporter expression enables an intrinsic resistance of olaparib in metaplastic breast carcinoma (43). For metabolism, it is predicted that olaparib does not inhibit CYP 1A2 and CYP 2D6, and causes inhibition of CYP3A4 and CYP 2C9, which is consistent with the results of the in vitro cytochrome P450 evaluation reported by AstraZeneca (44). For toxicology, olaparib may have a high risk of human hepatotoxicity and liver injury, which is a common side effect of antitumor drugs and has been reported in a case study (45). Moreover, olaparib is predicted active for the ARE toxicological pathway with high possibility, although this characteristic has not been reported in any publication. Actually, antioxidant capacity has been recognized as a potential mechanism of tumor malignancy (46), and it is possible that the antitumor action of olaparib is partly contributed from the interference with the Keap1-Nrf2-ARE signaling pathway. In summary, the predictive results of ADMETlab 2.0 basically correspond to the reported experimental data, highlighting the reliability of this server tool.

Comparison with other web-based tools

Currently, there are many online prediction servers for the evaluation of certain ADMET parameters, such as SuperCYPsPred (47) for cytochrome activity prediction, eMolTox (48) for potential toxicity prediction, ChemAGG (49) for colloidal aggregators identification, etc. Meanwhile, several excellent online platforms have been proposed for more systematic and convenient ADMET predictions, including SwissADME (14), admetSAR 2.0 (13), FAF-Drugs4 (12), pkCSM (16) and vNN-ADMET (50). Here, we compared these tools and ADMETlab 2.0, as well as its previous version, with details summarized in Table 3.

Table 3.

Comparison of the main features of ADMETlab (version 1.0–2.0) with other web-based tools. Symbol ‘+’ means feature support and more ‘+’ indicate batter support

ADMETlab
Features2.01.0SwissADMEadmetSAR 2.0FAF-Drugs4pkCSMvNN-ADME
Endpoints
Physicochemical property1771252060
Medicinal chemistry1361001600
ADME23219350209
Toxicity2770120106
Toxicophore rule8000400
PAINS includedYesNoYesNoYesNoNo
Batch evaluation++++++++++++
Explanation+++++++++++++
AvailabilityFreeFreeFreeFreeFreeFreeRegistration required
Computation time(1000 molecules)84 sMore than 2 h1560 s267 s967s1845 s2400 s
ADMETlab
Features2.01.0SwissADMEadmetSAR 2.0FAF-Drugs4pkCSMvNN-ADME
Endpoints
Physicochemical property1771252060
Medicinal chemistry1361001600
ADME23219350209
Toxicity2770120106
Toxicophore rule8000400
PAINS includedYesNoYesNoYesNoNo
Batch evaluation++++++++++++
Explanation+++++++++++++
AvailabilityFreeFreeFreeFreeFreeFreeRegistration required
Computation time(1000 molecules)84 sMore than 2 h1560 s267 s967s1845 s2400 s

*Medicinal chemistry contains drug-likeness rules, chemical friendly measures, and substructural rules of frequent hitters; ADME contains absorption, distribution, metabolism, and excretion related endpoints; Toxicity contains human toxicity, animal toxicity, environmental toxicity, and toxic pathways.

URL links:

Table 3.

Comparison of the main features of ADMETlab (version 1.0–2.0) with other web-based tools. Symbol ‘+’ means feature support and more ‘+’ indicate batter support

ADMETlab
Features2.01.0SwissADMEadmetSAR 2.0FAF-Drugs4pkCSMvNN-ADME
Endpoints
Physicochemical property1771252060
Medicinal chemistry1361001600
ADME23219350209
Toxicity2770120106
Toxicophore rule8000400
PAINS includedYesNoYesNoYesNoNo
Batch evaluation++++++++++++
Explanation+++++++++++++
AvailabilityFreeFreeFreeFreeFreeFreeRegistration required
Computation time(1000 molecules)84 sMore than 2 h1560 s267 s967s1845 s2400 s
ADMETlab
Features2.01.0SwissADMEadmetSAR 2.0FAF-Drugs4pkCSMvNN-ADME
Endpoints
Physicochemical property1771252060
Medicinal chemistry1361001600
ADME23219350209
Toxicity2770120106
Toxicophore rule8000400
PAINS includedYesNoYesNoYesNoNo
Batch evaluation++++++++++++
Explanation+++++++++++++
AvailabilityFreeFreeFreeFreeFreeFreeRegistration required
Computation time(1000 molecules)84 sMore than 2 h1560 s267 s967s1845 s2400 s

*Medicinal chemistry contains drug-likeness rules, chemical friendly measures, and substructural rules of frequent hitters; ADME contains absorption, distribution, metabolism, and excretion related endpoints; Toxicity contains human toxicity, animal toxicity, environmental toxicity, and toxic pathways.

URL links:

Specifically, admetSAR, pkCSM and vNN-ADMET were designed to assist researchers in understanding the ADMET characteristics of chemicals. Thereinto, admetSAR 2.0 included the most diverse metabolism properties, while the available predictions in vNN-ADMETb were relatively limited. FAF-Drugs4 and Swiss-ADME performed outstandingly in physicochemical and medicinal chemistry properties. FAF-Drugs4 made the first attempt to add toxicophore rules into ADMET evaluation, and Swiss-ADME could provide multiple calculation approaches for logP and logS and various drug-likeness rules. However, their pharmacokinetics predictions were inferior to other tools. In comparison, ADMETlab 2.0 is available for the prediction of every important aspect of molecular quality, covering major endpoints that medicinal chemists would be interested in. Some toxicophore rules, toxicity pathways, and some medicinal chemistry measures of our platform are unique among the tools of this kind. Backed up by the robust QSPR models trained by the MGA framework, it now can provide more reliable predictions relative to other tools based on traditional machine learning algorithms. Moreover, the updated version could provide more user-friendly design and efficient service, reflected in diverse input approaches, practical explanation and high computation speed. Collectively, ADMETlab 2.0 provides medicinal chemists a comprehensive, accurate, efficient, and user-friendly service for ADMET evaluation.

CONCLUSIONS

Here, we have introduced ADMETlab 2.0 which significantly enhanced the functionality of its predecessor. The new webserver provides the users easy access to comprehensive, accurate and efficient prediction of the ADMET profiles for chemicals, including absorption, distribution, metabolism, excretion and toxicity properties, as well as some important physicochemical and medicinal chemistry properties. In this update, the available ADMET profile is extended to 88 related characteristics, roughly twice the number of its predecessor. Meanwhile, the advanced MGA framework was employed to construct robust and accurate models as the foundation of our platform. Module restructure and batch computation support were implemented to improve usability and user experience. Optimized results explanation enables non-expert users to understand the hints behind the predicted values, thereby guiding the medicinal chemistry decision-making. We believe that ADMETlab 2.0 will prove useful in accelerating the process of drug R&D.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

ACKNOWLEDGEMENTS

We would like to thank the user community for using ADMETlab and providing us with valuable suggestions regarding further improvement of this tool. The study was approved by the university's review board.

FUNDING

Changsha Municipal Natural Science Foundation [kq2014144]; Changsha Science and Technology Bureau project [kq2001034]; Key R&D Program of Zhejiang Province [2020C03010]; National Science Foundation of China [21575128, 81773632]; Zhejiang Provincial Natural Science Foundation of China [LZ19H300001]; HKBU Strategic Development Fund project [SDF19-0402-P02]. Funding for open access charge: HKBU Strategic Development Fund project [SDF19-0402-P02].

Conflict of interest statement. None declared.

REFERENCES

1.

Cumming
J.G.
,
Davis
A.M.
,
Muresan
S.
,
Haeberlein
M.
,
Chen
H.
Chemical predictive modelling to improve compound quality
.
Nat. Rev. Drug Discov.
2013
;
12
:
948
962
.

2.

van de Waterbeemd
H.
,
Gifford
E.
ADMET in silico modelling: towards prediction paradise?
.
Nat. Rev. Drug Discov.
2003
;
2
:
192
204
.

3.

Hou
T.
,
Wang
J.
Structure-ADME relationship: still a long way to go?
.
Expert Opin. Drug Metab. Toxicol.
2008
;
4
:
759
770
.

4.

Hodgson
J.
ADMET–turning chemicals into drugs
.
Nat. Biotechnol.
2001
;
19
:
722
726
.

5.

Selick
H.E.
,
Beresford
A.P.
,
Tarbit
M.H.
The emerging importance of predictive ADME simulation in drug discovery
.
Drug Discov. Today
.
2002
;
7
:
109
116
.

6.

Kennedy
T.
Managing the drug discovery/development interface
.
Drug Discov. Today
.
1997
;
2
:
436
444
.

7.

Kola
I.
,
Landis
J.
Can the pharmaceutical industry reduce attrition rates?
.
Nat. Rev. Drug Discov.
2004
;
3
:
711
715
.

8.

Waring
M.J.
,
Arrowsmith
J.
,
Leach
A.R.
,
Leeson
P.D.
,
Mandrell
S.
,
Owen
R.M.
,
Pairaudeau
G.
,
Pennie
W.D.
,
Pickett
S.D.
,
Wang
J.
et al. .
An analysis of the attrition of drug candidates from four major pharmaceutical companies
.
Nat. Rev. Drug Discov.
2015
;
14
:
475
486
.

9.

Hay
M.
,
Thomas
D.W.
,
Craighead
J.L.
,
Economides
C.
,
Rosenthal
J.
Clinical development success rates for investigational drugs
.
Nat. Biotechnol.
2014
;
32
:
40
51
.

10.

Goller
A.H.
,
Kuhnke
L.
,
Montanari
F.
,
Bonin
A.
,
Schneckener
S.
,
Ter Laak
A.
,
Wichard
J.
,
Lobell
M.
,
Hillisch
A.
Bayer's in silico ADMET platform: a journey of machine learning over the past two decades
.
Drug Discov. Today
.
2020
;
25
:
1702
1709
.

11.

Dong
J.
,
Wang
N.N.
,
Yao
Z.J.
,
Zhang
L.
,
Cheng
Y.
,
Ouyang
D.
,
Lu
A.P.
,
Cao
D.S.
ADMETlab: a platform for systematic ADMET evaluation based on a comprehensively collected ADMET database
.
J Cheminform
.
2018
;
10
:
29
.

12.

Lagorce
D.
,
Bouslama
L.
,
Becot
J.
,
Miteva
M.A.
,
Villoutreix
B.O.
FAF-Drugs4: free ADME-tox filtering computations for chemical biology and early stages drug discovery
.
Bioinformatics
.
2017
;
33
:
3658
3660
.

13.

Yang
H.
,
Lou
C.
,
Sun
L.
,
Li
J.
,
Cai
Y.
,
Wang
Z.
,
Li
W.
,
Liu
G.
,
Tang
Y.
admetSAR 2.0: web-service for prediction and optimization of chemical ADMET properties
.
Bioinformatics
.
2019
;
35
:
1067
1069
.

14.

Daina
A.
,
Michielin
O.
,
Zoete
V.
SwissADME: a free web tool to evaluate pharmacokinetics, drug-likeness and medicinal chemistry friendliness of small molecules
.
Sci. Rep.
2017
;
7
:
42717
.

15.

Banerjee
P.
,
Eckert
A.O.
,
Schrey
A.K.
,
Preissner
R.
ProTox-II: a webserver for the prediction of toxicity of chemicals
.
Nucleic. Acids. Res.
2018
;
46
:
W257
W263
.

16.

Pires
D.E.
,
Blundell
T.L.
,
Ascher
D.B.
pkCSM: predicting small-molecule pharmacokinetic and toxicity properties using graph-based signatures
.
J. Med. Chem.
2015
;
58
:
4066
4072
.

17.

Bienfait
B.
,
Ertl
P.
JSME: a free molecule editor in JavaScript
.
J Cheminform
.
2013
;
5
:
24
.

18.

Yang
Z.Y.
,
Yang
Z.J.
,
Lu
A.P.
,
Hou
T.J.
,
Cao
D.S.
Scopy: an integrated negative design python library for desirable HTS/VS database design
.
Brief. Bioinform.
2020
;
doi:10.1093/bib/bbaa194
.

19.

Mendez
D.
,
Gaulton
A.
,
Bento
A.P.
,
Chambers
J.
,
De Veij
M.
,
Felix
E.
,
Magarinos
M.P.
,
Mosquera
J.F.
,
Mutowo
P.
,
Nowotka
M.
et al. .
ChEMBL: towards direct deposition of bioassay data
.
Nucleic. Acids. Res.
2019
;
47
:
D930
D940
.

20.

Kim
S.
,
Chen
J.
,
Cheng
T.
,
Gindulyte
A.
,
He
J.
,
He
S.
,
Li
Q.
,
Shoemaker
B.A.
,
Thiessen
P.A.
,
Yu
B.
et al. .
PubChem in 2021: new data content and improved web interfaces
.
Nucleic Acids Res.
2021
;
49
:
D1388
D1395
.

21.

Sushko
I.
,
Novotarskyi
S.
,
Korner
R.
,
Pandey
A.K.
,
Rupp
M.
,
Teetz
W.
,
Brandmaier
S.
,
Abdelaziz
A.
,
Prokopenko
V.V.
,
Tanchuk
V.Y.
et al. .
Online chemical modeling environment (OCHEM): web platform for data storage, model development and publishing of chemical information
.
J. Comput. Aided Mol. Des.
2011
;
25
:
533
554
.

22.

Martin
T.
,
P.H.
D.Y
EPA/600/C-12/006, U.S. Environmental Protection Agency
.
2012
;
Washington, DC
Ver. 4.1
.

23.

Chicco
D.
,
Jurman
G.
The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation
.
BMC Genomics
.
2020
;
21
:
6
.

24.

Schlichtkrull
M.
,
Kipf
T.N.
,
Bloem
P.
,
Van Den Berg
R.
,
Titov
I.
,
Welling
M.
European Semantic Web Conference
.
2018
;
Springer
593
607
.

25.

Yang
X.
,
Kim
S.
,
Xing
E.
Heterogeneous multitask learning with joint sparsity constraints
.
Adv. Neural Inform. Process. Syst.
2009
;
22
:
2151
2159
.

26.

Oprea
T.I.
Property distribution of drug-related chemical databases
.
J. Comput. Aided Mol. Des.
2000
;
14
:
251
264
.

27.

Irwin
J.J.
,
Tang
K.G.
,
Young
J.
,
Dandarchuluun
C.
,
Wong
B.R.
,
Khurelbaatar
M.
,
Moroz
Y.S.
,
Mayfield
J.
,
Sayle
R.A.
ZINC20—a free ultralarge-scale chemical database for ligand discovery
.
J. Chem. Inf. Model.
2020
;
60
:
6065
6073
.

28.

Oprea
T.I.
,
Davis
A.M.
,
Teague
S.J.
,
Leeson
P.D.
Is there a difference between leads and drugs? A historical perspective
.
J. Chem. Inf. Comput. Sci.
2001
;
41
:
1308
1315
.

29.

Lipinski
C.A.
,
Lombardo
F.
,
Dominy
B.W.
,
Feeney
P.J.
Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings
.
Adv. Drug. Deliv. Rev.
2001
;
46
:
3
26
.

30.

Hughes
J.D.
,
Blagg
J.
,
Price
D.A.
,
Bailey
S.
,
Decrescenzo
G.A.
,
Devraj
R.V.
,
Ellsworth
E.
,
Fobian
Y.M.
,
Gibbs
M.E.
,
Gilles
R.W.
et al. .
Physiochemical drug properties associated with in vivo toxicological outcomes
.
Bioorg. Med. Chem. Lett.
2008
;
18
:
4872
4875
.

31.

Gleeson
M.P.
Generation of a set of simple, interpretable ADMET rules of thumb
.
J. Med. Chem.
2008
;
51
:
817
834
.

32.

Johnson
T.W.
,
Dress
K.R.
,
Edwards
M.
Using the Golden Triangle to optimize clearance and oral absorption
.
Bioorg. Med. Chem. Lett.
2009
;
19
:
5560
5564
.

33.

Bickerton
G.R.
,
Paolini
G.V.
,
Besnard
J.
,
Muresan
S.
,
Hopkins
A.L.
Quantifying the chemical beauty of drugs
.
Nat. Chem.
2012
;
4
:
90
98
.

34.

Lovering
F.
,
Bikker
J.
,
Humblet
C.
Escape from flatland: increasing saturation as an approach to improving clinical success
.
J. Med. Chem.
2009
;
52
:
6752
6756
.

35.

Ivanenkov
Y.A.
,
Zagribelnyy
B.A.
,
Aladinskiy
V.A.
Are we opening the door to a new era of medicinal chemistry or being collapsed to a chemical singularity?
.
J. Med. Chem.
2019
;
62
:
10026
10043
.

36.

Ertl
P.
,
Schuffenhauer
A.
Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions
.
J Cheminform
.
2009
;
1
:
8
.

37.

Ertl
P.
,
Roggo
S.
,
Schuffenhauer
A.
Natural product-likeness score and its application for prioritization of compound libraries
.
J. Chem. Inf. Model.
2008
;
48
:
68
74
.

38.

Yang
Z.Y.
,
He
J.H.
,
Lu
A.P.
,
Hou
T.J.
,
Cao
D.S.
Frequent hitters: nuisance artifacts in high-throughput screening
.
Drug Discov. Today
.
2020
;
25
:
657
667
.

39.

Feinberg
E.N.
,
Sur
D.
,
Wu
Z.
,
Husic
B.E.
,
Mai
H.
,
Li
Y.
,
Sun
S.
,
Yang
J.
,
Ramsundar
B.
,
Pande
V.S.
PotentialNet for molecular property prediction
.
ACS Cent Sci
.
2018
;
4
:
1520
1530
.

40.

Montanari
F.
,
Kuhnke
L.
,
Ter Laak
A.
,
Clevert
D.A.
Modeling physico-chemical ADMET endpoints with multitask graph convolutional networks
.
Molecules
.
2019
;
25
:
44
.

41.

Zhao
Y.
,
Zhang
L.X.
,
Jiang
T.
,
Long
J.
,
Ma
Z.Y.
,
Lu
A.P.
,
Cheng
Y.
,
Cao
D.S.
The ups and downs of poly(ADP-ribose) polymerase-1 inhibitors in cancer therapy—current progress and future direction
.
Eur. J. Med. Chem.
2020
;
203
:
112570
.

42.

Ottria
R.
,
Ravelli
A.
,
Miceli
M.
,
Casati
S.
,
Orioli
M.
,
Ciuffreda
P.
Quantitative characterization of Olaparib in nanodelivery system and target cell compartments by LC-MS/MS
.
Molecules
.
2019
;
24
:
989
.

43.

Henneman
L.
,
van Miltenburg
M.H.
,
Michalak
E.M.
,
Braumuller
T.M.
,
Jaspers
J.E.
,
Drenth
A.P.
,
de Korte-Grimmerink
R.
,
Gogola
E.
,
Szuhai
K.
,
Schlicker
A.
et al. .
Selective resistance to the PARP inhibitor olaparib in a mouse model for BRCA1-deficient metaplastic breast cancer
.
Proc. Natl. Acad. Sci. U.S.A.
2015
;
112
:
8409
8414
.

44.

McCormick
A.
,
Swaisland
H.
,
Reddy
V.P.
,
Learoyd
M.
,
Scarfe
G.
In vitro evaluation of the inhibition and induction potential of olaparib, a potent poly(ADP-ribose) polymerase inhibitor, on cytochrome P450
.
Xenobiotica
.
2018
;
48
:
555
564
.

45.

Tufoni
M.
,
Serena Ricci
C.
,
Zaccherini
G.
A case of immune-mediated liver injury induced by Olaparib
.
Hepatology
.
2018
;
68
:
2039
2041
.

46.

Xu
P.
,
Liu
Q.
,
Xie
Y.
,
Shi
X.
,
Li
Y.
,
Peng
M.
,
Guo
H.
,
Sun
R.
,
Li
J.
,
Hong
Y.
et al. .
Breast cancer susceptibility protein 1 (BRCA1) rescues neurons from cerebral ischemia/reperfusion injury through NRF2-mediated antioxidant pathway
.
Redox. Biol.
2018
;
18
:
158
172
.

47.

Banerjee
P.
,
Dunkel
M.
,
Kemmler
E.
,
Preissner
R.
SuperCYPsPred-a web server for the prediction of cytochrome activity
.
Nucleic Acids Res.
2020
;
48
:
W580
W585
.

48.

Ji
C.
,
Svensson
F.
,
Zoufir
A.
,
Bender
A.
eMolTox: prediction of molecular toxicity with confidence
.
Bioinformatics
.
2018
;
34
:
2508
2509
.

49.

Yang
Z.Y.
,
Yang
Z.J.
,
Dong
J.
,
Wang
L.L.
,
Zhang
L.X.
,
Ding
J.J.
,
Ding
X.Q.
,
Lu
A.P.
,
Hou
T.J.
,
Cao
D.S.
Structural analysis and identification of colloidal aggregators in drug discovery
.
J. Chem. Inf. Model.
2019
;
59
:
3714
3726
.

50.

Schyman
P.
,
Liu
R.
,
Desai
V.
,
Wallqvist
A.
vNN web server for ADMET predictions
.
Front. Pharmacol.
2017
;
8
:
889
.

Author notes

The authors wish it to be known that, in their opinion, the first three authors should be regarded as joint First Authors.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

Supplementary data

Comments

0 Comments
Submit a comment
You have entered an invalid code
Thank you for submitting a comment on this article. Your comment will be reviewed and published at the journal's discretion. Please check for further notifications by email.