Abstract
For the first time, a global regression quantitative structure–toxicity/activity relationship (QSTR/QSAR) model was developed for the toxicity of a large data set including 1236 chemicals towards Vibrio fischeri, by using random forest (RF) regression algorithm. The optimal RF model with RF parameters of mtry = 3, ntree = 150 and nodesize = 5 was based on 13 molecular descriptors. It can achieve accurate prediction for the toxicity of 99.1% of 1236 chemicals, and yield coefficients of determination R2 of 0.893 for 930 log(Mw/IBC50) in the training set, 0.723 for 306 log(Mw/IBC50) in the test se, and 0.865 for 1236 toxicity log(Mw/IBC50) in the total set. The optimal RF global model proposed in this work is comparable to other published local QSTR models on small datasets of the toxicity to Vibrio fischeri.
Similar content being viewed by others
References
Cronin MTD, Bowers GS, Sinks GD, Schultz TW (2000) Structure-toxicity relationships for aliphatic compounds encompassing a variety of mechanisms of toxic action to Vibrio fischeri. SAR QSAR Environ Res 11:31–312. https://doi.org/10.1080/10629360008033237
de Melo EB, Martins JPA, Miranda EH, Ferreira MMC (2016) A best comprehension about the toxicity of phenylsulfonyl carboxylates in Vibrio fischeri using quantitative structure activity/property relationship methods. J Hazard Mater 304:33–241. https://doi.org/10.1016/j.jhazmat.2015.10.047
Gajewicz-Skretna A, Gromelski M, Wyrzykowska E, Furuhama A, Yamamoto H, Suzuki N (2021) Aquatic toxicity (Pre)screening strategy for structurally diverse chemicals: global or local classification tree models? Ecotoxicol Environ Saf 208:111738
Isah HM, Sawyerr HO, Raimi MO, Bashir BG, Haladu S, Odipe OE (2020) Assessment of commonly used pesticides and frequency of self-reported symptoms on farmers health in Kura, Kano State, Nigeria. J Educ Learn Manag 1(1):31–54
Keshavarz MH, Shirazi Z, Sheikhabadi PK (2021) Risk assessment of organic aromatic compounds to Tetrahymena pyriformis in environmental protection by a simple QSAR model. Process Saf Environ Prot 150:137–147. https://doi.org/10.1016/j.psep.2021.04.011
Khan K, Benfenati E, Roy K (2019) Consensus QSAR modeling of toxicity of pharmaceuticals to different aquatic organisms: ranking and prioritization of the DrugBank database compounds. Ecotoxicol Environ Saf 168:287–297. https://doi.org/10.1016/j.ecoenv.2018.10.060
Kumar P, Kumar A, Singh D (2021) CORAL: Development of a hybrid descriptor based QSTR model to predict the toxicity of dioxins and dioxin-like compounds with correlation intensity index and consensus modelling. Environ Toxicol Pharmacol 93:103893. https://doi.org/10.1016/j.etap.2022.103893
Li JJ, Wang XH, Wang Y, Wen Y, Qin WC, Su LM, Zhao YH (2015) Discrimination of excess toxicity from narcotic effect: influence of species sensitivity and bioconcentration on the classification of modes of action. Chemosphere 120:660–673. https://doi.org/10.1016/j.chemosphere.2014.10.013
Lu H, Yang F, Liu W, Yuan H, Jiao Y (2020) A robust model for estimating thermal conductivity of liquid alkyl halides. SAR QSAR Environ Res 31(2):73–85. https://doi.org/10.1080/1062936X.2019.1695225
Masand VH, Zaki MEA, Al-Hussain SA, Ghorbal AB, Akasapu S, Lewaa I, Ghosh A, Jawarkar RD (2021) Identification of concealed structural alerts using QSTR modeling for Pseudokirchneriella subcapitata. Aquat Toxicol 239:105962. https://doi.org/10.1016/j.aquatox.2021.105962
Mit C, Bado-Nilles A, Daniele G, Giroud B, Vulliet E, Beaudouin R (2022) The toxicokinetics of bisphenol A and its metabolites in fish elucidated by a PBTK model. Aquat Toxicol 247:106174. https://doi.org/10.1016/j.aquatox.2022.106174
Mo L-Y, Yuan B-K, Zhu J, Qin L-T, Dai J-F (2022) QSAR models for predicting additive and synergistic toxicities of binary pesticide mixtures on Scenedesmus obliquus. Chin J Struct Chem 41(3):2203166–2203177
Montes C, Kapelan Z (2021) Predicting non-deposition sediment transport in sewer pipes using Random forest. Water Res 189:116639. https://doi.org/10.1016/j.watres.2020.116639
Oukawa GY, Krecl P, Targino AC (2022) Fine-scale modeling of the urban heat island: A comparison of multiple linear regression and random forest approaches. Sci Total Environ 815:152836. https://doi.org/10.1016/j.scitotenv.2021.152836
Qin WC, Su LM, Zhang XJ, Qin HW, Wen Y, Guo Z, Sun FT, Sheng LX, Zhao YH, Abraham MH (2010) Toxicity of organic pollutants to seven aquatic organisms: effect of polarity and ionization. SAR QSAR Environ Res 21:389–401. https://doi.org/10.1080/1062936X.2010.501143
Roy K, Das RN, Ambure P, Aher RB (2016) Be aware of error measures. Further studies on validation of predictive QSAR models. Chemometr Intell Lab Syst 152:18–33. https://doi.org/10.1016/j.chemolab.2016.01.008
Roy K, Ambur P, Aher RB (2017) How important is to detect systematic error in predictions and understand statistical applicability domain of QSAR models? Chemometr Intell Lab Syst 162:44–54. https://doi.org/10.1016/j.chemolab.2017.01.010
Rudén C, Hansson SO (2010) Registration, evaluation, and authorization of chemicals (REACH) Is but the first step—how far will it take us? Six further steps to improve the European chemicals legislation. Environ Health Perspect 118:6–10. https://doi.org/10.1289/ehp.0901157
Tinkov OV, Grigorev VY, Grigoreva LD (2021) QSAR analysis of the acute toxicity of avermectins towards Tetrahymena pyriformis. SAR QSAR Environ Res 32(7):541–571. https://doi.org/10.1080/1062936X.2021.1932583
Todeschini R, Consonni V, Mauri A, Pavan M (2012) DRAGON software for the calculation of molecular descriptors, revision 6.0 for Windows. Talete s.r.l., Milan. http://www.talete.mi.it/
Tropsha A (2010) Best practices for QSAR model development, validation, and exploitation. Mol Inform 29:476–488. https://doi.org/10.1002/minf.201000061
Wang YN, Chen JW, Li XH, Wang B, Cai XY, Huang LP (2009) Predicting rate constants of hydroxyl radical reactions with organic pollutants: algorithm, validation, applicability domain, and mechanistic interpretation. Atmos Environ 43:1131–1135
Wang C, Wei ZB, Wang LS, Sun P, Wang ZY (2015) Assessment of bromide-based ionic liquid toxicity toward aquatic organisms and QSAR analysis. Ecotox Environ Safe 115:112–118. https://doi.org/10.1016/j.ecoenv.2015.02.012
Wang XH, Yu Y, Huang T, Qin WC, Su LM, Zhao YH (2016a) Comparison of toxicities to Vibrio fischeri and fish based on discrimination of excess toxicity from baseline level. PLoS ONE 11:e0150028. https://doi.org/10.1371/journal.pone.0150028
Wang Y, Yang XH, Wang JY, Cong Y, Mu JL, Jin F (2016b) ADFT-based toxicity QSAR study of aromatic hydrocarbons to Vibrio fischeri: consideration of aqueous freely dissolved concentration. J Hazard Mater 308:149–156. https://doi.org/10.1016/j.jhazmat.2016.01.006
Wu F, Zhang X, Fang Z, Yu X (2023) Support vector machine-based global classification model of the toxicity of organic compounds to Vibrio fischeri. Molecules 28:2703. https://doi.org/10.3390/molecules28062703
Yap CW (2011) PaDEL-descriptor: an open source software to calculate molecular descriptors and fingerprints. J Comput Chem 32(7):1466–1474. https://doi.org/10.1002/jcc.21707
Yu X (2020a) Quantitative structure-toxicity relationships of organic chemicals against Pseudokirchneriella subcapitata. Aquat Toxicol 224:105496. https://doi.org/10.1016/j.aquatox.2020.105496
Yu X (2020b) Prediction of chemical toxicity to Tetrahymena pyriformis with four descriptor models. Ecotoxicol Environ Saf 190:110146. https://doi.org/10.1016/j.ecoenv.2019.110146
Yu X (2021) Support vector machine-based model for toxicity of organic compounds against fish. Regul Toxicol Pharmacol 123:104942. https://doi.org/10.1016/j.yrtph.2021.104942
Yu X, Acree WE Jr (2023) QSPR-based model extrapolation prediction of enthalpy of solvation. J Mol Liq 376:121455. https://doi.org/10.1016/j.molliq.2023.121455
Yu X, Zeng Q (2022) Random forest algorithm-based classification model of pesticide aquatic toxicity to fishes. Aquat Toxicol 251:106265. https://doi.org/10.1016/j.aquatox.2022.106265
Zhang S, Wang N, Su L, Xu X, Li C, Qin W, Zhao Y (2020) MOA-based linear and nonlinear QSAR models for predicting the toxicity of organic chemicals to Vibrio fischeri. Environ Sci Pollut Res 27:9114–9125. https://doi.org/10.1007/s11356-019-06681-y
Zhao YH, Ji GD, Cronin MTD, Dearden JC (1998) QSAR study of the toxicity of benzoic acids to Vibrio fischeri, Daphnia magna and carp. Sci Total Environ 216:205–215. https://doi.org/10.1016/s0048-9697(98)00157-0
Funding
This work was supported by the Open Project Program of Hunan Provincial Key Laboratory of Environmental Catalysis & Waste Regeneration (Hunan Institute of Engineering) (No. 2018KF11) and the Hunan Provincial Natural Science Foundation (Nos. 2020JJ6013, 2021JJ50111).
Author information
Authors and Affiliations
Contributions
XY provided conceptualization, methodology, software, validation, writing—original draft, writing—review and editing. MH and LS performed data curation and validation.
Corresponding authors
Ethics declarations
Conflict of interest
The authors have no relevant financial or non-financial interests to disclose.
Ethical Approval
Xinliang Yu declares that he has not received any research grants or honoraria from any commercial companies. Minghui He declares that she has not received any research grants or honoraria from any commercial companies. Limin Su declares that she has not received any research grants or honoraria from any commercial companies.
Human and Animal Rights
This article does not contain any studies with human or animal subjects.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Yu, X., He, M. & Su, L. Large Dataset-Based Regression Model of Chemical Toxicity to Vibrio fischeri. Arch Environ Contam Toxicol 85, 46–54 (2023). https://doi.org/10.1007/s00244-023-01010-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00244-023-01010-4