journal
https://read.qxmd.com/read/38627866/meta-learning-based-inductive-logistic-matrix-completion-for-prediction-of-kinase-inhibitors
#1
JOURNAL ARTICLE
Ming Du, XingRan Xie, Jing Luo, Jin Li
Protein kinases become an important source of potential drug targets. Developing new, efficient, and safe small-molecule kinase inhibitors has become an important topic in the field of drug research and development. In contrast with traditional wet experiments which are time-consuming and expensive, machine learning-based approaches for predicting small molecule inhibitors for protein kinases are time-saving and cost-effective, which are highly desired for us. However, the issue of sample scarcity (known active and inactive compounds are usually limited for most kinases) poses a challenge to the research and development of machine learning-based kinase inhibitors' active prediction methods...
April 16, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38627862/classification-of-substances-by-health-hazard-using-deep-neural-networks-and-molecular-electron-densities
#2
JOURNAL ARTICLE
Satnam Singh, Gina Zeh, Jessica Freiherr, Thilo Bauer, Isik Türkmen, Andreas T Grasskamp
In this paper we present a method that allows leveraging 3D electron density information to train a deep neural network pipeline to segment regions of high, medium and low electronegativity and classify substances as health hazardous or non-hazardous. We show that this can be used for use-cases such as cosmetics and food products. For this purpose, we first generate 3D electron density cubes using semiempirical molecular calculations for a custom European Chemicals Agency (ECHA) subset consisting of substances labelled as hazardous and non-hazardous for cosmetic usage...
April 16, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38622746/zombie-cheminformatics-extraction-and-conversion-of-wiswesser-line-notation-wln-from-chemical-documents
#3
JOURNAL ARTICLE
Michael Blakey, Samantha Pearman-Kanza, Jeremy G Frey
PURPOSE: Wiswesser Line Notation (WLN) is a old line notation for encoding chemical compounds for storage and processing by computers. Whilst the notation itself has long since been surpassed by SMILES and InChI, distribution of WLN during its active years was extensive. In the context of modernising chemical data, we present a comprehensive WLN parser developed using the OpenBabel toolkit, capable of translating WLN strings into various formats supported by the library. Furthermore, we have devised a specialised Finite State Machine l, constructed from the rules of WLN, enabling the recognition and extraction of chemical strings out of large bodies of text...
April 15, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38622663/permutedds-a-permutable-feature-fusion-network-for-drug-drug-synergy-prediction
#4
JOURNAL ARTICLE
Xinwei Zhao, Junqing Xu, Youyuan Shui, Mengdie Xu, Jie Hu, Xiaoyan Liu, Kai Che, Junjie Wang, Yun Liu
MOTIVATION: Drug combination therapies have shown promise in clinical cancer treatments. However, it is hard to experimentally identify all drug combinations for synergistic interaction even with high-throughput screening due to the vast space of potential combinations. Although a number of computational methods for drug synergy prediction have proven successful in narrowing down this space, fusing drug pairs and cell line features effectively still lacks study, hindering current algorithms from understanding the complex interaction between drugs and cell lines...
April 15, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38622648/mind-your-prevalence
#5
JOURNAL ARTICLE
Sébastien J J Guesné, Thierry Hanser, Stéphane Werner, Samuel Boobier, Shaylyn Scott
Multiple metrics are used when assessing and validating the performance of quantitative structure-activity relationship (QSAR) models. In the case of binary classification, balanced accuracy is a metric to assess the global performance of such models. In contrast to accuracy, balanced accuracy does not depend on the respective prevalence of the two categories in the test set that is used to validate a QSAR classifier. As such, balanced accuracy is used to overcome the effect of imbalanced test sets on the model's perceived accuracy...
April 15, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38582911/comprehensive-machine-learning-boosts-structure-based-virtual-screening-for-parp1-inhibitors
#6
JOURNAL ARTICLE
Klaudia Caba, Viet-Khoa Tran-Nguyen, Taufiq Rahman, Pedro J Ballester
Poly ADP-ribose polymerase 1 (PARP1) is an attractive therapeutic target for cancer treatment. Machine-learning scoring functions constitute a promising approach to discovering novel PARP1 inhibitors. Cutting-edge PARP1-specific machine-learning scoring functions were investigated using semi-synthetic training data from docking activity-labelled molecules: known PARP1 inhibitors, hard-to-discriminate decoys property-matched to them with generative graph neural networks and confirmed inactives. We further made test sets harder by including only molecules dissimilar to those in the training set...
April 7, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38576047/using-test-time-augmentation-to-investigate-explainable-ai-inconsistencies-between-method-model-and-human-intuition
#7
JOURNAL ARTICLE
Peter B R Hartog, Fabian Krüger, Samuel Genheden, Igor V Tetko
Stakeholders of machine learning models desire explainable artificial intelligence (XAI) to produce human-understandable and consistent interpretations. In computational toxicity, augmentation of text-based molecular representations has been used successfully for transfer learning on downstream tasks. Augmentations of molecular representations can also be used at inference to compare differences between multiple representations of the same ground-truth. In this study, we investigate the robustness of eight XAI methods using test-time augmentation for a molecular-representation model in the field of computational toxicity prediction...
April 4, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38556873/a-general-model-for-predicting-enzyme-functions-based-on-enzymatic-reactions
#8
JOURNAL ARTICLE
Wenjia Qian, Xiaorui Wang, Yu Kang, Peichen Pan, Tingjun Hou, Chang-Yu Hsieh
Accurate prediction of the enzyme comission (EC) numbers for chemical reactions is essential for the understanding and manipulation of enzyme functions, biocatalytic processes and biosynthetic planning. A number of machine leanring (ML)-based models have been developed to classify enzymatic reactions, showing great advantages over costly and long-winded experimental verifications. However, the prediction accuracy for most available models trained on the records of chemical reactions without specifying the enzymatic catalysts is rather limited...
March 31, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38553720/rxn-insight-fast-chemical-reaction-analysis-using-bond-electron-matrices
#9
JOURNAL ARTICLE
Maarten R Dobbelaere, István Lengyel, Christian V Stevens, Kevin M Van Geem
The challenge of devising pathways for organic synthesis remains a central issue in the field of medicinal chemistry. Over the span of six decades, computer-aided synthesis planning has given rise to a plethora of potent tools for formulating synthetic routes. Nevertheless, a significant expert task still looms: determining the appropriate solvent, catalyst, and reagents when provided with a set of reactants to achieve and optimize the desired product for a specific step in the synthesis process. Typically, chemists identify key functional groups and rings that exert crucial influences at the reaction center, classify reactions into categories, and may assign them names...
March 29, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38549134/dget-an-open-source-deuteration-calculator-for-mass-spectrometry-data
#10
JOURNAL ARTICLE
Thomas E Lockwood, Alexander Angeloski
DGet! is an open-source analysis package written in Python for calculating the degree of deuterium enrichment in isotopically labelled molecules using mass spectrometric data. The nuclear properties of deuterium make it a valuable tracer in metabolic studies and an excellent contrast agent in nuclear spectroscopies. Determination of molecular deuteration levels is typically performed using mass spectrometry, however software options to perform these calculations are scarce. The in-house scripts and spreadsheets currently used rarely account for isotopic interferences from 13 C or multi-isotopic elements that impact deuteration calculations...
March 28, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38528548/effectiveness-of-molecular-fingerprints-for-exploring-the-chemical-space-of-natural-products
#11
JOURNAL ARTICLE
Davide Boldini, Davide Ballabio, Viviana Consonni, Roberto Todeschini, Francesca Grisoni, Stephan A Sieber
Natural products are a diverse class of compounds with promising biological properties, such as high potency and excellent selectivity. However, they have different structural motifs than typical drug-like compounds, e.g., a wider range of molecular weight, multiple stereocenters and higher fraction of sp3 -hybridized carbons. This makes the encoding of natural products via molecular fingerprints difficult, thus restricting their use in cheminformatics studies. To tackle this issue, we explored over 30 years of research to systematically evaluate which molecular fingerprint provides the best performance on the natural product chemical space...
March 25, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38520014/a-numerical-compass-for-experiment-design-in-chemical-kinetics-and-molecular-property-estimation
#12
JOURNAL ARTICLE
Matteo Krüger, Ashmi Mishra, Peter Spichtinger, Ulrich Pöschl, Thomas Berkemeier
Kinetic process models are widely applied in science and engineering, including atmospheric, physiological and technical chemistry, reactor design, or process optimization. These models rely on numerous kinetic parameters such as reaction rate, diffusion or partitioning coefficients. Determining these properties by experiments can be challenging, especially for multiphase systems, and researchers often face the task of intuitively selecting experimental conditions to obtain insightful results. We developed a numerical compass (NC) method that integrates computational models, global optimization, ensemble methods, and machine learning to identify experimental conditions with the greatest potential to constrain model parameters...
March 22, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38515171/pocket-crafter-a-3d-generative-modeling-based-workflow-for-the-rapid-generation-of-hit-molecules-in-drug-discovery
#13
JOURNAL ARTICLE
Lingling Shen, Jian Fang, Lulu Liu, Fei Yang, Jeremy L Jenkins, Peter S Kutchukian, He Wang
We present a user-friendly molecular generative pipeline called Pocket Crafter, specifically designed to facilitate hit finding activity in the drug discovery process. This workflow utilized a three-dimensional (3D) generative modeling method Pocket2Mol, for the de novo design of molecules in spatial perspective for the targeted protein structures, followed by filters for chemical-physical properties and drug-likeness, structure-activity relationship analysis, and clustering to generate top virtual hit scaffolds...
March 21, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38486289/advancing-material-property-prediction-using-physics-informed-machine-learning-models-for-viscosity
#14
JOURNAL ARTICLE
Alex K Chew, Matthew Sender, Zachary Kaplan, Anand Chandrasekaran, Jackson Chief Elk, Andrea R Browning, H Shaun Kwak, Mathew D Halls, Mohammad Atif Faiz Afzal
In materials science, accurately computing properties like viscosity, melting point, and glass transition temperatures solely through physics-based models is challenging. Data-driven machine learning (ML) also poses challenges in constructing ML models, especially in the material science domain where data is limited. To address this, we integrate physics-informed descriptors from molecular dynamics (MD) simulations to enhance the accuracy and interpretability of ML models. Our current study focuses on accurately predicting viscosity in liquid systems using MD descriptors...
March 14, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38486231/learnt-representations-of-proteins-can-be-used-for-accurate-prediction-of-small-molecule-binding-sites-on-experimentally-determined-and-predicted-protein-structures
#15
JOURNAL ARTICLE
Anna Carbery, Martin Buttenschoen, Rachael Skyner, Frank von Delft, Charlotte M Deane
Protein-ligand binding site prediction is a useful tool for understanding the functional behaviour and potential drug-target interactions of a novel protein of interest. However, most binding site prediction methods are tested by providing crystallised ligand-bound (holo) structures as input. This testing regime is insufficient to understand the performance on novel protein targets where experimental structures are not available. An alternative option is to provide computationally predicted protein structures, but this is not commonly tested...
March 14, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38481269/a-new-workflow-for-the-effective-curation-of-membrane-permeability-data-from-open-adme-information
#16
JOURNAL ARTICLE
Tsuyoshi Esaki, Tomoki Yonezawa, Kazuyoshi Ikeda
Membrane permeability is an in vitro parameter that represents the apparent permeability (Papp) of a compound, and is a key absorption, distribution, metabolism, and excretion parameter in drug development. Although the Caco-2 cell lines are the most used cell lines to measure Papp, other cell lines, such as the Madin-Darby Canine Kidney (MDCK), LLC-Pig Kidney 1 (LLC-PK1), and Ralph Russ Canine Kidney (RRCK) cell lines, can also be used to estimate Papp. Therefore, constructing in silico models for Papp estimation using the MDCK, LLC-PK1, and RRCK cell lines requires collecting extensive amounts of in vitro Papp data...
March 14, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38475916/automated-molecular-structure-segmentation-from-documents-using-chemsam
#17
JOURNAL ARTICLE
Bowen Tang, Zhangming Niu, Xiaofeng Wang, Junjie Huang, Chao Ma, Jing Peng, Yinghui Jiang, Ruiquan Ge, Hongyu Hu, Luhao Lin, Guang Yang
Chemical structure segmentation constitutes a pivotal task in cheminformatics, involving the extraction and abstraction of structural information of chemical compounds from text-based sources, including patents and scientific articles. This study introduces a deep learning approach to chemical structure segmentation, employing a Vision Transformer (ViT) to discern the structural patterns of chemical compounds from their graphical representations. The Chemistry-Segment Anything Model (ChemSAM) achieves state-of-the-art results on publicly available benchmark datasets and real-world tasks, underscoring its effectiveness in accurately segmenting chemical structures from text-based sources...
March 12, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38475907/systematic-analysis-aggregation-and-visualisation-of-interaction-fingerprints-for-molecular-dynamics-simulation-data
#18
JOURNAL ARTICLE
Sabrina Jaeger-Honz, Karsten Klein, Falk Schreiber
Computational methods such as molecular docking or molecular dynamics (MD) simulations have been developed to simulate and explore the interactions between biomolecules. However, the interactions obtained using these methods are difficult to analyse and evaluate. Interaction fingerprints (IFPs) have been proposed to derive interactions from static 3D coordinates and transform them into 1D bit vectors. More recently, the concept has been applied to derive IFPs from MD simulations, which adds a layer of complexity by adding the temporal motion and dynamics of a system...
March 12, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38449058/prediction-of-compound-target-interaction-using-several-artificial-intelligence-algorithms-and-comparison-with-a-consensus-based-strategy
#19
JOURNAL ARTICLE
Karina Jimenes-Vargas, Alejandro Pazos, Cristian R Munteanu, Yunierkis Perez-Castillo, Eduardo Tejera
For understanding a chemical compound's mechanism of action and its side effects, as well as for drug discovery, it is crucial to predict its possible protein targets. This study examines 15 developed target-centric models (TCM) employing different molecular descriptions and machine learning algorithms. They were contrasted with 17 third-party models implemented as web tools (WTCM). In both sets of models, consensus strategies were implemented as potential improvement over individual predictions. The findings indicate that TCM reach f1-score values greater than 0...
March 7, 2024: Journal of Cheminformatics
https://read.qxmd.com/read/38444032/small-molecule-autoencoders-architecture-engineering-to-optimize-latent-space-utility-and-sustainability
#20
JOURNAL ARTICLE
Marie Oestreich, Iva Ewert, Matthias Becker
Autoencoders are frequently used to embed molecules for training of downstream deep learning models. However, evaluation of the chemical information quality in the latent spaces is lacking and the model architectures are often arbitrarily chosen. Unoptimized architectures may not only negatively affect latent space quality but also increase energy consumption during training, making the models unsustainable. We conducted systematic experiments to better understand how the autoencoder architecture affects the reconstruction and latent space quality and how it can be optimized towards the encoding task as well as energy consumption...
March 5, 2024: Journal of Cheminformatics
journal
journal
42601
1
2
Fetch more papers »
Fetching more papers... Fetching...
Remove bar
Read by QxMD icon Read
×

Save your favorite articles in one place with a free QxMD account.

×

Search Tips

Use Boolean operators: AND/OR

diabetic AND foot
diabetes OR diabetic

Exclude a word using the 'minus' sign

Virchow -triad

Use Parentheses

water AND (cup OR glass)

Add an asterisk (*) at end of a word to include word stems

Neuro* will search for Neurology, Neuroscientist, Neurological, and so on

Use quotes to search for an exact phrase

"primary prevention of cancer"
(heart or cardiac or cardio*) AND arrest -"American Heart Association"

We want to hear from doctors like you!

Take a second to answer a survey question.