Theoretical and Natural Science
- The Open Access Proceedings Series for Conferences
Vol. 33, 08 March 2024
* Author to whom correspondence should be addressed.
Alzheimer’s disease (AD) is a neurodegenerative disease with a worldwide prevalence of 760.5 per 100,000 inhabitants. Environment factors and genetic predispositions can influence an individual’s risk of developing AD. Recent research provided insights into the genetic mechanisms of AD, but a comprehensive database of drug-like molecules that can exacerbate or reduce AD risk remains unavailable. Here, we use machine learning to create a similarity map that reveals new putative drugs structurally similar to existing compounds known to be targeting pathways relevant in AD. We trained an autoencoder on a large drug database of over 14,000 drugs, with features derived from several modalities including molecular fingerprints. We then computed similarity scores based on these reduced dimensions. We show that our model is able to identify new compounds structurally similar to existing drugs linked to AD. We conclude that our model holds the potential to elucidate new compounds based on structural similarity and can be used to identify new drugs that affect critical pathways in AD.
Alzheimer’s Disease, drugs, risk factors, machine learning
1. Olazarán J., Carnero-Pardo C., Fortea J., Sánchez-Juan P., García-Ribas G., Viñuela F, Martínez-Lage P, Boada M (2023) Prevalence of treated patients with Alzheimer’s disease: current trends and COVID-19 impact. Alzheimer’s Research & Therapy 15:130.
2. Goedert, M. (1993). Tau protein and the neurofibrillary pathology of Alzheimer’s disease. Trends in neurosciences, 16(11), 460-465.
3. Baas, P. W., & Qiang, L. (2019). Tau: it’s not what you think. Trends in cell biology, 29(6), 452-461.
4. Gallacher, J., Elwood, P., Pickering, J., Bayer, A., Fish, M., & Ben-Shlomo, Y. (2012). Benzodiazepine use and risk of dementia: evidence from the Caerphilly Prospective Study (CaPS). J Epidemiol Community Health, 66(10), 869-873.
5. Landrum G. (2006). RDKit: Open-source cheminformatics. 2006. Google Scholar.https://cir.nii.ac.jp/crid/1370004237630036224 [Accessed October 2, 2023].
6. Morgan H.L. (1965). The Generation of a Unique Machine Description for Chemical Structures-A Technique Developed at Chemical Abstracts Service. J Chem Doc 5:107–113.
7. Jaeger S., Fulle S., Turk S.. (2018). Mol2vec: Unsupervised Machine Learning Approach with Chemical Intuition. J Chem Inf Model 58:27–35.
8. Abdi H., Williams L. J. (2010). Principal component analysis. WIREs Computational Statistics 2:433–459.
9. Niki K. (1989). Layered dynamic auto-associative memory with auto-encoder and feedback. In: International 1989 Joint Conference on Neural Networks, pp 571 vols.2- Available at: https://ieeexplore.ieee.org/abstract/document/118308 [Accessed October 4, 2023].
The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. Authors who publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See Open Access Instruction).