Skip to main navigation Skip to search Skip to main content

Machine learning enhances prediction of plants as potential sources of antimalarials

  • Adam Richard-Bollans
  • , Conal Aitken
  • , Alexandre Antonelli
  • , Cássia Bitencourt
  • , David Goyder
  • , Eve Lucas
  • , Ian Ondo
  • , Oscar A. Pérez-Escobar
  • , Samuel Pironon
  • , James E. Richardson
  • , David Russell
  • , Daniele Silvestro
  • , Colin W. Wright
  • , Melanie Jayne R. Howes
  • Royal Botanic Gardens, Kew
  • EaStCHEM St Andrews
  • University of Gothenburg
  • University of Oxford
  • UN Environment Programme World Conservation Monitoring Centre (UNEP-WCMC)
  • Royal Botanic Garden Edinburgh
  • Universidad del Rosario
  • University of Fribourg
  • Swiss Institute of Bioinformatics
  • University of Bradford
  • King's College London

Research output: Contribution to journalArticlepeer-review

Abstract

Plants are a rich source of bioactive compounds and a number of plant-derived antiplasmodial compounds have been developed into pharmaceutical drugs for the prevention and treatment of malaria, a major public health challenge. However, identifying plants with antiplasmodial potential can be time-consuming and costly. One approach for selecting plants to investigate is based on ethnobotanical knowledge which, though having provided some major successes, is restricted to a relatively small group of plant species. Machine learning, incorporating ethnobotanical and plant trait data, provides a promising approach to improve the identification of antiplasmodial plants and accelerate the search for new plant-derived antiplasmodial compounds. In this paper we present a novel dataset on antiplasmodial activity for three flowering plant families – Apocynaceae, Loganiaceae and Rubiaceae (together comprising c. 21,100 species) – and demonstrate the ability of machine learning algorithms to predict the antiplasmodial potential of plant species. We evaluate the predictive capability of a variety of algorithms – Support Vector Machines, Logistic Regression, Gradient Boosted Trees and Bayesian Neural Networks – and compare these to two ethnobotanical selection approaches – based on usage as an antimalarial and general usage as a medicine. We evaluate the approaches using the given data and when the given samples are reweighted to correct for sampling biases. In both evaluation settings each of the machine learning models have a higher precision than the ethnobotanical approaches. In the bias-corrected scenario, the Support Vector classifier performs best – attaining a mean precision of 0.67 compared to the best performing ethnobotanical approach with a mean precision of 0.46. We also use the bias correction method and the Support Vector classifier to estimate the potential of plants to provide novel antiplasmodial compounds. We estimate that 7677 species in Apocynaceae, Loganiaceae and Rubiaceae warrant further investigation and that at least 1300 active antiplasmodial species are highly unlikely to be investigated by conventional approaches. While traditional and Indigenous knowledge remains vital to our understanding of people-plant relationships and an invaluable source of information, these results indicate a vast and relatively untapped source in the search for new plant-derived antiplasmodial compounds.

Original languageEnglish
Article number1173328
JournalFrontiers in Plant Science
Volume14
DOIs
Publication statusPublished - 2023

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • antiplasmodial activity
  • botany
  • ethnobotany
  • ethnopharmacology
  • machine learning
  • malaria
  • sampling bias
  • traditional and indigenous knowledge

Fingerprint

Dive into the research topics of 'Machine learning enhances prediction of plants as potential sources of antimalarials'. Together they form a unique fingerprint.

Cite this