In order to use machine learning, researchers must first convert the molecules into a computer-readable form. Many research groups have already tackled this problem, and consequently, there are various ways of performing this task. However, it is difficult to predict which of the available methods is best suited to answer a specific question - for example, to determine whether a chemical compound is harmful to humans. The new algorithm is designed to help find the optimal molecular fingerprint in each case. To do this, the algorithm gradually selects the molecular fingerprints that achieve the best results in the prediction from many randomly generated molecular fingerprints. "Following the example of nature, we use mutations, i.e. random changes to individual components of the fingerprints, or recombine components of two fingerprints," explains doctoral candidate Felix Katzenburg.
Philipp M. Pflüger, Marius Kühnemund, Felix Katzenburg, Herbert Kuchen and Frank Glorius (2024): An evolutionary algorithm for interpretable molecular representations. Chem, DOI: 10.1016/j.chempr.2024.02.004.