Sequence rules for gold-binding peptides
Abstract
Metal-binding peptides play a central role in bionanotechnology, wherein they are responsible for directing growth and influencing the resulting properties of inorganic nanomaterials. One of the key advantages of using peptides to create nanomaterials is their versatility, wherein subtle changes in the sequence can have a dramatic effect on the structure and properties of the nanomaterial. However, precisely knowing which position and which amino acid should be modified within a given sequence to enhance a specific property can be a daunting challenge owing to combinatorial complexity. In this study, classification based on association rules was performed using 860 gold-binding peptides. Using a minimum support threshold of 0.035 and confidence of 0.9, 30 rules with confidence and lift values greater than 0.9 and 1, respectively, were extracted that can differentiate high-binding from low-binding peptides. The test performance of these rules for categorizing the peptides was found to be satisfactory, as characterized by accuracy = 0.942, F1 = 0.941, MCC = 0.884. What stands out from the extracted rules are the importance of tryptophan and arginine residues in differentiating peptides with high binding affinity from those with low affinity. In addition, the association rules revealed that positions 2 and 4 within a decapeptide are frequently involved in the rules, thus suggesting their importance in influencing peptide binding affinity to AuNPs. Collectively, this study identified sequence rules that may be used to design peptides with high binding affinity.