Chunyang
Zhou
a,
Yiwei
Song
a,
Xiuyan
Jin
a,
Bei
Li
*bc and
Chunying
Pang
*a
aBiomedical Engineering, School of Life Science and Technology, Changchun University of Science and Technology, Changchun, 130031, China. E-mail: 1281957328@qq.com
bChangchun Institute of Optics, Precision Machinery and Physics, Chinese Academy of Sciences, Changchun, 130031, China. E-mail: beili@ciomp.ac.cn
cUniversity of Chinese Academy of Sciences, Beijing 100049, China
First published on 7th December 2022
The predictability of Watson–Crick base pairing provides unique structural programmability to DNA, facilitating the development and application of biomolecules in biocomputing. However, in DNA-based biocomputing, the scale of operation that can be achieved by an existing reaction system is very limited. How to expand the operation range of a logic circuit and realize the integration and extensibility of circuits is always the key problem to be solved in this field. In this work, by designing a multifunctional DNA–nanostructure-based reaction platform, which can realize an output of up to 2n scalable fluorescence signals, combined with the construction of an input “library” and a modular distribution strategy of output signals, for the first time, we successfully performed the calculation of both square roots and cube roots of consecutive integers within a decimal number of “10” and in each result of the operation, two digits after the decimal point are preserved (). We believe that the design concept presented in this work can help effectively solve the urgent problems of biological computing in terms of computational scaling, integration and scalability, and can open up new horizons for the design of new functional devices and complex computing circuits.
To get to the root of the problem, we try to explore the causes of the three basic components of DNA computing, including the coding of the solution, the processing of information through molecular interactions, and the extraction of the output signal after processing. Based on long-term experimental experience and literature analysis in this field, we summarize the following three reasons limiting the development of large-scale DNA computing. First, coding input DNA for increasing the scale of parallel computing is limited. The amount of DNA involved in encoding and the number of bases per DNA sequence directly affect the encoding efficiency. In the currently used DNA logic gates, there is usually only one DNA sequence encoded in each input channel.17–20 In addition, due to the limitation of synthesis technology, an excessive number of DNA bases lead to a reduction of purity and an increase in errors, which significantly limit the encoding range of DNA involved in the reaction and the scale of parallel computation. Second, the limitation of building a powerful reaction platform adapted to the large number of input signals used in highly parallel computing, as described above. The three basic components of DNA computing are interconnected, so an increase in the coding range of the input signal at the upper level requires a corresponding increase in the function of the reaction platform at the lower level. The third limitation is the specific recognition of large-scale output signals for the extraction of solutions. As the input becomes larger and the response platform becomes more powerful, the corresponding large-scale identifiable output signal is needed to extract the results obtained by the logic operation. However, the scale of the output signal that can be achieved is currently limited, usually no more than 5-bit binary digits.10–16
In order to overcome the limitations mentioned above and realize large-scale complex logic circuits, this work is devoted to the following three aspects of improvement. First of all, a DNA sequence “library” is constructed to support multi-channel signal input, replacing the traditional single-stranded DNA input mode. This improvement greatly increases the encoding capacity of the input DNA and provides a more flexible and large addressable range for downstream reaction platforms. In addition, because of DNA's powerful base-pairing properties and structural stability, the chain number of a DNA library can increase linearly or nonlinearly with an increase in the complexity of the solving problem. Second, in order to satisfy the increasing coding capacity of input DNA, a multi-component, multi-functional and highly stable DNA–nanostructure-based system is constructed as a reaction platform for large-scale complex logic operations. On the basis of our previous research,10,11 the scale of the reaction platform is enlarged in this work, and the range of molecular interactions is further increased. In this reaction system, each DNA–nanostructure monomer can provide up to six coding sites (Fig. S1, ESI†) with high parallelism and scalability. Therefore, such an improved reaction platform can not only be applied to a large-scale signal input and its encoding, but can also be enlarged with an increase in input capacity. Third, modular division of output fluorescence signals is carried out. The aim of this strategy is to overcome the problem that the fluorescence output channel cannot be improved due to the limitation of the types of fluorescence signals modified on DNA strands (generally no more than 10, and the emission peaks overlap, making it difficult to distinguish and extract the signals). Based on this, the strategy successfully extracts up to 2n fluorescence output channels from the reaction solution, and it is scalable.
Based on the above strategy, in this work, depending on the same reaction system, we successfully realize the calculation of both the square roots and cube roots of consecutive integers within a decimal number of “10”, and in each result of the operation, two digits after the decimal point are preserved (). This is by far the most computationally complex logical operation with the largest scale of output channels in DNA-based biological computing.
As shown in Fig. 1, the logical calculation ranges of square roots and cube roots are chosen to be any integer in the range “1 ≤ x ≤ 10” (or “1 ≤ y ≤ 10”), and the result value is reported up to two digits after the decimal point. The truth table begins with the conversion of the digits to be taken square root (decimal number) into a 4-bit binary number, which is used as input signals. The results of the computation reported to two digits after the decimal point (decimal number) are then converted to a 10-bit binary number, which is used as an output signal. Finally, according to the distribution position of the digits before and after the decimal point in the calculation result, the 10-bit output binary numbers are modularly divided into three parts, namely, the calculated integer bit of the decimal fraction (the decimal number of “M” corresponds to two binary numbers of “Y1” and “Y2”), the first digit after the decimal point (decimal number of “a” corresponds to four binary numbers of “Y3”, “Y4”, “Y5” and “Y6”), and the second digit after the decimal point (decimal number of “b” corresponds to four binary numbers of “Y7”, “Y8”, “Y9” and “Y10”) in sequence.
In the next step, relying on the powerful DNA hybridization and toehold-mediated DNA strand displacement (TDSD), a multi-functional reaction platform capable of implementing both square root calculation and cube root calculation needs to be constructed. The reaction platform is similar to the “central processing unit” of a traditional computer, which can read input instructions and perform certain operations on them (DNA hybridization or chain displacement reaction), and then release the signal as output for subsequent analysis. For complex biological computations, including the computing system in this work, the output scale of which is up to 10-bit binary digits, usually requires 10 fluorescent dyes with different emission peaks as output signal markers for detection and recognition. However, limited by the current development of science and technology, it is difficult to find 10 kinds of fluorescent output dyes that meet the conditions, which is also one of the difficulties faced in biocomputing in the expansion of the computing scale.
In order to overcome this problem, we construct a set of ten DNA–nanostructure-based monomers (P1–P10) as the reaction platform for this work, and evenly distributed these ten monomers in five tubes in turn, so that each pair of two monomers modified with different fluorescence signals (FAM and HEX) is divided into a subset, as shown in Fig. 2(a). Each monomer is composed of three DNA single strands, and two of the strands are modified with fluorescence groups and quenching groups, which hybridize with the third single strand. Based on this, the corresponding ten output signals are spatially isolated into five independent subsets too. Combined with the loading reaction platform of a 96-well plate and signal acquisition of a microplate reader, the multi-channel, high-efficiency, and distinguishable detection of the ten output signals labeled with only two fluorescent dyes is realized. The reaction system shows good scalability, which can be further expanded by increasing the channel of the reaction base and the corresponding encoded DNA input into the library. Also, the complexity of calculation can be overcome by directly increasing the number of reaction monomers and centrifuge the tubes arranged in parallel, and finally, an output scale of logic operation up to 2n can be realized. Although in this work we have accomplished an operation output scale of n = 5, in the subsequent research, we will continue to study in depth, aiming at achieving the complex logic operations with larger output.
Fig. 2 (a) Description of the construction of the reaction platform, which can be used to calculate both square root and cube root logic operations. The reaction platform is a combination of ten DNA-nanostructure-based monomers (P1–P10), which are successively separated into five tubes periodically with two different fluorescence (FAM and HEX). (b) Construction of an input DNA library for the logical operation of square roots. (c) Reaction mechanism of P1–P10 monomers in the platform. The fluorescence signal can be enhanced by hybridization with any of the active sites on the monomer. (d) Changes in the fluorescence intensity of ten monomers before and after adding the input sequence that can react with them. (e) Structural integrity of the synthetic DNA–nanostructure monomer is further demonstrated by a PAGE experiment. Taking the formation of P1 monomers as an example, P-1, Q-1, and F-1 are single-stranded DNA respectively, and their band positions are shown in Fig. 2(e). When they were mixed together and reacted for 1 hour, a band at P1 position was formed, indicating that the three single-stranded DNA had undergone sufficient hybridization reaction. |
The reaction mechanism of P1–P10 monomers in the reaction platform is shown in Fig. 2(c). The optimized concentrations of three single-stranded DNAs that consist of each DNA–nanostructure-based monomer has been discussed in Fig. S2 (ESI†). By editing the chain replacement hybridization reaction between the input and the reaction platform, the quenched fluorescence signal on the reaction platform can be effectively turned on. As shown in Fig. 2(d), changes in fluorescence intensity of ten monomers before and after adding the input sequence that can react with them are respectively shown, which proves that the reaction platform performs very well and has a very low background signal. In addition, we further prove that when ten monomers are divided into five subsets, each subset can still maintain an ideal fluorescence switching effect and a low background signal (Fig. S3, ESI†). As shown in Fig. 2(e), we further demonstrated the structural integrity of the synthetic DNA–nanostructure monomer by a PAGE experiment. Due to the short sequence of Q-DNA, the bands cannot be reflected in the PAGE.
The design rules of the reaction platform are closely related to the construction of input sequences. In order to increase the encoding capacity of the input DNA and provide a more flexible and large addressable range for the downstream reaction platform, two DNA sequence “libraries” are constructed for the operation of square root and cube root calculation in this work, as shown in Fig. 2(b) and Fig. S4 (ESI†). The construction of one DNA “library” used to calculate square roots is shown in Fig. 2(b), which includes four input sets (A, B, C, and D). According to the coding rules of the reaction system, each set contains a different number of subsets. In library-A, there are eight single strands of DNA serving as inputs, and two hairpin sequences are included. In library-B, up to 10 single strands of DNA serve as inputs. In library-C and library-D, a total of 6 and 8 DNA single strands serve as inputs, respectively. The red portion of each input sequence is the active toehold-site for TDSD reaction. The design of another input DNA “library” for cube root logical calculation is shown in Fig. S4 (ESI†).
Based on the successful construction of the reaction platform and its input library, in the next step, we need to determine the logical relationship between each subset of the input (or each single-stranded input DNA) and platform monomers. In this work, the square root (or cube root) logic operation of 4-bit-input and 10-bit-output is performed based on three basic logic gates, namely,the “OR” gate, “AND” gate, and “NAND” gate, as shown in Fig. 3(a). The input sequences involved in the operation of the three gates are defined as single-control switch (sc-SW), double-control switch (dc-SW), and blocking switch (bl-SW), respectively, as shown in Fig. 3(b). In the “OR” gate, based on the corresponding truth table, the coding relationship between the input DNA and reaction platform is as follows: any input sequence known as sc-SW can undergo TDSD reaction with the reaction platform, resulting in the recovery of the fluorescence signal (from low fluorescence intensity of “0” to high fluorescence intensity of “1”). In the “AND” gate, the logical relationship between the input and platform is as follows: only when two input DNAs exist at the same time can the fluorescence signal of the reaction platform be stimulated to restore it to the high level of “1”, and the two coexisting inputs involved in “AND” gate operation are called dc-SW. In the “NAND” gate, which is a series of the “AND” gate and “NOT” gate, the logical relationship between its corresponding inputs and the platform is that only when two inputs exist at the same time can the fluorescence signal of the reaction platform be suppressed, otherwise, the fluorescence signal output will be high (“1”). In this case, the two existing inputs involved in the “NAND” gate operation are called bl-SW. Based on this, similar to traditional electronic circuits, the logic circuit diagrams composed of DNA inputs, DNA-based reaction platform and the logical relationship between them are shown in Fig. 3(c) and Fig. S5 (ESI†), which are used to realize the calculation of square roots and cube roots. Different from the traditional circuit, the DNA-based logic circuit in this work uses fluorescence signals as an output indicator.
Next, we systematically describe the successful implementation of square root logic operation based on the response of each subset in the input “library” and the reaction platform, as shown in Fig. 4. Due to a large amount of computation and the similarity of the reaction mechanism, we select the calculation of “”, “”, “” and “” as typical examples to illustrate. In this work, to run for the calculation of any object, each of the input sequence is only involved in the following three conditions of reaction, namely, as a sc-SW directly reacting with the reaction platform to light it up, a dc-SW together with other sequence to “light up” the platform, and a bl-SW to ensure that an input sequence can hinder the active site of the sc-SW, which prevents it from “lighting up” the platform. First, in order to operate the calculation of “”, only input-A set is encoded as participating in the operation reaction according to the truth table, that is, its subset-input-A1 performs TDSD reaction with the P-2 nanoswitch in the reaction platform, restoring the quenched HEX fluorescence signal modified on P-2 (i.e., from output binary “0” to “1”). Other nanoswitches in the reaction platform do not participate in the calculation, and the fluorescence signal remains binary “0”. Second, in order to operate the calculation of “”, the input-A set (subsets A1, A3, A4, A6) and input-B set (subsets B1, B2, B4, B5, B8, and B10) are encoded as participating in the operation reaction. In this operation, the recovery of fluorescence signals (P-2, P-4, P-5, P-6, P-9, and P-10) depends on two switch modes, namely, sc-SW and dc-SW. In the sc-SW, any input DNA can undergo TDSD reaction with the corresponding platform and restore the fluorescence intensity of “1”. For example, input-A1 or B1 can react with P-2 and result in an enhanced HEX fluorescence signal. In the dc-SW, only when all the input DNAs are added can the TDSD reaction take place with the corresponding nanoswitch platform and the fluorescence intensity is restored to “1”. For example, only when input-A3 and B5 are added simultaneously, TDSD reaction can occur with P-5, resulting in an enhanced FAM fluorescence signal.
Third, in order to operate the calculation of “”, the input-A set (subsets A1, A3, A4, A5, and A6), input-B set (subsets B1, B2, B4, B5, B6, B8, and B10) and input-C set (subsets C1, C3, C4, C5, and C6) are encoded as participating in the operation reaction. In this calculation, the recovery of fluorescence signals (P-1, P-4, P-5, P-8, and P-10) depends on three switching modes, namely, sc-SW, dc-SW and bl-SW, in which the sc-SW and dc-SW modes are committed to enhancing the fluorescence intensity from “0” to “1”, as described above. However, the bl-SW mode in this operation works to prevent the fluorescence signal from becoming stronger (detailed description is shown in Fig. S6, ESI†). For example, the added input-C6 can preferentially hybridize with input-A4, blocking its active site that can open the hairpin structure of B4, which can “light up” the HEX fluorescence of P-6. Finally, in order to operate the calculation of “”, the input-B set (subsets B1, B2, B3, B4, B7, B9, and B10) and input-D set (subsets D1, D2, D3, D4, D5, D6, D7, and D8) are encoded as participating in the operation. In this calculation, the recovery of fluorescence signals (P-1, P-2, P-6, P-8, and P-9) depends on three switching modes, namely, sc-SW, dc-SW and bl-SW. The detailed working mechanism of bl-SW in this operation is shown in Fig. S6 (ESI†). All computed output signals are analysed in sequence by modular segmentation, testing, and analysis. Based on this reaction mechanism, in this work, we successfully implement the square root and cube root logic operation of decimal integers up to 10, which is almost the most complex logical calculation with the largest scale of fluorescence outputs based on DNA computation so far.
As shown in Fig. 5 and Fig. S7 (ESI†), the fluorescence intensity values output by the system was normalized and the threshold was defined to confirm the successful operation of the 10-bit output square root and cube root logic circuit. Thresholding is actually the digital abstraction of the output signal, namely, binarization operation.10 It is the basic principle of digital logic in electronics—by pushing essentially analog signals toward ideal ON or OFF values, which is the binary number of “0” or “1”.27 Similar to the approach applied in electronic circuits, in this work, the output threshold value is defined as “1” when the normalized fluorescence intensities are higher than 0.5. On the contrary, the output threshold value can be defined as “0” when the normalized fluorescence intensities are lower than 0.5. Due to thresholding, the problem of signal interference caused by the presence of multiple input signals can be effectively avoided to ensure the ideal OFF and ON states of output signals, and maintain the digital abstraction.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d2nh00525e |
This journal is © The Royal Society of Chemistry 2023 |