Jiang F.
Zhong
*abc,
Yan
Chen
c,
Joshua S.
Marcus
c,
Axel
Scherer
c,
Stephen R.
Quake
d,
Clive R.
Taylor
b and
Leslie P.
Weiner
a
aDepartment of Neurology, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA. E-mail: jzhong@usc.edu
bDepartment of Pathology, Keck School of Medicine, University of Southern California, Los Angeles, CA 90033, USA
cDepartment of Bioengineering and Electrical Engineering, California Institute of Technology, Pasadena, CA 91125, USA
dDepartment of Bioengineering, Stanford University, Stanford, CA 94305, USA
First published on 2nd November 2007
The gene expression of human embryonic stem cells (hESC) is a critical aspect for understanding the normal and pathological development of human cells and tissues. Current bulk gene expression assays rely on RNA extracted from cell and tissue samples with various degree of cellular heterogeneity. These ‘cell population averaging’ data are difficult to interpret, especially for the purpose of understanding the regulatory relationship of genes in the earliest phases of development and differentiation of individual cells. Here, we report a microfluidic approach that can extract total mRNA from individual single-cells and synthesize cDNA on the same device with high mRNA-to-cDNA efficiency. This feature makes large-scale single-cell gene expression profiling possible. Using this microfluidic device, we measured the absolute numbers of mRNA molecules of three genes (B2M, Nodal and Fzd4) in a single hESC. Our results indicate that gene expression data measured from cDNA of a cell population is not a good representation of the expression levels in individual single cells. Within the G0/G1 phase pluripotent hESC population, some individual cells did not express all of the 3 interrogated genes in detectable levels. Consequently, the relative expression levels, which are broadly used in gene expression studies, are very different between measurements from population cDNA and single-cell cDNA. The results underscore the importance of discrete single-cell analysis, and the advantages of a microfluidic approach in stem cell gene expression studies.
Single-cell gene expression profiling from early embryos has suggested the transient expression of critical regulatory genes, again underscoring the importance of systematical single-cell expression profiling.1,2 Currently, multiple color fluorescent-activated cell sorting (FACS) has a limited utility for isolation of pure stem cell populations due to a lack of discriminating cell markers. Single-cell gene expression profiling studies with laser capture microdissection (LCM),3–12 patch-clamp analysis,2,13,14 as well as in situmRNA amplification13,15 have been reported, but analysis of a large number of cells has proven to be very difficult with these methods. Single-cell whole genome microarray gene expression screening4,5,16 and single-cell cDNA library construction1,17,18 also have been conducted on a limited number of cells. While these studies demonstrate the potential value of single-cell gene expression profiling, they also showed the limitation of these methods in processing a large number of samples. Material loss and low biochemical reaction efficiency (mRNA capture and RT reaction) are other major challenges for single-cell analysis. A single mammalian cell contains 20–40 pg of total RNA,19,20 but only 0.5–1.0 pg of mRNA (105 to106mRNA molecules).21 Therefore, detecting single-cell mRNA is difficult with current methods.
The microfluidic device described here significantly increases the mRNA-to–cDNA processing efficiency ∼5 fold to 54% compared to bulk reactions (∼12%).22,23 With this device, we measured the absolute copy number of three genes in individual single hESCs and compared these data to those obtained from the cDNA of FACS sorted hESCs in the G0/G1 phase. Our results indicate that the hESC colony is a heterogeneous cell population and many single cells do not express all three of the interrogated genes. This result suggests that pluripotent hESC colonies are not homogeneous cell populations, rather they are highly heterogeneous cell populations regulated by different gene networks. Besides stochastic factors of individual cells, heterogeneity of cells with respect to cell cycle and other factors may be a major contributor to observed variations in mammalian single-cell gene expression.
The molds were first exposed to chlorotrimethylsilane (TMCS, Aldrich) vapor for 2 min to promote elastomer release after the baking steps. For the flow layer of the device, a mixture of PDMS (5 parts RTV615A : 1 part RTV615B) was poured onto the flow mold. After degassing, the flow molds were baked for 45 min at 80 °C. For the control layer of the device, a mixture of PDMS (20 parts RTV615A : 1 part RTV615B) was spun on the control mold at 1800 rpm and baked for 30 min at 80 °C. The flow layer was separated from the flow mold and flow channel access holes were then punched. Next, the flow and control layers were aligned and baked for 45 min at 80 °C. The two layer structure was peeled from the control mold with control channel access holes punched, and was mounted to another thin PDMS layer made by spinning 20 : 1 PDMS mixture on a blank wafer. After baking for 3 hours at 80 °C, the three layer structure was bonded to a clean microscope slide and baked overnight at 80 °C.
Fig. 1 The setting of the microfluidic device for single hESC mRNA extraction. A. The system includes a microscope, a computer to control air pressure with pressure regulators, and a heating stage to heat the microfluidic chip to desire temperatures. B. A typical microfluidic chip. C. Merged image of immunofluorescent stained (Oct-3/4) and light microscope images from a pluripotent hESC colony. The hESC colony was labeled with mouse α human Oct-3/4 IgG and PE-conjugated rabbit anti-mouse IgG antibodies. Only cells in the center of the hESC colony expressed Oct-3/4. The intensity of the labeling indicates the Oct-3/4 positive cells expressed Oct-3/4 at different levels. The spontaneously differentiated cells around the colony do not express Oct-3/4. |
MEF were prepared from the embryos of 13–14 day pregnant CF-1 mice (Charles River Labs) and stocks were cryopreserved until required for culture of hESCs. hESCs were consistently observed as large clumps that appeared on the matrigel-coated surface, consistent with the published observations of others. Periodic karyotyping confirmed their human diploid chromosomal character. After injection into severe combined immunodeficient (SCID) mice, these cells produced teratomas which include tissues of ectodermal, mesodermal and endodermal origin. Immunostaining of the hESC colonies for alkaline phosphatase and with antibodies specific for SSEA-4, TRA-1-60, TRA-1-81 or Oct-3/4 confirmed the pluripotency of these cells.
Single-cell lysis, mRNA capture and RT were performed in the same microfluidic device to convert mRNA into cDNA. Dynabeads with oligo (dT)25 (Invitrogen, USA) were used to capture mRNA. Sensiscript RT kit (Qiagen, USA) was used for converting captured mRNA to cDNA. The oligo (dT)25 sequence in Dynabeads serves as both mRNA capture sequence and primer for cDNA synthesis. After oligo (dT)25 beads with attached cDNA were flushed to the collection wells, the wells were cut off from the chip. Centrifugation was used to transfer the beads from individual wells into PCR tubes. These beads with attached cDNA from individual cells were subjected to realtime qPCR for measurement of the number of molecules of interrogated mRNA in IQ5 (Bio-Rad, USA).
Fig. 2 Single-cell mRNA extraction microfluidic device filled with food dye for illustration. All flow channels are filled with yellow food dye, multiplexer control channels are filled with red dye, collection and waste channels are in blue. The inserts show enlargements of four important areas of the chip. After loading cell suspension from the cell input inlet, single-cells are captured in cell lysis module (Insert 1) within the flow channels (blue). The pump valves are green. The separation valve is black. The lysis buffer is yellow. A captured single hESC is labeled with a fluorescent dye (green) and shown in Insert 2. Cell lysis is performed by opening the portion valve and pumping to mix lysis buffer (yellow) with the captured cell. The resulting cell lysate is pushed through oligo-dT bead columns for mRNA capture. Oligo-dT beads are stacked into columns by closing the sieve valve while loading bead suspension. Insert 3 shows six stacked oligo-dT bead columns next to the sieve valve. After washing beads with buffers, RT reaction master mix is flown through the bead columns to synthesize cDNA from the captured mRNA at 40 °C. After RT reaction, beads with attached cDNA are pushed to collection wells (Insert 4) by opening the sieve valve. The beads are recovered by cutting the wells off the chips and centrifuging a flipped-well in a microcentrifuge tube. |
Three sets of primers were used to amplify 905 bp, 982 bp and 890 bp fragments of B2M, Nodal and Fzd4 respectively from human brain cDNA (Invitrogen, USA).The primer sets were: B2M: ggcattcctgaagctgaca (forward) and ccagattaaccacaaccatgc (reverse) ; Nodal: cttcctgagccaacaagagg (forward) and cagactccactgagcccttc (reverse); Fzd4: gggacgtctaaaatcccaca (forward) and ggcagtggagatgaaacaca (reverse)
In order to measure the absolute number of molecules of mRNA with standard curves, the mRNA-to-cDNA efficiency of the processor must be obtained. An artificial Poly-A RNA standard (Affymetrix, USA) was used to compare the input mRNA copy number to the cDNA copy number which was measured by qPCR at the end of the processes. This RNA standard contains known amounts of artificial RNAs with poly-A tails from 4 B. subtilisgenes, lys, phe, thr and dap. The lysgene was used as RNA standard for calculation of mRNA-to-cDNA efficiency. A standard curve was generated with a known amount of lysDNA at 10 fold dilutions. The mRNA capture and RT reaction were carried out at both conventional micro-litre level and at nano-litre scale with the microfluidic processor.
In the conventional reaction, a 50 µL RT reaction was carried out after capturing 4 µL of artificial mRNA (7.3 nM) with 30 µL oligo (dT)25 beads as instructed by the manufacturer. One micro litre of the RT product (corresponding to 2.16 × 108 molecules of lys artificial mRNA) was used for realtime quantitative PCR (qPCR) detection. Based on the standard curve, the final detected number of molecules was 2.5 ± 0.5 × 107 molecules (Table 1). The processing efficiency of conventional micro-litre scale reaction is 12%.
Input LysRNA (molecule number) | Threshold Cycle | Detected cDNA (molecule number) | Efficiency (%) | |
---|---|---|---|---|
Microlitre Reaction (Bench-top) | 2.16 × 108 | 13.22 ± 0.37 (n = 20) | 2.6 ± 0.6 × 107 | 12 |
Nanolitre Reaction (Microfluidic) | 1.84 × 106 | 18.15 ± 0.46 (n = 20) | 1.0 ± 0.3 × 106 | 54 |
To compare the efficiency of conventional and microfluidic approaches, the same artificial RNA sample was introduced into the cell capturing chamber, processed into cDNA and detected with realtime qPCR. Each of the 20 cell-capture chambers in the microfluidic device has a volume of 3.4 nL resulting in 1.84 × 106 molecules of input lys artificial mRNA. The absolute lys cDNA molecule number detected by qPCR with the standard curve is 1.0 + 0.3 × 106. The total processing efficiency was 54% which is approximately 5 times the efficiency of the conventional approach. A series of 10 fold dilutions of the artificial RNA were used and they verified that the processing efficiencies of both approaches do not vary significantly in different concentrations of the input artificial RNA. The standard deviations of both bench-top and microfluidic reactions were also calculated and are similar (Table 1). The small standard deviation of the 20 reactions in the same device indicates consistent processing efficiency among the 20 reaction chambers.
Based on the mRNA-to-cDNA efficiency calculated above, the numbers of three mRNAs in a single hESC were measured. Standard curves were constructed with known amounts of full length B2M, Fzd 4 and Nodal cDNA. The three standard curves overlap each other, indicating similar qPCR efficiency for each set of the primers (Fig. 3A). These standard curves have a range of 2 to 2 × 106 molecules that allow the direct comparison of the expression levels of these three genes from their amplification curves (Fig. 3B). The minimal detectable level is 4 molecules (2/54%) for our microfluidic processor, and 17 molecules (2/12%) for bulk assays with these standard curves.
Fig. 3 Measuring the absolute molecule number of the three genes in a single hESC with multiplex quantitative PCR. A. Standard curves are generated with known amounts of plasmid DNA containing the full sequence of the genes. The curves cover from 2 to 2 × 106 copies of the respective genes. With our primer design, all the curves overlap each other, and indicate similar PCR efficiency. B. The multiplex quantitative PCR amplification curves obtained from cDNA of hESC colonies are plotted with curves obtained from cDNA of a representative hESC. Because the standard curves of the three genes are very similar, these amplification curves show that the expression ratio of B2M and Nodal is similar in population cDNA and this single-cell cDNA. However, the expression of Fzd4 and Nodal is very similar in this particular single hESC, but very different in the hESC population. Unlike this single hESC, some single hESC do not express all three genes. This result suggests the heterogeneity of hESC and underscores the importance of single-cell analysis. |
The pluripotent hESC colonies are often considered and treated as a homogenous population. However, our data indicate that this is not the case. As Fig. 3B shows, the absolute expression levels (mRNA molecule number) measured from cDNA equivalents to 1000 cells are not even close to 1000 fold of the levels measured from a typical single-cell. With cDNA equivalent to 1000 G1/G0 phase hESC, the absolute molecule numbers of B2M, Nodal and Fzd4 are 6,034 ± 660, 402 ± 55 and 69 ± 19 respectively. However, the distributions of the molecule numbers of these 3 genes are diverse in single hESCs (Fig. 4). Among the 54 interrogated single hESCs, 14.8%, 37% and 37% of hESC did not express detectable levels (less than 4 copies) of B2M, Fzd4 and Nodal respectively. The expression of B2M ranged from 4 to 76 copies, with the majority between 30 and 50 copies. The expression of Fzd4 and Nodal ranged from 6 to 548 copies, and 22 to 504 copies respectively. All cells expressed at least one of the 3 genes.
Fig. 4 Expression of B2M, Nodal and Fzd4 in single hESCs. A. A single-hESC expresses both Nodal and Fzd4, but does not express B2M in a detectable level. B. A single hESC expresses only Nodal. The mRNAs of B2M and Fzd4 are undetectable. C, D and E shows the distribution of B2M, Fzd4 and Nodal in single hESCs. Among the 54 interrogated single hESCs, 14.8%, 37% and 37% of cells (indicated by black strait pattern) do not express detectable levels (less than 4 copies) of B2M, Fzd4 and Nodal respectively. The distribution pattern of expressions is narrower for B2M compared to the other 2 genes. The discontinuous distribution of Nodal and Fzd4 suggest a high heterogeneity of the 54 cells. |
Many gene expression studies, such as microarray or quantitative PCR, are based on the relative expression level of genes with normalization to a house keeping gene which is assumed to have a similar expression in all samples. Our data show that even the relative expression level of the three genes are very different between measurements derived from cDNA of a cell population and separately from individual cells. In the measurement from population cDNA, the expression level of Fzd4 is approximately 8 fold lower than the expression of Nodal after being normalized to B2M. However, in a representative single hESC, the expression levels of Nodal and Fzd4 are very similar after normalization with B2M (Fig. 4).
Due to the nanolitre scale nature of the microfluidic device, the input number of mRNA in conventional reaction and microfluidic reaction is ∼100 fold different when same RNA standard is used. To rule out the possibility of RNA concentration in processing efficiency, serial 10-fold dilutions of the RNA standard were used for the same experiments. The mRNA-to-cDNA efficiencies are not significantly different for both bench-top and microfluidic reactions. Therefore, the high mRNA-to-cDNA efficiency is a desirable characteristic of microfluidic devices. With this 54% processing efficiency, the microfluidic device described potentially can detect 2 mRNA copies of interrogated genes.
In our device, there are 20 individual single-cell processing reactors for simultaneously processing. The standard deviation of threshold cycles of the 20 reactors is small (0.37) and similar to those from conventional reactions (∼0.5). These data indicate that the processing efficiencies among the 20 reactors are very similar. Similar results are also obtained in multiple lots of devices. This aspect is critical for comparing results from individual cells intra- and inter-devices.
With this device, we profiled the expression of three genes in hESC. Our data indicated that even for FACS-sorted G1/G0 phase hESC, the expression data from population cDNA can not be used to estimate the gene expression level in individual cells. Therefore, the population averaging effect distorts the expression levels of the three genes in bulk assays conducted with population cDNA from hESC colonies. Because the population averaging effects are not the same for all three genes, the relative expression levels are also distorted. The qPCR with cDNA from 1000 hESC indicates that Nodal is expressed approximately 8 times more than Fzd4 after normalization to B2M. However, in individual cells the expressions of these genes are very diverse. The difference observed in pooled cDNA may be due to the different number of cells expressing the respective genes.
It has been reported that gene expression is highly variable in the single-cell levels.28,29 A speculation is that the high variability is due to experimental variation. In the present study, we showed that experimental variation may not be the main factor. The stochastic gene expression behavior of single-cells (biological noise) has been reported in various studies of prokaryote cells.30–34 Biochemical processes such as transcription, translation, RNA and protein degradation have been thought of as the primary contributors. However, heterogeneity of the mammalian cell populations may be a factor related to the observed expression variations in single-cell analysis of mammalian cells. When gene expression studies are conducted at the single-cell level, we must recognize that no two cells are identical. In a particular mammalian cell population, two cells always are different in cell cycle, differentiation stages, and environmental stimulation. The variation of gene expression at the single-cell levels is expected, and the reason may be due to stochastic expression fluctuation, or due to heterogeneity of the cell populations. In mammalian cells which are regulated by more complex gene networks than yeast and bacteria, cell heterogeneity may be the major contributor of variation of gene expression at single-cell levels.
This journal is © The Royal Society of Chemistry 2008 |