RNA interference a technology platform for target validation, drug discovery and therapeutic development. Summer 03
This elegant and revolutionary reverse genetic approach has a tremendous commercial promise to develop new drugs and therapeutics for several human diseases.This review outlines and forecasts some of the potential applications of gene suppression strategies in the pre-clinical drug discovery process at biotech and pharmaceutical industries.
Most of you are aware of the terms ‘Genome’ (study of expression of all the genes in an organism called as Genomics), ‘Proteome’ (study of expression of all the proteins in an organism called as Proteomics), and ‘Glycome’ (study of expression of all the glycoproteins in an organism called as Glycomics). Another scientific buzz word is spreading fast in the research community, that is ‘RNome’, the RNA equivalent of the ‘proteome’, ‘genome or ‘glycome’. The subject is referred to as ‘RNomics’ (Figure 1). RNomics is a newly emerging field that categorically studies the structure, function and processes of noncoding RNAs (ncRNAs) in a cell1. The first ncRNAs were identified in the 1960s.
At that time, biologists thought these ncRNAs had no function in the cell. However, these ncRNAs (in general about 20 to 500 nucleotides in length, some are as long to be as 17kb) have been shown to be involved in the processes of replication, transcription, RNA processing, modification, mRNA translation, gene silencing, protein stability and protein translocation2. Using comparative genomics, computational, cDNA library construction and microarray expression approaches different groups have discovered several hundreds of small, single-stranded RNAs in organisms including archaeons, parasites, worms, fruit flies, yeast, plants, and mammals3. These ncRNAs are divided into two classes based on their functions: first are microRNAs (miRNA), usually ranging from 21 to 23nt in length and believed to specifically regulate translation of target mRNA in a stage-and/or tissue- specific manner. The second class of miRNAs are generated by enzymatic cleavage of long double- stranded RNAs and tends to degrade the target mRNA by a process known as interference4. Therefore, these specific ncRNAs are referred to as small interfering RNAs (siRNAs), and the process mediated by siRNAs is RNA interference (RNAi)4.
We are now beginning to exploit the information gleaned from genome sequencing projects of human and several organisms. However, this massive genetic information opens new challenges to decipher the complete list of protein-coding genes. In addition, transcriptional events such as RNA splicing and post-translational modifications make it difficult to predict the exact number of genes or proteins (Figure 1). With this degree of complexity, monitoring the entire proteome expression levels as a means to elucidate their functions and develop them as drug targets is a challenging paradigm in the bio-industry. Despite the ‘proteome’ sequencing efforts, the ‘RNome’ also has to be studied indepth to fully understand and tally the number of genes encoded by a genome and their regulation2. The challenge for scientists in both academia and industry is to identify the whole complement of ncRNAs and elucidate their functions in gene expression and regulation. In terms of healthcare, it is important to identify the disease relevant genes from these functional ‘OMES’. Although, the human genome sequencing was completed, the total number of genes in the human genome is still in debate5. A possible hypothetical number of genes associated with various diseases are shown in Figure 1.
Five years ago, Mello and his colleagues discovered the phenomenon of RNAi4. Since then, it erupted like a volcano in the cell and molecular biology communities as a tool to understand gene expression and regulation. In addition, scientists have begun to take an ‘RNomics’ approach to understanding the nature and function of microRNAs and siRNAs in order to utilise them as a gene silencing mechanism. In general, RNAi processing involves the cleavage of longer dsRNAs present in the cytoplasm by an enzyme called DICER into small interfering RNAs, roughly 21-23 nucleotides in length. These siRNAs then become incorporated into an RNA/protein complex (known as RNA-induced silencing complex, (RISC)), which acts to recognise a target mRNA for subsequent degradation6,7 (Figure 2A). The siRNAs present in the cytoplasm can also cause post-transcriptional silencing in cytoplasm, and also enter the nucleus, and affect DNA methylation (Figure 2B). In broad terms, ‘Post-transcriptional gene silencing’, ‘co-suppression’, ‘quelling’ and ‘siRNA’ are collectively included in the phenomenon of ‘RNA interference’. Although, the mechanisms and processes are similar, but not quite identical, common sets of proteins and short RNAs are utilised7.
The function of this process in the cell is believed to serve as a protective mechanism for the genome against viruses and transposable elements and to eliminate defective mRNAs. This process is highly conserved in the evolution and observed in viruses, parasites, worms, fruit-flies, plants and animals8. A couple of years ago, Tuschl and his colleagues for the first time demonstrated gene silencing in mammalian cells by transfecting the synthetic siRNA molecules9. Presence of dsRNA in mammalian cells provokes strong cytotoxic response, and the effect is transient. Therefore, to overcome this limitation, several groups developed DNA-based siRNA vector systems to analyse gene function in a variety of mammalian cell types10. As an example, a DNA-based vector that carries fluorescently (cyanine 3) labelled siRNA sequences for human glyceraldehydes-3-phosphate dehydrogenase (GAPDH) was exogenously transfected into mammalian cells and scored for its protein expression using an antibody against GAPDH. The results shown in Figure 2 II (green) indicated that the expression of GAPDH was completely inhibited in the experimental (A) compared to its control (B). Current approaches to create stable phenotypes in mammalian cells have generally met with limited success. However, newly developed RNAi methods and the availability of DNA-based vectors have the potential to provoke a revolution in molecular biology.
Applications of RNAi
The discovery of siRNAs and RNAi mechanism has a tremendous commercial potential in the bioindustry. This molecular tool will permit investigators to routinely implement ‘loss-of-function’ screens and helps to develop rapid tests for genetic interactions in mammalian cells, which up to this point have been quite difficult to perform quickly. Some of the applications are shown in Figure 3, and described as follows:
The RNAi approach has been applied to study the function of several essential genes involved in cell growth, cell cycle, cytoskeleton, signalling, membrane trafficking, transcription and DNA methylation. The functions of these genes were studied in about 25 different mammalian cells either by incorporating synthetic siRNAs or using cloned plasmids that carry siRNA sequences11,12. Using an RNAi-based strategy one can target the gene product itself that might be central to the cellular or disease process. Several biotech companies have started or adopted RNAi platform technology for functional genomic studies (Table 1A). Several reagent companies now offer products ranging from synthesis of siRNA molecules, cloning vectors and transfection reagents (Table 1B). In addition, several pharmaceutical companies are racing ahead in launching this technology platform in their existing functional genomics divisions. The biggest advantage of the RNAi platform is that it is highly adoptable to a high throughput format significantly reducing the product development cycle and enhancing the target validation and drug development process (Figure 4).
A whole genome RNAi strategy has been used in high throughput phenotypic screens to identify several hundred genes that are involved in the cell cycle, embryonic or germ-line development, ovary and vulva specific in Caenorhabditis elegans. Concurrently, four independent groups looked at the function of nearly all the genes in C. elegans using high throughput RNAi analysis at the whole genome level13. Instead of knocking down a single gene at a time, which could take a year of effort, the RNAi method allows scientists for the first time to knockout every gene in an organism in a few months. Recently, in a collaborative approach, scientists constructed a double-stranded RNAi bacterial library with 86% of the 19,000 C. elegans and disrupted the expression of 16, 757 worm genes by high throughput RNAi method. This method allowed them to isolate several hundred genes involved in body fat gene regulation14. Using a fluorescent dye in the worm’s bug diet, this group also identified the human counterpart genes involved in the signal transduction process and their targets. Some of the newly identified targets/genes will be ideal candidates for developing drugs to treat obesity and diabetes. The high throughput capacity of RNAi makes it a particularly attractive method for rapid screening and validation of targets identified by microarray analyses, protein-protein interactions or in silico gene prediction. In a nutshell, this ‘genome-wide RNAi screens’ strategy can be combined with cell-based assays and other methods to elucidate the functions of all the human genes (Figure 4).
Target validation is one of the biggest problems for the biopharmaceutical industry. RNAi offers the prospect of reducing this bottleneck and speeding the drug development process. Target validation determines whether a known candidate gene is responsible for a disease and whether altering expression of the gene is likely to result in a therapeutic effect. Functional genomics and target validation are critical to providing pharmaceutical and biotechnology companies with new gene targets involved in the disease processes. These companies then look to modify the gene or gene products of such targets to treat disease using their drug discovery and development platforms. These products will include high-throughput applications with the potential to industrialise gene function analysis, which should dramatically improve the pharmaceutical industry’s ability to identify ‘druggable’ gene families or targets. RNAi-based target validation will enable companies to fast track the discovery of drug targets in short period in a more cost-effective approach (Figure 4).
Drug screening and development
Selection and validation of molecular targets is of great importance for drug development in the post-genomic era. Although phenotypes of many diseases are well known, the identification of the genes responsible for these phenotypes is a major challenge in the drug development process. The RNAi technology offers an alternative method to achieve this goal in a rapid and more economical way. One can use a library of several hundred to thousands of chemical compounds and identify candidate target genes through transcriptional expression profiling in a chemical genomics approach. Subsequently, the function of the several target genes identified for a specific chemical compound can be evaluated in a high throughput manner using RNAi transfections directly into micro-titer plates, seeded with mammalian cells. In addition, RNAi could facilitate drug screening and development by identifying genes that can confer drug resistance or genes whose mutant phenotypes are ameliorated by drug treatment. This approach will not only allow for determining the modes of action for novel compounds, but also helps to develop a new generation of antibiotics. RNAi methods could be extended to study gene expression of insect and parasite genomes and subsequently develop better gene-based insecticides or infection controlling drugs.
The gene silencing approach holds great promise for selectively inhibiting virus-specific genes or host genes for the treatment of viral infections or autoimmune disorders. Some of the examples are stated here.
HPV: Human papilloma viruses cause cervical cancer in women. Cervical cancer is the second most common form of cancer in women after breast cancer. In general, during the virus lifecycle, the virus produces proteins that suppress the activity of genes in the human anti-cancer defence system. Therefore, suppressing the HPV encoded viral gene products could help to inhibit the growth of cancer simply by allowing the virally infected cell to undergo apoptosis or cell death. This approach is followed in conjunction with RNAi to knock-down the function of several HPV coded viral proteins15.
HCV: Hepatitis C virus (HCV) infection is an emerging global epidemic. Since adequate animal models or tissue culture systems for the propagation of HCV are not available, the development of therapeutic and preventive strategies is an alarming challenge for biotechnology and pharmaceutical companies. It is now possible for scientists to suppress selectively the host/viral genes involved in the replication of virus. Using RNAi, scientists systematically suppressed the function of cellular genes (those required for HCV replication), which are involved in host-cell interactions and viral morphogenesis16. Very recently, the siRNA approach has also been adopted to target the host Fas protein to reduce severe forms of hepatitis in a mice model17.
HIV: Several groups applied an RNAi approach to specifically inhibit the replication of human immunodeficiency virus (HIV) by targeting siRNAs to viral (p24, vif, nef, tat and rev) or cellular genes (CD4, CXCR4, CCR5) and expressing them in human cell lines, primary lymphocytes, and primary macrophages18-20. This RNAi-based gene therapy for HIV infection is not only an effective way to inhibit viral replication, but also can be extended to block the infection of several other animal viruses. Therefore, this particular area opens new avenues for gene-based therapeutics.
Anti-cancer cancer therapeutics
Gene expression profiling methods brought a new revolution in the classification of tumours and helped to develop new prognostic indicators for studying various forms of cancers21. However, the detailed study of individual genes and proteins remains critical in terms of basic science and in generating new therapeutics. Gene suppression by siRNAs is a powerful tool to analyse the function of proteins in vitro, especially, for the rational design of drugs to block the tumour-relevant genes. Several oncogenes have already been cloned into siRNA-based vectors and stably expressed, and gene suppression was studied in detail22. RNAi can be easily applied to hormone-regulated growth of breast cancer cells and estrogen-induced cell cycle progression, specifically targeting the inhibition of transcription factors (such as Sp1, NFkB) and expressing them in human breast cancer cell lines.
The ability to engineer siRNA vectors for stable expression of mutated tumour suppressor genes, oncogenes and transcription factors in human cancer cell lines will certainly sparkle a conflagration of effort to evaluate their advantage as a cancer prevention method. The ability to create ‘permanent’ ‘knock-down’ cancer cell lines will help us to understand the ‘loss-of-function phenotype’ and subsequently develop commercially important cancer preventive targets. In addition, this study will dramatically facilitate the dissection of signalling pathways and the study of cell growth and division in order to understand the biology of cancer. Several researchers have already demonstrated expression of exogenously infused siRNA in living mouse and embryonic chick models. Very recently, this reverse-genetics approach has been adopted to study modulation of the polyglutamine repeat associated with Huntington’s disease, a neuro-degenerative disorder using viral promoter- based vectors and direct injection into mice embryos. The foreseeable challenge is for us to analyse how these gene suppression systems work directly in human cancer tissues, and ultimately to develop gene-specific therapeutics.
In the not too distant future, the research community intends to integrate functional genomics and proteomic mapping approaches to reveal the biological functions of all the coding genes in the human genome. To reach that level, it is pre-requisite to combine the ‘interactome’ (protein-protein interactions) mapping data with ‘phenome’ (largescale phenotypic analysis) mapping data. Recently, this approach was demonstrated by integrating the transcriptome data (that was gathered by microarray experiments) with that of phenome mapping data collected by high throughput RNAi analysis of the germ line genes from C. elegans23. The combination of informatically-driven gene identification, established functional genomics methods, and now the RNAi transfection of mammalian cells can be extend to study the integrated networks in human cells or tissues with the potential of deducing the functions of dozens to thousands of proteins at a time.
RNAi in agri-biotech industry
The RNAi work carried out in the plant Arabidopsis opened new avenues to produce not only new varieties of plants but also to prevent plant virus infections24. This strategy is expected to be especially highly useful in the agri-biotech industry to study plant host-virus interactions.
Bottlenecks in RNAi research
siRNA primer design and sequence specificity:
siRNAs may be the best tools for target validation in biomedical research today because of their exquisite specificity, efficiency and endurance of gene-specific silencing. However, design of siRNAs and the secondary structure of the mRNA target strongly play a role in the genesilencing phenomenon. In addition, incorporation of mismatches in the siRNA sequence will also affect gene suppression25. A single point mutation in the targeted region abolishes the mRNA degradation and may cause RNAi resistance in tumour cell lines.
Potency, efficacy and duration: The dosage and concentrations of siRNAs can also play a significant role in the gene expression inhibition. Another interesting concern is time duration that siRNAs can inhibit the expression of a target genes in particular tissues or cell lines also varies.
siRNA delivery problem: Although, several companies (Table 1B) market different types of transfection reagents for the in vivo and in vitro delivery of siRNA molecules into tissues and mammalian cells, the efficiency of transfection varies greatly from cell line to cell line and tissue to tissue. Therefore, a new generation of transfection reagents based on either novel cationic lipids or beta cyclodextrin-containing polymers are needed to increase this efficiency. Probably, siRNA molecules complexed with these new reagents could better penetrate through cell membranes and reach target sites efficiently. Most importantly, a new generation of transfection reagents needs to be developed that shows less toxicity to cells. Electroporation methods can be employed to deliver siRNA-containing plasmids directly into tissues, though this is very difficult in vivo. Gene silencing is possible in brain cells, the genes those specifically expressed in neurons are difficult to silence.
Endogeneous: In a therapeutic context, the neutralisation of exogenously transfected siRNAs by the immune system in the cells may also be a foreseeable problem.
We hope that all these approaches will help to develop new diagnostic reagents and novel molecular interventions for several human diseases. Within 10 years RNAi will undoubtedly emerge as a routine molecular tool to study the problems in biomolecular medicine and potentially treat diseases.
The opinions expressed in this article are exclusively those of the author and do not reflect those of GeneExpression Systems, Inc. Due to the space constraint the author has omitted several papers of others to cite and limited to recent reviews. Although several other companies are involved in RNAi research, the author has cited few representative companies only but not intentionally ignored others. The author thanks Professor Steven R. Gullans of Harvard and CSO of USGenomics, Woburn, MA for reading the manuscript and providing valuable comments. The author also thanks Dr Tom Tuschl of Rockefeller University, New York for providing thoughtful insights on some of the bottleneck issues in the present RNAi research.
Krishnarao Appasani is presently with GeneExpression Systems, Inc, Waltham, MA and collaborates with investigators at Harvard Medical School. Until recently, he was a Staff Scientist in the R&D department of PerkinElmer Life Sciences and was involved in the development of cDNA arrays and nucleic acid labelling chemistries. At PerkinElmer he was also responsible for organising the Global Biomics Guest Lecture Series. Before that he served as an Application Specialist in the New England Sales division of Carl Zeiss Imaging, Inc. From 1995-2000 he was a member of the faculty of Harvard Medical School as a Blum scholar in the Thoracic Oncology Program at Dana-Farber Cancer Institute and Brigham and Women’s Hospital in Boston. He received his PhD in 1986 from the Molecular Biology Unit of Banaraus Hindu University, India. Subsequently, he did postdoctoral research training in three laboratories including one at the Massachusetts Institute of Technology with Nobel laureate H. Gobind Khorana during 1991-94. He received an MBA in 1998 from Bryant College, Smithfield, RI. Dr Appasani has received many awards for his contributions in the gene expression field and has just completed a book, Perspective in Gene Expression, for Eaton Publishing Company.
1 Filipowicz,W 2000). Imprinted expression of small nucleolar RNAs in brain: time for RNomics. Proc. Natl.Acad. Sci. USA. 97:14035-14037.
2 Storz, G (2002).An expanding universe of noncoding RNAs. Science. 296:1260-121263.
3 Huttenhofer,A, Brosius, J, Bachellerie, JP (2002). RNomics: identification and function of small, non-messenger RNAs. Curr.Op. Chem. Biol. 6: 835-843.
4 Fire,A, Xu, S, Montgomery, MK, Kostas, SA, Driver, SE, Mello, CC (1998). Potent and specific genetic interference by double-stranded RNA in C. elegans. Nature. 391:806-811.
5 Martin, K, Pardee,AB (2000). Identifying expressed genes. Proc. Natl.Acad. Sci. U S A. 97:3789-791.
6 Hammond, SM, Bernstein, E, Beach, D, Hannon, GJ (2000). An RNA-directed nuclease mediates post-transcriptional gene silencing in Drosophila cells. Nature. 404:293-296.
7 Zamore, PD (2002).Ancient pathways programmed by small RNAs. Science. 296:1265-1269.
8 Tijsterman, M, Ketting, RF, Plasterk, RHA (2002).The genetics of RNA silencing.Ann. Rev. Genet. 36:489-519.
9 Elbashir, SM, Harborth, J, Lendeckel,W,Yalcin,A,Weber, K,Tuschl,T (2001). Duplexes of 21-nucleotide RNAs mediate RNA interference in cultured mammalian cells. Nature. 411:494-498.
10 Shi,Y (2003). Mammalian RNAi for masses.Trends in Genet. 19:9-12.
11 Tuschl,T, Borkhardt,A (2002). Small interfering RNAs:A revolutionary tool for the analysis of gene function and gene therapy. Mol. Innterventions. 2:158-167.
12 McManus,MT, Sharp,PA (2002). Gene silencing in mammals by small interferening RNAs. Nature Rev Genetics. 3:737-747.
13 Hope, IA (2001). Broadcast interference-functional genomics. Trends in genetics, 17: 297-299.
14 Ashrafi, K, Chang, FY,Watts, JL, Fraser,AG, Kamath, RS, Ahringer, J, Ruvkun, G (2003). Genome-wide RNAi analysis of C. elegans fat regulatory genes. Nature. 421:268-272.
15 Jiang, M et al (2002). Selective silencing of viral gene expression in HPV-positive human cervical carcinoma cells treated with siRNA, a primer of RNA interference. Oncogene, 21: 6041-6048.
16 McCaffrey, AP et al (2002). Nature 418: 38-39.
17 Song, E, Lee, SK,Wang, J, Ince, N, Ouyang, N, Min, J, Chen, J, Shankar, P, Lieberman, J (2003). RNA interference targeting Fas protects mice from fulminant hepatitis. Nature Med. 3:347-351.
18 Lee, NK, Dohjima,T, Bauer, G, Li, H, Li, MJ, Ehsani,A, Rossi, J (2002). Expression of small interfering RNAs targeted against HIV-1 rev transcripts in human cells. Nature Biotechnology. 20:500-505.
19 Martinez, M, Conaventura, C, Este, JA (2002). RNA interference of HIV replication. Trends in Immunology. 1-3.
20 Novina, CD, Murray, MF, Dykxhoorn, DM, Beresford, PJ, Riess, J, Lee, SK, Collman, RG, Lieberman, J, Shankar, P, Sharp, PA (2002). siRNA-directed inhibition of HIV-1 infection. Nature Medicine. 8:681-686.
21 Golub,TR, Slonim, DK, Tamayo, P, Huard, C et al (1999). Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science. 286:531-537.
22 Brummelkamp,TR, Bernards, R,Agami, R (2002). Stable suppression of tumorigenicity by virus-mediated RNA interference. Cancer Cell. 2:243-247.
23 Walhout,AJM, Reboul, J, Shtanko, O, Bertin, N et. al (2002). Integrating interactome, phenome, and transcriptome mapping data for the C. elegans germline. Current Biology. 12:1-20.
24 Llave, C, Xie, Z, Kasschau, KD, Carrington, JC (2002). Clevage of Scarecrow-like mRNA targets directed by a class of Arabidopsis miRNA. Science, 297:2053-2060.
25 Vickers,TA,Koo, S, Bennett, CF,Crooke, ST, Deam,NM,Baker, BF (2003). Efficient reduction of target mRNAs by small interfering RNA and RNase Hdependent antisense agents. J. Biol. Chem. 278:7108-7118.