-
In the next five years, molecular biology will get its first look at the complete genetic code of a multicellular animal. The Caenorhabditis elegans genome sequencing project, a collaboration between Robert Waterston's group in St. Louis and John Sulston's group in Cambridge, is currently on schedule towards its goal of obtaining the complete sequence of this organism and all its estimated 15,000 to 20,000 genes by 1998. By that time, we should also know the complete genome sequence of a few other organisms as well, including the prokaryote Escherichia coli and the single-celled eukaryote Saccharomyces
-
[
WormBook,
2005]
The C. elegans genome contains approximately 1300 genes that produce functional noncoding RNA (ncRNA) transcripts. Here we describe what is currently known about these ncRNA genes, from the perspective of the annotation of the finished genome sequence. We have collated a reference set of C. elegans ncRNA gene annotation relative to the WS130 version of the genome assembly, and made these data available in several formats.
-
[
WormBook,
2007]
Heterorhabditis bacteriophora is an entomopathogenic nematode (EPN) mutually associated with the enteric bacterium, Photorhabdus luminescens, used globally for the biological control of insects. Much of the previous research concerning H. bacteriophora has dealt with applied aspects related to biological control. However, H. bacteriophora is an excellent model to investigate fundamental processes such as parasitism and mutualism in addition to its comparative value to Caenorhabditis elegans. In June 2005, H. bacteriophora was targeted by NHGRI for a high quality genome sequence. This chapter summarizes the biology of H. bacteriophora in common and distinct from C. elegans, as well as the status of the genome project.
-
[
1994]
The current interest in the nematode Caenorhabditis elegans began approximately 25 years ago when Sidney Brenner selected this species as the most suitable for studies of metazoan development and nervous system. The basis of this selection rested on the anatomical simplicity of nematodes, which nevertheless possess the major differentiated cell types of higher animals, and the tractability of C. elegans to the genetic approach. Over the past two decades or so, progress has been impressive: the cell lineage from egg to adult and the anatomy of the nervous system have been completely described, genetic investigations of numerous developmental problems are co-ordinated within a universally-agreed, systematic nomenclature, a physical map of the C. elegans genome is nearing completion and a project to sequence the entire genome is underway. Furthermore, the number of laboratories seeking to understand the mechanisms controlling animal development through genetic and molecular investigations of C. elegans is rising rapidly as the advantages of this organism become
-
A previous chapter in this series (1) described, primarily, the physical mapping of the 100 Mb Caenorhabditis elegans genome by fingerprinting of cosmid clones, and the linking of the contigs thus derived by YAC hybridization. At that time, the primary function of the map was to enhance the molecular genetics of the organism by facilitating the cloning of known genes, and to serve as an archive for genomic information. However, a clonal physical map - even with good alignment to the genetic map - carries only a tiny proportion of the information present in the genome. Consequently, the current objective of the C. elegans genome project (2) is to establish of the entire genomic sequence. The bacterial clone map, although incomplete by virtue of the uncloneability of regions of the genome in cosmid vectors (a factor which we shall discuss later in this chapter), has proved a sound basis for the systematic sequence analysis. The sevenfold cosmid coverage has a resolution sufficient to enable the selection of a subset of cosmids for sequencing such that, on average, each clone contributes 30 kb of unique sequence to the whole. Sequencing projects based on bacterial clone maps (3-5) of a number of other genomes of a range of sizes are also well advanced, in particular Saccharomyces cerevisiae (15 Mb; complete), Schizosaccharomyces pombe (15Mb), and Drosohpila melanogaster (150 Mb). Although it has recently been demonstrated that small bacterial genomes can be sequenced by direct shotgun sequence analysis of the entire genome with no prior mapping (6), the ability to interrelate and map clone sets, whether derived by random selection of in a directed manner, is still the most convenient route to the sequence analysis of larger genomes.
-
[
2020]
Onchocerciasis, also known as the African river blindness, is the second most important cause of infectious blindness worldwide after trachoma. It is caused by the filarial nematode, <i>Onchocerca volvulus</i>, and transmitted by repeated bites of the vector, female black fly of the genus <i>Simulium damnosum</i>. The vector breeds in fast-flowing and oxygen-rich rivers in affected areas with transmission and disease prevalence usually stretching along these river basins and thereby the name river blindness.[1]Aside from blindness, onchocerciasis results in a troubling chronic dermatitis.[1]
-
[
WormBook,
2006]
The completion of the C. elegans genome sequence permits the comprehensive examination of the expression and function of genes. Annotation of virtually every encoded gene in the genome allows systematic analysis of those genes using high-throughput assays, such as microarrays and RNAi. This chapter will center on the use of microarrays to comprehensively identify genes with enriched expression in the germ line during development. This knowledge provides a database for further studies that focus on gene function during germline development or early embryogenesis. Additionally, a comprehensive overview of germline gene expression can uncover striking biases in how genes expressed in the germ line are distributed in the genome, leading to new discoveries of global regulatory mechanisms in the germ line.
-
[
WormBook,
2006]
Transposons are discrete segments of DNA capable of moving through the genome of their host via an RNA intermediate in the case of class I retrotransposon or via a "cut-and-paste" mechanism for class II DNA transposons. Since transposons take advantage of their host''s cellular machinery to proliferate in the genome and enter new hosts, transposable elements can be viewed as parasitic or "selfish DNA". However, transposons may have been beneficial for their hosts as genome evolution drivers, thus providing an example of molecular mutualism. Interactions between transposon and C. elegans research were undoubtedly mutualistic, leading to the advent of needed genomic tools to drive C. elegans research while providing insights into the transposition field. Tc1, the first C. elegans transposon to be identified, turned out to be the founding member of a widespread family of mobile elements: the Tc1/ mariner superfamily. The investigation into transposition regulation in C. elegans has uncovered an unforeseen link between transposition, genome surveillance and RNA interference. Conversely, transposons were utilized soon after their identification to inactivate and clone genes, providing some of the first molecular identities of C. elegans genes. Recent results suggest that transposons might provide a means to engineer site-directed mutations into the C. elegans genome. This article describes the different transposons present in the C. elegans genome with a specific emphasis on the ones that proved to be mobile under laboratory conditions. Mechanisms and control of transposition are discussed briefly. Some tools based on the use of transposons for C. elegans research are presented at the end of this review.
-
[
Adv Exp Med Biol,
2010]
Neuropeptides are short sequences ofamino acids that function in all multicellular organisms to communicate information between cells. The first sequence ofa neuropeptide was reported in 1970' and the number of identified neuropeptides remained relatively small until the 1990s when the DNA sequence of multiple genomes revealed treasure troves ofinformation. Byblasting away at the genome, gene families, the sizes ofwhich were previously unknown, could now be determined. This information has led to an exponential increase in the number of putative neuropeptides and their respective gene families. The molecular biology age greatly benefited the neuropeptide field in the nematode Caenorhabditis elegans. Its genome was among the first to be sequenced and this allowed us the opportunity to screen the genome for neuropeptide genes. Initially, the screeningwas slow, as the Genefinder and BLAST programs had difficulty identifying small genes and peptides. However, as the bioinformatics programs improved, the extent of the neuropeptide gene families in C. elegans gradually emerged.
-
[
Lecture Notes in Computer Science,
2005]
The OMA project is a large-scale effort to identify groups of orthologs from complete genome data, currently 150 species. The algorithm relies solely on protein sequence information and does not require any human supervision. It has several original features, in particular a verification step that detects paralogs and prevents them from being clustered together. Consistency checks and verification are performed throughout the process. The resulting groups, whenever a comparison could be made, are highly consistent both with EC assignments, and with assignments from the manually curated database HAMAP. A highly accurate set of orthologous sequences constitutes the basis for several other investigations, including phylogenetic analysis and protein classification.