Two-for-one misconceptions about genomes from the New York Times.

To date, two identified human beings have had their genomes sequenced: J. Craig Venter and James D. Watson. Venter’s was completed in draft form in 2001 and the final version was completed recently. Watson received his genome sequence on disk (a hard drive, not a DVD as reported) from Jonathan Rothberg, founder of 454 Life Sciences, at Baylor College of Medicine yesterday. You can watch the presentation here.

The notion that individual people can have their genomes sequenced (still for about $2 million, but the cost will fall precipitously in the future) is sure to elicit some interesting discussions about medical applications, ethical implications, and intriguing research into human variation. Certainly, the completion of Watson’s genome sequence has already gained media attention. Unfortunately, the same old catchphrases and errors abound. Apparently, even the mighty combined forces of Genomicron, Evolgen, and Sandwalk are insufficient to stop this.

Today, both RPM of Evolgen and Jonathan Badger at T. taxus take aim at the New York Times, who not only confuse sequencing with “deciphering”, but think that Watson discovered DNA in 1953 (Genome of DNA Discoverer Is Deciphered by Nicholas Wade).

To clarify, DNA (“nuclein”) was discovered by Friedrich Miescher in 1869. Watson and Crick elucidated the double helix structure of DNA in the 1950s, based on the results of decades of work on the chemical properties of the molecule by a large number of researchers.

I give full credit to Watson and Crick for their monumental contribution, which rightly garnered them the 1962 Nobel Prize. But credit is also due to Miescher and the countless others whose work was integral to the subsequent rise of molecular genetics and genome sequencing.

Here are two headlines announcing the same story, one inaccurate and the other fine:

Genome of DNA Discoverer is Deciphered (New York Times)

Nobel Laureate James Watson Receives Personal Genome (ScienceDaily)

Is one less catchy than the other? It seems to me that getting the history and the science right would be relatively simple and would only add to the strength of a story.

____________

Updates:

The Genetic Genealogist mentions the story and argues that Nicholas Wade may not be responsible for the headline. Fair enough — my criticism is about the entire presentation, whether that be the fault of the author, editor, or other. It does bear noting, however, that Wade has used this terminology several times previously, including describing it in the main text as the “project to sequence, or decode, the genome.”

Sandwalk has opened a discussion about whether readers would (or, like Larry, would not) want to have their genomes sequenced.

DNADirectTalk repeats the standard inaccuracies.

I don’t think we’re going to be rid of the “decoding” analogy any time soon, especially since sequencers themselves use it. Venter has a book coming out in October, with the unfortunate title A Life Decoded: My Genome: My Life. (Wouldn’t The Sequence of My Life or My Life’s Sequence have been catchier anyway?). The US Department of Energy (which financed much of the Human Genome Project) still has it on their website Human Genome Research: Decoding DNA also. To be fair to science writers, we can’t hold them to a higher standard of terminological accuracy than applies to scientists. In other words, we need to clean it up on our side first and then, hopefully, reporters will follow our lead.


Professors who blog.

Technology & Learning has a recent story about “Professors who blog“. They make the point that many profs who blog view it as part of their “service” component in the form of outreach*. Their research is instead published in peer-reviewed journals, books, conference proceedings, and other traditional media. As readers know, this is also how I see the role of blogs by scientists. Blogs are no substitute for peer reviewed publications, but they are a useful medium in which to discuss science from various perspectives and to make the information more accessible to non-scientists around the globe. I suspect that, in time, more scientists will come to see the value of blogs as an outreach apparatus and will make time for them despite being chronically overcommitted.

[Hat tip: Pharyngula]

___________

* Most science faculty have three components to their jobs. Research, teaching, and service, often distributed at 40 : 40 : 20 (though this varies). Students sometimes are unaware that teaching a given class may represent only 10-20% of a professor’s job description (though it invariably takes up much more time than this, especially if he or she strives to be an effective educator). Something to consider next time you feel the prof isn’t giving you enough personal attention or doesn’t respond to your emails fast enough. 🙂


Pioneers of genome size: Prof. Michael D. Bennett.

Prof. Mike Bennett is well known in the genome size community for his work in conducting and compiling genome size estimates in plants and as the originator of the “nucleotypic theory” in which DNA content exerts a causative influence on nucleus and cell size and is therefore of adaptive significance.

He completed his undergraduate degree in 1965 in the
Department of Agricultural Botany at the University College of Wales, Aberystwyth, UK, where he first learned to estimate nuclear DNA contents. He earned his PhD in 1968 under the supervision of Prof. Huw Rees, also in Aberystwyth.

Prof. Bennett in the lab, circa 1996.

Part of his PhD research involved investigating the work of Pearce (1937, Bull.Torrey Bot. Club 64: 345-355) who in 1937 had reported how chromosomes of Viola conspersa varied in size by over 300% depending on the amount of phosphate in the culture solution. One of Prof. Bennett’s roles was to investigate whether such changes in chromosome size were also accompanied by changes in genome size. In an early paper arising from his PhD (Bennett MD, Rees H. 1967. Natural and induced changes in chromosome size and mass in meristems. Nature 215: 93-94), he showed that in rye (Secale cereale) there was no change in genome size despite changes in chromosome volume of 50% depending on the phosphate level.

Prof. Bennett’s first post-doctoral fellowship was
with Sir Ralph Riley at the Plant Breeding Institute, Cambridge, UK, looking at the mechanisms of meiosis in cereals. Part of this work involved measuring the duration of meiosis in a number of cereals. He correlated this with genome size to produce the well-known paper (Bennett MD. 1971. The duration of meiosis. Proceedings of the Royal Society of London B 178: 277-299) in which he coined the term “nucleotype” (cf. “genotype”) to reflect”that condition of the nucleus [most notably, DNA content] that affects the phenotype independently of the informational content of the DNA”. He also carried out large scale analysis of relationship between genome size and minimum generation time (Bennett MD. 1972. Nuclear DNA content and minimum generation time in herbaceous plants. Proceedings of the Royal Society of London B 181: 109-135).

It soon became evident that there was a need to collate widely spread plant genome size data into one accessible source, and so he began producing the lists of DNA amounts (the first one in 1976, followed by 7 others since then) which together contain data for over 4400 angiosperm species. These papers have been cited more than 1,500 times. This was later followed by the electronic databases (first release of Angiosperm DNA C-values database in 1997, followed by the Plant DNA C-values Database in 2000).

In 1987, Prof. Bennett became Keeper of the Jodrell Laboratory at the Royal Botanic Gardens , Kew, where he remained until his retirement in 2006. Throughout his scientific career, Prof. Bennett has authored and coauthored more than 320 publications, many of which are on genome size.

Prof. Mike Bennett in 1988, soon after becoming Keeper of the Jodrell Lab at Kew.

I first met Mike at a the 2003 Genome Size Meeting at Kew, and it has been my privilege to work with him on a few projects since that time, including having him author a chapter in The Evolution of the Genome and coauthoring a paper about our genome size databases published earlier this year.

Group photo at the 2003 Genome Size Meeting at Kew.

In light of his recent “retirement” (no one who knows him believes for a second that he will stop working, in fact he estimates that he still has at least another 50 papers to write), it seems fitting to have him as the first person profiled in the Pioneers of Genome Size series. Congratulations on a job well done, my friend, and cheers to the future.

[Special thanks to Dr. Ilia Leitch for information and photos]


Science’s "big questions"

The July 1st, 2005, issue of Science included a list of 25 “hard questions” in celebration of the 125th anniversary of the journal, one of the most prestigious on the planet. (A more detailed discussion is currently underway at Sandwalk — I thought I would throw in my 2 cents).

Here is how they came up with the list, as described by Donald Kennedy and Colin Norman in the introductory article “What Don’t We Know?“:

We began by asking Science‘s Senior Editorial Board, our Board of Reviewing Editors, and our own editors and writers to suggest questions that point to critical knowledge gaps. The ground rules: Scientists should have a good shot at answering the questions over the next 25 years, or they should at least know how to go about answering them. We intended simply to choose 25 of these suggestions and turn them into a survey of the big questions facing science. But when a group of editors and writers sat down to select those big questions, we quickly realized that 25 simply wouldn’t convey the grand sweep of cutting-edge research that lies behind the responses we received. So we have ended up with 125 questions, a fitting number for Science‘s 125th anniversary.

[…]

We selected 25 of the 125 questions to highlight based on several criteria: how fundamental they are, how broad-ranging, and whether their solutions will impact other scientific disciplines. Some have few immediate practical implications–the composition of the universe, for example. Others we chose because the answers will have enormous societal impact–whether an effective HIV vaccine is feasible, or how much the carbon dioxide we are pumping into the atmosphere will warm our planet, for example. Some, such as the nature of dark energy, have come to prominence only recently; others, such as the mechanism behind limb regeneration in amphibians, have intrigued scientists for more than a century. We listed the 25 highlighted questions in no special order, but we did group the 100 additional questions roughly by discipline.

The questions are:

What Is the Universe Made Of?
Charles Seife

What Is the Biological Basis of Consciousness?
Greg Miller

Why Do Humans Have So Few Genes?
Elizabeth Pennisi

To What Extent Are Genetic Variation and Personal Health Linked?
Jennifer Couzin

Can the Laws of Physics Be Unified?

Charles Seife

How Much Can Human Life Span Be Extended?

Jennifer Couzin

What Controls Organ Regeneration?
R. John Davenport

How Can a Skin Cell Become a Nerve Cell?
Gretchen Vogel

How Does a Single Somatic Cell Become a Whole Plant?
Gretchen Vogel

How Does Earth’s Interior Work?
Richard A. Kerr

Are We Alone in the Universe?
Richard A. Kerr

How and Where Did Life on Earth Arise?

Carl Zimmer

What Determines Species Diversity?
Elizabeth Pennisi

What Genetic Changes Made Us Uniquely Human?
Elizabeth Culotta

How Are Memories Stored and Retrieved?
Greg Miller

How Did Cooperative Behavior Evolve?
Elizabeth Pennisi

How Will Big Pictures Emerge From a Sea of Biological Data?
Elizabeth Pennisi

How Far Can We Push Chemical Self-Assembly?
Robert F. Service

What Are the Limits of Conventional Computing?
Charles Seife

Can We Selectively Shut Off Immune Responses?
Jon Cohen

Do Deeper Principles Underlie Quantum Uncertainty and Nonlocality?
Charles Seife

Is an Effective HIV Vaccine Feasible?
Jon Cohen

How Hot Will the Greenhouse World Be?
Richard A. Kerr

What Can Replace Cheap Oil–and When?
Richard A. Kerr and Robert F. Service

Will Malthus Continue to Be Wrong?
Erik Stokstad

(The other 100 questions are listed here).

Overall, I think there are some good questions in there. I am also happy to see so many of them being about biology, given my interests. However, in some ways the list is rather disappointing. Many of the questions are simply about technology and not science as I understand the term. Others are simply “wait and see” types of questions that involve only continued measurements and no real innovations. Some are “wait and see” questions about technology, in fact.

A more significant point relates to the packaging. Several of the questions are indeed intriguing, but they are not big enough because they focus on one species. Here’s what I mean:

“Why Do Humans Have So Few Genes?”
Humans have what will have to be recognized as the standard amount of genes for a mammal. The real question is, how do those genes relate to proteomes (e.g., do most genes encode multiple proteins, for example via alternative splicing?), how do genes interact, how are genes regulated, and how did the regulatory systems evolve. All very important questions. The fact that humans have what was a somewhat surprisingly low gene number is very much secondary to all of this, and there is really nothing particularly relevant about humans having this number rather than, say, a cow.

“How Much Can Human Life Span Be Extended?”
The real issue is, what are the inherent limitations on longevity? This applies to all animals, not just humans, and can be studied from a variety of perspectives, including evolutionary theories of senescence, physiological work involving DNA damage (e.g., by oxygen radicals), links with diet, and genetic input. As phrased, it is a technology question, but viewed more broadly it is a very interesting and active area of research touching on multiple fields.

“How Can a Skin Cell Become a Nerve Cell?”
The real question is about how stem cells become specialized somatic cells in general, which is a fundamental question in developmental biology that happens to have major medical implications. The “biological alchemy” described in the article comes after the key processes are understood, and of course these insights extend well outside the boundaries of our own species and are relevant to the evolution of morphology in multicellular organisms as a whole.

In general, I think this is a useful exercise and it is always interesting to get people to think about the big picture in science. I just wish the picture were a little more scientific and a little bigger in this particular case.

________________

Incidentally, I am glad to see that two questions that relate to my work made it on the secondary list of 100 questions. So glad, in fact, that I won’t even complain about anything they say, though readers can likely guess what my comments might be…

Why are some genomes really big and others quite compact?
The puffer fish genome is 400 million bases; one lungfish’s is 133 billion bases long. Repetitive and duplicated DNA don’t explain why this and other size differences exist.

What is all that “junk” doing in our genomes?
DNA between genes is proving important for genome function and the evolution of new species. Comparative sequencing, microarray studies, and lab work are helping genomicists find a multitude of genetic gems amid the junk.



Human gene number: surprising (at first) but not paradoxical.

In 2001, when the draft sequences were announced, it was revealed that the human genome contains somewhere between 30,000 and 35,000 protein-coding genes (International Human Genome Sequencing Consortium 2001; Venter et al. 2001). The completed sequence, published in 2004, provided an even lower estimate of 20,000 to 25,000 genes (International Human Genome Sequencing Consortium 2004). At present, Ensembl gives the number of protein-coding genes in the human genome as 21,724 known genes plus 1,017 novel genes. (“Known genes” correspond to an identifiable protein, “novel genes” look like they probably correspond to a protein but not yet a known one).

As I have discussed in a previous post, there is quite a bit of interest in comparisons of gene number among species. Part of the reason is that there has long been an expectation that gene number (and prior to the 1970s, genome size) should be linked to some measure of organismal complexity. More often than not, complexity is defined in such a way as to place humans at the top of the scale, but objective metrics also have been attempted. (See the excellent post entitled “Step away from that ladder” by PZ Myers for discussion on this).

Prior to the human genome sequence, the expected gene number most commonly cited was 100,000, even though lower estimates were becoming increasingly common (e.g., Aparicio 2000) and the basis of this figure was somewhat dubious to begin with. As a result, the finding of 20,000-25,000 genes in the human genome has inspired extensive commentary. Some authors even characterized this as a new “G-value paradox” or “N-value paradox”, in reference to the “C-value paradox” of yesteryear (Claverie 2001; Betrán and Long 2002; Hahn and Wray 2002).

Two questions are relevant to this topic: Is the “low” number of protein-coding genes really surprising? If so, is this “paradoxical”?

Between 2000 and 2003, a light-hearted betting pool known as “GeneSweep” was run in which genome researchers could guess at the number of genes in the human genome. A bet placed in 2000 cost $1, but this rose to $5 in 2001 and $20 in 2002 as information about the human genome sequence increased. One had to physically enter the bet in a ledger at Cold Spring Harbor, and all told 165 bets were registered. Bets ranged from 25,497 to 153,438 genes, with a mean of 61,710, as indicated by the plot below.

It has been argued that this shows that a substantial percentage of scientists expected a low gene number and were not surprised by the human gene count estimates. I interpret these data differently, for several reasons.

First, this was a betting pool, and as a result there would have been additional factors influencing the entries. For example, in a sports pool, people may assume that everyone will pick the top-ranked teams and therefore intentionally select an underdog that they hope, but do not necessarily expect, will win. If the most commonly repeated gene count estimate was 100,000 at the time, then this would be the last bet I would have placed. The decision would therefore be to either go higher or lower than this. Personally, I probably would have gone lower rather than higher, because more than 100,000 genes might be problematic due to mutational load. So, based purely on the dynamics of informed betting, I would have expected most people to pick a number substantially lower than 100,000 even if they still believed that to be the most likely number.

Second, it is important to consider when the different bets were placed (I am looking into this out of curiosity). It is entirely possible that the high values were picked first, and then lower numbers were mostly chosen later for two key reasons. One, people had to physically enter their bets at Cold Spring Harbor, so they would have seen what others were guessing and could adjust accordingly (see above). Two, new estimates came out around 2000 that put the value well above 100,000, followed by other estimates that were much closer to 40,000. If the betting trends simply tracked these data, then one could not argue that people always expected a low number. Indeed, it may be that few people would have guessed a low number until very shortly before the release of the sequence.

Third, the winning estimates were higher than the probable total by several thousand genes. (The contest ended in a three way tie, with half of the $1,200 in prize money going to Lee Rowen [who bet 25,947 in 2001] and the other half shared by Paul Dear [27,462 in 2000] and Olivier Jaillon [26,500 in 2002]; see Pennisi 2003, 2007). No one guessed too low. In fact, most entries were far above the high end of the initial draft sequence estimates of 35,000, even though betting continued for at least another year. Likewise, no estimates based on molecular data prior to the close of betting gave a value of 23,000 either.

We may also ask what genome sequencers had to say at the time. James Watson, co-discoverer of the double helix structure of DNA and the original director of the Human Genome Project, wrote the following in 2001:

Until we saw the first DNA scripts underlying multicellular existence, it seemed natural that increasing organismal complexity would involve corresponding increases in gene numbers. So, I and virtually all of my scientific peers were surprised last year when the number of genes of the fruit fly, Drosophila melanogaster, was found to be much lower than that of a less complex animal, the roundworm Caenorhabditis elegans (13,500 vs. 18,500). More shocking still was the recent finding that the small mustard plant, Arabadopsis thaliana, contains many thousand more genes (~28,000) than does C. elegans. Now we are jolted again by the conclusion that the number of human genes may not be much more than 30,000. Until a year ago, I anticipated that human existence would require 70,000-100,000 genes.

J. Craig Venter, who led the private initiatives to sequence the fruit fly and human genomes, was quoted by The Observer in 2001 as saying “When we sequenced the first genome of … the fruit fly, we found it had about 13,000 genes, and we all thought, well we are much bigger and more complicated and so we must have a lot more genes. Now we find that we only have about twice what they have. It makes it a bit difficult to explain the human constitution.” In the same piece, Venter is quoted as noting that “Certainly, it shows that there are far fewer genes than anyone imagined.”

The Human Genome Project Information page said the following in 2004: “This lower estimate came as a shock to many scientists because counting genes was viewed as a way of quantifying genetic complexity. With around 30,000, the human gene count would be only one-third greater than that of the simple roundworm C. elegans at about 20,000 genes”.

Lee Rowen, co-winner of GeneSweep, noted that her estimate was inspired by Jean Weisenbach of Genoscope, who had suggested a few years earlier that the human gene number might be low. Rowen noted that, “at the time, everybody nearly fell of their chair” upon hearing this proposition (Pennisi 2003).

The list could go on, but I think it is evident that, in the light of pre-genomic views of genetics, most scientists were surprised by the low gene count in humans, especially when compared to other species to which we intuitively attribute lesser complexity. Notably, the nematode Caenorhabditis elegans appears to possess 20,069 known and novel genes even though it consists only of ~1,000 cells, the fly Drosophila melanogaster has 14,039 genes, the sea urchin Strongylocentrotus purpuratus 23,300 on first pass, and rice Oryza sativa upwards of 50,000. I don’t know if anyone expected this pattern prior to the dawn of genome sequencing, but I personally have not met him or her.

The question, then, is how will this discrepancy between expectation and reality be resolved? Is it truly “paradoxical”, meaning that it is self-contradictory? Or is it simply a matter of updating our understanding of how genetics works? Coming from the field of genome size evolution, which underwent the same transition decades earlier, I am of the view that “paradox” is not an appropriate descriptor. A complex puzzle — a “G-value enigma” — it may be, but it is one that can be resolved with a broader approach to genetics and much additional research. It took several decades for it to become widely acknowledged that genome size and gene number are unrelated (and indeed, a few authors still argue against it based on a biased dataset consisting exclusively of small, sequenced genomes; see Gregory 2005 for discussion), but we are now developing a reasonable understanding of the non-coding elements of the genome that make up the difference, their effects and (sometimes) functions, and their evolutionary dynamics. Similarly, there are many reasons why gene number and complexity need not be correlated. A list of possibilities is available here (though it was compiled for rather different reasons).

The expectation seems to have been that humans should have comparatively high gene numbers. We do not, and at first this was surprising. Now let us move on to a post-genomic understanding of genetics, and focus less on counting one-dimensional parameters and more on appreciating and ultimately deciphering the complexity inherent within the genome.

_____________

References

Anonymous. 2000. The nature of the number. Nature Genetics 25: 127-128.

Aparicio, S.A.J.R. 2000. How to count…human genes. Nature Genetics 25: 129-130.

Betrán, E. and M. Long. 2002. Expansion of genome coding regions by acquisition of new genes. Genetica115: 65-80.

Claverie, J.-M. 2001. What if there are only 30,000 human genes? Science 291: 1255-1257.

Dunham, I. 2000. The gene guessing game. Yeast 17: 218-224.

Gregory, T.R. 2005. Synergy between sequence and size in large-scale genomics. Nature Reviews Genetics 6: 699-708.

Hahn, M.W. and G.A. Wray. 2002. The g-value paradox. Evolution & Development 4: 73-75.

International Human Genome Sequencing Consortium. 2001. Initial sequencing and analysis of the human genome. Nature 409: 860-921.

International Human Genome Sequencing Consortium. 2004. Finishing the euchromatic sequence of the human genome. Nature 431: 931-945.

Pennisi, E. 2000. And the gene number is …? Science 288: 1146-1147.

Pennisi, E. 2003. A low gene number wins the GeneSweep pool. Science 300: 1484.

Pennisi, E. 2007. Working the (gene count) numbers: finally, a firm answer? Science 316: 1113.

Pennisi, E. 2007. Why do humans have so few genes? Science 309: 80.

Semple, C.A.M., K.L. Evans, and D.J. Porteous. 2001. Twin peaks: the draft human genome sequence. Genome Biology 2: comment2003.2001-comment2003.2005.

Venter, J.C., et al. 2001. The sequence of the human genome. Science 291: 1304-1351.

Watson, J.D. 2001. The human genome revealed. Genome Research 11: 1803-1804.

Give us the title!

I have made this point in passing before, but I will reiterate it in its own post in the vague hope that science writers will get the message (or perhaps that other bloggers will pick up the issue).

When you write a story about a recent discovery, whether for a magazine, an online news service, or a blog, please give us the title and as much other information as you can about the article so that we can look up the original. A footnote at the end would go a long way. Online, there are no constraints on page space, so this should be straightforward to implement.

I am getting frustrated with the usual “… which will appear this week in Nature” or “… to be published online in the next issue of PNAS”. Don’t you know that this almost instantly becomes dated and uninformative? Are you unaware that readers may come across your story even years from now, and that it is a substantial pain to go from the date of your entry and try to find which paper came out in the online pre-publication version of the journal shortly thereafter? Yes, we can search author names, but we shouldn’t have to.

Maybe this is not the writers’ fault. It could be that journals provide only summaries to writers without any information on the actual reference. However, many stories in the larger media include interviews with the author(s). Maybe the last question could be “Hey, what’s your paper called?”. And to press offices: you should give the title too.

I am sure that many, maybe most, readers of science news stories do not look up the original article. But some of us do. In this sense, having the summary stories serving as a gateway to the actual paper would be helpful.


Cracking the code?

I was at a scientific conference last week, and am only now catching up on email, journal publications, and science news. In the case of the latter, I am noticing a striking resurgence (or maybe just persistence) of the description of genome sequencing and analysis as “cracking the genetic code”. Sigh.

I can understand the attraction to the analogy, in that a great deal remains to be done before we will have anything approaching a comprehensive understanding of how complex phenotypes are generated from the combined influence of genes, their regulation and interaction, and the environment. However, the “genetic code” has a specific meaning in science, and it was “cracked” in the 1960s.

[Based on an astute comment by RPM of Evolgen, it bears updating this post to include “mapping the genome” as an equally inappropriate description of modern comparative genomics. Genetic mapping, in proper terms, refers to identifying the relative proximities of genes on chromosomes as first accomplished in 1913 by Alfred Sturtevant, a student of Nobel Prize-winner Thomas Hunt Morgan in the famous Fly Room at Columbia University. Physical mapping, which is the identification of physical locations of genes on chromosomes, remains an important component in genomics, but this generally is not what journalists are referring to when they use the term; what they mean is sequencing.]

I think this is another symptom of too much journalism and not enough science in science journalism. Instead of resorting to the standard catchphrases and clichés, why not introduce your readers to some accurate terms and concepts with which they may not be familiar? You can catch the interest of readers and educate them on the basics rather than appealing to their misconceptions or lack of prior knowledge.


A quick word about species names.

This is a very quick post about something that is annoying to many biologists — the misuse of species names. It is prompted by a news headline I just saw, which is about the 106th time I have seen this problem (“T. Rex”).

The binomial naming system was developed by Linnaeus in the 18th century and is still in use in modern biology. It consists of two names for each species — or, more specifically, a genus and species name. Every species should have only one genus/species name, although it takes work to correct multiple names for the same species (synonyms) and to split different species that are grouped together into a single name (cryptic species). The genus is a broader category and is (usually) comprised of many species. I will not get into the difficulty in defining what species are (though this is an interesting issue), only a few points about the use of terminology.

  • The plural of “species” is “species”. The singular of “species” is also “species”, not “specie”. “Specie” refers to coins.
  • The plural of “genus” is “genera”.
  • Genus and species names are always written in italics.
  • Genus names are capitalized, species designations are not. For example, it should be “Tyrannosaurus rex“, not “Tyrannosaurus Rex“.
  • Once you have defined the species name (or if it is very well known), you can abbreviate the genus. For example, “T. rex“. The same capitalized/lowercase and italics rules apply (it does not become “T. Rex“, though watch for automated spell checkers to un-correct this for you).
  • Some species are further partitioned into subspecies (although this is a more nebulous category than species); subspecies designations follow the same rules as species names (e.g., “Canis lupus familiaris“).
  • Humans are categorized as Homo sapiens, the name given by Linnaeus in 1758. It means “man, the wise”. Homo sapiens is a proper noun and not a common noun, such that one human is not a “Homo sapien”.


Upcoming issues of Gene Genie.

Here is an update on the next two editions of the bi-weekly Gene Genie blog carnival:

Issue #8 will be hosted by Eye on DNA and will be up June 3.


Issue #10, the Canada Day ultraspectacular edition, will be hosted here at Genomicron (and that’s July 1 for you non-Canucks out there).

Participation is not in any way restricted to blogs that are part of the new DNA Network — if you write about genetics and would like to be included, please feel welcome to submit your post here.

Gene Genie is particularly relevant in my experiment to use blogging as a form of public outreach by scientists in light of its mention recently, along with Mendel’s Garden, in the prestigious journal Cell.

Gene Genie and the DNA Network.

Here are some of the positive developments among blogs that I am happy to discuss.

The latest edition of Gene Genie is now up on Gene Sherpas, in which I have two contributions. This is my first blog carnival, and I want to thank our host and everyone else involved. The next round will be located at Eye on DNA on June 3, so remember to submit your links that you would like to have included. See here for earlier entries.

The second piece of news, as many people have already noted, describes a new network of genetics blogs entitled The DNA Network, to which you are welcome to subscribe by Feedburner. This is the outcome of efforts by Rick Vidal of My Biotech Life and Hsien-Hsien Lei from Eye on DNA.

Current members include:

Enjoy!