Nova: Science in the news
Published by the Australian Academy of Science
Back to the normal view

The Human Genome Project – discovering the human blueprint


In what has been called the Everest of modern biology, scientists from around the world have worked together to unravel and record the entire set of human genetic instructions.
Contents

Key text

Box 1. Genes – the basic facts
Box 2. Gene mapping and DNA sequencing
Activities Activity 1. Changes in genetic research and knowledge
Activity 2. Creating a table of human genetic diseases
Activity 3. The Human Genome Project: Advantages and disadvantages
Further reading
Useful sites
Glossary

Back to basics
You will get more from this topic if you have mastered the basics of DNA and genes – this link will take you to an annotated list of sites with helpful background information.

Key text

How the project began

The idea of the Human Genome Project first began in a vague way in the 1970s when biologists started to investigate human genes at the molecular level. As biochemical analysis of DNA (deoxyribonucleic acid) became possible, it became clear that certain segments of DNA (called markers) were associated with particular conditions.

Various countries started to map parts of the human genome in the 1980s but the international project really got under way when the USA became involved. In 1989, the Human Genome Organisation (HUGO) was founded by leading scientists in the field to coordinate the massive international effort involved in unravelling the secrets of our genes.

The Human Genome Project – A feat so vast that at first it seemed unachievable

The project aimed to map the position of every human gene and to read and decipher every message encoded in the the twisted double helix of our DNA (Box 1: Genes – the basic facts). It was a stupendous and very costly undertaking, involving advanced biotechnology, and took many years to complete. A first draft of the human genome was announced in June 2000.

In February 2001, the publicly funded Human Genome Project and the private company Celera jointly announced that they had mapped the bulk of the human genome. These maps show that there are only about 30,000 genes – many fewer than the 100,000 expected.

In April 2003, the 50th anniversary of the publication of the structure of DNA, the completed map, was announced. The final sequence covers 99 per cent of the gene-containing regions of the genome.

Australia plays its part

Despite its strong contributions to biological and medical research, Australia has been slow to become involved in large-scale genome research. Few of our institutions have had the funding or facilities to undertake such projects. The establishment of the Australian Genome Research Facility in 1995 provided Australia with a facility for DNA sequencing and is now Australia's largest provider of genomic services (Box 2: Gene mapping and DNA sequencing).

The controversy

There is no doubt that information from the Human Genome Project will provide huge benefits to human health with the diagnosis and possible treatment of genetic diseases (eg, cystic fibrosis and Huntington's chorea). However, some people feel that the huge amounts of money being spent on the project could have been used to improve the human condition in more effective ways.

Genetic information can be misused; for example, through genetic discrimination by employers or insurance companies. The ethical, legal and social issues (ELSI) associated with genetic information have been considered by the US Department of Energy and National Institutes of Health under the world's largest bioethics program.

The completion of the project and the issues associated with it will be an essential part of modern biology for years to come.

Related Nova topics:

Biology meets industry – genomics, proteomics, phenomics


Box 1. Genes – the basic facts

Genes are made of DNA

To understand genes, we must first understand DNA (deoxyribonucleic acid) – the stuff of which genes are made. DNA is an enormously long, thin molecule, composed of compounds called bases strung together like beads in a necklace.

Each DNA molecule is made of two strands of DNA intertwined together. Within the DNA are four different types of bases (adenine, thymine, guanine and cytosine – often referred to by their first letters: A, T, G and C), each base is paired with another base on the opposing strand. The order in which the base pairs are arranged can be used to send a message – usually an instruction to other biological molecules about what to do and what to make. That is why bases are sometimes said to comprise the genetic alphabet.

Some stretches of DNA have no obvious function

Just as the letters in a whole book are not all stuck together, so DNA bases are organised into the equivalent of individual words, paragraphs or chapters. There is also punctuation and, it appears, plenty of gibberish that conveys nothing – probably the result of past errors in the copying process that takes place during reproduction. Stretches of DNA with no obvious function are sometimes referred to as junk DNA or non-coding DNA. The discovery of microRNA and its role in development has led to a new appreciation of parts of the genome that were once considered ‘junk DNA’.

Genes contain information to make proteins

A gene is a sequence of DNA bases that typically codes for a protein. Proteins in turn manufacture and control all the living processes. For example, the gene for brown eyes is an instruction that gets the cells in the iris of the eye to make a dark pigment. A different sequence of bases would spell a different message with different consequences – rather like spelling out a different sentence using the same letters of the alphabet.

DNA molecules are organised into chromosomes

Each normal cell in an individual’s body (apart from egg or sperm cells) contains identical DNA molecules, organised into manageable units called chromosomes. In humans each cell contains 22 pairs of autosomes plus two sex chromosomes. Males have an X and a Y; females have two Xs. But not every gene is necessarily switched on in every cell. For example, the cells that make pigment in the eye also contain the genes for making tooth enamel or liver tissue but, fortunately, don’t do so because those genes are inactive.

The genome refers to all the genes within a species

Taken together, all the genes within a living thing or within a species are called the genome. Naturally, there are slight differences in the sequence of DNA bases between individuals of a species, but it’s obvious that all humans have more characteristics in common than they do differences.

The amount of information needed to make a living thing is staggering

If you’ve ever tried to follow do-it-yourself instructions to assemble a piece of furniture, you’ll realise how nightmarishly complicated would be the task of putting together a living thing. Although scientists are not yet in the business of much organism-building, they are already at the stage of trying to read and puzzle out all the instructions.

Very simple organisms, such as tiny viruses, can be made with only a few genes. Even though they are simple and small, each virus gene is still hundreds of thousands of DNA bases long. The first complete viral genome was worked out in 1977 in the United Kingdom by Dr Fred Sanger, a double Nobel laureate in science. Since then, molecular biologists have succeeded in reading the entire message of several viral genomes. Recently it was announced that the genome of a common yeast species has also been sequenced (ie, all the bases have been read). The information to make a yeast cell is a staggering 15 million bases.

But when we move to organisms larger than a single cell, the amount of information we are faced with makes a multi-volume encyclopaedia look like a comic book.

Consider this: a tiny worm has 80 million letters in its genetic library; the fruit fly has 155 million. And humans? Well, we don’t have the most by any means, but we do weigh in with a hefty 2800 million. That’s why the Human Genome Project is such a massive undertaking.

Related sites


Box 2. Gene mapping and DNA sequencing

The position occupied by a gene on a chromosome is its locus (plural loci), which is like its address. Although it is usual to talk about gene mapping, in many cases it is the loci of the genes that are being mapped. For all of the human gene loci, about a third are variable – that is, the genes on such loci can exist in different forms or variants. For example, the locus for the gene that controls ear lobe shape may have on it the arrangement of DNA that produces attached ear lobes, or the arrangement of DNA that produces free ear lobes. Variations on the same gene locus are called alleles. While humans share identical genes, individuals differ in the specific alleles they possess. So the Human Genome Project is identifying the loci for all human genes and some of the alleles for those genes that have them.

Where loci for different genes are on the same chromosome they are said to be linked because usually they will all be passed on together to the offspring. However, sometimes chromosomes can break and re-join, and the genes that are usually linked become separated. The chance of separation of linked genes depends on how close together the loci of the genes are. If they are close together crossing over between them is rare. The further apart they are the greater is the chance of crossing over happening between them. Therefore the frequency of crossing over can be used as a measure of distance between genes. From these kinds of data it is possible to produce a genetic linkage map of chromosomes.

Scientists are now producing other kinds of gene maps. After cloning large amounts of DNA from the genome, they use markers to connect the cloned pieces correctly. This information is then used to produce a physical map. The object of physical mapping is to create complete sets of cloned DNA pieces that span every region of a chromosome and even the whole human genome.

DNA sequence maps are also a kind of gene map. Knowing the sequence of bases – the chemical building blocks that make up the DNA strand – is important because it tells scientists what kind of genetic information the DNA carries in that particular segment. Scientists also use sequence information to sort out which stretches of DNA contain genes and to analyse genes for changes in sequence that may cause disease.

Related sites


Activities

  • Intimate Strangers (Public Broadcasting Service, USA)

  • National Human Genome Research Institute (USA)

  • Oak Ridge National Laboratory (USA)

  • Science NetLinks (American Association for the Advancement of Science)
    • DNA chips – explores issues surrounding DNA microarray technology.

  • Microsoft, USA
    • The Human Genome Project – explores the importance of genetics. Students prepare reports that track scientific advancements and project possible outcomes.

  • The Wellcome Trust (UK)
    • Beyond the genome – Provides background information for students to prepare a play in three acts, 'Whose cells line is it anyway?'


Activity 1. Changes in genetic research and knowledge

It is less than 100 years since the significance of Mendel's work was first realised.
  1. Describe how our understanding of inheritance has changed since Mendel's work in the 1860s:

  2. Using examples from genetics, explain how the nature of scientific research has changed.

Teachers notes

  1. Mendel's work showed that genetic traits were discrete, indivisible units which were inherited independently of each other. Other workers then found that some genes were inherited with others (linkage groups). Cytological work in the early 1900s then related these linkage groups to chromosomes. At this time traits were given the name 'genes'.

    Subsequent work showed that Mendel's traits can indeed be divided and are actually segments of DNA (a sequence of nucleotide bases) and that most genes code for the structure of a polypeptide. So a sequence of nucleotides codes for a sequence of amino acids that is a protein. Proteins are essential components of all cells as an integral part of cell scaffolding and membranes as well as cellular enzymes. Genes therefore control an organism's structure and function.

  2. The major changes are:
    • that research has shifted from interested amateurs to highly specialised research scientists and from individuals working alone to scientists working in teams (Mendel versus the teams involved in the Human Genome Project);
    • from very simple measuring devices to sophisticated 'black boxes' (The difference between Mendel's tall and dwarf peas could easily have been measured with a ruler while machines obtain most of the information for DNA analysis);
    • from hand-done calculations to computerised recording, calculation and interpretation of results (DNA scientists analyse much of their data using computers);
    • from the news about discoveries taking months or even years to spread across the world to being almost instantaneous (in Mendel's time communication was restricted to letters, the delivery of which was unreliable and slow whereas modern scientist working in similar fields are in daily contact by phone or e-mail;
    • from whole organism investigation to investigations at the sub-cellular (or molecular) level (Mendel looked at the size and shape of pea plants and seeds). Modern geneticists now look at morphology and attempt to relate appearance to DNA sequence.

    The Human Genome Project has accelerated these changes in the nature of scientific research from manual tasks to computer and machine-driven tasks. This is producing a rapid increase in our understanding of our genes and how they work.


Activity 2. Creating a table of human genetic diseases

Draw up a three-column table with one column headed 'Disease', one headed 'Cause' and one headed 'Comments'.

Do library research to fill in the table for at least five genetic diseases.

In the comments column you could refer to such things as the advantages and disadvantages of diagnosis, the possibility of treatment and the likely impact of the Human Genome Project.

Teachers notes

Students should be able to find information on the following diseases: Huntington disease; cystic fibrosis; thalassaemia; phenylketonuria; haemophilia; Turner syndrome and Klinefelter syndrome.


Activity 3. The Human Genome Project: Advantages and disadvantages

Hold a small group discussion on the advantages and disadvantages of the Human Genome Project.

Use ideas raised by other people in your group to help you decide whether you think the Human Genome Project should have been established.

Take a class vote to find out the overall opinion of the class.

Teachers notes

As the Human Genome Project is a controversial issue there is no right answer. Taking a class vote is only useful as an indicator of the most common opinion in the class. Encourage students who disagree with the majority opinion to realise that, provided their opinions are based on well-reasoned arguments, they are just as acceptable as the majority opinion.

Arguments for could include the unforeseen benefits that come from pure research, and increasing our understanding of human genes and human health by:

  • improving our knowledge of gene expression,
  • elucidating the function of the large proportion of DNA we know little about
  • discovering possible means of diagnosis for some genetic diseases,
  • discovering possible treatments for currently untreatable genetic diseases
  • discovering new tools and techniques for genetic research,
  • generating the ability to go directly from a trait to a gene,
  • identifying genetically validated therapeutic targets which would increase the cost-benefit ratio in pharmaceutical discovery,
  • investigating the development of drug resistance in bacteria,
  • investigating antigenic variation and host-parasite interaction at both the host and parasite level.

In addition there are other spin-offs such as:

  • producing a better understanding of biology (procaryotic and eucaryotic),
  • allowing the rapid development of new crops,
  • increasing the productivity of livestock,
  • breeding pest-resistant plants and animals,
  • documenting commercially useful micro-organisms,
  • improving the quality control of our food and environment, and
  • increasing our taxonomic understanding.

Arguments against could include:

  • the cost – the money could be spent elsewhere,
  • the anguish resulting from knowing that a person has an untreatable genetic disease,
  • the use or misuse of genetic information by such organisations as insurance companies and employers,
  • the ownership of genetic test results,
  • the patenting of human genes and DNA,
  • the increasing gap between rich and poor countries in the quality of life and the level of health and disease treatment,
  • the exploitation of isolated populations in the search for disease genes,
  • the ethics of accumulating genotypic profiles of people - are they able to be used for anything that the researcher wants,
  • decisions about the ownership of data by 'affected' or donor individuals,
  • the ethics of germline gene therapy,
  • the ethics of somatic gene therapy,
  • the costs of genetic treatment versus benefit to the community.


Further reading


Australasian Science
September 2003, pages 38-40
The common thread (by Stephen Luntz)
An interview with Sir John Sulston about how close society came to losing control over ownership of our genes.


August 2003, pages 20-22
The legal helix (by Matthew Rimmer)
Explains the legal issues over access to sequence data that are unravelling public research.


Nature
A collection of Nature articles on human genomics is available.


New Scientist
A collection of articles on genetics and genes is available.


6 December 2006
Genomic atlas of the mouse brain revealed (by Peter Aldhous)
Reports on a genomic atlas of the mouse brain.


22 November 2006
Human DNA is far more varied than thought (by Debora MacKenzie)
Research suggests that humans may differ due to differences in the number of copies of genes.


18 November 2006
Human genome: The end of the beginning (by Andy Coghlan and Nell Boyce)
Looks back on the announcement of the completion of the human genome project.


15 November 2006
Neanderthals have genome chunk sequenced (by Dan Jones)
Announces the sequencing of over a million base pairs of fossilised Neanderthal DNA.


11 October 2006
Neanderthal DNA illuminates split with humans (by Roxanne Khamsi)
Sequencing of Neanderthal DNA suggests that humans and Neanderthals are distant relatives.


19 August 2006, ages 29-36
All about me (by Dan Jones)
Discusses the role of genes in the development of the mind, personality and behaviour.


15 April 2006, page 32
Genome-in-a-day promised as DNA is put through hoops (by Andy Coghlan)
Describes future rapid DNA sequence analysis using magnetic codes.


8 April 2006, pages 38-41
Magic numbers (by Bob Holmes)
Looks at the role of chromosome deletions or duplications – gene copy number – in diseases.


7 March 2006
Many human genes evolved recently (by Melissa Lee Phillips)
Suggests that human genes involved in metabolism, skin pigmentation, brain function and reproduction evolved in response to recent environmental changes.


21 January 2006, pages 8-9
One million people, one medical gamble (by Andy Coghlan)
Describes the ‘Biobank’ project in the UK and US to study the interaction between genes and the environment.


13 April 2005
Gene project will map humans’ global spread (by Michael Le Page and Will Knight)
Describes a project to map the migration of humans across the globe using genetic information.


19 July 2003, page 21
DIY approach to drug discovery (Rachel Nowark)
Reports on a chemical genomics project that would allow academics to carry out early stages of drug development as part of existing research.


RTD info
April 2005
DNA: life's memory
The hundreds of millions of chemical letters in a genome sequence hold the memory of the evolution that led to a species' appearance. The skill lies in deciphering them.


Science
1 July 2005, page 80
Why do humans have so few genes? (by Elizabeth Pennisi)
Explains how humans function with fewer genes than expected.


Scientific American
December 2006, page 35
DNA sequencing on the cheap (Charles Choi)
Reports on low cost DNA sequencing technology.


February 2006, pages 58-65
Owning the stuff of life (by Gary Stix)
Covers the patenting of human genes by companies and organisations.


May 2005, pages 70-73
Molecular treasure hunt (by Gary Stix)
Describes GeneWays software which reads information in scientific literature to find new molecular interactions or pathways.


April 2005, pages 40-47
Alternative genome (by Gil Ast)
Explains alternative gene splicing, which allows one gene to have more than one protein or function.


October 2004, pages 30-37
The hidden genetic program of complex organisms (by John Mattick)
Explains the role of junk DNA and RNA in the control of gene expression.


April 2004, pages 56-63
Evolution encoded (by Stephen Freeland and Laurence Hurst)
Looks at the natural balance between maintaining the integrity of the genome while allowing mistakes for evolution.


Useful sites

The Human Genetic Code – The Human Genome Project and beyond (The Centre for Genetics Education, Australia)

Provides an overview of the Human Genome Project – its history, achievements to date, expected benefits and ethical issues.
http://www.genetics.com.au/pdf/factsheets/fs24.pdf


About the Human Genome Project (National Human Genome Research Institute)

Includes an explanation of the Human Genome Project, the significance of the draft genome published in 2001, and a brief history of the project.
http://www.genome.gov/page.cfm?pageID=10001772


Human Genome Project information (Oak Ridge National Laboratories, USA)

Up-to-date information about a range of aspects of the project. Click on 'About the HGP' for an overview of the project and links to more detailed information. Click on 'New primer now available' for information about genomics and its impact on medicine and society. Scroll down to Project Information and Click on 'Science behind the project' for basic genetics, insights from the draft sequence and post-genome science.
http://www.ornl.gov/techresources/Human_Genome/home.html


Australian Broadcasting Corporation

  • Human DNA surprisingly diverse (News in Science, 23 November 2006)
    Suggests that genetic diversity in humans lies in the number of copies of genes.
    http://abc.net.au/science/news/stories/2006/1795367.htm

  • Neanderthal DNA shows we're quite separate (News in Science, 16 November 2006)
    Sequencing reveals that Neanderthals are distant relatives of modern humans who interbred rarely, if at all, with our own immediate ancestors.
    http://www.abc.net.au/science/news/stories/2006/1790014.htm?ancient

  • Human Genome Project – legal and ethical implications (Science Show, 26 August 2006)
    Discusses the plausibility of some of the scenarios relating to the use of genetic information.
    http://www.abc.net.au/rn/scienceshow/stories/2006/1724056.htm

  • Technology for reading human genomes (Science Show, 8 July 2006)
    Describes the technology used to sequence the human genome and the information obtained from sequencing the DNA of Neanderthal organelles.
    http://www.abc.net.au/rn/scienceshow/stories/2006/1680989.htm#

  • It’s over! Last chromosome sequenced (News in Science, 18 May 2006)
    Announces the completion of the sequencing of human chromosome 1.
    http://www.abc.net.au/science/news/stories/s1641588.htm

  • New DNA map shows variety is spice of life (News in Science, 27 October 2005)
    Scientists have produced a ‘HapMap’ that reveals the genetic differences between individuals.
    http://www.abc.net.au/science/news/tech/InnovationRepublish_1491634.htm

  • Global DNA (The Lab, 22 September 2005)
    Describes the Genographic Project to collect genetic information from over 100,000 people and some of the culturally sensitive issues it raises.
    http://www.abc.net.au/science/features/globaldna/

  • Genius of junk DNA (Catalyst, 10 July 2003)
    An interview with the Australians who hold the international patent on junk DNA.
    http://www.abc.net.au/catalyst/stories/s898887.htm

  • The state of the genome 2001
    From this site you can access information on 'The race', 'The findings', and 'The future'. Click on 'DNA for dummies' for a brief introduction to DNA, chromosomes and genes.
    http://www.abc.net.au/science/slab/genome2001/default.htm

  • What is the human genome and why do we need to sequence it?
    Dr Simon Foote explains what is meant by mapping the human genome, covering in very general terms the approaches that are used.
    http://www.abc.net.au/science/slab/genome/story.htm

  • Human genome sequencing (The Science Show, 1 July 2000)
    Discusses how, and when, the information gained from sequencing the human genome might be used.
    http://www.abc.net.au/rn/science/ss/stories/s146511.htm

  • Implications of the human genome project (7.30 Report, 14 June 2000)
    Explains that mapping the human genome could help scientists develop ways to diagnose, prevent and treat human diseases.
    http://www.abc.net.au/7.30/stories/s140129.htm


A gene map of the human genome (US National Centre for Biotechnology)

Illustrates the genetic map for each of the human chromosomes. Several genes on each chromosome are illustrated with electron-micrographs or diagrams. There is a link to an article in Science, describing new mapping techniques in detail. A technical site.
http://www.ncbi.nlm.nih.gov/SCIENCE96/


Genetics home reference – your guide to understanding genetic conditions (National Institutes of Health, USA)

Provides information about genetic conditions and the genes or chromosomes responsible for those conditions.
http://ghr.nlm.nih.gov/


Human gene testing (Beyond Discovery, National Academy of Sciences, USA)

This article explores the trail of basic research (eg, discovery of the structure of DNA, breaking the code, and mapping the DNA to locate specific genes) that opened the door to gene testing. (A PDF file of the complete article is available.)
http://www.beyonddiscovery.org/content/view.article.asp?a=239


Glossary

allele. Genes can exist in more than one form. Each different form of the same gene is called an allele. For example, in the case of seed shape, there is one allele that determines wrinkled seeds and another allele that determines round seeds.

base (in DNA). Any one of four nitrogenous (nitrogen-containing) bases (adenine, thymine, guanine and cytosine). The sequence of the bases in DNA determines the sequence of amino acids in all proteins found in living things.

base pairs. Two bases held together by weak chemical bonds. The double helix shape of DNA is dependent on its two strands being held together by the bonds between the base pairs. In DNA, the bases that pair are adenine with thymine and guanine with cytosine.

chromosome. A long DNA molecule that contains the genes of the organism. Chromosomes are visible in cells during cell division.

DNA (deoxyribonucleic acid). The nucleic acid forming the genetic material of all organisms with the exception of some viruses which have RNA. DNA is present in the nucleus and other organelles such as mitochondria and chloroplasts.

gene. The basic unit of inheritance. A gene is a segment of DNA that specifies the structure of a protein or an RNA molecule.

genetic diseases. Those diseases (malfunctions) that result from abnormalities in chromosomes or DNA, and are inherited.

genetic map (linkage map). A map showing the sequence of genes on chromosomes.

genome. The total genetic material of an individual or species.

linked. The association of traits that occurs when the genes coding for them are on the same chromosome.

locus (plural loci). The position on a chromosome of a gene or other chromosome marker.

map. A plan of the linear sequence of chromosomes.

mapping. Constructing a plan (or map) of the linear sequence of chromosomes.

physical map. A map showing the location of sites (loci) on a chromosome.


External sites are not endorsed by the Australian Academy of Science.
Posted February 1997.

The Australian Foundation for Science is also a supporter of Nova.

This topic is sponsored by the Australian Government's National Innovation Awareness Strategy.


© Australian Academy of Science