Major flaw found in standard approach to gene expression analysis

Friday, 26 October, 2012

Common assumptions employed in the generation and interpretation of data from global gene expression analyses can lead to seriously flawed conclusions about gene activity and cell behaviour, according to Whitehead Institute researchers.

“Expression analysis is one of the most commonly used methods in modern biology,” said Whitehead Member Richard Young. “So we are concerned that flawed assumptions may affect the interpretation of many biological studies.”

Much of today’s interpretation of gene expression data relies on the assumption that all cells being analysed have similar total amounts of messenger RNA (mRNA), the roughly 10% of a cell’s RNA that acts as a blueprint for protein synthesis. However, some cells, including aggressive cancer cells, produce several times more mRNA than other cells. Traditional global gene expression analyses have typically ignored such differences.

“We’ve highlighted this common assumption in gene expression analysis that potentially affects many researchers,” said Tony Lee, a scientist in Young’s lab and a corresponding author of the article published in Cell. “We provided a concrete example of the problem and a solution that can be implemented by investigators.”

Members of the Young lab recently uncovered the flaw while investigating genes expressed in cancer cells expressing high levels of c-Myc, a gene regulator known to be highly expressed in aggressive cancer cells. When comparing cells with high and low c-Myc levels, they were surprised to find very different results using different approaches to gene expression analysis. Further investigation revealed that there were striking differences in the total amounts of RNA from the high and low c-Myc-containing cells, yet these differences were masked by commonly used experimental and analytical methods.

“The different results we saw from different methods of gene expression analysis were shocking and led us to reinvestigate the whole process on several platforms,” said Jakob Lovén, postdoctoral researcher in Young’s lab and co-author of the Cell paper. “We then realised that the common assumption that cells contain similar levels of mRNA is badly flawed and can lead to serious misinterpretations, particularly with cancer cells that can have very different amounts of RNA.”

In addition to delineating this problem, the Whitehead scientists also describe a remedy. By using synthetically produced mRNAs, called RNA spike-ins, as standardised controls, researchers can compare experimental data and eliminate assumptions about total cell RNA amounts. The remedy applies to all three gene expression analysis platforms they studied.

Although the researchers believe the use of RNA spike-ins should become the new standard for global gene expression analyses, questions are likely to persist about the interpretations of much prior research.

“There are over 750,000 expression datasets in public databases, and because they generally lack information about the cell numbers used in the analysis, it is unclear whether they can be re-examined in order to validate the original interpretation,” said David Orlando, a scientist in the Young lab. “It may be necessary to reinvestigate some important concepts.”

Related News

Blood-based biomarker can detect sleep deprivation

The biomarker detected whether individuals had been awake for 24 hours with a 99.2% probability...

Epigenetic signature helps to diagnose rare breast tumour

The current way of diagnosing phyllodes tumours is to analyse their cellular features under a...

New instrument measures cardiovascular disease biomarkers

CVD-21 enables a 'liquid cardiovascular biopsy' for quantification of multiple...


  • All content Copyright © 2024 Westwick-Farrow Pty Ltd