MicrorrayDataStandardisation

上传人:飞*** 文档编号:2666156 上传时间:2017-07-26 格式:PPT 页数:32 大小:419.50KB
返回 下载 相关 举报
MicrorrayDataStandardisation_第1页
第1页 / 共32页
MicrorrayDataStandardisation_第2页
第2页 / 共32页
MicrorrayDataStandardisation_第3页
第3页 / 共32页
MicrorrayDataStandardisation_第4页
第4页 / 共32页
MicrorrayDataStandardisation_第5页
第5页 / 共32页
点击查看更多>>
资源描述

《MicrorrayDataStandardisation》由会员分享,可在线阅读,更多相关《MicrorrayDataStandardisation(32页珍藏版)》请在金锄头文库上搜索。

1、Microrray Data Standardisation,Microarray Gene Expression Database group - MGEDDecember, 2000,Public data repositories for microarray data,There is a growing consensus in the life science community for a need for public repositories of gene expression data analogous to DDBJ/EMBL/GenBank for sequence

2、s,Some of the reasons:,Gradually building up gene expression profiles for various organisms, tissues, cell types, developmental stages, various states, under influence of various compounds Through links to other genomics databases builds up systematic knowledge about gene functions and networksCompa

3、rison of profiles, access and analysis of data by third partiesCross validation of results and platforms - quality control,Systematic gene expression profiling initiatives in public domain,The International Life Science Institute (ILSI) is coordinating a program undertaken by 25 pharmaceutical and f

4、ood companies to generate toxicity related gene expression data under defined experimental conditionsevaluate gene expression profiles in standardised test systems following exposure to toxicantsrelate changes in gene expression to other measures of toxicity,Microarray data handling and analysis - a

5、 major bottleneck (Calculations by Jerry Lanfear),Experiments:100 000 genes in human320 cell types2000 compounds3 time points2 concentrations2 replicatesData8 x 1011 data-points1 x 1015 = 1 petaB of data,Expression data repository projects,Public repositories in making:GEO - NCBIGeneX - NCGRArrayExp

6、ress - EBIIn-house databases - Stanford, MIT, University of Pennsylvania, Organism specific databases: Mouse in JacksonProprietary databases - Gene Logic, NCI,Difficulties,Raw data are imagesWhat is needed for higher level analysis and mining is gene expression matrix (genes/samples/gene expression

7、levels)lack of standard measurement units for gene expressionlack of standards for sample annoation,Raw data - images,Treated sample labeled red (Cy5)Control data labeled green (Cy3)Competitive hybridization onto chipRed dot - gene overexpressed in treated sampleGreen dot - gene underexpressed in tr

8、eated sampleYellow - equally expressedIntensity - “absolute” levelred/green - ratio of expression2 - 2x overexpressed0.5 - 2x underexpressedlog2( red/green ) - “log ratio” 1 2x overexpressed-1 2x underexpressed,cDNA plotted microarrayStanford university (Yeast,1997),Gene expression matrix,Samples,Ge

9、nes,Gene expression levels,Gene expression levels,What we would like to havegene expression levels expressed in some standard units (e.g. molecules per cell)reliability measure associated with each value (e.g. standard deviation)What we do haveeach experiment using different unitsno reliability info

10、rmation,Comparing expression data,Comparing expression data,Comparing expression data,Measurement units,In perspective:standard controls for experiments (on chips and in the samples)replicate measurementsTemporary solution:storing intermediate analysis results (including the images) and annotations

11、of how they were obtained - i.e., the evidence,Comparing expression data - problem 2,How gene names relate in different data matrices?How samples relate in different data matrices?,Sample annotation,Gene expression data have any meaning only in the context of what are the experimental conditions of

12、the target systemControlled vocabularies and ontologies (species, cell types, compound nomenclature, treatments, etc) are needed for unambiguous sample annotation Sample annotations in current public databases are typically useless,In perspective,Standard units for gene expression measurementsStanda

13、rds for sample annotation.,More immediate actions,To understand what information about microarray experiments should be captured to make the descriptions reasonably self-containedDevelop data exchange format able to capture this minimum informationDevelop recommendations how data should be normalise

14、d and what controls should be used,MGED group,The MGED group is an open discussion group initially established at the Microarray Gene Expression Database meeting MGED 1 (14-15 November, 1999, Cambridge, UK). The goal of the group is to facilitate the adoption of standards for DNA-array experiment an

15、notation and data representation, as well as the introduction of standard experimental controls and data normalisation methods. The underlying goal is to facilitate the establishing of gene expression data repositories, comparability of gene expression data from different sources and interoperabilit

16、y of different gene expression databases and data analysis software. Since 1999 the group has had two general meetings and the third one is planned for 2001For more see www.mged.org,MGED participants including,AffymetrixBerkeleyDDBJ DKFZEMBLGene LogicIncyteMax Plank Institute,NCBINCGRNHGRISanger CentreStanfordUni PennsylvaniaUni WashingtonWhitehead Institute,

展开阅读全文
相关资源
正为您匹配相似的精品文档
相关搜索

最新文档


当前位置:首页 > 行业资料 > 其它行业文档

电脑版 |金锄头文库版权所有
经营许可证:蜀ICP备13022795号 | 川公网安备 51140202000112号