Tissue heterogeneity

From Wikipedia, the free encyclopedia

Tissue heterogeneity refers to the fact that data generated with biological samples can be compromised by cells originating from other tissues or organs than the target tissue or organ of profiling.[1] It can be caused by biological processes (such as immune cell infiltration), sample contamination, or mistakes in sample labelling. Tissue heterogeneity affects commonly used, reference gene expression datasets such as the Genotype-Tissue Expression Project (GTEx).[2]

Cancer samples often display varying degree of heterogeneity, because they consist of tumor cells of multiple subclones, immune cells, and other cell types. Beyond cancer, many gene expression studies are affected by tissue heterogeneity. The prevalence of tissue heterogeneity in publicly available gene-expression studies is estimated between 1% and 40%, varying by tissues of origin.[3]

Detected tissue heterogeneity may be used to weight samples in differential gene-expression analysis to reduce the impact of the heterogeneity. Alternatively, the gene expression profile may be analyzed by cellular deconvolution algorithms to infer the composition of cell types.

References[edit]

  1. ^ Zhang, Jitao David; Hatje, Klas; Sturm, Gregor; Broger, Clemens; Ebeling, Martin; Burtin, Martine; Terzi, Fabiola; Pomposiello, Silvia Ines; Badi, Laura (2017). "Detect tissue heterogeneity in gene expression data with BioQC". BMC Genomics. 18: 277. doi:10.1186/s12864-017-3661-2. ISSN 1471-2164. PMC 5379536. Retrieved 2017-04-10.
  2. ^ Nieuwenhuis, Tim O.; Yang, Stephanie Y.; Verma, Rohan X.; Pillalamarri, Vamsee; Arking, Dan E.; Rosenberg, Avi Z.; McCall, Matthew N.; Halushka, Marc K. (22 April 2020). "Consistent RNA sequencing contamination in GTEx and other data sets". Nature Communications. 11 (1): 1933. doi:10.1038/s41467-020-15821-9. PMC 7176728.
  3. ^ Sturm, Gregor; List, Markus; Zhang, Jitao David (23 June 2021). "Tissue heterogeneity is prevalent in gene expression studies". NAR Genomics and Bioinformatics. 3 (3): lqab077. doi:10.1093/nargab/lqab077. PMC 8415427. PMID 34514392.