Deciphering colorectal cancer genetics through multi-omic analysis of 100,204 cases and 154,587 controls of European and East Asian ancestries

  1. Fernandez-Rozadilla, Ceres 1
  2. Timofeeva, Maria 1
  3. Zhishan Chen
  4. Law, Philip
  5. Minta Thomas
  6. Schmit, Stephanie
  7. Díez-Obrero, Virginia
  8. Hsu, Li
  9. Fernandez-Tajes, Juan
  10. Palles, Claire
  11. Sherwood, Kitty
  12. Briggs, Sarah
  13. Svinti, Victoria
  14. Donnelly, Kevin
  15. Farrington, Susan
  16. Blackmur, James
  17. Vaughan-Shaw, Peter
  18. Xiao-Ou Shu
  19. Jirong Long
  20. Qiuyin Cai
  21. Xingyi Guo
  22. Yingchang Lu
  23. Broderick, Peter
  24. Studd, James
  25. Huyghe, Jeroen
  26. Harrison, Tabitha
  27. Conti, David
  28. Dampier, Christopher
  29. Devall, Mathew
  30. Schumacher, Fredrick
  31. Melas, Marilena
  32. Rennert, Gad
  33. Obón-Santacana, Mireia
  34. Martín-Sánchez, Vicente
  35. Moratalla-Navarro, Ferran
  36. Oh, Jae Hwan
  37. Jeongseon Kim
  38. Jee, Sun Ha
  39. Jung, Keum Ji
  40. Sun-Seog Kweon
  41. Shin, Min-Ho
  42. Aesun Shin
  43. Ahn, Yoon-Ok
  44. Kim, Dong-Hyun
  45. Oze, Isao
  46. Wanqing Wen
  47. Keitaro Matsuo
  48. Matsuda, Koichi
  49. Tanikawa, Chizu
  50. Zefang Ren
  51. Yu-Tang Gao
  52. Jia, Wei-Hua
  53. Hopper, John
  54. Jenkins, Mark
  55. Aung Ko Win
  56. Rish Pai
  57. Figueiredo, Jane
  58. Haile, Robert
  59. Gallinger, Steven
  60. Woods, Michael
  61. Newcomb, Polly
  62. Duggan, David
  63. Cheadle, Jeremy
  64. Kaplan, Richard
  65. Maughan, Timothy
  66. Kerr, Rachel
  67. Kerr, David
  68. Kirac, Iva
  69. Böhm, Jan
  70. Lukka-Pekka Mecklin
  71. Jousilahti, Pekka
  72. Knekt, Paul
  73. Aaltonen, Lauri
  74. Rissanen, Harri
  75. Pukkala, Eero
  76. Eriksson, Johan
  77. Cajuso, Tatiana
  78. Hänninen, Ulrika
  79. Kondelin, Johanna
  80. Palin, Kimmo
  81. Tanskanen, Tomas
  82. Renkonen-Sinisalo, Laura
  83. Zanke, Brent
  84. Männistö, Satu
  85. Albanes, Demetrius
  86. Weinstein, Stephanie
  87. Ruiz-Narvaez, Edward
  88. Palmer, Julie
  89. Buchanan, Daniel
  90. Platz, Elizabeth
  91. Visvanathan, Kala
  92. Ulrich, Cornelia
  93. Siegel, Erin
  94. Brezina, Stefanie
  95. Gsur, Andrea
  96. Campbell, Peter
  97. Chang-Claude, Jenny
  98. Hoffmeister, Michael
  99. Brenner, Hermann
  100. Slattery, Martha
  101. Potter, John
  102. Tsilidis, Konstantinos
  103. Schulze, Matthias
  104. Gunter, Marc
  105. Murphy, Neil
  106. Castells, Antoni
  107. Castellví-Bel, Sergi
  108. Moreira, Leticia
  109. Arndt, Volker
  110. Shcherbina, Anna
  111. Stern, Mariana
  112. Bens Pardamean
  113. Bishop, Timothy
  114. Giles, Graham
  115. Southey, Melissa
  116. Idos, Gregory
  117. McDonnell, Kevin
  118. Zomoroda Abu-Ful
  119. Greenson, Joel
  120. Shulman, Katerina
  121. Lejbkowicz, Flavio
  122. Offit, Kenneth
  123. Su, Yu-Ru
  124. Steinfelder, Robert
  125. Temitope Keku
  126. Van Guelpen, Bethany
  127. Hudson, Thomas
  128. Hampel, Heather
  129. Pearlman, Rachel
  130. Berndt, Sonja
  131. Hayes, Richard
  132. Martinez, Marie Elena
  133. Thomas, Sushma
  134. Corley, Douglas
  135. Pharoah, Paul
  136. Larsson, Susanna
  137. Yen, Yun
  138. Heinz-Josef Lenz
  139. White, Emily
  140. Li, Li
  141. Doheny, Kimberly
  142. Pugh, Elizabeth
  143. Shelford, Tameka
  144. Chan, Andrew
  145. Cruz-Correa, Marcia
  146. Lindblom, Annika
  147. Hunter, David
  148. Joshi, Amit
  149. Schafmayer, Clemens
  150. Scacheri, Peter
  151. Anshul Kundaje
  152. Nickerson, Deborah
  153. Schoen, Robert
  154. Hampe, Jochen
  155. Zsofia Stadler
  156. Vodicka, Pavel
  157. Vodickova, Ludmila
  158. Vymetalkova, Veronika
  159. Papadopoulos, Nickolas
  160. Chistopher Edlund
  161. Gauderman, William
  162. Thomas, Duncan
  163. Shibata, David
  164. Toland, Amanda
  165. Markowitz, Sanford
  166. Kim, Andre
  167. Chanock, Stephen
  168. Franzel Van Duijnhoven
  169. Feskens, Edith
  170. Sakoda, Lori
  171. Gago-Dominguez, Manuela
  172. Wolk, Alicja
  173. Naccarati, Alessio
  174. Pardini, Barbara
  175. FitzGerald, Liesel
  176. Lee, Soo Chin
  177. Ogino, Shuji
  178. Bien, Stephanie
  179. Kooperberg, Charles
  180. Li, Christopher
  181. Lin, Yi
  182. Prentice, Ross
  183. Conghui Qu
  184. Bézieau, Stéphane
  185. Tangen, Catherine
  186. Mardis, Elaine
  187. Yamaji, Taiki
  188. Sawada, Norie
  189. Iwasaki, Motoki
  190. Haiman, Christopher
  191. Loic Le Marchand
  192. Wu, Anna
  193. Chenxu Qu
  194. McNeil, Caroline
  195. Coetzee, Gerhard
  196. Hayward, Caroline
  197. Deary, Ian
  198. Harris, Sarah
  199. Evropi Theodoratou
  200. Reid, Stuart
  201. Walker, Marion
  202. Ooi, Li Yin
  203. Moreno, Victor
  204. Casey, Graham
  205. Gruber, Stephen
  206. Tomlinson, Ian
  207. Zheng, Wei
  208. Dunlop, Malcolm 1
  209. Houlston, Richard
  210. Peters, Ulrike
  211. Mostrar todos los/as autores/as +
  1. 1 UoE

Editor: Zenodo

Año de publicación: 2022

Tipo: Dataset

CC BY 4.0

Resumen

<strong>Colorectal cancer (CRC) is a leading cause of mortality worldwide. We conducted a genome-wide association study meta-analysis of 100,204 CRC cases and 154,587 controls of European and Asian ancestry, identifying 205 independent risk associations, of which 50 were unreported. We performed integrative genomic, transcriptomic and methylomic analyses across large bowel mucosa and other tissues. Transcriptome- and methylome-wide association studies revealed an additional 53 risk associations. We identified 155 high confidence effector genes functionally linked to CRC risk, many of which had no previously established role in CRC. These have multiple different functions, and specifically indicate that variation in normal colorectal homeostasis, proliferation, cell adhesion, migration, immunity and microbial interactions determines CRC risk. Cross-tissue analyses indicated that over a third of effector genes most likely act outside the colonic mucosa. Our findings provide insights into colorectal oncogenesis, and highlight potential targets across tissues for new CRC treatment and chemoprevention strategies.</strong> <strong>The data submitted here are expression and methylation models with LD reference data for the transcriptome-wide (TWAS), methylome-wide (MWAS) and transcript isoform-wide association study (TIsWAS) as described in the manuscript "Deciphering colorectal cancer genetics through multi-omic analysis of 100,204 cases and 154,587 controls of European and East Asian ancestries". Details of the methods are presented in the method section and supplementary information file. </strong> <strong>TWAS analysis </strong> Gene expression models for the six in-house expression datasets were generated using the PredictDB v7 pipeline for a total of 1,077 participants. Elastic net model building with 10-fold cross-validation was performed independently for each dataset. The elastic net models for GTEx v8 Colon Transverse were obtained from the PredictDB data repository (http://predictdb.org/) and had been generated using the same pipeline. Models were computed using HapMap2 SNPs ±1Mb from each gene, together with covariate factors estimated using PEER32, clinical covariates when appropriate (age, sex and, where appropriate, case-control status, type of polyp and anatomic location in the colorectum), and three PCs from the individual dataset’s SNP genotype data. Transcript-based TWAS analyses (TIsWAS) were likewise performed by using transcript-level data from the SOCCS, BarcUVa-Seq and GTEx Colon Transverse datasets. <strong>MWAS analysis </strong> Methylation beta values were calculated based on the manufacturer’s standard, ranging from 0 to 1. Quality control and data normalization were performed in R using the ChAMP software pipeline for the EPIC and 450K arrays. Briefly, we filtered out failed probes with detection P &gt; 0.02 in &gt;5% of samples, probes with &lt;3 reads in &gt;5% of samples per probe and all non-CpG probes. Samples with failed probes &gt;0.1 were also excluded from downstream analyses. We discarded all probes with SNPs within 10bp of the interrogated CpG (from 1,000 Genomes Project, CEU population)34, and probes that ambiguously mapped to multiple locations in the human genome with up to two mismatches33. We only considered probes mapping to autosomes and those overlapping between the EPIC and the 450K arrays. Normalization was achieved using the Beta MIxture Quantile (BMIQ) method. Per probe methylation models were created using the PredictDB pipeline on the normalized methylation matrix and the genotypes as per TWAS eQTL analysis. To optimize power, we restricted our analysis to 263,341-238,443 (for the 450K array) and 377,678 (for the EPIC array) probes annotated to Islands, Shores and Shelves, and discarded “Open Sea” regions.