JBDJournal Of Bioinformatics And Diabetes2374-9431Open Access PubUnited StatesJBD-13-22610.14302/issn.2374-9431.jbd-13-226research-articleBioinformatic Resources for Diabetic NephropathyAmyJayne McKnight1*AlexanderPeter Maxwell1Nephrology Research, Centre for Public Health, Queen’s University of BelfastCorresponding authorLiXia1Department of Medicine, Stanford University School of MedicineDr A.J. McKnight; Nephrology Research, Centre for Public Health ; c/o Regional Genetics Centre, Level A, Tower Block ; Belfast City Hospital, Lisburn Road, Co. Antrim, BT9 7AB ; Northern Ireland, United Kingdom ; Tel: +44 (0)2890 63480 ; Fax: 028 90 235900 ;a.j.mcknight@qub.ac.ukDr Amy Jayne McKnight is a senior lecturer in bioinformatics (genetic epidemiology) and Professor Alexander Peter Maxwell is a consultant nephrologist and professor of renal medicine.
The authors have declared that no competing interests exist.
The number of individuals with diabetes is increasing worldwide and a large subset of those affected will develop diabetic nephropathy. Diabetic nephropathy is the leading cause of end-stage renal disease, has serious health consequences for affected individuals, and represents a major monetary cost to healthcare providers.
Technological and analytical developments have enabled large-scale, collaborative studies that are revealing risk factors associated with diabetic nephropathy. However, much of the inherited predisposition and biological mechanisms underpinning risk of this disease remain to be identified. Meta-analyses and integrated pathway studies are becoming an increasingly important part of research for diabetic nephropathy including, genetic, epigenetic, transcriptomic, proteomic research, clinical observations and the development of animal models.
This report highlights current bioinformatic resources and standards of reporting to maximise interdisciplinary research for diabetic nephropathy. The identification of an -Omics profile that can lead to earlier diagnosis and / or offer improved clinical evaluation of individuals with diabetes would not only provide significant health benefits to affected individuals, but may also have major utility for the efficient use of healthcare resources.
Diabetes is a major public health concern with rates of diabetes increasing globally and approximately 40% of affected individuals developing diabetic nephropathy 1234. Diabetic nephropathy is the leading cause of end-stage renal disease and represents a substantial cost to healthcare providers 56. Strategies that can help predict those individuals at higher risk of developing diabetic nephropathy, improve understanding of the pathogenesis of this disease, or suggest novel targets for optimised therapies are urgently required. With the increasing size and complexity of research studies, bioinformatics has become an essential discipline to help unravel the biological mechanisms that lead to diabetic nephropathy and end-stage renal disease in individuals with diabetes. Clinically-based resources such as routine laboratory measurements, hospitalisation records, treatment regimens and patient outcomes may help inform strategic planning, change healthcare policy, and contribute to ‘basic’ research discoveries 78. Epidemiological studies confirm inherited risk factors influence the development and progression of diabetic nephropathy, however identifying clinically useful biomarkers and effective therapies is proving to be considerably more challenging. Recent technological advances enable cost-effective investigations of functional risk factors for diabetic nephropathy including genetic, epigenetic, transcriptomic, proteomic and metabolomic pathways coupled with data from clinical observations and animal models of diabetic kidney disease. Analysing integrated networks and pathways from rich and diverse data sources, often using systems biology-based approaches, is becoming an important component of diabetic nephropathy research.
Genetic Studies:
Genetic epidemiology is moving away from single SNP studies, towards an emphasis on the comprehensive analysis of a candidate gene [candidate gene 9,10 or systematic literature reviews and meta-analyses 11,12,13,14,15. Several genome-wide association studies (GWAS) have been performed for diabetic nephropathy 16,17,18,19,20,21, but only two independent GWAS datasets are publicly available via dbGaP: (i) GoKinD US 19 and (ii) All Ireland-Warren3-GoKinD UK 21 collections. Recently, the GENIE consortium completed the first meta-analysis of GWAS for diabetic nephropathy with subsequent replication in more than 12,500 individuals, 21. Ongoing projects involve more comprehensive association studies in larger discovery cohorts together with deep next-generation resequencing to identify more elusive rare variants that may contribute to diabetic nephropathy. These types of studies maximise the chance of finding true genetic signals that influence risk of diabetic kidney disease, or the more extreme end-stage renal disease in individuals, but pose substantial challenges in terms of archiving the data so that it is usefully accessible to other researchers. ‘Raw’ datasets that are available to bona fide researchers are ideal in that they facilitate downstream analyses by whichever methods are most appropriate for individual applications (Table 1). Several older resources such as T2D-db 22 (http://t2ddb.ibab.ac.in, last updated for type 2 diabetes in 2009) and corgi 23 (http://go.qub.ac.uk/kidney-corgi, last updated for kidney genes in 2011) also contain useful data that support and promote interdisciplinary research.
Web-based resources
Resource
Description
Link
GUDMAPGenitoUrinary Development Molecular Anatomy Project
Curated, gene expression datasets in development transgenic mice
www.gudmap.org
KUPKBThe Kidney and Urinary Pathway Knowledge Base
-Omics datasets from scientific publications and other renal databases
http://www.kupkb.org
Nephromine
Comprises renal gene expression profiles
www.nephromine.org
T1DBASEType 1 Diabetes Database
Curated, integrated datasets informing genetics across species
www.t1dbase.org
DiaCompDiabetic complications consortium
Data on animal models for diabetic complications, including nephropathy.
www.diacomp.org
dkCOINNational Institute of Diabetes, Digestive and Kidney Diseases (NIDDK) Consortium Interconnectivity Network
Toolkit of interconnected resources (datasets, reagents, and protocols)
generated from individual consortia
www.dkcoin.org
dbGAP: Genotype-phenotype association studies
Case-control study for nephropathy in type 1 diabetes with 1801 participants using Illumina Omni1-quad
http://www.ncbi.nlm.nih.gov/proects/gap
phs000088.v1.p1
phs000018.v1.p1
Susceptibility Genes for Diabetic Nephropathy in Type 1 Diabetes (GoKinD study participants and parents), NIDDK
Case-control study for nephropathy in type 1 diabetes with 1825 participants using Affymetrix 500K set
phs000302.v1.p1
Genetic Study on Nephropathy in Type-2 Diabetes
CC study for nephropathy in type 2 diabetes with 350 participants using Illumina 370CNV array.
phs000333.v1.p1
Family Investigation of Nephropathy and Diabetes (FIND) Study
CC study for nephropathy in type 2 diabetes with 2622 participants using Affymetrix 6.0
Case-control study for nephropathy in type 1 diabetes with 1801 participants using Illumina Omni1-quad
GEO: Gene expression omnibus
http://www.ncbi.nlm.nih.gov/geo/
GSE20067
Case-control approach on 192 individuals using Illumina’s Infinium 27k methylation beadchip
GSE1009
Expression profiling on 6 kidney samples using Affymetrix Human Genome U95 Version 2
GDS3649
Analysis of HK2 proximal tubular cells using Illumina HumanWG-6 v3.0 expression beadchip
GDS961
Case-control comparison of glomeruli
Comprehensive clinical and demographic information is very important when researchers combine data from multiple studies 24. Knowing the precise phenotype, inclusion/exclusion criteria, and potential confounding factors such as duration of diabetes and ancestry are critical to derive robust findings from meta-analyses.
Quality control is another essential element of all genetic studies, particularly in larger-scale studies where systematic bias may substantially affect the results; stringent quality control was highlighted as very important for a diabetic nephropathy GWAS study 25. Standardised guidelines have been suggested to help evaluate published genetic association studies and improve transparency of reporting; STrengthening the REporting of Genetic Association Studies (STREGA): an extension of the statement 26.
Gene Expression Studies:
Multiple studies have been reported that suggest transcriptomic differences between individuals with and without diabetic nephropathy. Traditionally larger-scale studies of the transcriptome were conducted using DNA microarrays that comply with reporting standards designed to improve reliability and confidence in outcomes such as MIAME 27 and MAQC 28. Several transcriptomic studies are publicly available in the Gene Expression Omnibus 29, Nephromine 30, GUDMAP 31, and KUPKB 32 RNA-seq is a powerful sequence-based method that enables researchers to discover RNA biomarkers, novel isoforms, and to profile and quantify entire RNA transcripts across the transcriptome. RNA-seq may also provide insights into the potential functional impact of epigenetic modification to DNA and histones 33,34. RNA-Seq will provide more reliable, precise and informative measurements of the transcriptome, however challenges remain in terms of the sheer quantity of data generated and researcher’s unfamiliarity with this rapidly developing technique. Nonetheless, RNA-Seq is generating novel insights for the kidney transcriptome that are relevant for diabetic nephropathy 35.
Epigenetic Studies:
Epigenetic modifications of the genome contribute to disease susceptibility, however much of the “inherited” epigenetic architecture remains unexplained. Emerging evidence for epigenetic phenomena has transformed investigations of heritable influences on disease and, complementary to genome-wide association studies (GWAS), it is now cost-effective to perform population-based studies of the epigenome 36,37. Epigenetic modifications modulate gene expression without changing the DNA sequence; these may be either stably inherited or dynamic epigenetic marks. Methylation is a key epigenetic feature that plays an important role in chromosomal integrity and regulation of gene expression with different methylation profiles now being associated with many complex diseases, including diabetes 38,39. Initial studies support an important role for differential methylation in diabetic nephropathy40,41, however as yet only one dataset is publicly available via the Gene Expression Omnibus 29 It is feasible that methylation profiles may lead to clinically useful biomarkers or direct researchers to novel therapeutic targets in individuals with diabetes. The identification of a genetic-epigenetic profile that can lead to earlier diagnosis and / or offer improved clinical evaluation would not only provide significant healthcare benefits to affected individuals, but may also have major utility for the efficient use of monetary resources
Other epigenetic features include chromatin regulation and RNA interference. Histone modifications do play a role in diabetic nephropathy 42, but large scale studies are not yet available. MicroRNAs have been an area of intense interest in recent years, with several markers highlighted with functionally important to modulate diabetic nephropathy 43,44,45. Non-protein coding RNAs are attractive targets for therapeutic intervention and as clinically useful biomarkers in the development of diabetic nephropathy. It is possible that epigenetic regulation of gene expression may represent a major contribution for diabetic nephropathy. An epigenomics resource at the National Center for Biotechnology Information (NCBI) has been created to serve as a comprehensive public repository for whole-genome epigenetic data sets 46.
Proteome Studies: Diabetic nephropathy involves a complex interaction of biological processes and proteomic analysis represents a potentially powerful approach to identify clinically relevant biomarkers. Centralised repositories exist for proteomic data such as the PRIDE (PRoteomics IDEntifications database; www.ebi.ac.uk/pride), and the Human Metabolome Database 47 has been developed for metabolomic data, but broadly accepted experimental and reporting standards for large-scale studies are still under development 48,49,50. Promising biomarkers for diabetic nephropathy are being suggested from multicentre collaborations and the integration of experimental and clinical data 51,52.
An Integrated Approach:
Efficient bioinformatic tools are becoming increasingly important to maximise the outcomes from individual and collaborative multi-centre research programmes. Web-based resources that store, organise and present complex information from diverse datasets enhance effective research. One such example that facilitates access to multidisciplinary information is dkCOIN 53 this collaborative resource was recently launched to share information from the Beta Cell Biology Consortium, the Nuclear Receptor Signalling Atlas, the Diabetic Complications Consortium, and Mouse Metabolic Phenotyping Centres. A systematic, multidisciplinary approach that combines clinical insight with basic biological research is not yet publicly available for diabetic nephropathy, but the use of integrated datasets is increasing (Figure 1). SysKid (systems biology towards novel chronic kidney disease diagnosis and treatment) is a consortium-driven effort that aims to define a comprehensive picture of the consequences of diabetes on kidney function (www.syskid.eu), although data is not publicly available. Systems biology is providing novel insights for diabetes 54,55,56 and for diabetic nephropathy in particular 57,58,59.
With the development of population based registries and biobank information, it is possible that clinical and research oriented databases will be integrated to form a rich, linked information resource, however multiple ethical and legal challenges need to be overcome before this becomes practical 60,61,62,63. The identification of an -Omics profile that can lead to earlier diagnosis and / or offer improved clinical evaluation would not only provide significant health benefits to affected individuals, but may also have major utility for the efficient use of healthcare resources. Bioinformatics is a key discipline that can aid our understanding of the initiation and progression of diabetic nephropathy. In addition, relevant education of healthcare providers is also important to ensure clinically relevant outcomes from –Omics projects that will help patient evaluation and management.
An integrated approach for diabetic nephropathy
AfkarianMMC SachsKestenbaumBIB HirschTuttleK.R.et al 2013, Kidney Disease and Increased Mortality Risk in Type2302308HossainPKawarBMEl NahasObesity and diabetes in the developing world--a growing challenge2007356213215DanaeiGFinucaneM MLuYGM SinghCowanM.J.et al 2011, National, regional, and global trends in fasting plasma glucose and diabetes prevalence since 1980: systematic analysis of health examination surveys and epidemiological studies with 370 country-years and 2.7 million participants3783140RitzERychlikILocatelliFHalimiSEnd-stage renal failure in type 2 diabetes: A medical catastrophe of worldwide dimensions199934795808KA McBrienBJ MannsChuiBSW KlarenbachRabiD.et al 2012, Health Care Costs in People With Diabetes and Their Association With Glycemic Control and Kidney FunctionDiabetes CareHexNBartlettCWrightDTaylorMVarleyDEstimating the current and future costs of Type 1 and Type 2 diabetes in the UK, including direct health costs and indirect societal and productivity costs201229855862BelloAHemmelgarnBMannsBTonelliMUse of administrative databases for health-care planning in CKD20121218LiZWenJZhangXWuCLiZ2012ClinData Express - A Metadata Driven Clinical Research Data Management System for Secondary Use of Clinical Data, AMIA. Annu. Symp. Proc.2012552557KeeneK LJC MychaleckyjSmithS GTS LeakPerlegas, P.S.et al 2008, Comprehensive evaluation of the estrogen receptor alpha gene reveals further evidence for association with type 2 diabetes enriched for nephropathy in an African American population123333341AJ McKnightPattersonC CKA PettigrewDA SavageKilnerJ.et al 2010, A GREM1 gene variant associates with diabetic nephropathy21773781AJ McKnightPattersonC CSandholmNKilnerJBuckhamT.A.et al 2010, Genetic polymorphisms in nitric oxide synthase 3 gene and implications for kidney disease: a meta-analysis32476481CuiWDuBZhouWJiaYSunG.et al 2012, Relationship between five GLUT1 gene single nucleotide polymorphisms and diabetic nephropathy: a systematic review and meta-analysis3985518558YangSZhangJFengCHuangGMTHFR 677T variant contributes to diabetic nephropathy risk in Caucasian individuals with type 2 diabetes: A meta-analysis2012MetabolismAL MooyaartEJ ValkEsL A vanJA BruijnHeerdeE.et al 2011, Genetic associations in diabetic nephropathy: a meta-analysis54544553WilliamsW WRM SalemAJ McKnightSandholmNForsblomC.et al 2012, Association testing of previously reported variants in a large case-control meta-analysis of diabetic nephropathy6121872194AJ McKnightAP MaxwellSawcerSCompstonASetakisE.et al 2006, A genome-wide DNA microsatellite association screen to identify chromosomal regions harboring candidate genes in diabetic nephropathy17831836DW CraigMPJK DiStefanoGenome-wide SNP genotyping study using pooled DNA to identify candidate markers mediating susceptibility to end-stage renal disease attributed to Type 1 diabetes20092610901098CW McDonoughND PalmerPJ HicksBH RohAnS.S.et al 2011, A genome-wide association study for diabetic nephropathy genes in African Americans79563572MG PezzolesiGD PoznikJC MychaleckyjAD PatersonBaratiM.T.et al 2009, Genome-wide association scan for diabetic nephropathy susceptibility genes in type 1 diabetes5814031410RL HansonDW CraigMillisM PKA YeattsKobesS.et al 2007, Identification of PVT1 as a candidate gene for end-stage renal disease in type 2 diabetes using a pooling-based genome-wide single nucleotide polymorphism association study56975983SandholmNRM SalemAJ McKnightEP BrennanForsblomC.et al 2012, New susceptibility loci associated with kidney disease in type 1 diabetes81002921AgrawalSDimitrovaNNathanPUdayakumarKLakshmiS.S.et al 2008, T2D-Db: an integrated platform to study the molecular basis of Type 2 diabetes9320AJ McKnightO’DonoghueDMA PeterAnnotated chromosome maps for renal disease200930314320ZaitlenNLindstromSPasaniucBCornelisMGenoveseG.et al 2012, Informed conditioning on clinical covariates increases power in case-control association studies81003032PluzhnikovAJE BelowKonkashbaevATikhomirovAKistner-GriffinE.et al 2010, Spoiling the whole bunch: quality control aimed at preserving the integrity of high-throughput genotyping87123128LittleJJP HigginsJP IoannidisMoherDGagnonF.et al 2009, STrengthening the REporting of Genetic Association Studies (STREGA): an extension of the STROBE statement622BrazmaAHingampPQuackenbushJSherlockGSpellmanP.et al 2001, Minimum information about a microarray experiment (MIAME)-toward standards for microarray data29365371ShiLCampbellGWD JonesCampagneFWenZ.et al 2010, The MicroArray Quality Control (MAQC)-II study of common practices for the development and validation of microarray-based predictive models28827838BarrettTSE WilhiteLedouxPEvangelistaCKimI.F.et al 2013, NCBI GEO: archive for functional genomics data sets--update41991995MartiniSEichingerFNairVKretzlerMDefining human diabetic nephropathy on the molecular level: integration of transcriptomic profiles with biological knowledge20089267274SD HardingArmitCArmstrongJBrennanJChengY.et al 2011, The GUDMAP database--an online resource for genitourinary research13828452853KleinJJuppSMoulosPFernandezMBuffin-MeyerB.et al 2012, The KUPKB: a novel Web application to access multiomics data on kidney disease2621452153WangZGersteinMSnyderMRNA-Seq: a revolutionary tool for transcriptomics2009105763SinicropiDQuKCollinFCragerMLiuM.L.et al 2012, Whole transcriptome RNA-Seq analysis of breast cancer recurrence risk using formalin-fixed paraffin-embedded tumor tissue740092EP BrennanMorineM JWalshD WRoxburghS ALindenmeyerM.T.et al 2012, Next-generation sequencing identifies TGF-beta1-associated gene expression profiles in renal epithelial cells reiterated in human diabetic nephropathy1822589599AP FeinbergEpigenetics at the epicenter of modern medicine200829913451350VK RakyanTA DownDJ BaldingBeckSEpigenome-wide association studies for common human diseases201112529541VK RakyanBeyanHTA DownMI HawaMaslauS.et al 2011, Identification of type 1 diabetes-associated DNA methylation variable positions that precede disease diagnosis71002300NEl HajjPliushchGSchneiderEDittrichMMullerT.et al 2012, Metabolic Programming of MEST DNA Methylation by Intrauterine Exposure to Gestational Diabetes Mellitus, DiabetesCG BellAE TeschendorffVK RakyanAP MaxwellBeckS.et al 2010, Genome-wide DNA methylation analysis for diabetic nephropathy in type 1 diabetes mellitus333SapienzaCLeeJPowellJErinleOYafaiF.et al 2011, DNA methylation profiling identifies epigenetic differences between diabetes patients with ESRD and diabetes patients without nephropathy62028RE GilbertHuangQThaiKSL AdvaniLeeK.et al 2011, Histone deacetylase inhibition attenuates diabetes-associated kidney growth: potential role for epigenetic modification of the epidermal growth factor receptor7913121321ML AlvarezJK DiStefanoTowards microRNA-based therapeutics for diabetic nephropathy2012DiabetologiaML AlvarezJK DiStefanoThe role of non-coding RNAs in diabetic nephropathy: Potential applications as biomarkers for disease development and progression, Diabetes Res201399111HW KhellaBakhetMLichnerZAD RomaschinJewettM.A.et al 2012, MicroRNAs in Kidney Disease: An Emerging UnderstandingIM FingermanZhangXRatzatWHusainNCohenR.F.et al 2013, NCBI Epigenomics: What’s new for 2013, Nucleic Acids Res41221225DS WishartJewisonTAC GuoWilsonMKnoxC.et al 2013, HMDB 3.0--The Human Metabolome Database in 201341801807JA Medina-AunonMartinez-BartolomeSMA Lopez-GarciaSalazarENavajasR.et al 2011, The ProteoRed MIAPE web toolkit: a user-friendly framework to connect and share proteomics standards10111JL GriffinSteinbeckCSo what have data standards ever done for us? The view from metabolomics2010238PosteGBiospecimens, biomarkers, and burgeoning data: the imperative for more rigorous research standards201218717722HirayamaANakashimaESugimotoMAkiyamaSSatoW.et al 2012, Metabolic profiling reveals new serum biomarkers for differentiating diabetic nephropathy40431013109RaimondoFCorbettaSMorosiLChinelloCGianazzaE.et al 2013, Urinary exosomes and diabetic nephropathy: a proteomic approachMol. BiosystNJ McKennaCL HowardAufieroMEaston-MarksJSteffenD.L.et al 2012, Research resource: dkCOIN, the National Institute of Diabetes, Digestive and Kidney Diseases (NIDDK) consortium interconnectivity network: a pilot program to aggregate research resources generated by multiple research consortia2616751681JainPVigSDattaMJindelDMathurA.K.et al 2013, Systems biology approach reveals genome to phenome correlation in type 2 diabetes853522MengQVP MakinenLukHYangX2013Systems Biology Approaches and Applications in Obesity, Diabetes, and Cardiovascular Diseases, Curr. Cardiovasc. Risk Rep77383WangJSunYZhengSXS ZhangZhouH.et al 2013, APG: an Active Protein-Gene Network Model to Quantify Regulatory Signals in31097CV KomorowskyBrosiusFC Pennathur SKretzlerMPerspectives on systems biology applications in diabetic kidney disease20125491508JimBSantosJSpathFHJ CijiangBiomarkers of diabetic nephropathy, the present and the future20128317328MayerPMayerBMayerGSystems biology: building a useful model from multiple markers and profiles20122739954002PrainsackBBuyxA2013McCartyC ARL ChisholmChuteC GIJ KulloJarvikG.P.et al 2011, The eMERGE Network: a consortium of biorepositories linked to electronic medical records data for conducting genomic studies413SC DenaxasGeorgeJHerrettEAD ShahKalraD.et al 2012, Data Resource Profile: Cardiovascular disease research using linked bespoke studies and electronic health records (CALIBER)4116251638GaskellGGottweisHStarkbaumJMBroerseJ.et al 2013, Publics and biobanks: Pan-European diversity and the challenge of responsible innovation21142