This leads to the challenge of harmonization of ... [more ▼]Translational research today is data-intensive and requires multi-stakeholder collaborations to generate and pool data together for integrated analysis. This leads to the challenge of harmonization of data from different sources with different formats and standards, which is often overlooked during project planning and thus becomes a bottleneck of the research progress. We report on our experience and lessons learnt about data curation for translational research garnered over the course of the eTRIKS program (https://www.etriks.org), a unique, 5-year, cross-organizational, cross-cultural collaboration project funded by the Innovative Medicines Initiative of the EU. Here, we discuss the obstacles and suggest what steps are needed for effective data curation in translational research, especially for projects involving multiple organizations from academia and industry. [less ▲]Detailed reference viewed: 46 (6 UL) A rare loss-of function variant of ADAM17 is associated with late-onset familial Alzheimer diseaseHartl, Daniela; May, Patrick ; Gu, Wei et alin Molecular Psychiatry (2020), 25(3), 629-639Common variants of about 20 genes contributing to AD risk have so far been identified through genome-wide association studies (GWAS). However, there is still a large proportion of heritability that might ... [more ▼]Common variants of about 20 genes contributing to AD risk have so far been identified through genome-wide association studies (GWAS). However, there is still a large proportion of heritability that might be explained by rare but functionally important variants. One of the so far identified genes with rare AD causing variants is ADAM10. Using whole-genome sequencing we now identified a single rare nonsynonymous variant (SNV) rs142946965 [p.R215I] in ADAM17 co-segregating with an autosomal-dominant pattern of late-onset AD in one family. Subsequent genotyping and analysis of available whole-exome sequencing data of additional case/control samples from Germany, the UK and the USA identified five variant carriers among AD patients only. The mutation inhibits pro-protein cleavage and the formation of the active enzyme, thus leading to loss-of-function of ADAM17 α-secretase. Further, we identified a strong negative correlation between ADAM17 and APP gene expression in human brain and present in vitro evidence that ADAM17 negatively controls the expression of APP. As a consequence, p.R215I mutation of ADAM17 leads to elevated Aß formation in vitro. Together our data supports a causative association of the identified ADAM17 variant in the pathogenesis of AD. [less ▲]Detailed reference viewed: 291 (31 UL) Data and knowledge management in translational research: implementation of the eTRIKS platform for the IMI OncoTrack consortiumGu, Wei ; Yildirimman, Reha; Van der Stuyft, Emmanuel et alin BMC Bioinformatics (2019), 20(1), 164For large international research consortia, such as those funded by the European Union’s Horizon 2020 programme or the Innovative Medicines Initiative, good data coordination practices and tools are ... [more ▼]For large international research consortia, such as those funded by the European Union’s Horizon 2020 programme or the Innovative Medicines Initiative, good data coordination practices and tools are essential for the successful collection, organization and analysis of the resulting data. Research consortia are attempting ever more ambitious science to better understand disease, by leveraging technologies such as whole genome sequencing, proteomics, patient-derived biological models and computer-based systems biology simulations. [less ▲]Detailed reference viewed: 211 (12 UL) Genetic meta-analysis of diagnosed Alzheimer's disease identifies new risk loci and implicates Aβ, tau, immunity and lipid processingKunkle, Brian W.; Grenier-Boley, Benjamin; Sims, Rebecca et alin Nature Genetics (2019), 51(3), 414Risk for late-onset Alzheimer's disease (LOAD), the most prevalent dementia, is partially driven by genetics. To identify LOAD risk loci, we performed a large genome-wide association meta-analysis of ... [more ▼]Risk for late-onset Alzheimer's disease (LOAD), the most prevalent dementia, is partially driven by genetics. To identify LOAD risk loci, we performed a large genome-wide association meta-analysis of clinically diagnosed LOAD (94,437 individuals). We confirm 20 previous LOAD risk loci and identify five new genome-wide loci (IQCK, ACE, ADAM10, ADAMTS1, and WWOX), two of which (ADAM10, ACE) were identified in a recent genome-wide association (GWAS)-by-familial-proxy of Alzheimer's or dementia. Fine-mapping of the human leukocyte antigen (HLA) region confirms the neurological and immune-mediated disease haplotype HLA-DR15 as a risk factor for LOAD. Pathway analysis implicates immunity, lipid metabolism, tau binding proteins, and amyloid precursor protein (APP) metabolism, showing that genetic variants affecting APP and A$\beta$ processing are associated not only with early-onset autosomal dominant Alzheimer's disease but also with LOAD. Analyses of risk genes and pathways show enrichment for rare variants (P = 1.32 × 10−7), indicating that additional rare variants remain to be identified. We also identify important genetic correlations between LOAD and traits such as family history of dementia and education. [less ▲]Detailed reference viewed: 174 (10 UL) Fractalis: A scalable open-source service for platform-independent interactive visual analysis of biomedical dataHerzinger, Sascha ; Groues, Valentin ; Gu, Wei et alin GigaScience (2018)Background: Translational research platforms share the aim to promote a deeper understanding of stored data by providing visualization and analysis tools for data exploration and hypothesis generation ... [more ▼]Background: Translational research platforms share the aim to promote a deeper understanding of stored data by providing visualization and analysis tools for data exploration and hypothesis generation. However, such tools are usually platform-bound and are not easily reusable by other systems. Furthermore, they rarely address access restriction issues when direct data transfer is not permitted. In this article we present an analytical service that works in tandem with a visualization library to address these problems. Findings: Using a combination of existing technologies and a platform-specific data abstraction layer we developed a service that is capable of providing existing web-based data warehouses and repositories with platform-independent visual analytical capabilities. The design of this service also allows for federated data analysis by eliminating the need to move the data directly to the researcher. Instead, all operations are based on statistics and interactive charts without direct access to the dataset. Conclusion: The software presented in this article has a potential to help translational researchers achieve a better understanding of a given dataset and quickly generate new hypothesis. Furthermore, it provides a framework that can be used to share and reuse explorative analysis tools within the community. [less ▲]Detailed reference viewed: 236 (26 UL) Presenting and Sharing Clinical Data using the eTRIKS Standards Master Tree for tranSMARTBarbosa-Silva, Adriano; Bratfalean, Dorina; Gu, Wei et alin Bioinformatics (2018)Motivation Standardization and semantic alignment have been considered one of the major challenges for data integration in clinical research. The inclusion of the CDISC SDTM clinical data standard into ... [more ▼]Motivation Standardization and semantic alignment have been considered one of the major challenges for data integration in clinical research. The inclusion of the CDISC SDTM clinical data standard into the tranSMART i2b2 via a guiding master ontology tree positively impacts and supports the efficacy of data sharing, visualization and exploration across datasets. Results We present here a schema for the organization of SDTM variables into the tranSMART i2b2 tree along with a script and test dataset to exemplify the mapping strategy. The eTRIKS master tree concept is demonstrated by making use of fictitious data generated for four patients, including 16 SDTM clinical domains. We describe how the usage of correct visit names and data labels can help to integrate multiple readouts per patient and avoid ETL crashes when running a tranSMART loading routine. Availability The eTRIKS Master Tree package and test datasets are publicly available at https://doi.org/10.5281/zenodo.1009098 and a functional demo installation at https://public.etriks.org/transmart/datasetExplorer/ under eTRIKS - Master Tree branch, where the discussed examples can be visualized. [less ▲]Detailed reference viewed: 159 (15 UL) SmartR: An open-source platform for interactive visual analytics for translational research data.Herzinger, Sascha ; Gu, Wei ; Satagopam, Venkata et alin Bioinformatics (Oxford, England) (2017)In translational research, efficient knowledge exchange between the different fields of expertise is crucial. An open platform that is capable of storing a multitude of data types such as clinical, pre ... [more ▼]In translational research, efficient knowledge exchange between the different fields of expertise is crucial. An open platform that is capable of storing a multitude of data types such as clinical, pre-clinical, or OMICS data combined with strong visual analytical capabilities will significantly accelerate the scientific progress by making data more accessible and hypothesis generation easier. The open data warehouse tranSMART is capable of storing a variety of data types and has a growing user community including both academic institutions and pharmaceutical companies. tranSMART, however, currently lacks interactive and dynamic visual analytics and does not permit any post-processing interaction or exploration. For this reason, we developed SmartR , a plugin for tranSMART, that equips the platform not only with several dynamic visual analytical workflows, but also provides its own framework for the addition of new custom workflows. Modern web technologies such as D3.js or AngularJS were used to build a set of standard visualizations that were heavily improved with dynamic elements. Contact: reinhard.schneider@uni.lu. Supplementary information: Supplementary data are available at Bioinformatics online. Availability: : The source code is licensed under the Apache 2.0 License and is freely available on GitHub: https://github.com/transmart/SmartR. [less ▲]Detailed reference viewed: 263 (19 UL) Rare coding variants in PLCG2, ABI3, and TREM2 implicate microglial-mediated innate immunity in Alzheimer's diseaseSims, Rebecca; van der Lee, Sven J.; Naj, Adam C. et alin Nature Genetics (2017), 49Detailed reference viewed: 214 (11 UL) IDENTIFICATION OF A RARE GENE VARIANT THAT IS ASSOCIATED WITH FAMILIAL ALZHEIMER DISEASE AND REGULATES APP EXPRESSIONHartl, Daniela; May, Patrick ; Gu, Wei et alin Alzheimer's & Dementia : The Journal of the Alzheimer's Association (2017), 13(7, Supplement), 648Background Genetic mutations leading to familial forms of Alzheimer disease (AD) have so far been reported for a few genes including APP, PSEN1 and PSEN2, UNC5C, PLD3, ABCA7, TTC3, and possibly ADAM10 ... [more ▼]Background Genetic mutations leading to familial forms of Alzheimer disease (AD) have so far been reported for a few genes including APP, PSEN1 and PSEN2, UNC5C, PLD3, ABCA7, TTC3, and possibly ADAM10. With the advent of whole exome and whole genome sequencing approaches new genes and mutations are likely to be identified. Methods We analyzed the genetic cause of AD in a large multiplex family with an autosomal-dominant pattern of inheritance with LOAD. The family lacked pathogenic mutations of known AD genes. We performed whole-genome sequencing (WGS) in six family members (two affected and four unaffected) and prioritized rare, potential damaging, variants that segregated with disease. Variants were further characterized by subsequent molecular analyzes in human brain and cell culture models. Results We identified a single rare nonsynonymous variant co-segregating with AD. The mutation inhibits pro-protein cleavage and the formation of the active enzyme, thus leading to a loss-of-function of the gene. We further found a strong negative correlation between the identified gene and APP gene expression in human brain and in cells over-expressing the gene. The negative regulation of APP expression was only observed for the wt gene, but not for mutated forms, thus causing beside the loss of enzyme function a decoupling of both APPexpression and subsequent beta-amyloid formation. The identity of the gene will be presented on the conference. Conclusions This novel pathway strongly supports a causative association of the identified gene with the pathogenesis of AD. [less ▲]Detailed reference viewed: 246 (25 UL) The miRNome of Alzheimer's disease: consistent downregulation of the miR-132/212 clusterPichler, Sabrina; Gu, Wei ; Hartl, Daniela et alin Neurobiology of Aging (2016), 50MicroRNAs (miRNAs) are small noncoding RNA molecules, with essential functions in RNA silencing and post-transcriptional regulation of gene expression. miRNAs appear to regulate the development and ... [more ▼]MicroRNAs (miRNAs) are small noncoding RNA molecules, with essential functions in RNA silencing and post-transcriptional regulation of gene expression. miRNAs appear to regulate the development and function of the nervous system. Alterations of miRNA expression have been associated with Alzheimer's disease (AD). To characterize the AD miRNA signature, we examined genome-wide miRNA and mRNA expression patterns in the temporal cortex of AD and control samples. We validated our miRNA results by semiquantitative real-time polymerase chain reaction (PCR) in independent prefrontal cortex. Furthermore, we separated gray and white matter brain sections to identify the cellular origin of the altered miRNA expression. We observed genome-wide downregulation of hsa-miR-132-3p and hsa-miR-212-3p in AD with a stronger decrease in gray matter AD samples. We further identified 10 differently expressed transcripts achieving genome-wide levels of significance. Significantly deregulated miRNAs and mRNAs were correlated and examined for potential binding sites (in silico). This miRNome-wide study in AD provides supportive evidence and corroborates an important contribution of miR-132/212 and corresponding target mRNAs to the pathogenesis of AD. [less ▲]Detailed reference viewed: 162 (6 UL) Integration and Visualization of Translational Medicine Data for Better Understanding of Human Diseases.Satagopam, Venkata ; Gu, Wei ; Eifes, Serge et alin Big data (2016), 4(2), 97-108Translational medicine is a domain turning results of basic life science research into new tools and methods in a clinical environment, for example, as new diagnostics or therapies. Nowadays, the process ... [more ▼]Translational medicine is a domain turning results of basic life science research into new tools and methods in a clinical environment, for example, as new diagnostics or therapies. Nowadays, the process of translation is supported by large amounts of heterogeneous data ranging from medical data to a whole range of -omics data. It is not only a great opportunity but also a great challenge, as translational medicine big data is difficult to integrate and analyze, and requires the involvement of biomedical experts for the data processing. We show here that visualization and interoperable workflows, combining multiple complex steps, can address at least parts of the challenge. In this article, we present an integrated workflow for exploring, analysis, and interpretation of translational medicine data in the context of human health. Three Web services-tranSMART, a Galaxy Server, and a MINERVA platform-are combined into one big data pipeline. Native visualization capabilities enable the biomedical experts to get a comprehensive overview and control over separate steps of the workflow. The capabilities of tranSMART enable a flexible filtering of multidimensional integrated data sets to create subsets suitable for downstream processing. A Galaxy Server offers visually aided construction of analytical pipelines, with the use of existing or custom components. A MINERVA platform supports the exploration of health and disease-related mechanisms in a contextualized analytical visualization system. We demonstrate the utility of our workflow by illustrating its subsequent steps using an existing data set, for which we propose a filtering scheme, an analytical pipeline, and a corresponding visualization of analytical results. The workflow is available as a sandbox environment, where readers can work with the described setup themselves. Overall, our work shows how visualization and interfacing of big data processing services facilitate exploration, analysis, and interpretation of translational medicine data. [less ▲]Detailed reference viewed: 282 (24 UL) Amyloid-β Protein Precursor Cleavage Products in Postmortem Ventricular Cerebrospinal Fluid of Alzheimer’s Disease PatientsHartl, Daniela; Gu, Wei ; Mayhaus, Manuel et alin Journal of Alzheimer's Disease [=JAD] (2015), 47(2), 365-372Detailed reference viewed: 143 (11 UL) Reproducible Research Results R3Trefois, Christophe ; Jarosz, Yohan ; Gu, Wei et alPoster (2014, December)Detailed reference viewed: 160 (26 UL) Gene-wide analysis detects two new susceptibility genes for Alzheimer's disease.Escott-Price, Valentina; Bellenguez, Celine; Wang, Li-San et alin PloS one (2014), 9(6), 94661BACKGROUND: Alzheimer's disease is a common debilitating dementia with known heritability, for which 20 late onset susceptibility loci have been identified, but more remain to be discovered. This study ... [more ▼]BACKGROUND: Alzheimer's disease is a common debilitating dementia with known heritability, for which 20 late onset susceptibility loci have been identified, but more remain to be discovered. This study sought to identify new susceptibility genes, using an alternative gene-wide analytical approach which tests for patterns of association within genes, in the powerful genome-wide association dataset of the International Genomics of Alzheimer's Project Consortium, comprising over 7 m genotypes from 25,580 Alzheimer's cases and 48,466 controls. PRINCIPAL FINDINGS: In addition to earlier reported genes, we detected genome-wide significant loci on chromosomes 8 (TP53INP1, p = 1.4x10-6) and 14 (IGHV1-67 p = 7.9x10-8) which indexed novel susceptibility loci. SIGNIFICANCE: The additional genes identified in this study, have an array of functions previously implicated in Alzheimer's disease, including aspects of energy metabolism, protein degradation and the immune system and add further weight to these pathways as potential therapeutic targets in Alzheimer's disease. [less ▲]Detailed reference viewed: 130 (8 UL) Frontotemporal dementia and its subtypes: a genome-wide association study.Ferrari, Raffaele; Hernandez, Dena G.; Nalls, Michael A. et alin Lancet neurology (2014), 13(7), 686-99BACKGROUND: Frontotemporal dementia (FTD) is a complex disorder characterised by a broad range of clinical manifestations, differential pathological signatures, and genetic variability. Mutations in three ... [more ▼]BACKGROUND: Frontotemporal dementia (FTD) is a complex disorder characterised by a broad range of clinical manifestations, differential pathological signatures, and genetic variability. Mutations in three genes-MAPT, GRN, and C9orf72-have been associated with FTD. We sought to identify novel genetic risk loci associated with the disorder. METHODS: We did a two-stage genome-wide association study on clinical FTD, analysing samples from 3526 patients with FTD and 9402 healthy controls. To reduce genetic heterogeneity, all participants were of European ancestry. In the discovery phase (samples from 2154 patients with FTD and 4308 controls), we did separate association analyses for each FTD subtype (behavioural variant FTD, semantic dementia, progressive non-fluent aphasia, and FTD overlapping with motor neuron disease [FTD-MND]), followed by a meta-analysis of the entire dataset. We carried forward replication of the novel suggestive loci in an independent sample series (samples from 1372 patients and 5094 controls) and then did joint phase and brain expression and methylation quantitative trait loci analyses for the associated (p<5 x 10(-8)) single-nucleotide polymorphisms. FINDINGS: We identified novel associations exceeding the genome-wide significance threshold (p<5 x 10(-8)). Combined (joint) analyses of discovery and replication phases showed genome-wide significant association at 6p21.3, HLA locus (immune system), for rs9268877 (p=1.05 x 10(-8); odds ratio=1.204 [95% CI 1.11-1.30]), rs9268856 (p=5.51 x 10(-9); 0.809 [0.76-0.86]) and rs1980493 (p value=1.57 x 10(-8), 0.775 [0.69-0.86]) in the entire cohort. We also identified a potential novel locus at 11q14, encompassing RAB38/CTSC (the transcripts of which are related to lysosomal biology), for the behavioural FTD subtype for which joint analyses showed suggestive association for rs302668 (p=2.44 x 10(-7); 0.814 [0.71-0.92]). Analysis of expression and methylation quantitative trait loci data suggested that these loci might affect expression and methylation in cis. INTERPRETATION: Our findings suggest that immune system processes (link to 6p21.3) and possibly lysosomal and autophagy pathways (link to 11q14) are potentially involved in FTD. Our findings need to be replicated to better define the association of the newly identified loci with disease and to shed light on the pathomechanisms contributing to FTD. FUNDING: The National Institute of Neurological Disorders and Stroke and National Institute on Aging, the Wellcome/MRC Centre on Parkinson's disease, Alzheimer's Research UK, and Texas Tech University Health Sciences Center. [less ▲]Detailed reference viewed: 148 (9 UL) Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer's diseaseLambert, Jean-Charles; Ibrahim-Verbaas, Carla A.; Harold, Denise et alin Nature Genetics (2013), 45Detailed reference viewed: 224 (10 UL) Hydrogen-Bonded Networks Along and Bifurcation of the E-Pathway in Quinol:Fumarate ReductaseHerzog, Elena; Gu, Wei ; Juhnke, Hanno D. et alin Biophysical Journal (2012), 103(6), 1305-1314The E-pathway of transmembrane proton transfer has been demonstrated previously to be essential for catalysis by the diheme-containing quinol:fumarate reductase (QFR) of Wolinella succinogenes. Two ... [more ▼]The E-pathway of transmembrane proton transfer has been demonstrated previously to be essential for catalysis by the diheme-containing quinol:fumarate reductase (QFR) of Wolinella succinogenes. Two constituents of this pathway, Glu-C180 and heme bp ring C (b(D)-C-) propionate, have been validated experimentally. Here, we identify further constituents of the E-pathway by analysis of molecular dynamics simulations. The redox state of heme groups has a crucial effect on the connectivity patterns of mobile internal water molecules that can transiently support proton transfer from the b(D)-C-propionate to Glu-C180. The short H-bonding paths formed in the reduced states can lead to high proton conduction rates and thus provide a plausible explanation for the required opening of the E-pathway in reduced QFR. We found evidence that the b(D)-C-propionate group is the previously postulated branching point connecting proton transfer to the E-pathway from the quinol-oxidation site via interactions with the heme bp ligand His-C44. An essential functional role of His-C44 is supported experimentally by site-directed mutagenesis resulting in its replacement with Glu. Although the H44E variant enzyme retains both heme groups, it is unable to catalyze quinol oxidation. All results obtained are relevant to the QFR enzymes from the human pathogens Campylobacter jejuni and Helicobacter pylori. [less ▲]Detailed reference viewed: 101 (3 UL) Design of a Gated Molecular Proton ChannelGu, Wei ; Zhou, Bo; Geyer, Tihamer et alin Angewandte Chemie International Edition (2011), 50(3), 768-771Detailed reference viewed: 101 (2 UL) Adhesive water networks facilitate binding of protein interfacesAhmad, Mazen; Gu, Wei ; Geyer, Tihamer et alin Nature Communications (2011), 2Water structure has an essential role in biological assembly. Hydrophobic dewetting has been documented as a general mechanism for the assembly of hydrophobic surfaces; however, the association mechanism ... [more ▼]Water structure has an essential role in biological assembly. Hydrophobic dewetting has been documented as a general mechanism for the assembly of hydrophobic surfaces; however, the association mechanism of hydrophilic interfaces remains mysterious and cannot be explained by simple continuum water models that ignore the solvent structure. Here we study the association of two hydrophilic proteins using unbiased extensive molecular dynamics simulations that reproducibly recovered the native bound complex. The water in the interfacial gap forms an adhesive hydrogen-bond network between the interfaces stabilizing early intermediates before native contacts are formed. Furthermore, the interfacial gap solvent showed a reduced dielectric shielding up to distances of few nanometres during the diffusive phase. The interfacial gap solvent generates an anisotropic dielectric shielding with a strongly preferred directionality for the electrostatic interactions along the association direction. [less ▲]Detailed reference viewed: 185 (2 UL) Carbon Nanotube Wins the Competitive Binding over Proline-Rich Motif Ligand on SH3 DomainZuo, Guanghong; Gu, Wei ; Fang, Haiping et alin JOURNAL OF PHYSICAL CHEMISTRY C (2011), 115(25), 12322-12328The binding competition between a proline-rich motif (PRM) ligand and a hydrophobic nanoparticle, the single-wall carbon nanotube (SWCNT), at the binding pocket of SH3 domain, has been investigated by ... [more ▼]The binding competition between a proline-rich motif (PRM) ligand and a hydrophobic nanoparticle, the single-wall carbon nanotube (SWCNT), at the binding pocket of SH3 domain, has been investigated by molecular dynamics simulations. It is found that the SWCNT has a very high probability of occupying the binding pocket of the SH3 domain, which prevents the PRM ligand from binding to the pocket. The binding free energy landscapes show that the SWCNT has similar to 0.6 kcal/mol stronger binding affinity than the ligand in the three-way binding competition (SWCNT + ligand + protein). The potent binding affinity between the SWCNT and the SH3 domain is shown to be mainly from the pi-pi stacking interactions between the CNT and aromatic residues in the binding pocket. Our findings show that the existence of hydrophobic particles can greatly reduce the possibility of the regular binding of the ligand with the target protein, suggesting potential toxicity to proteins by hydrophobic nanoscale particles. [less ▲]Detailed reference viewed: 101 (2 UL) 1 2