This query concerns only resources held in libraries.
19 results
Sort by:
Add to the list:
    • Several versions

    The L-CSC cluster: Optimizing power efficiency to become the greenest supercomputer in the world in the Green500 list of November 2014

    Rohr, David, Neskovic, Gvozden, Lindenstruth, Volker
    arXiv.org, Nov 28, 2018 [Peer Reviewed Journal]

    • Several versions

    Relativistic Hydrodynamics on Graphic Cards

    Gerhard, Jochen, Lindenstruth, Volker, Bleicher, Marcus
    arXiv.org, Sep 9, 2012 [Peer Reviewed Journal]

    • Article
    Select

    Optimized HPL for AMD GPU and multi-core CPU usage

    Bach, Matthias, Kretz, Matthias, Lindenstruth, Volker, Rohr, David
    Computer Science - Research and Development, 2011, Vol.26(3), pp.153-164 [Peer Reviewed Journal]
    Springer Science & Business Media B.V.
    Available
    More…
    Title: Optimized HPL for AMD GPU and multi-core CPU usage
    Author: Bach, Matthias; Kretz, Matthias; Lindenstruth, Volker; Rohr, David
    Subject: Heterogeneous computing ; Linpack ; HPL ; DGEMM ; CALDGEMM ; GPGPU
    Description: The installation of the LOEWE-CSC ( http://csc.uni-frankfurt.de/csc/?51 ) supercomputer at the Goethe University in Frankfurt lead to the development of a Linpack which can fully utilize the installed AMD Cypress GPUs. At its core, a fast DGEMM for combined GPU and CPU usage was created. The DGEMM library is tuned to hide all DMA transfer times and thus maximize the GPU load. A work stealing scheduler was implemented to add the remaining CPU resources to the DGEMM. On the GPU, the DGEMM achieves 497 GFlop/s (90.9% of the theoretical peak). Combined with the 24-core Magny-Cours CPUs, 623 GFlop/s (83.6% of the peak) are achieved. The HPL () benchmark was modified to perform well with one MPI-process per node. The modifications include multi-threading, vectorization, use of the GPU DGEMM, cache optimizations, and a new Lookahead algorithm. A Linpack performance of 70% theoretical peak is achieved and this performance scales linearly to hundreds of nodes.
    Is part of: Computer Science - Research and Development, 2011, Vol.26(3), pp.153-164
    Identifier: 1865-2034 (ISSN); 1865-2042 (E-ISSN); 10.1007/s00450-011-0161-5 (DOI)

    • Article
    Select

    Exploiting the alice hlt for proof by scheduling of virtual machines

    Meoni, Marco, Boettger, Stefan, Zelnicek, Pierre, Lindenstruth, Volker, Kebschull, Udo
    Journal of Physics: Conference Series, 2011, Vol.331(7), p.072054 (6pp) [Peer Reviewed Journal]
    IOPscience (IOP Publishing)
    Available
    More…
    Title: Exploiting the alice hlt for proof by scheduling of virtual machines
    Author: Meoni, Marco; Boettger, Stefan; Zelnicek, Pierre; Lindenstruth, Volker; Kebschull, Udo
    Subject: Physics;
    Description: The HLT (High-Level Trigger) group of the ALICE experiment at the LHC has prepared a virtual Parallel ROOT Facility (PROOF) enabled cluster (HAF - HLT Analysis Facility) for fast physics analysis, detector calibration and reconstruction of data samples. The HLT-Cluster currently consists of 2860 CPU cores and 175TB of storage. Its purpose is the online filtering of the relevant part of data produced by the particle detector. However, data taking is not running continuously and exploiting unused cluster resources for other applications is highly desirable and improves the usage-cost ratio of the HLT cluster. As such, unused computing resources are dedicated to a PROOF-enabled virtual cluster available to the entire collaboration. This setup is especially aimed at the prototyping phase of analyses that need a high number of development iterations and a short response time, e.g. tuning of analysis cuts, calibration and alignment. HAF machines are enabled and disabled upon user request to start or complete analysis tasks. This is achieved by a virtual machine scheduling framework which dynamically assigns and migrates virtual machines running PROOF workers to unused physical resources. Using this approach we extend the HLT usage scheme to running both online and offline computing, thereby optimizing the resource usage.
    Is part of: Journal of Physics: Conference Series, 2011, Vol.331(7), p.072054 (6pp)
    Identifier: 1742-6588 (ISSN); 1742-6596 (E-ISSN); 10.1088/1742-6596/331/7/072054 (DOI)

    • Article
    Select

    Lattice QCD based on OpenCL

    Bach, Matthias, Lindenstruth, Volker, Philipsen, Owe, Pinke, Christopher
    arXiv.org, Sep 26, 2012 [Peer Reviewed Journal]
    © ProQuest LLC All rights reserved, Engineering Database, Publicly Available Content Database, ProQuest Engineering Collection, ProQuest Technology Collection, ProQuest SciTech Collection, Materials Science & Engineering Database, ProQuest Central (new), ProQuest Central Korea, SciTech Premium Collection, Technology Collection, ProQuest Central Essentials, ProQuest One Academic, Engineering Collection (ProQuest)
    Available
    More…
    Title: Lattice QCD based on OpenCL
    Author: Bach, Matthias; Lindenstruth, Volker; Philipsen, Owe; Pinke, Christopher
    Contributor: Pinke, Christopher (pacrepositoryorg)
    Subject: Algorithms ; Quantum Chromodynamics ; Monte Carlo Method ; Fermions
    Description: We present an OpenCL-based Lattice QCD application using a heatbath algorithm for the pure gauge case and Wilson fermions in the twisted mass formulation. The implementation is platform independent and can be used on AMD or NVIDIA GPUs, as well as on classical CPUs. On the AMD Radeon HD 5870 our double precision dslash implementation performs at 60 GFLOPS over a wide range of lattice sizes. The hybrid Monte-Carlo presented reaches a speedup of four over the reference code running on a server CPU.
    Is part of: arXiv.org, Sep 26, 2012
    Identifier: 2331-8422 (E-ISSN); 10.1016/j.cpc.2013.03.020 (DOI)

    • Article
    Select

    Lattice QCD based on OpenCL

    Bach, Matthias, Lindenstruth, Volker, Philipsen, Owe, Pinke, Christopher
    Cornell University
    Available
    More…
    Title: Lattice QCD based on OpenCL
    Author: Bach, Matthias; Lindenstruth, Volker; Philipsen, Owe; Pinke, Christopher
    Subject: High Energy Physics - Lattice
    Description: We present an OpenCL-based Lattice QCD application using a heatbath algorithm for the pure gauge case and Wilson fermions in the twisted mass formulation. The implementation is platform independent and can be used on AMD or NVIDIA GPUs, as well as on classical CPUs. On the AMD Radeon HD 5870 our double precision dslash implementation performs at 60 GFLOPS over a wide range of lattice sizes. The hybrid Monte-Carlo presented reaches a speedup of four over the reference code running on a server CPU. Comment: 19 pages, 11 figures
    Identifier: 1209.5942 (ARXIV ID)

    • Several versions

    How stable are transport model results to changes of resonance parameters? A UrQMD model study

    Gerhard, Jochen, Bäuchle, Bjørn, Lindenstruth, Volker, Bleicher, Marcus
    arXiv.org, May 7, 2012 [Peer Reviewed Journal]

    • Several versions

    BioEM: GPU-accelerated computing of Bayesian inference of electron microscopy images

    Cossio, Pilar, Rohr, David, Baruffa, Fabio, Rampp, Markus, Lindenstruth, Volker, Hummer, Gerhard
    arXiv.org, Sep 21, 2016 [Peer Reviewed Journal]

    • Several versions

    GPU-accelerated track reconstruction in the ALICE High Level Trigger

    Rohr, David, Gorbunov, Sergey, Lindenstruth, Volker
    arXiv.org, Dec 26, 2017 [Peer Reviewed Journal]

    • Several versions

    Online Reconstruction and Calibration with feed back loop in the ALICE High Level Trigger

    Rohr, David, Shahoyan, Ruben, Zampolli, Chiara, Krzewicki, Mikolaj, Wiechula, Jens, Gorbunov, Sergey, Chauvin, Alex, Schweda, Kai, Lindenstruth, Volker
    arXiv.org, Dec 26, 2017 [Peer Reviewed Journal]