Dr.-Ing. Jan Eitzinger
Publikationen
2023
MD-Bench: A Generic Proxy-App Toolbox for State-of-the-Art Molecular Dynamics Algorithms
In: Parallel Processing and Applied Mathematics. PPAM 2022., Springer, Cham, 2023, S. 321-332 (Lecture Notes in Computer Science (LNCS), Bd.13826)
ISBN: 978-3-031-30441-5
DOI: 10.1007/978-3-031-30442-2_24 , , , :
MD-Bench: A performance-focused prototyping harness for state-of-the-art short-range molecular dynamics algorithms
In: Future Generation Computer Systems-The International Journal of Grid Computing Theory Methods and Applications 149 (2023), S. 25-38
ISSN: 0167-739X
DOI: 10.1016/j.future.2023.06.023 , , , , , :
MD-Bench: A performance-focused prototyping harness for state-of-the-art short-range molecular dynamics algorithms
In: Future Generation Computer Systems-The International Journal of Grid Computing Theory Methods and Applications (2023)
ISSN: 0167-739X
DOI: 10.1016/j.future.2023.06.023 , , , , , :
2021
An instrumentation framework for performance analysis of halide schedules
In: Journal of Computer Languages (2021), S. 101065
ISSN: 2590-1184
DOI: 10.1016/j.cola.2021.101065 , , , :
tinyMD: Mapping molecular dynamics simulations to heterogeneous hardware using partial evaluation
In: Journal of Computational Science 54 (2021), Art.Nr.: 101425
ISSN: 1877-7503
DOI: 10.1016/j.jocs.2021.101425 , , , , , , , , :
2019
ClusterCockpit-A web application for job-specific performance monitoring
2019 IEEE International Conference on Cluster Computing, CLUSTER 2019 (Albuquerque, NM, 23. September 2019 - 26. September 2019)
In: Proceedings - IEEE International Conference on Cluster Computing, ICCC 2019
DOI: 10.1109/CLUSTER.2019.8891017 , , , , :
2018
Unified Code Generation for the Parallel Computation of Pairwise Interactions using Partial Evaluation
17th International Symposium on Parallel and Distributed Computing (ISPDC) (Geneva, 25. Juni 2018 - 28. Juni 2018)
In: IEEE (Hrsg.): Proceedings of the 17th International Symposium on Parallel and Distributed Computing (ISPDC) 2018
DOI: 10.1109/ISPDC2018.2018.00012
URL: https://ieeexplore.ieee.org/document/8452015/ , , , :
Unified Code Generation for the Parallel Computation of Pairwise Interactions using Partial Evaluation
International Symposium on Computational Science at Scale (CoSaS) (Erlangen, 5. September 2018 - 7. September 2018)
DOI: 10.1109/ispdc2018.2018.00012
URL: https://graphics.cg.uni-saarland.de/papers/schmitt-2018-cosas-poster.pdf , , , , :
2017
Kerncraft: A Tool for Analytic Performance Modeling of Loop Kernels
10th International Workshop on Parallel Tools for High Performance Computing (Stuttgart, Germany, 4. Oktober 2016 - 5. Oktober 2016)
In: Niethammer C, Gracia J, Hilbrich T, Knüpfer A, Resch MM, Nagel WE (Hrsg.): Tools for High Performance Computing 2016, Cham: 2017 , , , :
Performance analysis of the Kahan-enhanced scalar product on current multi-core and many-core processors
In: Concurrency and Computation-Practice & Experience 29 (2017)
ISSN: 1532-0626
DOI: 10.1002/cpe.3921 , , , , , :
LIKWID monitoring stack: A flexible framework enabling job specific performance monitoring for the masses
2017 IEEE International Conference on Cluster Computing, CLUSTER 2017
DOI: 10.1109/CLUSTER.2017.115 , , , :
2016
Exploring performance and power properties of modern multi-core chips via simple machine models
In: Concurrency and Computation-Practice & Experience 28 (2016), S. 189-210
ISSN: 1532-0626
DOI: 10.1002/cpe.3180 , , , :
Analysis of intel’s haswell microarchitecture using the ECM model and microbenchmarks
Springer Verlag, 2016
ISBN: 9783319306940
DOI: 10.1007/978-3-319-30695-7_16 , , , , :
Analysis of Intel's Haswell Microarchitecture Using the ECM Model and Microbenchmarks
29th International Conference on Architecture of Computing Systems (Nuremberg)
In: Architecture of Computing Systems -- ARCS 2016: 29th International Conference, Nuremberg, Germany, April 4-7, 2016, Proceedings, Cham: 2016
DOI: 10.1007/978-3-319-30695-7_16 , , , , :
Performance analysis of the Kahan-enhanced scalar product on current multi-corecore and many-core processors
In: Concurrency and Computation-Practice & Experience 28 (2016)
ISSN: 1532-0626
DOI: 10.1002/cpe.3921 , , , , , :
2015
Automatic loop kernel analysis and performance modeling with kerncraft
6th International Workshop in Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems, PMBS 2015 - Held as part of the 27th ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2015
DOI: 10.1145/2832087.2832092 , , , :
Automatic Loop Kernel Analysis and Performance Modeling With Kerncraft
SC15 The International Conference for High Performance Computing, Networking, Storage and Analysis (Austin, TX, USA, 15. November 2015)
In: Proceedings of the 6th International Workshop on Performance Modeling, Benchmarking, and Simulation of High Performance Computing Systems, New York, NY, USA: 2015
DOI: 10.1145/2832087.2832092
URL: http://dl.acm.org/citation.cfm?id=2832087&preflayout=flat , , , :
Execution-Cache-Memory Performance Model: Introduction and Validation
(2015)
URL: https://arxiv.org/abs/1509.03118
(Techreport) , , :
Performance analysis of the Kahan-enhanced scalar product on current multicore processors
the 11th International Conference on Parallel Processing and Applied Mathematics (Krakow, Poland)
In: Accepted for PPAM 2015 2015
URL: http://arxiv.org/abs/1505.02586 , , , , :
Overhead Analysis of Performance Counter Measurements
43rd International Conference on Parallel Processing Workshops, ICPPW 2014
DOI: 10.1109/ICPPW.2014.34 , , , :
2014
Quantifying performance bottlenecks of stencil computations using the Execution-Cache-Memory model
DOI: 10.1145/2751205.2751240
URL: http://arxiv.org/abs/1410.5010 , , , :
Comparing the Performance of Different x86 SIMD Instruction Sets for a Medical Imaging Application on Modern Multi- and Manycore Chips
2014 1st ACM SIGPLAN Workshop on Programming Models for SIMD/Vector Processing, WPMVP 2014 - Co-located with PPoPP 2014 (Orlando, USA, 16. Februar 2014 - 16. Februar 2014)
In: Proceedings of the 2014 Workshop on Programming models for SIMD/Vector processing, New York, NY, USA: 2014
DOI: 10.1145/2568058.2568068
URL: http://dl.acm.org/citation.cfm?doid=2568058.2568068 , , , :
Performance Engineering for a Medical Imaging Application on the Intel Xeon Phi Accelerator
In: ARCS Workshops'14 2014
URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=6775080&isnumber=6775071 , , , :
2013
Pushing the limits for medical image reconstruction on recent standard multicore processors
In: International Journal of High Performance Computing Applications 27 (2013), S. 162-177
ISSN: 1094-3420
DOI: 10.1177/1094342012442424 , , , , :
2012
Performance Patterns and Hardware Metrics on Modern Multicore Processors: Best Practices for Performance Engineering
5th Workshop on Productivity and Performance (PROPER 2012) (Rhodes Island, Greece)
In: Euro-Par 2012, -: 2012
URL: http://arxiv.org/abs/1206.3738 , , :
Exploring performance and power properties of modern multicore chips via simple machine models
In: Concurrency and Computation-Practice & Experience Submitted (2012), S. 22
ISSN: 1532-0626
URL: http://arxiv.org/abs/1208.2908 , , , :
High performance smart expression template math libraries
High Performance Computing and Simulation (HPCS) 2012 (Madrid, 2. Juli 2012 - 6. Juli 2012)
In: High Performance Computing and Simulation (HPCS) 2012, International Conference on 2012
DOI: 10.1109/HPCSim.2012.6266939
URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=06266939 , , , :
Evaluation of the Coarray Fortran Programming Model on the Example of a Lattice Boltzmann Code
The 6th Conference on Partitioned Global Address Space Programming Models (Santa Barbara, CA, USA)
In: PGAS12, In Press: 2012 , , , , :
2011
Efficient multicore-aware parallelization strategies for iterative stencil computations
In: Journal of Computational Science 2 (2011), S. 130137
ISSN: 1877-7503
DOI: 10.1016/j.jocs.2011.01.010
URL: http://www.sciencedirect.com/science/article/pii/S1877750311000172 , , :
Expression Templates Revisited: A Performance Analysis of the Current ET Methodology
In: SIAM Journal on Scientific Computing (2011), S. 1-15
ISSN: 1064-8275
URL: http://arxiv.org/abs/1104.1729 , , , :
2010
Introducing a Performance Model for Bandwidth-Limited Loop Kernels
8th International Conference, PPAM 2009 , Revised Selected Papers, Part I (Wroclaw, Poland, 13. September 2009 - 16. September 2009)
In: Parallel Processing and Applied Mathematics, Berlin Heidelberg: 2010
DOI: 10.1007/978-3-642-14390-8_64
URL: http://www.springerlink.com/content/m720118145140122/ , :
Complexities of Performance Prediction for Bandwidth-Limited Loop Kernels on Multi-Core Architectures
Transactions of the Fourth Joint HLRB and KONWIHR Review and Results Workshop (Leibniz Supercomputing Centre, Garching/Munich, Germany)
In: High Performance Computing in Science and Engineering, Garching/Munich 2009, Berlin Heidelberg: 2010
DOI: 10.1007/978-3-642-13872-0_1
URL: http://www.springerlink.com/content/m1288m0174021600/ , , :
LIKWID: A Lightweight Performance-Oriented Tool Suite for x86 Multicore Environments
39th International Conference on Parallel Processing Workshops (San Diego, CA, USA, 13. September 2010 - 16. September 2010)
In: Proceedings of PSTI2010, the First International Workshop on Parallel Software Tools and Tool Infrastructures, IEEE: 2010
DOI: 10.1109/ICPPW.2010.38
URL: http://arxiv.org/abs/1004.4431 , , :
LIKWID performance tools
URL: http://inside.hlrs.de/pdfs/inSiDE_spring2010.pdf , , , :
Leveraging shared caches for parallel temporal blocking of stencil codes on multicore processors and clusters
In: Parallel Processing Letters 20 (2010), S. 359-376
ISSN: 0129-6264
DOI: 10.1142/S0129626410000296
URL: http://arxiv.org/abs/1006.3148 , , , :
2009
Efficiency Improvements of Iterative Numerical Algorithms on Modern Architectures (Dissertation, 2009)
URL: https://www10.cs.fau.de/publications/dissertations/Diss_Treibig_2009.pdf :- Eitzinger, Jan:
Efficiency Improvements of Iterative Numerical Algorithms on Modern Architectures (Dissertation, 2009)
URL: https://www10.cs.fau.de/publications/dissertations/Diss_2009-Treibig.pdf
2008
Optimising a 3D multigrid algorithm for the IA-64 architecture
In: International Journal of Computational Science and Engineering 4 (2008), S. 29-35
ISSN: 1742-7185
DOI: 10.1504/IJCSE.2008.021110
URL: https://www10.informatik.uni-erlangen.de/Publications/Papers/2008/Stuermer_IJCSE_4_1.pdf , , :
2006
ORCAN: A platform for complex parallel simulation software
8th Workshop 'Parallel Programs and Algorithms' (Frankfurt)
In: ARCS 2006 - 19th International Conference on Architecture of Computing Systems, Workshops Proceedings, March 16, 2006, Frankfurt am Main, Germany 2006 , , :
Optimizing performance on modern HPC systems: learning from simple kernel benchmarks
The 2nd Russian-German Advanced Research Workshop (Stuttgart, Germany)
In: Computational Science and High Performance Computing II, Berlin Heidelberg: 2006
DOI: 10.1007/3-540-31768-6_23
URL: http://www.springerlink.com/content/8401n54088177483/ , , , :
Optimizing a 3D Multigrid Algorithm for the IA-64 Architecture
ASIM 2006 - 19. Symposium Simulationstechnik (Hannover)
In: Simulationstechnique - 19th Symposium in Hannover, September 2006, Erlangen: 2006
URL: https://www10.informatik.uni-erlangen.de/Publications/Papers/2006/Stuermer_ASIM2006.pdf , , :
2005
Performance analysis of the Lattice Boltzmann Method on x86-64 Architectures
18th Symposium Simulationstechnique ASIM 2005 (Erlangen)
In: 18th Symposium Simulationstechnique ASIM 2005 Proceedings, Erlangen: 2005
URL: https://www10.informatik.uni-erlangen.de/Publications/Papers/2005/Treibig_ASIM05.pdf , , :
2003
Simulation of highly loaded gas-solid two-phase flows by combining a cellular automata for the particles with a lattice Boltzmann flow solver
In: International Journal of Modern Physics B 17 (2003), S. 201-204
ISSN: 0217-9792
DOI: 10.1142/S0217979203017333 , , , :
Auszeichnungen
Es wurden leider keine Auszeichnungen gefunden.