2017 IXPUG US Annual Meeting at TACC
Experts from around the world are invited to the Texas Advanced Computing Center (TACC) in Austin, TX for the
IXPUG 2017 Annual US Meeting (IXPUG2017).
Share experiences with Xeon Phi-based systems, and learn how to optimize software for manycore machines.
Location: The Texas Advanced Computing Center (TACC)
The University of Texas at Austin, JJ Pickle Research Campus, Austin, TX USA
Date: September 26-28, 2017
SURVEY!
Did you attend this event? Please fill out the event survey.
Keynote Speakers:
James R. Reinders
Nishanth Dandapanthu (Dell EMC)
Sameer Shende (ParaTools, Inc., U of Oregon)
Intel Roadmap/Technologies Speakers:
Barry Davis (Intel)
Day 1: Tuesday, Sept. 26, 2017 - IXPUG Annual Fall Conference
Day 1: Tuesday, Sept. 26, 2017 - IXPUG Annual Fall Conference
Start |
End |
Title |
Author(s) |
|
8:00 |
8:30 |
Registration & Continental Breakfast |
|
|
|
|
Opening Session [Chair: Lisa Smith] |
|
|
8:30 |
8:35 |
Welcome from TACC |
Dan Stanzione |
|
8:35 |
8:45 |
Welcome from IXPUG Leadership Team |
David Martin |
|
8:45 |
9:45 |
Keynote1: "Supercomputing" is the best description of the future of HPC |
James Reinders (Intel Corporation, retired 2016), Parallel Programming and HPC Enthusiast (and Expert) |
Video_k1 |
9:45 |
10:30 |
Center Updates (~15 minutes each) |
- NERSC, Richard Gerber
- TACC, Dan Stanzione
- ALCF (ANL), David Martin
|
Video |
10:30 |
11:00 |
Coffee Break |
|
|
|
|
Session 1 [Chair:David Martin] |
|
|
11:00 |
12:00 |
11 Advancing MPI Libraries to the Many-core Era: Designs and Evaluations with MVAPICH2
9 Performance of PGAS Models on KNL: A Comprehensive Study with MVAPICH2-X
|
Sourav Chakraborty, Mohammadreza Bayatpour, Hari Subramoni and Dhabaleswar Panda
Jahanzeb Maqbool Hashmi, Mingzhe Li, Hari Subramoni and Dhabaleswar Panda
|
Video |
12:00 |
1:00 |
Lunch |
|
|
|
|
Session 2 [Chair:Lars Koesterke] |
|
|
1:00 |
2:30 |
4 Software modernization strategies adopted in Genesis – a Molecular Dynamics Application for Biological Materials targeting LANL’s Trinity Phase 2 Cray KNL platform
13 Improving the Performance of the MILC Code on Intel Knights Landing, An Overview
14 Performance Portability of the Wilson Dslash Operator for Lattice QCD
|
Adetokunbo Adedoyin
Douglas Doerfler, Karthik Raman and Ruizi Li
Balint Joo, Thorsten Kurth, Jack Deslippe and Kate Clark
|
video4
video13
video14
|
2:30 |
3:00 |
Coffee Break |
|
|
|
|
Session 3 [Chair: Richard Gerber] |
|
|
3:00 |
4:30 |
5 Early Results of Deep Learning on the Stampede2 Supercomputer
12 Operating JLab’s SciPhi-XVI KNL cluster
21 OpenMP Affinity in Many-core Computing
|
Zhao Zhang, Weijia Xu, Niall Gaffney and Daniel Stanzione
Sandra Philpott
Kent Milfeld
|
video5
video12
video21
|
Evening |
4:30-6:00 |
TACC Tour and Reception |
Tommy Minyard and Melyssa Fratkin |
|
Day 2: Wednesday, Sept. 27, 2017 - IXPUG Annual Fall Conference
Day 2: Wednesday, Sept. 27, 2017 - IXPUG Annual Fall Conference
Start |
End |
Title |
Author(s) |
|
8:00 |
8:30 |
Registration & Continental Breakfast |
|
|
|
|
Session [Chair: Lisa Smith] |
|
|
8:30 |
9:30 |
Keynote: Pursuit in the simplification of HPC and Deep Learning |
Nishanth Dandapanthu (Dell EMC) |
video_k2 |
9:30 |
10:30 |
Intel Compiler and Tools Updates |
James Tullos (Intel) |
|
10:30 |
11:00 |
Break |
|
|
|
|
Lightning Talk Session 4 [Chair: Clayton Hughes] |
|
|
11:00 |
12:00 |
10 Reducing OS noise using offload driver on Intel Xeon Phi x200 Processor
8 A Comparative Evaluation of Xeon Phi Platforms Based on a Hodgkin-Huxley Neuron Simulator
22 Using an Interactive Tool to adapt codes to KNL
7 In Situ Visualization on Stampede2 - an IXPUG Workshop Report
Feedback on IXPUG Working Groups
|
Grzegorz Andrejczuk and Jarek Kogut
George Chatzikonstantis, Diego Jimenez, Esteban Meneses, Christos Strydis, Harry Sidiropoulos and Dimitrios Soudris
Lars Koesterke and Ritu Arora
Paul Navratil and Jim Jeffers
John Penneycook
|
video10
video8
video22
video7
|
12:00 |
1:00 |
Lunch |
|
|
|
|
Session 5 [Chair: John Pennycook] |
|
|
1:00 |
2:30 |
1 MPI/OpenMP parallelization of the Hartree-Fock method for the second generation of Intel Xeon Phi processor
16 Performance optimization of WEST and Qbox on Intel Knights Landing
18 Rapid evaluation of scalar and vector fields of molecular charge density properties on KNL
|
Vladimir Mironov, Yuri Alexeev, Kristopher Keipert, Michael Dmello, Alexander Moskovsky and Mark Gordon
Huihuo Zheng, Christopher Knight, Giulia Galli, Marco Govoni and Francois Gygi
Alvaro Vazquez Mayagoitia, Raymundo Hernandez-Esparza and Jorge Garza
|
video1
video16
video18
|
2:30 |
3:00 |
Break |
|
|
|
|
Session 6 [Chair:Kent Milfeld] |
|
|
3:00 |
4:00 |
Intel Roadmap/Technologies |
Barry Davis (Intel Corporation) |
|
4:00 |
4:45 |
Panel: Where are we going next? |
James Reinders, Tommy Minyard (TACC), Nishanth Dandapanthu (Dell EMC), Richard Gerber (NERSC), David Martin (ANL), Sameer Shende (ParaTools, UofOregon)
|
|
Evening |
6:00 |
TEXAS BBQ Dinner at County Line On The Lake |
Meet at Hyatt House for bus ride (6:00). |
|
Day 3: Thursday, Sept. 28, 2017 - IXPUG Annual Fall Conference
Day 3: Thursday, Sept. 28, 2017 - IXPUG Annual Fall Conference
Start |
End |
Title |
Author(s) |
|
8:00 |
8:30 |
Registration & Continental Breakfast |
|
|
|
|
Opening Session [Chair: Lisa Smith] |
|
|
8:30 |
9:30 |
Keynote: Portable Performance Tools to Observe, Optimize, and Scale Application |
Sameer Shende (ParaTools, Inc. U. of Oregon) |
video_k3 |
|
|
Session 7 [Chair: John Cazes] |
|
|
9:30 |
10:30 |
2 Harnessing the Intel Xeon Phi x200 Processor for Earthquake Simulations
20 Optimization Strategies for WSM6 on KNL
|
Alexander Breuer, Yifeng Cui, Alexander Heinecke, Josh Tobin and Charles Yount
Timbwaoga Ouermi, Martin Berzins and Robert M. Kirby
|
video2
video20
|
10:30 |
10:45 |
Break |
|
|
|
|
Session 8 [Chair: Doug Doerfler ] --- 26min presentations |
|
|
10:45 |
12:30 |
15 WARP3D Implementation of MKL Cluster Pardiso Solver
17 Scaling and optimization results of the real-space DFT solver PARSEC on Haswell and KNL systems
6 KNL Disabled Tiles and Performance Variability
19 Vectorization for non-trivial data structures
|
Jeremy Nicklas, Karen Tomko, Robert Dodds and Kevin Manalo
Kevin Gott, Charles Lena, Ariel Biller, Josh Neitzel, Kai-Hsin Liou, Jack Deslippe and James R Chelikowsky
Phillip Romero
Ivo Kabadshow and Andreas Beckmann
|
video15
video17
video6
video19
|
12:30 |
1:00 |
Lunch |
|
|
|
|
TAU Tutorial |
Sameer Shende |
|
1:00 |
3:00 |
TAU |
The complex nature of HPC platforms and the application development environment, combining multiple languages, programming paradigms, hardware, and compilers, make effective performance engineering a challenging task. To meet the needs of computational scientists in performance engineering their codes, we present a tutorial with hands-on sessions on the TAU Performance System. TAU is a powerful profiling and tracing toolkit that covers multiple aspects of performance instrumentation, measurement, and analysis. After describing and demonstrating how performance data is collected using TAU’s automated instrumentation, the workshop will present ways to analyze the performance data collected and to drill down to find performance bottlenecks. Topics will cover generating performance profiles and traces with memory and system load utilization metrics, I/O, communication, and hardware performance counter data using PAPI. The workshop will cover instrumentation of Hybrid MPI and OpenMP codes on the Stampede 2 Intel Xeon Phi (KNL) platform at TACC. |
video_tau |
|
|
KNL Tutorial ROOM ACB 1.104 |
Lars Koesterke, Todd Evans, Kent Milfeld |
|
1:00 |
2:30 |
KNL Tutorial |
[Kicking the tires-- Hardware Experiences] 15 min Lect, 35min Lab LECTURE 1: Overview of the Stampede2 KNL system and programming environment.
Lab1: Login on to a Stampede2 KNL compute node interactively (compile & OpenMP/MPI/hybrid execution). Hardware Experiences: run some basic benchmarks|utilities that illustrate the numa node configurations, DDR and MCDRAM speeds, and cluster modes. (Download your own code, compile and run -- compare to your own system)
[Hybrid Computing on the KNL] 20 min Lect, 20 min Lab
LECTURE2: How to merge MPI and OpenMP methods.
LAB2: Setting up hybrid runs; exploring mpi-task/OMP-thread ratios and other concerns. Hybrid computing in cluster modes such as SNC-4.
|
|
2:30 |
2:45 |
Break |
|
|
2:45 |
4:45 |
KNL Tutorial (cont.) |
[Many-core Affinity] 30min Lect, 30 min Lab
LECTURE3: Learn the basics about the kernel (affinity) map, and how to control affinity (the map) through environment variables.
LAB3: Evaluate and explore simple and explicit settings with the AMASK tool.
[Tools-- VTune] 30min Lecture, 30 min Lab
LECTURE4: KNL-centric hands-on experience with VTune. LAB4: A few examples
|
|
Important Dates:
Abstract Submission Deadline |
Aug 6 -> Aug 13 |
2017 |
Abstracts Reviewed by IXPUG Committee |
Aug 16-22 |
2017 |
Acceptance Notification |
Aug 15 ->Aug 23 |
2017 |
Preliminary Agenda Posted to IXPUG Website |
August 25 |
2017 |
Registration Deadline |
extended Sept 25 |
2017 |
Agenda Finalized. -- Registration Deadline -- |
September 15 |
2017 |
Final Presentations Due from Speakers |
September 19 |
2017 |
IXPUG Sessions |
September 26-28 |
2017 |
Call for Presentations:
IXPUG welcomes submissions on innovative work from KNL users in academia, industry and government labs, describing original discoveries and experiences that will promote and prescribe efficient use of manycore and multicore systems. The authors of the best scored abstracts and draft presentations will be selected for a full 30 minute presentation; others may be offered an opportunity to present shorter Lightning Talks.
Submission Guidelines:
A short Extended Abstract and Draft Presentation should be submitted by Sunday, August 6th 13th. The suggested organization of the extended abstract is: short abstract summarizing the work, benefits and accomplishments; an introduction with an objective; accomplishments (analysis, optimization, algorithm/software design, tool design, experience, new concepts, etc.) with results; and a summary. An abstract submission must be a PDF file, in a convenient text format; two or three pages should be sufficient. The Draft Presentation does not need to be complete by this date. It should reflect the overall intent of the presentation and contain placeholders for the remaining content to be completed by the Final Presentation. Presentations describing application results and work on KNL-specific features (e.g. use of MCDRAM, multi-node messaging (MPI) performance and configurations, and new performance tools exploitation) will be prioritized.
For presentation format, please use the IXPUG presentation template; submit Abstract (pdf) and Presentation (pptx or pdf) through EasyChair. (Please indicate Lightning Talk at the top of your abstract if you only intend to submit a short presentation.)
Topics of interest are (but not limited to):
- Vectorization: SIMD operations and directives, data layout
- Memory: DDR/MCDRAM partitioning, memory affinity, prefetching, latency, streams, etc.
- Communication: MPI inter-/intra-node performance, scaling and tuning for Omni-Path/IB
- IO:Local disk vs global (Lustre, etc.)
- Thread and Process Management: Affinity, resource sharing in SMTs (simultaneous multi-threading) and Tiles
- System Management: Memory/Cluster Modes, Large Pages, Node Stats, XPPSL, OpenHPC
- Hybrid Computing: MPI Process/Thread Partitioning, On-node/Off-node Scaling
- Programming Models: OpenMP, TBB, Cilk, hStreams, MPI, --others
- Algorithms and Methods:Application Scaling and Vectorizable Algorithms
- Tools: Benchmarking, Profiling, Performance Analysis, Affinity
- Visualization: Software Performance, Algorithms, Methods
- Deep Learning:Application to speech, image, bioinformatics, natural language, ...
Travel and Hotel Information:
- Transportation, Parking, etc.: Information on how to get to the venue can be found here.
- Lodging: Special conference rates will be available at the Hyatt House hotel (shuttles available). Use this site for finding other accommodations.
Hotel |
Hotel Address/Phone |
Conference Rate |
Cut-off Date |
Hyatt House Arboretum
|
10001 N Capital of TX Hwy
Phone: 512 342-8080 |
$119
|
August 26,2017
|
IXPUG2017 Program Committee:
Kent |
Milfeld (co-Chair) |
Texas Advanced Computing Center (TACC) |
Richard |
Gerber |
NERSC/Lawrence Berkeley National Laboratory |
Gilles |
Civario |
Dell Inc. (DELL) |
Douglas |
Doerfler |
NERSC/Lawrence Berkeley National Laboratory |
Helen |
He |
NERSC/Lawrence Berkeley National Laboratory |
Clayton |
Hughes |
Sandia National Laboratories |
Juha |
Jaykka |
University of Cambridge |
Michael |
Klemm |
Intel Corporation |
Lars |
Koesterke (co-Chair) |
Texas Advanced Computing Center (TACC) |
David |
Martin |
Argonne National Laboratory |
Hai Ah |
Nam |
Los Alamos National Laboratory |
John |
Pennycook |
Intel Corporation |
Thomas |
Steinke |
Zuse Institute Berlin |
Estela |
Suarez |
Forschungszentrum Juelich |
Sameer |
Shende |
ParaTools, Inc.,/University of Oregon |
Jerome |
Vienne |
Texas Advanced Computing Center (TACC) |