2020 IXPUG US Annual Meeting
Experts from around the world are invited to join us online
for the
IXPUG 2020 Annual Meeting
October 13-16, 2020 – Hosted by TACC via Zoom
Agenda
Tuesday, October 13 -- All times listed for Austin/CDT (UTC -5)
Start |
End |
Title |
Author |
Presentation |
Video |
10:00 |
10:15 |
Welcome |
Melyssa Fratkin, IXPUG Secretary Stephen Harrell & Amit Ruhela, Meeting Co-Chairs
|
|
Recording (host welcome)
Recording (keynote intro)
|
10:15 |
11:00 |
Keynote: The Landscape of Modern Parallel Programming Using Open Standards |
Michael Wong |
Presentation |
Recording |
11:00 |
11:30 |
Intel® DPC++ Compatibility Tool – Porting SPECFEM3D GLOBE to DPC++ |
Sunny Gogar, Konstantinos Krommydas, Rama Kishan V Malladi and Philippe Thierry |
Presentation |
Recording |
11:30 |
12:00 |
Custom-Precision Mathematical Library Explorations for Code Profiling and Optimization |
David Defour, Pablo de Oliveira Castro, Matei Istoan and Eric Petit |
Presentation |
Recording |
12:00 |
12:30 |
Break |
|
|
|
12:30 |
13:00 |
Simulating quantum algorithms on HPC systems: a performance perspective |
Luigi Iapichino and Fabio Baruffa |
Presentation |
Recording |
13:00 |
13:30 |
User-space thin file system coupled with an ultra-fast and low-latency IO stack as an alternative for use by database storage engines |
Jan Lisowiec |
Presentation |
Recording |
13:30 |
14:00 |
Optane PMem as an Enabler for Large DNN Models with Homomorphic Encryption |
Guillermo Lloret-Talavera, Marc Jorda, Harald Servat, Fabian Boemer, Chetan Chauhan, Shigeki Tomishima, Nilesh N. Shah and Antonio J. Peña |
Presentation |
Recording |
14:00 |
14:15 |
Break |
|
|
|
14:15 |
15:15 |
Lightning Talks (15 Mins each) |
|
|
|
|
|
Overlapping communication and computation using the Intel MPI library's asynchronous progress control |
Sebastian Ohlmann, Fabio Baruffa and Markus Rampp |
Presentation |
Recording |
|
|
Mixed-Precision Arithmetic for 3DGAN to Simulate High Energy Physics Detectors |
John Osorio Rios, Adrià Armejach, Gulruk Khattak, Eric Petit, Sofia Vallecorsa and Marc Casas |
Presentation |
Recording |
|
|
Porting NAQMD kernels to GPU via OpenMP Offload |
Pankaj Rajak, Ye Luo, Ken-Ichi Nomura and Aiichiro Nakano |
Presentation |
Recording |
|
|
Performance analysis of OpenMP Offload on IntelGen9 |
Neil Mehta, Rahul Gayatri, Yasaman Ghadar and Jack Deslippe |
Presentation |
Recording |
15:15 |
16:15 |
Tutorial: Accelerating Deep Learning workloads by using Intel® AI Analytics Toolkit and 3rd generation Xeon Scalable processors |
Louie Tsai |
Presentation |
Recording |
16:15 |
|
Closing Remarks |
|
|
|
Wednesday, October 14 -- All times listed for Austin/CDT (UTC -5)
Start |
End |
Title |
Author |
Presentation |
Video |
10:00 |
10:15 |
Welcome |
Stephen Harrell & Amit Ruhela, Meeting Co-Chairs |
|
|
10:15 |
11:00 |
Keynote: How HPC Developers Using SYCL Need to Deal with Data Movement |
Michael Wong |
Presentation |
Recording |
11:00 |
11:30 |
Characterizing simulation and machine learning workloads |
Chris Lishka |
Presentation |
Recording |
11:30 |
12:00 |
SYCL Performance and Portability |
Kumudha Narasimhanm |
(Not available) |
Recording |
12:00 |
12:30 |
Break |
|
|
|
12:30 |
12:45 |
Lattice QCD on CPU and GPU with OpenMP |
Patrick Steinbrecher |
(Not available) |
(Not available) |
12:45 |
14:00 |
Site Updates: TACC, ZIB, Argonne, Hartree Centre, University of Tsukuba |
John Cazes, TACC Thomas Steinke, ZIB David Martin, Argonne Luke Mason, Hartree Centre Taisuke Boku, University of Tsukuba
|
Presentation (TACC)
Presentation (ZIB)
Presentation (Argonne)
Presentation (Tsukuba)
|
Recording (TACC)
Recording (ZIB)
Recording (Argonne)
Recording (Hartree)
Recording (Tsukuba)
|
14:00 |
14:15 |
Break |
|
|
|
14:15 |
15:15 |
Lightning Talks (15 minutes each) |
|
|
|
|
|
A geometric multigrid method kernel on Intel GPU with Performance Portable Programming Models |
Jaehyuk Kwack |
Presentation |
Recording |
|
|
From CUDA to DPC++ back to Nvidia GPUs... and FPGAs - An oneAPI case study with the tsunami simulation easyWave |
Steffen Christgau and Marius Knaust |
Presentation |
Recording |
|
|
No Instruction Computing Using Pointer and Operation in Registers for Adaptable Architecture |
Nagi Mekhiel |
Presentation |
Recording |
|
|
Simple use of oneMKL for high performance |
Matthew Cordery, Dahai Guo, Michael D'Mello |
Presentation |
Recording |
15:15 |
16:15 |
Tutorial: Traditional Machine Learning with oneDAL and XGBoost* |
Rachel Oberman |
Presentation |
Recording |
16:15 |
|
Closing Remarks |
|
|
|
Thursday, October 15 -- All times listed for Austin/CDT (UTC -5)
Start |
End |
Title |
Author |
Presentation |
Video |
10:00 |
10:15 |
Welcome |
Stephen Harrell & Amit Ruhela, Meeting Co-Chairs |
|
|
10:15 |
12:00 |
Tutorial: Performance Tuning and Best Practices to leverage Intel MPI on Multicore Processors |
Amit Ruhela |
Presentation |
(Not available) |
12:00 |
12:30 |
Break |
|
|
|
12:30 |
14:00 |
Tutorials: OneAPI/ DPC++ Essential Series hands on (Through Friday)
oneAPI Intro Module: (This module is used to introduce oneAPI, DPC++ Hello World and Intel DevCloud)
DPC++ Program Structure: (Classes - device, device_selector, queue, basic kernels and ND-Range kernels, Buffers-Accessor memory model, DPC++ Code Anatomy)
|
Praveen Kundurthy |
Presentation |
Recording |
14:00 |
14:15 |
Break |
|
|
|
14:15 |
15:15 |
Tutorial: DPC++ New Features - Unified Shared Memory (USM), Sub-Groups (Intel oneAPI DPC++ Library -Usage of oneDPL, Buffer Iterators and oneDPL with USM ) |
Rakshith Krishnappa |
Presentation |
Recording |
15:15 |
|
Closing Remarks |
|
|
|
Friday, October 16 -- All times listed for Austin/CDT (UTC -5)
Start |
End |
Title |
Author |
Presentation |
Video |
10:00 |
10:15 |
Welcome |
Stephen Harrell & Amit Ruhela, Meeting Co-Chairs |
|
|
10:15 |
11:30 |
Tutorial: Migration of GPGPU Applications to DPC++ with Lab - Compatibility tool |
Edward Mascarenhas and Sunny Gogar |
Presentation |
Recording |
11:30 |
12:00 |
Break |
|
|
|
12:00 |
13:30 |
Tutorial: Running DPC++ code on GPGPU |
Ruyman Reyes and Stuart Adams |
Presentation |
(Not available) |
13:30 |
13:45 |
Break |
|
|
|
13:45 |
14:30 |
Tutorial: Cache-Aware Roofline Model: Performance, Power and Energy-Efficiency |
Aleksandar Ilic |
Presentation |
Recording |
14:30 |
15:30 |
Tutorial: Profile DPC++ and GPU Workload Performance-VTune, Advisor, Roofline analysis |
Vladimir Tsymbal |
Presentation |
Recording |
15:30 |
15:45 |
Break |
|
|
|
15:45 |
16:45 |
Tutorial: FPGA offload on DPC++ |
Ricardo Menotti |
Presentation |
Recording |
16:45 |
|
Conference Closing Remarks |
|
|
|
Just Announced! oneAPI Bring-Your-Own-Code (BYOC) Workshop
Have a code that will benefit from heterogenous hardware? In this workshop on Monday, October 19th 8:00AM-12:00PM PT (10:00AM-2:00PM CT) you will have an opportunity to work directly with oneAPI experts to make the code heterogenous. Register via Webex HERE. Participants present their workload and instructors answer questions. Each team gets half an hour and follow up sessions scheduled individually as needed based on topics below:
DPC++ essentials – Praveen Kundurthy and Rakshit Krishnappa
DPCT – Sunny Gogar and Edward Mascarenhas
DPC++ GPGPU – Ruyman Reyes
Advisor – Vladimir Tsymbal and Prof Aleksandar Ilic
FPGA – Ricardo Menotti
AI/oneDAL/XGBoost – Louie Tsai, Rachael Oberman
We will consider BYOC workshop successful if we are able to help you in porting and tuning of the participants’ codes, either during the workshop or up to two weeks after the workshop.
Call for Presentations:
This Intel eXtreme Performance User Group (IXPUG) conference is focused on all aspects of adopting and employing state-of-the-art technologies and practices for optimal application execution. This includes accelerators (e.g., co-processors, FPGAs, GPUs), as well as topics related to system hardware beyond the processor (memory, interconnects, etc.), software tools, programming models, HPC workloads, troubleshooting, and more — all with a focus on Intel platforms. The conference will provide an interactive experience, organized around key themes associated with high-performance computing, data analytics, artificial intelligence (machine learning and deep learning), cloud computing, and more. The conference will feature keynote presentations, invited expert talks, 30-minute technical sessions, 10-to-15-minute lightning talks, and two-hour hands-on tutorials. Speakers come from a wide variety of HPC organizations and share their real-world experiences. Attendees will encounter an open forum, through which Intel and renowned industry experts will share best practices and techniques for maximizing software productivity and efficiency. The conference will provide an opportunity to share experiences in leveraging Intel architectures and technologies with all attendees, including expert software developers, scientists, researchers, academics, systems analysts, students, and end-users, etc. The challenges surrounding application performance and scalability will be covered across at all levels, including tuning and optimization of diverse sets of applications on large-scale HPC systems.
Abstract Submission Guidelines:
A short abstract should be submitted by August 21, 2020 (updated!) via EasyChair and the content should reflect the topics of interest that are listed below. All final presentations are due by October 9, 2020. We would like you to include keywords that pertain to the techniques, Intel products, and associated domains that pertain to your technical work. Please see the full list below:
Techniques: Artificial Intelligence (Machine Learning/Deep Learning), Algorithms & Methods, Compiler Flags, Software Environment & Tools, Libraries & Tools, Parallel- Programming (Communications, Thread & Process Management Experience, All), Multi-node, Memory Management, Vectorization, etc.
Products: Intel® Xeon Scalable processor, Intel® Omni Path Fabric, Intel® FPGA, Intel® SSDs/NVMe Solutions, Intel® Lustre Software and DAOS, Visualization Technology, Intel® SW Tools, OneAPI, Intel Xe graphics, Intel Processor Graphics
Domains: Astrophysics, Bioinformatics, Chemistry, Climate & Weather, Computational Fluid Dynamics, Data Analytics, Energy/Oil & Gas, Financial Services, Geophysics, Life Sciences, Material Science, Medical imaging, Molecular Dynamics, Nanotechnology, Physics, Visualization, High Energy Physics, etc.
Abstract Topics of Interest:
- Implications of workload behavior on system design at extreme scale (Power, Reliability, Scalability, Performance, Processor Design, Memory System, I/O)
- Software environments and tools for computing at extreme scale (Instrumentation, Debugging/Correctness, Thread and Process Management, Libraries and Language Development)
- Experience using extreme scale systems: Usability, In-situ Visualization, Programming Challenges, Algorithms and Methods, etc.
- Application characterization on emerging technologies: Novel Memories (NVM), processors (Intel® Xeon Scalable processor, Intel® FPGA, etc.)