Tuowen Zhao
Systems Control (United States)(US)
Publications by Year
Research Areas
Parallel Computing and Optimization Techniques, Advanced Data Storage Technologies, Distributed and Parallel Computing Systems, Scientific Computing and Data Management, Tensor decomposition and applications
Most-Cited Works
- → Exploiting reuse and vectorization in blocked stencil computations on CPUs and GPUs(2019)44 cited
- → Delivering Performance-Portable Stencil Computations on CPUs and GPUs Using Bricks(2018)37 cited
- → SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts(2024)22 cited
- → Improving communication by optimizing on-node data movement with data layout(2021)18 cited
- → Polyhedral Specification and Code Generation of Sparse Tensor Contraction with Co-iteration(2022)18 cited
- → Performance Portability Evaluation of Blocked Stencil Computations on GPUs(2023)12 cited
- → Bricks: A high-performance portability layer for computations on block-structured grids(2024)3 cited
- → SIMD code generation for stencils on brick decompositions(2018)3 cited
- → Maximizing Performance Through Memory Hierarchy-Driven Data Layout Transformations(2022)2 cited