Publications
Publications by categories in reversed chronological order.
2024
- Distributed transformer for high order epistasis detection in large-scale datasetsScientific Reports, 2024
- IPU-EpiDet: Identifying Gene Interactions on Massively Parallel Graph-Based AI AcceleratorsIn 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2024
2023
- A Performance Modelling-Driven Approach to Hardware Resource ScalingIn European Conference on Parallel Processing, 2023
- Sparse-Aware CARM: Rooflining Locality of Sparse ComputationsIn European Conference on Parallel Processing, 2023
- Interpreting High Order Epistasis Using Sparse TransformersIn Proceedings of the 8th ACM/IEEE International Conference on Connected Health: Applications, Systems and Engineering Technologies, 2023
- Performance modelling-driven optimization of RISC-V hardware for efficient SpMVIn International Conference on High Performance Computing, 2023
2022
- Tensor-Accelerated Fourth-Order Epistasis Detection on GPUsIn Proceedings of the 51st International Conference on Parallel Processing, 2022
- Unlocking Personalized Healthcare on Modern CPUs/GPUs: Three-way Gene Interaction StudyIn 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2022
2021
- Mansard roofline model: Reinforcing the accuracy of the roofsACM Transactions on Modeling and Performance Evaluation of Computing Systems, 2021
- Fourth-order exhaustive epistasis detection for the xPU EraIn Proceedings of the 50th International Conference on Parallel Processing, 2021
- HEDAcc: FPGA-based accelerator for high-order epistasis detectionIn 2021 IEEE 29th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM), 2021
- Retargeting tensor accelerators for epistasis detectionIEEE Transactions on Parallel and Distributed Systems, 2021
2020
- Heterogeneous CPU+ iGPU processing for efficient epistasis detectionIn European conference on parallel processing, 2020
- Application-driven cache-aware roofline modelFuture Generation Computer Systems, 2020
- Exploring the binary precision capabilities of tensor cores for epistasis detectionIn 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2020
- Parallel evolutionary computation for multiobjective gene interaction analysisJournal of Computational Science, 2020
2018
- Cache-aware roofline model and medical image processing optimizations in gpusIn International Conference on High Performance Computing, 2018
- Modeling non-uniform memory access on large compute nodes with the cache-aware roofline modelIEEE Transactions on Parallel and Distributed Systems, 2018
2017
- Analyzing performance of multi-cores and applications with cache-aware roofline modelIn 2017 international conference on high performance computing & simulation (HPCS), 2017
- Performance analysis with cache-aware roofline model in intel advisorIn 2017 International Conference on High Performance Computing & Simulation (HPCS), 2017
- Exploring GPU performance, power and energy-efficiency bounds with Cache-aware Roofline ModelingIn 2017 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2017
2016
- Beyond the roofline: Cache-aware power and energy-efficiency modeling for multi-coresIEEE Transactions on Computers, 2016
2013
- Cache-aware roofline model: Upgrading the loftIEEE Computer Architecture Letters, 2013