Publications
All Publications - Courtesy of the University of Delaware CAPSL group
EDA/FPGA
"DEEP: An Iterative FPGA-based Many-Core Emulation System for Chip Verification and Architecture Research"
In Proceedings of 19th ACM/SIGDA International Symposium on Field-Programmable Gate Arrays (FPGA'11), Monterrey, CA, USA. February 27 - March 1, 2011.
Juergen Ributzka, Yuhei Hayashi, Fei Chen and Guang R. Gao
"FAST: A Functionally Accurate Simulation Toolset for the Cyclops-64 Cellular Architecture"
In Proceedings of Workshop on Modeling, Benchmarking and Simulation (MoBS), held in conjunction with the 32nd Annual International Symposium on Computer Architecture (ISCA 2005), Madison, WI, USA. June 4, 2005.
Juan del Cuvillo, Weirong Zhu, Ziang Hu, and Guang R. Gao
Also available in pdf format
Oil and Gas
"Locality Optimization of Stencil Applications using Data Dependency Graphs"
In Proceedings of the 23rd International Workshop on Languages and Compilers for Parallel Computing (LCPC 2010), Houston, TX, USA. October 7-9, 2010.
Daniel Orozco, Elkin Garcia and Guang R. Gao
Many-Core benchmarking
"Optimized Dense Matrix Multiplication on a Many-Core Architecture"
In Proceedings of International European Conference on Parallel and Distributed Computing (Euro-Par'10), Ischia, Italy. August 31- September 3, 2010.
Elkin Garcia, Ioannis E. Venetis, Rishi Khan and Guang R. Gao
"Performance Analysis of Cooley-Tukey FFT Algorithms for a Many-core Architecture "
In Proceedings of The High Performance Computing Symposium (HPC 2010), Orlando, FL, USA. April 12-15, 2010.
Long Chen and Guang R. Gao
"Mapping the FDTD Application to Many-Core Chip Architectures"
In Proceedings of the 38th International Conference on Parallel Processing (ICPP 2009), Vienna, Austria. September 22-25, 2009.
Daniel Orozco and Guang R. Gao
"Mapping the LU Decomposition on a Many Core Architecture: Challenges and Solutions"
In Proceedings of ACM International Conference on Computing Frontiers (CF 2009), Ischia, Italy. May 18-20, 2009
Ioannis E. Venetis and Guang R. Gao
"Experience of Optimizing FFT on Intel Core Architecture"
In Proceedings of Workshop on Performance Optimization for High-Level Languages and Libraries in the 21st IEEE International Parallel and Distributed Processing Symposium (IPDPS 2007), Long Beach, CA, USA. March 26 - 30, 2007.
Daniel Orozco, Liping Xue, Murat Bolat, Xiaoming Li and Guang Gao
Also available in pdf format
"Optimizing Fast Fourier Transform on a Multi-core Architecture"
In Proceedings of Workshop on Performance Optimization for High-Level Languages and Libraries in the 21st IEEE International Parallel and Distributed Processing Symposium (IPDPS 2007), Long Beach, CA, USA. March 26 - 30, 2007.
Long Chen and Ziang Hu
Also available in pdf format
"Energy efficient tiling on a Many-Core Architecture"
In Proceedings of 4th Workshop on Programmability Issues for Heterogeneous Multicores (MULTIPROG 2011); 6th International Conference on High-Performance and Embedded Architectures and Compilers (HiPEAC), Heraklion, Greece. January 23, 2011.
Elkin Garcia, Daniel Orozco and Guang R. Gao
Cyclops Hardware
"Exploring a multithreaded Methodology to Implement a Network Communication Protocol on the Cyclops-64 Multithreaded Architecture"
In Proceedings of First Workshop on Multithreaded Architectures and Applications in the 21st IEEE International Parallel and Distributed Processing Symposium (IPDPS 2007), Long Beach, CA, USA. March 26 - 30, 2007.
Ge Gan, Ziang Hu, Juan del Cuvillo, and Guang R. Gao
Also available in pdf format
"Optimization of Dense Matrix Multiplication on IBM Cyclops-64: Challenges and Experiences"
In Proceedings of the 12th International European Conference on Parallel Processing (Euro-Par 2006), Dresden, Germany. August 29 - September 1, 2006.
Ziang Hu, Juan del Cuvillo, Weirong Zhu, and Guang R. Gao
Also available in pdf format
"Towards a Software Infrastructure for the Cyclops-64 Cellular Architecture"
In Proceedings of the 20th International Symposium on High Performance Computing Systems and Applications (HPCS'06), St. John's, Canada. May 14 - 17, 2006.
Juan del Cuvillo, Weirong Zhu, Ziang Hu, and Guang R. Gao
Also available in pdf format
"A Study of the On-Chip Interconnection Network for the IBM Cyclops-64 Multi-Core Architecture"
In Proceedings of 20th International Parallel and Distributed Processing Symposium (IPDPS2006), Rhodes Island, Greece. April 25 - 29, 2006.
Ying M. P. Zhang, Taikyeong Jeong, Fei Chen, Haiping Wu, Ronny Nitzsche, and Guang R. Gao
Also available in pdf format
"Performance Modelling and Optimization of Memory Access on Cellular Computer Architecture Cyclops-64"
In Proceedings of Network and Parallel Computing (NPC 2005), Beijing, China. November 30 - December 3, 2005.
Yanwei Niu, Ziang Hu, Kenneth Barner, Guang R. Gao
Also available in pdf format
TnT Software model
"TiNy threads on BlueGene/P: Exploring many-core parallelisms beyond The traditional OS"
In Proceedings of Workshop on Multithreaded Architecures and Applications (MTAAP); 24th IEEE International Parallel and Distributed Processing Symposium (IPDPS 2010), Atlanta, GA, USA. April 23, 2010.
Handong Ye, Robert Pavel, Aaron Landwehr and Guang Gao
"TiNy Threads: a Thread Virtual Machine for the Cyclops-64 Cellular Architecture"
In Proceedings of the Fifth Workshop on Massively Parallel Processing (WMPP), held in conjunction with the 19th International Parallel and Distributed Processing Symposium (IPDPS 2005), Denver, CO, USA. April 3 - 8, 2005
Juan del Cuvillo, Weirong Zhu, Ziang Hu, and Guang R. Gao
Also available in pdf format
MPI and OpenMP
"A Study of a Software Cache Implementation of the OpenMP Memory Model for Multicore and Manycore Architectures"
In Proceedings of International European Conference on Parallel and Distributed Computing (Euro-Par'10), Ischia, Italy. August 31- September 3, 2010.
Chen Chen, Joseph B Manzano, Ge Gan, Guang R. Gao and Vivek Sarkar
"Tile Percolation: an OpenMP Tile Aware Parallelization Technique for the Cyclops-64 Multicore Processor"
In Proceedings of International European Conference on Parallel and Distributed Computing (Euro-Par'09), Delft, The Netherlands. August 25-28, 2009
Ge Gan, Xu Wang, Joseph Manzano and Guang R. Gao
"Tile reduction: the first step towards Openmp tile aware parallelization"
In Proceedings of the 5th International Workshop on OpenMP (IWOMP'09), Dresden, Germany, June 3-5, 2009
Ge Gan, Xu Wang, Joseph Manzano, Guang R. Gao
"Landing OpenMP on Cyclops-64: An Efficient Mapping of OpenMP to a many-core System-on-a-chip"
In Proceedings of the 3rd ACM International Conference on Computing Frontiers, Ischia, Italy. May 2-5, 2006.
Juan del Cuvillo, Weirong Zhu, Guang R. Gao
Also available in pdf format
"Performance Characteristics of OpenMP Language Constructs on a Many-core-on-a-chip Architecture"
In Proceedings of the 2nd International Workshop on OpenMP (IWOMP2006), Remis, France. June 12-15 2006.
Weirong Zhu, Juan del Cuvillo, and Guang R. Gao
Also available in pdf format
"Synchronization State Buffer: Supporting Efficient Fine-Grain Synchronization for Many-Core Architectures"
In Proceedings of the 34th International Symposium on Computer Architecture (ISCA 2007), San Diego, CA, USA. June 9-13, 2007
Weirong Zhu, Vugranam C. Sreedhar, Ziang Hu, and Guang R. Gao
