Design and application of reconfigurable circuits and systems by Cheung, Peter
  
 
 
 
 
 
	
	
	
	
	
Design	and	Application	of		
Reconfigurable	Circuits	and	Systems		
	
	
	
	
	
	
Submission	for	the	Award	of	the	Degree	of	Doctor	of	Science	(DSc)	
Imperial	College	London	
	
	
	
	
	
	
	
Peter	Y.	K.	Cheung	
	
	
	
	
	
	
	
21	December	2015	
   2	
 
    1	
Acknowledgements	
	
	
Being	 research	 partner	 to	 Professor	Wayne	 Luk	 is	 like	 a	 dream.	 	 Since	 his	 arrival	 at	
Imperial	 College	 in	 1995,	 we	 have	 been	 collaborating	 closely	 in	 our	 research.	 The	
synergistic	relationship	between	us	has	enable	us	to	create	at	Imperial	College	one	of	
the	strongest	teams	in	the	world	working	on	reconfigurable	systems.		Ours	remains	the	
best	 collaborative	effort	between	 the	Department	of	EEE	and	Computing	at	 Imperial	
College.	I	am	particularly	grateful	for	Wayne’s	infinite	patience,	unbounded	energy	and	
deep	insights	into	many	aspects	of	FPGAs	and	custom	computing	research.		I	am	also	
fully	of	 admirations	 for	his	 ability	 to	 answer	email	 at	 anytime	 from	anywhere	 in	 the	
world!	
	
I	am	thankful	for	the	opportunity	to	work	with	some	of	the	best	research	students	who	
passed	through	my	laboratory.		In	particular,	Professor	George	Constantinides	and	Dr	
Christos	Bouganis,	who	have	since	 join	the	Department	as	academic	staff,	were	 ideal	
research	students	once,	but	are	now	wonderful	colleagues.				
	
I	 also	owe	a	debt	of	 gratitude	 to	 Imperial	 College,	 the	 Faculty	of	 Engineering	 and	 in	
particular,	to	the	Department	of	Electrical	and	Electronic	Engineering.	 	Together	they	
provided	me	with	a	supportive	and	 inspiring	environment	 for	the	past	three	decades	
to	pursue	scholarship	instead	of	government	metrics,	offered	me	resources,	freedom	
and	encouragements	to	become	world-class	in	my	research.	
	
Finally	 I	 thank	my	 long-suffering	wife,	Dr	Miranda	 Lai-Kit	Cheung,	who	has	 tolerated	
my	 absences	 due	 to	 conference	 trips,	 College	 commitments	 and	 professional	
engagements	for	too	many	years	and	far	too	frequently.				
    2	
	
	
	
	
This	page	is	intentionally	left	blank.	
    3	
	
	
	
	
Table	of	Contents	
	
	
	
	
	
Acknowledgements	 Page	1	
Table	of	Contents	 Page	3	
Curriculum	Vitae	 Page	5	
List	of	Submitted	Publications	 Page	31	
Statement	of	Contribution	 Page	35	
Statement	of	Conjoint	Work	 Page	47	
Publications	 Page	51	
	
	
	
	
	
	
	
    4	
	
	
	
	
	
This	page	is	intentionally	left	blank.	
    5	
	
	
	
	
	
	
	
	
Curriculum	Vitae
    6	
	
 
 
 
Name: Peter, Ying-Kay CHEUNG Nationality: British 
 
Date of Birth: 4th May 1953 
 
Contact Details: Department of Electrical & Electronic Engineering 
 Imperial College London, SW7 2BT. 
 
Email Address: p.cheung@imperial.ac.uk  Daytime no: +44 20 7594 
6200 
 
 
Title of Current Appointment:    
 Professor of Digital Systems 
 Vice Dean (Education), Faculty of Engineering 
 Imperial College London 
 
Education: 1970 - 1973 BSc (Eng) (1st class Hons) Department of Elec Eng, 
  Imperial College of Science & Technology  
  University of London 
 
Scholarships & Awards: 1971,72 Imperial College Scholarship Award 
 1973 IEE Prize for overall distinction 
 1994 Imperial College Teaching Fellow 
 
Professional Membership:  
 Senior Member of IEEE, Fellow of IET 
 
Employment: 
2010 – Now Vice Dean (Education)  
 Faculty of Engineering, Imperial College London 
2008 – Now Professor of Digital Systems 
2008 – 2015 Head of Electrical & Electronic Engineering, Imperial College London 
2002 – 2008 Director of Undergraduate Studies 
Deputy Head of Electrical & Electronic Engineering, Imperial College London 
1997 – 2002 Reader of Digital Systems, Director of Undergraduate Studies 
Deputy Head of Electrical & Electronic Engineering, Imperial College London 
1992 - 1997 Senior Lecturer, Imperial College 
1980 - 1992 Lecturer & Director of Microprocessor Teaching Unit 
Department of Electrical & Electronic Engineering, Imperial College 
1976 - 1980 Research assistant, Department of Electrical Engineering, Imperial College 
1973 - 1976 Design Engineer, Hewlett Packard Ltd., Scotland 
 
 
    7	
LIST OF PUBLICATIONS 
Books Authored (2), Edited (2) and Contributions to Books Edited by Others (7) 
[1] C.Bouganis, S.A.Fahmy and P.Y.K. Cheung. From Algorithms to Hardware 
Implementation, Chapter 13. In M. Petrou and A. Bharath (Eds.), "Next Generation 
Artificial Vision Systems: Reverse Engineering the Human Visual System", Artech House 
Series Bioinformatics & Biomedical Imaging, 2008 
[2] Y.Liu, C.Bouganis and P.Y.K. Cheung. Real-Time Spatiotemporal Saliency, Chapter 14. 
In M. Petrou and A. Bharath (Eds.), "Next Generation Artificial Vision Systems: Reverse 
Engineering the Human Visual System", Artech House Series Bioinformatics & 
Biomedical Imaging, 2008. 
[3] Dong-U Lee, W.Luk, John Villasenor and P.Y.K. Cheung. The Effects of Polynomial 
Degrees. In “New Algorithms, Architectures and Applications for Reconfigurable 
Computing”, Eds. Wolfgang Rosenstiel, Patrick Lysaght, pp.301-313, Kluwer Academic 
Press, 2005. 
[4] Theerayod Wiangtong, P.Y.K. Cheung and Wayne Luk. A Unified Codesign Environment 
for the UltraSONIC Reconfigurable Computer. In “New Algorithms, Architectures and 
Applications for Reconfigurable Computing”, Eds. Wolfgang Rosenstiel, Patrick Lysaght, 
pp.81-92, Kluwer Academic Publishers, 2005. 
[5] G. Constantinides, P.Y.K. Cheung, and W. Luk. Synthesis and optimization of DSP 
algorithms. Kluwer Academic Publishers,  pp. 1-164, 2004. 
[6] P.Y.K. Cheung, G. A. Constantinides and J. de Sousa (ed.). Field-Programmable Logic 
and Applications, LNCS 2778, 1179 pages, Springer-Verlag, 2003. 
[7] J. de Sousa and P.Y.K. Cheung. Boundary-scan Test and Diagnosis: Interconnect Defects. 
Kluwer Academic Publishers. 169 pages, ISBN 0792373146, 2001. 
[8] W. Luk, N. Shirazi and P.Y.K. Cheung. Configurable computing. In Electrical 
Engineering Handbook, W. K. Chen (Ed.) ISBN: 978-0-12-170960-0.. Academic Press, 
2004. 
[9] G. A. Constantinides, P.Y.K. Cheung and W. Luk. Multiprocessor systems, Electrical 
Engineering Handbook, W. K. Chen (Ed.). ISBN: 978-0-12-170960-0. Academic Press, 
2004. 
[10] W. Luk, P.Y.K. Cheung and M. Glesner (ed.). Field-Programmable Logic and 
Applications. LNCS 1304, Springer-Verlag, 1997. 
[11] D.J. Ostrowski and P.Y.K. Cheung. A Fuzzy Logic Approach to Handwriting 
Recognition. In Fuzzy Logic Implementation and Applications, Ed. M.J. Patryra, D.M. 
Mlynek, 299-314, Wiley 1996. 
Patents (4) 
[12] P.Y.K. Cheung, N. P. Sedcole, and J. S. J. Wong. Method of measuring delay in an 
integrated circuit, Oct. 8 2013. US Patent 8,552,740.  
[13] W. Luk, P.Y.K. Cheung, and S. P. Seng. Flexible instruction processor systems and 
methods, June 2 2009. US Patent 7,543,283.  
[14] P.Y.K. Cheung and N. P. Sedcole. Random generation of pld configurations to 
compensate for delay variability, Feb. 15 2008. US Patent App. 12/526,927.  
[15] P.Y.K. Cheung and S. D. Haynes. Reconfigurable multiplier array, Apr. 9 2002. US 
Patent 6,369,610.  
    8	
Journal (65) and Conference (223) Articles 
[16] B.H.K. Chen, P.Y.S. Cheung, P.Y.K. Cheung, and Y.K. Kwok. CypherDB: A Novel 
Architecture for Outsourcing Secure Database Processing. Computers, IEEE Transactions 
on, (accepted for publication) 2016.   
[17] E. Stott, J. M. Levine, P.Y.K. Cheung, and N. Kapre. Timing fault detection in FPGA-
based circuits. In Field-Programmable Custom Computing Machines (FCCM), 2014 
IEEE 22nd Annual International Symposium on, pages 96–99. IEEE, 2014.  
[18] J. Liu, C. Bouganis, and P.Y.K. Cheung. Image progressive acquisition for hardware 
systems. In Proceedings of the conference on Design, Automation & Test in Europe 
(DATE’14), pages 355-360. European Design and Automation Association, 2014.  
[19] J. M. Levine, E. Stott, and P.Y.K. Cheung. Dynamic voltage & frequency scaling with 
online slack measurement. In Proceedings of the 2014 ACM/SIGDA international 
symposium on Field-programmable gate arays, pages 65–74. ACM, 2014.  
[20] Z. Guan, J. S. Wong, S. Chaudhuri, G. Constantinides, and P.Y.K. Cheung. Mitigation of 
process variation effect in FPGAs with partial rerouting method. IEICE Electronics 
Express, 11(3):20140011–20140011, 2014.  
[21] Z. Guan, J. S. Wong, S. Chaudhuri, G. Constantinides, and P.Y.K. Cheung. Classication 
on variation maps: a new placement strategy to alleviate process variation on FPGA. 
IEICE Electronics Express, 11(3):20130912– 20130912, 2014.  
[22] T. C. Chau, J. S. Targett, M. Wijeyasinghe, W. Luk, P.Y.K. Cheung, B. Cope, A. Eele, 
and J. Maciejowski. Accelerating sequential Monte Carlo method for real-time air traffic 
management. ACM SIGARCH Computer Architecture News, 41(5):35–40, 2014.  
[23] T. C. Chau, X. Niu, A. Eele, J. Maciejowski, P.Y.K. Cheung, and W. Luk. Mapping 
adaptive particle filters to heterogeneous reconfigurable systems. ACM Transactions on 
Reconfigurable Technology and Systems (TRETS), 7(4):9, 2014.  
[24] J. S. Wong and P.Y.K. Cheung. Timing measurement platform for arbitrary black-box 
circuits based on transition probability. Very Large Scale Integration (VLSI) Systems, 
IEEE Transactions on, 21(12):2307–2320, 2013.  
[25] E. Stott, Z. Guan, J. M. Levine, J. S. Wong, and P.Y.K. Cheung. Variation and reliability 
in FPGAs. IEEE Design & Test, 6(30):50–59, 2013.  
[26] Adam Powell, C. Savvas-Bouganis, and P.Y.K. Cheung. High-level power and 
performance estimation of FPGA-based soft processors and its application to design 
space exploration. Journal of Systems Architecture, 59(10):1144– 1156, 2013.  
[27] J. Liu, C. Bouganis, and P.Y.K. Cheung. Domain-specific progressive sampling of face 
images. In Global Conference on Signal and Information Processing (GlobalSIP), 2013 
IEEE, pages 1021–1024. IEEE, 2013.  
[28] J. M. Levine, E. Stott, G. Constantinides, P.Y.K. Cheung, et al. SMI: Slack 
Measurement Insertion for online timing monitoring in FPGAs. In Field Programmable 
Logic and Applications (FPL), 2013 23rd International Conference on, pages 1–4. IEEE, 
2013.  
[29] Z. Guan, J. S. Wong, S. Chaudhuri, G. Constantinides, and P.  Y. K. Cheung. A 
variation-adaptive retiming method exploiting reconfigurability. In Field Programmable 
Logic and Applications (FPL), 2013 23rd International Conference on, pages 1–4. IEEE, 
2013.  
[30] Z. Guan, J. S. Wong, S. Chaudhuri, G. Constantinides, and P.Y.K. Cheung. Exploiting 
stochastic delay variability on FPGAs with adaptive partial rerouting. In Field-
    9	
Programmable Technology (FPT), 2013 International Conference on, pages 254–261. 
IEEE, 2013.  
[31] J.J. Davis and P.Y.K. Cheung. Datapath fault tolerance for parallel accelerators. In Field-
Programmable Technology (FPT), 2013 International Conference on, pages 366–369. 
IEEE, 2013.  
[32] P.Y.K. Cheung, N. P. Sedcole, and J. S. J. Wong. Method of measuring delay in an 
integrated circuit, Oct. 8 2013. US Patent 8,552,740.  
[33] T. C. Chau, X. Niu, A. Eele, W. Luk, P.Y.K. Cheung, and J. M. Maciejowski. 
Heterogeneous reconfigurable system for adaptive particle filters in real-time 
applications. In ARC, pages 1–12, 2013.  
[34] T. C. Chau, K.-W. Kwok, G. C. Chow, K. H. Tsoi, K.-H. Lee, Z. Tse, P.Y.K. Cheung, 
and W. Luk. Acceleration of real-time proximity query for dynamic active constraints. In 
Field-Programmable Technology (FPT), 2013 International Conference on, pages 206–
213. IEEE, 2013.  
[35] A.Powell, C.-S. Bouganis, and P.Y.K. Cheung. Early performance estimation of image 
compression methods on soft processors. In Field Programmable Logic and Applications 
(FPL), 2012 22nd International Conference on, pages 587–590. IEEE, 2012.  
[36] J. M. Levine, E. Stott, G. Constantinides, P.Y.K. Cheung, et al. Online measurement of 
timing in circuits: For health monitoring and dynamic voltage & frequency scaling. In 
Field-Programmable Custom Computing Machines (FCCM), 2012 IEEE 20th Annual 
International Symposium on, pages 109–116. IEEE, 2012.  
[37] Z. Guan, J. S. Wong, S. Chaudhuri, G. Constantinides, and P.Y.K. Cheung. A two-stage 
variation-aware placement method for FPGAs exploiting variation maps classification. In 
Field Programmable Logic and Applications (FPL), 2012 22nd International Conference 
on, pages 519–522. IEEE, 2012.  
[38] T. C. Chau, W. Luk, P.Y.K. Cheung, A. Eele, and J. Maciejowski. Adaptive sequential 
Monte Carlo approach for real-time applications. In Field Programmable Logic and 
Applications (FPL), 2012 22nd International Conference on, pages 527–530. IEEE, 2012.  
[39] T. C. Chau, W. Luk, and P.Y.K. Cheung. Roberts: reconfigurable platform for 
benchmarking real-time systems. ACM SIGARCH Computer Architecture News, 
40(5):10–15, 2012.  
[40] J. S. Wong and P.Y.K. Cheung. Improved delay measurement method in FPGA based on 
transition probability. In Proceedings of the 19th ACM/SIGDA international symposium 
on Field programmable gate arrays, pages 163–172. ACM, 2011.  
[41] E. Stott and P.Y.K. Cheung. Improving FPGA reliability with wear-levelling. In Field 
Programmable Logic and Applications (FPL), 2011 International Conference on, pages 
323–328. IEEE, 2011.  
[42] T. Mak, P.Y.K. Cheung, K.-P. Lam, and W. Luk. Adaptive routing in network-on-chips 
using a dynamic-programming network. Industrial Electronics, IEEE Transactions on, 
58(8):3701–3716, 2011.  
[43] B. Cope, P.Y.K. Cheung, W. Luk, and L. Howes. A systematic design space exploration 
approach to customising multi-processor architectures: Exemplified using graphics 
processors. In Transactions on High-Performance Embedded Architectures and 
Compilers IV, pages 63–83. Springer, 2011.  
[44] S. Chaudhuri, J. S. Wong, and P.Y.K. Cheung. Timing speculation in FPGAs: 
Probabilistic inference of data dependent failure rates. In Field-Programmable 
Technology (FPT), 2011 International Conference on, pages 1–8. IEEE, 2011.  
    10	
[45] M. E. Angelopoulou, C.-S. Bouganis, and P.Y.K. Cheung. Blur identification with 
assumption validation for sensor-based video reconstruction and its implementation on 
field programmable gate array. Computers & Digital Techniques, IET, 5(4):271–286, 
2011.  
[46] Y. Wu, P. Kuvinichkul, P.Y.K. Cheung, and Y. Demiris. Towards anthropomorphic 
robot thereminist. In Robotics and Biomimetics (ROBIO), 2010 IEEE International 
Conference on, pages 235–240. IEEE, 2010.  
[47] E. A. Stott, J. S. Wong, P.  Sedcole, and P.Y.K. Cheung. Degradation in FPGAs: 
measurement and modelling. In Proceedings of the 18th annual ACM/SIGDA 
international symposium on Field programmable gate arrays, pages 229–238. ACM, 
2010.  
[48] E. Stott, J. S. Wong, and P.Y.K. Cheung. Degradation analysis and mitigation in FPGAs. 
In Field Programmable Logic and Applications (FPL), 2010 International Conference on, 
pages 428–433. IEEE, 2010.  
[49] E. Stott, P.  Sedcole, and P.Y.K. Cheung. Fault tolerance and reliability in field-
programmable gate arrays. Computers & Digital Techniques, IET, 4(3):196–210, 2010.  
[50] A.M. Smith, G. A. Constantinides, and P.Y.K. Cheung. An automated flow for 
arithmetic component generation in field-programmable gate arrays. ACM Transactions 
on Reconfigurable Technology and Systems (TRETS), 3(3):13, 2010.  
[51] A.M. Smith, G. Constantinides, P.Y.K. Cheung. FPGA architecture optimization using 
Geometric Programming. Computer-Aided Design of Integrated Circuits and Systems, 
IEEE Transactions on, 29(8):1163–1176, 2010.  
[52] T. Mak, P.  Sedcole, P.Y.K. Cheung, and W. Luk. Wave-pipelined intra-chip signaling 
for on-fpga communications. Integration, the VLSI Journal, 43(2):188–201, 2010.  
[53] S. L´opez, R. Sarmiento, P. G. Potter, W. Luk, and P.Y.K. Cheung. Exploration of 
hardware sharing for image encoders. In Design, Automation & Test in Europe 
Conference & Exhibition (DATE), 2010, pages 1737–1742. IEEE, 2010.  
[54] A.Kahoul, A. M. Smith, G. A. Constantinides, and P.Y.K. Cheung. Efficient 
heterogeneous architecture floorplan optimization using analytical methods. ACM 
Transactions on Reconfigurable Technology and Systems (TRETS), 4(1):3, 2010.  
[55] D. H. Jones, A. Powell, C.-S. Bouganis, and P.Y.K. Cheung. A salient region detector 
for GPU using cellular automata architecture. In Neural Information Processing. Models 
and Applications, pages 501–508. Springer Berlin Heidelberg, 2010.  
[56] D. H. Jones, A. Powell, C.-S. Bouganis, and P.Y.K. Cheung. GPU versus FPGA for high 
productivity computing. In Field Programmable Logic and Applications (FPL), 2010 
International Conference on, pages 119–124. IEEE, 2010.  
[57] P. Jamieson, T. Becker, P.Y.K. Cheung, W. Luk, T. Rissa, and T. Pitkanen. 
Benchmarking and evaluating reconfigurable architectures targeting the mobile domain. 
ACM Transactions on Design Automation of Electronic Systems (TODAES), 15(2):14, 
2010.  
[58] B. Cope, P.Y.K. Cheung, W. Luk, and L. Howes. Performance comparison of graphics 
processors to reconfigurable logic: A case study. Computers, IEEE Transactions on, 
59(4):433–448, 2010.  
[59] C. S. Bouganis, I. Pournara, and P.Y.K. Cheung. Exploration of heterogeneous FPGAs 
for mapping linear projection designs. Very Large Scale Integration (VLSI) Systems, 
IEEE Transactions on, 18(3):436–449, 2010.  
    11	
[60] T. Becker, W. Luk, and P.Y.K. Cheung. Energy-aware optimisation for run-time 
reconfiguration. In Field-Programmable Custom Computing Machines (FCCM), 2010 
18th IEEE Annual International Symposium on, pages 55–62. IEEE, 2010.  
[61] T. Becker, P. Jamieson, W. Luk, P.Y.K. Cheung, and T. Rissa. Power characterisation 
for fine-grain reconfigurable fabrics. International Journal of Reconfigurable Computing, 
2010:2, 2010.  
[62] J. S. Wong, P.  Sedcole, and P.Y.K. Cheung. Self-measurement of combinatorial circuit 
delays in FPGAs. ACM Transactions on Reconfigurable Technology and Systems 
(TRETS), 2(2):10, 2009.  
[63] L. Wang, T. Mak, P. Sedcole, and P.Y.K. Cheung. Throughput maximization for wave-
pipelined interconnects using cascaded buffers and transistor sizing. In Circuits and 
Systems, 2009. ISCAS 2009. IEEE International Symposium on, pages 1293–1296. 
IEEE, 2009.  
[64] E. Stott, P.  Sedcole, and P.Y.K. Cheung. Modelling degradation in FPGA lookup tables. 
In Field-Programmable Technology, 2009. FPT 2009. International Conference on, pages 
443–446. IEEE, 2009.  
[65] A.M. Smith, G. Constantinides, S. J. Wilton, P.Y.K. Cheung. Concurrently optimizing 
FPGA architecture parameters and transistor sizing: Implications for FPGA design. In 
Field-Programmable Technology, 2009. FPT 2009. International Conference on, pages 
54–61. IEEE, 2009.  
[66] A.M. Smith, G. Constantinides, P.Y.K. Cheung, et al. Area estimation and optimisation 
of FPGA routing fabrics. In Field programmable logic and applications, 2009. FPL 2009. 
International conference on, pages 256–261. IEEE, 2009.  
[67] P.  Sedcole, E. Stott, and P.Y.K. Cheung. Compensating for variability in FPGAs by re-
mapping and re-placement. In Field Programmable Logic and Applications, 2009. FPL 
2009. International Conference on, pages 613–616. IEEE, 2009.  
[68] P. G. Potter, W. Luk, and P.Y.K. Cheung. Partition-based exploration for reconfigurable 
JPEG designs. In Design, Automation & Test in Europe Conference & Exhibition, 
(DATE) 2009, pages 886–889. IEEE, 2009.  
[69] T. Mak, P.Y.K. Cheung, W. Luk, and K. P. Lam. A DP-network for optimal dynamic 
routing in network-on-chip. In Proceedings of the 7th IEEE/ACM international 
conference on Hardware/software codesign and system synthesis, pages 119–128. ACM, 
2009.  
[70] W. Luk, P.Y.K. Cheung, and S. P. Seng. Flexible instruction processor systems and 
methods, June 2 2009. US Patent 7,543,283.  
[71] Y. Liu, C.-S. Bouganis, and P.Y.K. Cheung. Hardware architectures for eigenvalue 
computation of real symmetric matrices. IET Computers & Digital Techniques, 3(1):72–
84, 2009.  
[72] Q. Liu, G. A. Constantinides, K. Masselos, and P.Y.K. Cheung. Data-reuse exploration 
under an on-chip memory constraint for low-power FPGA-based systems. IET computers 
& digital techniques, 3(3):235–246, 2009.  
[73] Q. Liu, G. A. Constantinides, K. Masselos, and P.Y.K. Cheung. Compiling C-like 
languages to FPGA hardware: Some novel approaches targeting data memory 
organization. The Computer Journal, page bxp020, 2009. ???? 
[74] Q. Liu, G. Constantinides, K. Masselos, P.Y.K. Cheung. Combining data reuse with 
data-level parallelization for FPGA-targeted hardware compilation: A Geometric 
Programming framework. Computer-Aided Design of Integrated Circuits and Systems, 
IEEE Transactions on, 28(3):305–315, 2009.  
    12	
[75] A.Kahoul, G. A. Constantinides, A. M. Smith, and P.Y.K. Cheung. Heterogeneous 
architecture exploration: Analysis vs. parameter sweep. In Reconfigurable Computing: 
Architectures, Tools and Applications, pages 133–144. Springer Berlin Heidelberg, 2009.  
[76] P. Jamieson, T. Becker, W. Luk, P.Y.K. Cheung, T. Rissa, and T. Pitkanen. 
Benchmarking reconfigurable architectures in the mobile domain. In Field Programmable 
Custom Computing Machines, 2009. FCCM’09. 17th IEEE Symposium on, pages 131–
138. IEEE, 2009.  
[77] S. A. Fahmy, P.Y.K. Cheung, and W. Luk. High-throughput one-dimensional median 
and weighted median filters on FPGA. IET computers & digital techniques, 3(4):384–
394, 2009.  
[78] J. A. Clarke, G. A. Constantinides, and P.Y.K. Cheung. Word-length selection for power 
minimization via nonlinear optimization. ACM Transactions on Design Automation of 
Electronic Systems (TODAES), 14(3):39, 2009.  
[79] C.-S. Bouganis, S.-B.  Park, G. A. Constantinides, and P.Y.K. Cheung. Synthesis and 
optimization of 2D filter designs for heterogeneous FPGAs. ACM Transactions on 
Reconfigurable Technology and Systems (TRETS), 1(4):24, 2009.  
[80] T. Becker, W. Luk, and P.Y.K. Cheung. Parametric design for reconfigurable software-
designed radio. In Reconfigurable Computing: Architectures, Tools and Applications, 
pages 15–26. Springer Berlin Heidelberg, 2009.  
[81] T. Becker, P.  Jamieson, W. Luk, P.Y.K. Cheung, and T. Rissa. Power characterisation 
for the fabric in fine-grain reconfiguralbe architectures. In Programmable Logic, 2009. 
SPL. 5th Southern Conference on, pages 77–82. IEEE, 2009.  
[82] M. E. Angelopoulou, C.-S. Bouganis, P.Y.K. Cheung, and G. A. Constantinides. Robust 
real-time super-resolution on FPGA and an application to video enhancement. ACM 
Transactions on Reconfigurable Technology and Systems (TRETS), 2(4):22, 2009.  
[83] M. E. Angelopoulou, C.-S. Bouganis, and P.Y.K. Cheung. A sensor-based approach to 
linear blur identification for real-time video enhancement. In Image Processing (ICIP), 
2009 16th IEEE International Conference on, pages 141–144. IEEE, 2009.  
[84] J. S. Wong, P. Sedcole, and P.Y.K. Cheung. A transition probability based delay 
measurement method for arbitrary circuits on FPGAs. . In Field-Programmable 
Technology 2008, FPT 2008. International Conference on, pages 105–112. IEEE, 2008.  
[85] J. S. Wong, P.Y.K. Cheung, and P. Sedcole. Combating process variation on FPGAs 
with a precise at-speed delay measurement method. In Field Programmable Logic and 
Applications, 2008. FPL 2008. International Conference on, pages 703–704. IEEE, 2008.  
[86] K. Turkington, G. Constantinides, K. Masselos, P.Y.K. Cheung. Outer loop pipelining 
for application specific datapaths in FPGAs. Very Large Scale Integration (VLSI) 
Systems, IEEE Transactions on, 16(10):1268– 1280, 2008.  
[87] K. Turkington, G. Constantinides, P.Y.K. Cheung, K. Masselos. Co-optimisation of 
datapath and memory in outer loop pipelining. In Field-Programmable Technology 2008, 
FPT 2008. International Conference on, pages 1–8. IEEE, 2008.  
[88] E. Stott, P. Sedcole, and P.Y.K. Cheung. Fault tolerant methods for relia•bility in 
FPGAs. In Field Programmable Logic and Applications, 2008. FPL 2008. International 
Conference on, pages 415–420. IEEE, 2008.  
[89] A.M. Smith, G. Constantinides, P.Y.K. Cheung. Integrated floor-planning, module-
selection, and architecture generation for reconfigurable devices. Very Large Scale 
Integration (VLSI) Systems, IEEE Transactions on, 16(6):733–744, 2008.  
    13	
[90] P. Sedcole, J. S. Wong, and P.Y.K. Cheung. Modelling and compensating for clock skew 
variability in FPGAs. In Field-Programmable Technology, 2008. FPT 2008. International 
Conference on, pages 217–224. IEEE, 2008.  
[91] P. Sedcole, J. S. Wong, and P.Y.K. Cheung. Measuring and modeling FPGA clock 
variability. In Proceedings of the 16th international ACM/SIGDA symposium on Field 
programmable gate arrays, pages 258–258. ACM, 2008.  
[92] P. Sedcole, J. S. Wong, and P.Y.K. Cheung. Characterisation of FPGA clock variability. 
In Symposium on VLSI, 2008. ISVLSI’08. IEEE Computer Society Annual, pages 322–
328. IEEE, 2008.  
[93] P. Sedcole and P.Y.K. Cheung. Parametric yield modeling and simulations of FPGA 
circuits considering within-die delay variations. ACM Transactions on Reconfigurable 
Technology and Systems (TRETS), 1(2):10, 2008.  
[94] T. Mak, P. Sedcole, P.Y.K. Cheung, and W. Luk.  Wave-pipelined signal•ing for on-
FPGA communication. In Field-Programmable Technology, 2008. FPT 2008. 
International Conference on, pages 9–16. IEEE, 2008.  
[95] T. Mak, P. Sedcole, P.Y.K. Cheung, and W. Luk. Interconnection lengths and delays 
estimation for communication links in FPGAs. In Proceedings of the 2008 international 
workshop on System level interconnect prediction, pages 1–10. ACM, 2008.  
[96] T. Mak, C. D’Alessandro, P. Sedcole, P.Y.K. Cheung, A. Yakovlev, and W. Luk. Global 
interconnections in FPGAs: modeling and performance analysis. In Proceedings of the 
2008 international workshop on System level interconnect prediction, pages 51–58. 
ACM, 2008.  
[97] T. Mak, C. D. Alessandro, P. Sedcole, P.Y.K. Cheung, A. Yakovlev, and W. Luk. 
Implementation of wave-pipelined interconnects in FPGAs. In Networks-on-Chip, 2008. 
NoCS 2008. Second ACM/IEEE International Symposium on, pages 213–214. IEEE, 
2008.  
[98] Q. Liu, G. Constantinides, K. Masselos, P.Y.K. Cheung. Combining data reuse 
exploitationwith data-level parallelization for FPGA targeted hardware compilation: A 
geometric programming framework. In Field Programmable Logic and Applications, 
2008. FPL 2008. International Conference on, pages 179–184. IEEE, 2008.  
[99] B. Cope, P.Y.K. Cheung, and W. Luk. Using reconfigurable logic to optimise GPU 
memory accesses. In Design, Automation and Test in Europe, (DATE) 2008. pages 44–
49. IEEE, 2008.  
[100] B. Cope, P.Y.K. Cheung, and W. Luk. Systematic design space exploration for 
customisable multi-processor architectures. In Embedded Computer Systems: 
Architectures, Modeling, and Simulation, 2008. SAMOS 2008. International Conference 
on, pages 57–64. IEEE, 2008.  
[101] J. Clarke, G. Constantinides, P.Y.K. Cheung, A. M. Smith.  Glitch-aware output 
switching activity from word-level statistics. In Circuits and Systems, 2008. ISCAS 2008. 
IEEE International Symposium on, pages 1792–1795. IEEE, 2008.  
[102] P.Y.K. Cheung and N. P. Sedcole. Random generation of pld configurations to 
compensate for delay variability, Feb. 15 2008. US Patent App. 12/526,927.  
[103] T. Becker, P. Jamieson, W. Luk, P.Y.K. Cheung, and T. Rissa.  Towards benchmarking 
energy efficiency of reconfigurable architectures. In Field Programmable Logic and 
Applications, 2008. FPL 2008. International Conference on, pages 691–694. IEEE, 2008.  
[104] S. Arifin and P.Y.K. Cheung. Affective level video segmentation by utilizing the 
pleasure-arousal-dominance information. Multimedia, IEEE Transactions on, 
10(7):1325–1341, 2008.  
    14	
[105] M. E. Angelopoulou, K. Masselos, P.Y.K. Cheung, and Y. Andreopoulos. 
Implementation and comparison of the 5/3 lifting 2d discrete wavelet transform 
computation schedules on FPGAs. Journal of signal processing systems, 51(1):3–21, 
2008.  
[106] M. E. Angelopoulou, C.-S. Bouganis, P.Y.K. Cheung, and G. A. Constantinides. FPGA-
based real-time super-resolution on an adaptive image sensor. In Reconfigurable 
Computing: Architectures, Tools and Applications, pages 125–136. Springer Berlin 
Heidelberg, 2008.  
[107] M. E. Angelopoulou, C.-S. Bouganis, and P.Y.K. Cheung. Video enhancement on an 
adaptive image sensor. In Image Processing, 2008. ICIP 2008. 15th IEEE International 
Conference on, pages 681–684. IEEE, 2008.  
[108] S.-S. Ang, G. A. Constantinides, W. Luk, and P.Y.K. Cheung. Custom parallel caching 
schemes for hardware-accelerated image compression. Journal of Real-Time Image 
Processing, 3(4):289–302, 2008.  
[109] J. S. Wong, P. Sedcole, and P.Y.K. Cheung. Self-characterization of combinatorial 
circuit delays in FPGAs. In Field-Programmable Technology, 2007. ICFPT 2007. 
International Conference on, pages 17–23. IEEE, 2007.  
[110] A.M. Smith, G. Constantinides, P.Y.K. Cheung. Fused-arithmetic unit generation for 
reconfigurable devices using common subgraph extraction. In Field-Programmable 
Technology, 2007. ICFPT 2007. International Conference on, pages 105–112. IEEE, 
2007.  
[111] P. Sedcole, P.Y.K. Cheung, G. Constantinides, W. Luk. Run-time integration of 
reconfigurable video processing systems. Very Large Scale Integration (VLSI) Systems, 
IEEE Transactions on, 15(9):1003–1016, 2007.  
[112] P.  Sedcole and P.Y.K. Cheung. Parametric yield in FPGAs due to within-die delay 
variations: a quantitative analysis. In Proceedings of the 2007 ACM/SIGDA 15th 
international symposium on Field programmable gate arrays, pages 178–187. ACM, 
2007.  
[113] G. W. Morris, G. A. Constantinides, and P.Y.K. Cheung. ROM to DSP block transfer for 
resource constrained synthesis. IET Computers & Digital Techniques, 1(1):17–26, 2007.  
[114] T. S. Mak, P. Sedcole, P.Y.K. Cheung, W. Luk, and K.-P. Lam. A hybrid analog-digital 
routing network for noc dynamic routing. In Networks-on-Chip, 2007. NOCS 2007. First 
International Symposium on, pages 173–182. IEEE, 2007.  
[115] T. S. Mak, P. Sedcole, P.Y.K. Cheung, and W. Luk. Average interconnections delay 
prediction for on-FPGA communication links. Nineteenth UK Asynchronous Forum, 
2007.  
[116] T. S. Mak, P. Sedcole, P.Y.K. Cheung, and W. Luk. Average interconnection delay 
estimation for on-FPGA communication links. Electronics letters, 43(17):918–920, 2007.  
[117] Y. Liu, C.-S. Bouganis, and P.Y.K. Cheung. Efficient mapping of a Kalman Filter into 
an FPGA using Taylor expansion. In Field Programmable Logic and Applications, 2007. 
FPL 2007. International Conference on, pages 345–350. IEEE, 2007.  
[118] Q. Liu, G. A. Constantinides, K. Masselos, and P.Y.K. Cheung. Data reuse exploration 
under area constraints for low power reconfigurable systems. Proc. WASP, 2007.  
[119] Q. Liu, G. Constantinides, K. Masselos, P.Y.K. Cheung.  Automatic on-chip memory 
minimization for data reuse. In Field-Programmable Custom Computing Machines, 2007. 
FCCM 2007. 15th Annual IEEE Symposium on, pages 251–260. IEEE, 2007.  
    15	
[120] S. A. Fahmy,  C.-S. Bouganis, P.Y.K. Cheung, and W. Luk. Real-time hardware 
acceleration of the trace transform. Journal of Real-Time Image Processing, 2(4):235–
248, 2007.  
[121] B. Cope, P.Y.K. Cheung, and W. Luk. Bridging the gap between FPGAs and multi-
processor architectures: A video processing perspective. In Application-specific Systems, 
Architectures and Processors, 2007. ASAP. IEEE International Conf. on, pages 308–313. 
IEEE, 2007.  
[122] J. Clarke, G. Constantinides, P.Y.K. Cheung. On the feasibility of early routing 
capacitance estimation for FPGAs. In Field Programmable Logic and Applications, 2007. 
FPL 2007. International Conference on, pages 234–239. IEEE, 2007.  
[123] C. S. Bouganis, I. Pournara, and P.Y.K. Cheung. Efficient mapping of dimensionality 
reduction designs onto heterogeneous FPGAs. In Field-Programmable Custom 
Computing Machines, 2007. FCCM 2007. 15th Annual IEEE Symposium on, pages 141–
150. IEEE, 2007.  
[124] T. Becker, W. Luk, and P.Y.K. Cheung. Enhancing relocatability of partial bitstreams 
for run-time recon•guration. In Field-Programmable Custom Computing Machines, 2007. 
FCCM 2007. 15th Annual IEEE Symposium on, pages 35–44. IEEE, 2007.  
[125] S. Arifin and P.Y.K. Cheung. A novel video parsing algorithm utilizing the pleasure-
arousal-dominance emotional information. In Image Processing, 2007. ICIP 2007. IEEE 
International Conference on, volume 6, pages VI–333. IEEE, 2007.  
[126] S. Arifin and P.Y.K. Cheung.  A novel probabilistic approach to modeling the pleasure-
arousal-dominance content of the video based on ”working memory”. In Semantic 
Computing, 2007. ICSC 2007. International Conference on, pages 147–154. IEEE, 2007.  
[127] S. Arifin and P.Y.K. Cheung. A computation method for video segmentation utilizing 
the pleasure-arousal-dominance emotional information. In Proceedings of the 15th 
international conference on Multimedia, pages 68–77. ACM, 2007.  
[128] S.-S. Ang, G. Constantinides, W. Luk, and P.Y.K. Cheung. A hybrid memory sub-
system for video coding applications. In Field-Programmable Custom Computing 
Machines, 2007. FCCM 2007. 15th Annual IEEE Symposium on, pages 317–318. IEEE, 
2007.  
[129] T. Todman, G. Constantinides, S. Wilton, O. Mencer, W. Luk, and P.Y.K. Cheung. 
Reconfigurable computing: architectures and design methods. System On Chip: Next 
Generation Electronics, pages 452–493, 2006.  
[130] A.M. Smith, G. Constantinides, P.Y.K. Cheung. A novel heuristic and provable bounds 
for reconfigurable architecture design. In Field Programmable Logic and Applications, 
2006. FPL’06. International Conference on, pages 1–6. IEEE, 2006.  
[131] P. Sedcole, P.Y.K. Cheung, G. Constantinides, W. Luk. On-chip communication in run-
time assembled reconfigurable systems. In Embedded Computer Systems: Architectures, 
Modeling and Simulation, 2006. SAMOS 2006. International Conference on, pages 168–
176. IEEE, 2006.  
[132] P. Sedcole and P.Y.K. Cheung. Within-die delay variability in 90nm FPGAs and 
beyond. In Field Programmable Technology, 2006. FPT 2006. IEEE International 
Conference on, pages 97–104. IEEE, 2006.  
[133] T. Rissa, P.Y.K. Cheung, and W. Luk. System level design exploration of jpeg 2000 
with softsonic virtual hardware platform. In Circuits and Systems, 2006. MWSCAS’06. 
49th IEEE International Midwest Symposium on, volume 1, pages 276–280. IEEE, 2006.  
    16	
[134] T. S. Mak, P.  Sedcole, P.Y.K. Cheung, and W. Luk. On-FPGA communication 
architectures and design factors. In Field Programmable Logic and Applications, 2006. 
FPL’06. International Conference on, pages 1–8. IEEE, 2006.  
[135] Y. Liu, C.-S. Bouganis, P.Y.K. Cheung, P. H. Leong, and S. J. Motley. Hardware 
efficient architectures for eigenvalue computation. In Design, Automation and Test in 
Europe, 2006. (DATE) 2006. Proceedings, volume 1, pages 1–6. IEEE, 2006.  
[136] Y. Liu, C.-S. Bouganis, and P.Y.K. Cheung. A spatiotemporal saliency framework. In 
Image Processing, 2006. ICIP 2006. IEEE International Conference on, pages 437–440, 
2006.  
[137] S. Fahmy, C.-S. Bouganis, P.Y.K. Cheung, W. Luk. Efficient realtime FPGA 
implementation of the trace transform. In Field Programmable Logic and Applications, 
2006. FPL’06. International Conference on, pages 1–6. IEEE, 2006.  
[138] J. Clarke, A. A. Gaffar, G. Constantinides, P.Y.K. Cheung. Fast word-level power 
models for synthesis of FPGA-based arithmetic. In Circuits and Systems, 2006. ISCAS 
2006. Proceedings. 2006 IEEE International Symposium on, pages 4–pp. IEEE, 2006.  
[139] N. Campregher, P.Y.K. Cheung, G. A. Constantinides, and M. Vasilko. Yield 
enhancements of design-specific FPGAs. In Proceedings of the 2006 ACM/SIGDA 14th 
international symposium on Field programmable gate arrays, pages 93–100. ACM, 2006.  
[140] N. Campregher, P.Y.K. Cheung, G. Constantinides, M. Vasilko, et al. Re-condiguration 
and fine-grained redundancy for fault tolerance in FPGAs. In Field Programmable Logic 
and Applications, 2006. FPL’06. International Conference on, pages 1–6. IEEE, 2006.  
[141] G. Caffarena, G. Constantinides, P.Y.K. Cheung, C. Carreras, O. Nieto-Taladriz. 
Optimal combined word-length allocation and architectural synthesis of digital signal 
processing circuits. Circuits and Systems II: Express Briefs, IEEE Transactions on, 
53(5):339–343, 2006.  
[142] C.-S. Bouganis, I. Pournara, and P.Y.K. Cheung. A statistical framework for 
dimensionality reduction implementation in FPGAs. In Field Programmable Technology, 
2006. FPT 2006. IEEE International Conference on, pages 365–368. IEEE, 2006.  
[143] C.-S. Bouganis, P.Y.K. Cheung, and L. Zhaoping. FPGA-accelerated pre-attentive 
segmentation in primary visual cortex. In Field Programmable Logic and Applications, 
2006. FPL’06. International Conference on, pages 1–6. IEEE, 2006.  
[144] S. Arifin and P.Y.K. Cheung. User attention based arousal content modeling. In Image 
Processing, 2006 IEEE International Conference on, pages 433–436. IEEE, 2006.  
[145] S. Arifin and P.Y.K. Cheung. Towards affective level video applications: A novel 
FPGA-based video arousal content modeling system. In Field Programmable Logic and 
Applications, 2006. FPL’06. International Conference on, pages 1–4. IEEE, 2006.  
[146] S. Arifin and P.Y.K. Cheung.  A novel FPGA-based implementation of time adaptive 
clustering for logical story unit segmentation. In Proceedings of the conference on 
Design, automation and test in Europe, (DATE) 2006, Designers’ forum, pages 227–232. 
European Design and Automation Association, 2006.  
[147] M. Angelopoulou, K. Masselos, P.Y.K. Cheung, and Y. Andreopoulos. A comparison of 
2-D discrete wavelet transform computation schedules on FPGAs. In Field Programmable 
Technology, 2006. FPT 2006. IEEE International Conference on, pages 181–188. IEEE, 
2006.  
[148] S.-S. Ang, G. Constantinides, W. Luk, and P.Y.K. Cheung. The cost of data dependence 
in motion vector estimation for reconfigurable platforms. In Field Programmable 
Technology, 2006. FPT 2006. IEEE International Conference on, pages 333–336. IEEE, 
2006.  
    17	
[149] S.-S. Ang, G. Constantinides, P.Y.K. Cheung, and W. Luk. A flexible multi-port caching 
scheme for reconfigurable platforms. In Reconfigurable Computing: Architectures and 
Applications, pages 205–216. Springer Berlin Heidelberg, 2006.  
[150] T. Wiangtong, P.Y.K. Cheung, and W. Luk.  A unified codesign environment. In New 
Algorithms, Architectures and Applications for Reconfigurable Computing, pages 81–91. 
Springer US, 2005.  
[151] T. Wiangtong, P.Y.K. Cheung, and W. Luk. Hardware/software codesign: a systematic 
approach targeting data-intensive applications. Signal Processing Magazine, IEEE, 
22(3):14–22, 2005.  
[152] T. J. Todman, G. A. Constantinides, S. J. Wilton, O. Mencer, W. Luk, and P.Y.K. 
Cheung. Reconfigurable computing: architectures and design methods. IEE Proceedings-
Computers and Digital Techniques, 152(2):193– 207, 2005.  
[153] A.M. Smith, G. Constantinides, P. Y.K. Cheung. Generation and exploration of 
reconfigurable architectures using mathematical programming. In Field Programmable 
Logic and Applications, 2005. International Conference on, pages 341–346. IEEE, 2005.  
[154] A.M. Smith. Exploration of heterogeneous reconfigurable architectures. In Field 
Programmable Logic and Applications, 2005. International Conference on, pages 719–
720. IEEE, 2005.  
[155] N. Sidahao, G. Constantinides, P.Y.K. Cheung. Power and area optimization for multiple 
restricted multiplication. In Field Programmable Logic and Applications, 2005. 
International Conference on, pages 112–117. IEEE, 2005.  
[156] N. Sidahao, G. Constantinides, P.Y.K. Cheung. A heuristic approach for multiple 
restricted multiplication. In Circuits and Systems, 2005. ISCAS 2005. IEEE International 
Symposium on, pages 692–695. IEEE, 2005.  
[157] T. Rissa, P.Y.K. Cheung, and W. Luk. Mixed abstraction execution for the softsonic 
virtual hardware platform. In Circuits and Systems, 2005. 48th Midwest Symposium on, 
pages 976–979. IEEE, 2005.  
[158] G. W. Morris, G. Constantimdes, and P.Y.K. Cheung. Using DSP blocks for ROM 
replacement: a novel synthesis flow. In Field Programmable Logic and Applications, 
2005. International Conference on, pages 77–82. IEEE, 2005.  
[159] W. J. Melis, K. Turkington, A. Whitton, W. Luk, P.Y.K. Cheung, and P. Metzgen. Cell 
based motion estimators for reconfigurable platforms. In ERSA, pages 218–224, 2005.  
[160] D.-U. Lee, W. Luk, J. D. Villasenor, and P.Y.K. Cheung. The effects of polynomial 
degrees. In New Algorithms, Architectures and Applications for Reconfigurable 
Computing, pages 301–313. Springer US, 2005.  
[161] L. Hey, P.Y.K. Cheung, M. Gellman. FPGA based router for cognitive packet networks. 
In Field-Programmable Technology, 2005. Proceedings. 2005 IEEE International 
Conference on, pages 331–332. IEEE, 2005.  
[162] S. Fahmy, P.Y.K. Cheung, W. Luk. Novel FPGA-based implementation of median and 
weighted median filters for image processing. In Field Programmable Logic and 
Applications, 2005. International Conference on, pages 142–147. IEEE, 2005.  
[163] S. Fahmy, P.Y.K. Cheung, W. Luk, et al. Hardware acceleration of hidden Markov 
model decoding for person detection. In Design, Automation and Test in Europe, (DATE) 
2005. Proceedings, pages 8–13. IEEE, 2005.  
[164] C. T. Ewe, P.Y.K. Cheung, G. Constantinides. Error modelling of dual fixed-point 
arithmetic and its application in field programmable logic. In Field Programmable Logic 
and Applications, 2005. International Conference on, pages 124–129. IEEE, 2005.  
    18	
[165] B. Cope, P.Y.K. Cheung, W. Luk, and S. Witt. Have GPUs made FPGAs redundant in 
the field of video processing? In Field-Programmable Technology, 2005. Proceedings. 
2005 IEEE International Conference on, pages 111–118. IEEE, 2005.  
[166] G. Constantinides, P.Y.K. Cheung, W. Luk. Optimum and heuristic synthesis of multiple 
word-length architectures. Very Large Scale Integration (VLSI) Systems, IEEE 
Transactions on, 13(1):39–57, 2005.  
[167] R. C. Cheung, N. J.-b. Telle, W. Luk, and P.Y.K. Cheung. Customizable elliptic curve 
cryptosystems. Very Large Scale Integration (VLSI) Systems, IEEE Transactions on, 
13(9):1048–1059, 2005.  
[168] R. C. Cheung, W. Luk, and P.Y.K. Cheung. Reconfigurable elliptic curve cryptosystems 
on a chip. In Proceedings of the conference on Design, Automation and Test in Europe 
(DATE), 2005 -Volume 1, pages 24–29. IEEE Computer Society, 2005.  
[169] R. C. Cheung, D.-U. Lee, O. Mencer, W. Luk, and P.Y.K. Cheung. Automating custom-
precision function evaluation for embedded processors. In Proceedings of the 2005 
international conference on Compilers, architectures and synthesis for embedded systems, 
pages 22–31. ACM, 2005.  
[170] N. Campregher, P.Y.K. Cheung, G. A. Constantinides, and M. Vasilko. Yield modelling 
and yield enhancement for FPGAs using fault tolerance schemes. In Field Programmable 
Logic and Applications, 2005. FPL’05. International Conference on, pages 409–414. 
IEEE, 2005.  
[171] N. Campregher, P.Y.K. Cheung, G. A. Constantinides, and M. Vasilko. Analysis of yield 
loss due to random photolithographic defects in the interconnect structure of FPGAs. In 
Proceedings of the 2005 ACM/SIGDA 13th international symposium on Field-
programmable gate arrays, pages 138–148. ACM, 2005.  
[172] C.-S. Bouganis, G. Constantinides, P.Y.K. Cheung. A novel 2D filter design 
methodology for heterogeneous devices. In Field-Programmable Custom Computing 
Machines, 2005. FCCM 2005. 13th Annual IEEE Symposium on, pages 13–22. IEEE, 
2005.  
[173] C.-S. Bouganis, G. Constantinides, P.Y.K. Cheung. A novel 2D filter design 
methodology. In Circuits and Systems, 2005. ISCAS 2005. IEEE International 
Symposium on, pages 532–535. IEEE, 2005.  
[174] C.-S. Bouganis, P.Y.K. Cheung, G. Constantinides. Heterogeneity exploration for 
multiple 2D filter designs. In Field Programmable Logic and Applications, 2005. 
International Conference on, pages 263–268. IEEE, 2005.  
[175] C. Te Ewe, P.Y.K. Cheung, and G. A. Constantinides. Dual fixed-point: An efficient 
alternative to floating-point computation. In Field Programmable Logic and Application, 
pages 200–208. Springer Berlin Heidelberg, 2004.  
[176] C. Sim, C. Toumazou, and P.Y.K. Cheung. Ratiometric current-mode rational DAC. 
Electronics Letters, 40(7):409–410, 2004.  
[177] N. Sidahao, G. A. Constantinides, and P.Y.K. Cheung. Multiple restricted multiplication. 
In Field Programmable Logic and Application, pages 374–383. Springer Berlin 
Heidelberg, 2004.  
[178] P. Sedcole, P.Y.K. Cheung, G. A. Constantinides, and W. Luk. A structured 
methodology for system-on-an-FPGA design. In Field Programmable Logic and 
Application, pages 1047–1051. Springer Berlin Heidelberg, 2004.  
[179] P. Sedcole, P.Y.K. Cheung, G. Constantinides, and W. Luk. A structured system 
methodology for FPGA based system-on-a-chip design. In Field-Programmable Custom 
    19	
Computing Machines, 2007. FCCM 2004. 12th Annual IEEE Symposium on, pages 271–
272. IEEE, 2004.  
[180] T. Rissa, W. Luk, and P. Y.K. Cheung. Automated combination of simulation and 
hardware prototyping. Proc. Int. Conf. on Engineering of Reconfigurable Systems and 
Algorithms (CSREA Press, 2004), 2004.  
[181] T. Rissa, P.Y.K. Cheung, and W. Luk. Softsonic: A customisable modular platform for 
video applications. In Field Programmable Logic and Application, pages 54–63. Springer 
Berlin Heidelberg, 2004.  
[182] W. J. Melis, P.Y.K. Cheung, and W. Luk. Scalable structured data access by combining 
autonomous memory blocks. In Field-Programmable Technology, 2004. Proceedings. 
2004 IEEE International Conference on, pages 457–460. IEEE, 2004.  
[183] W. J. Melis, P.Y.K. Cheung, and W. Luk. Autonomous memory block for reconfigurable 
computing. In ISCAS (2), pages 581–584, 2004.  
[184] D.-U. Lee, W. Luk, J. D. Villasenor, and P.Y.K. Cheung. A Gaussian noise generator for 
hardware-based simulations. Computers, IEEE Transactions on, 53(12):1523–1534, 2004.  
[185] S. Hettiaratchi and P.Y.K. Cheung.  A novel implementation of tile-based address 
mapping. In Design, Automation and Test in Europe Conference, (DATE) 2004. 
Proceedings, volume 1, pages 306–310. IEEE, 2004.  
[186] J. Gause, P.Y.K. Cheung, and W. Luk. Reconfigurable computing for shape-adaptive 
video processing. IEE Proceedings-Computers and Digital Techniques, 151(5):313–320, 
2004.  
[187] R. C. Cheung, A. Brown, W. Luk, and P.Y.K. Cheung. A scalable hardware architecture 
for prime number validation. In Field-Programmable Technology, 2004. Proceedings. 
2004 IEEE International Conference on, pages 177–184. IEEE, 2004.  
[188] N. Campregher, P.Y.K. Cheung, and M. Vasilko.  BIST based interconnect fault location 
for FPGAs. In Field Programmable Logic and Application, pages 322–332. Springer 
Berlin Heidelberg, 2004.  
[189] C.-S. Bouganis, P.Y.K. Cheung, J. Ng, and A. A. Bharath. A steerable complex wavelet 
construction and its implementation on FPGA. In Field Programmable Logic and 
Application, pages 394–403. Springer Berlin Heidelberg, 2004.  
[190] T. Wiangtong, C. Ewe, and P.Y.K. Cheung.  Sonicmole: A debugging environment for 
the ultrasonic reconfigurable computer. In Circuits and Systems, 2003. ISCAS’03. 
Proceedings of the 2003 International Symposium on, volume 2, pages II–808. IEEE, 
2003.  
[191] T. Wiangtong, P.Y.K. Cheung, and W. Luk.  A unified codesign run-time environment 
for the ultrasonic reconfigurable computer. In Field Programmable Logic and 
Application, pages 396–405. Springer Berlin Heidelberg, 2003.  
[192] T. Wiangtong, P.Y.K. Cheung, and W. Luk. Multitasking in hardware-software codesign 
for reconfigurable computer. In Circuits and Systems, 2003. ISCAS’03. Proceedings of 
the 2003 International Symposium on, volume 5, pages V–621. IEEE, 2003.  
[193] T.  Wiangtong, P.Y.K. Cheung, and W. Luk. Cluster-driven hardware/software 
partitioning and scheduling approach for a reconfigurable computer system. In Field 
Programmable Logic and Application, pages 1071–1074. Springer Berlin Heidelberg, 
2003.  
[194] N. Sidahoao, G. Constantinides, P.Y.K. Cheung. Architectures for function evaluation 
on FPGAs. In Circuits and Systems, 2003. ISCAS’03. Proceedings of the 2003 
International Symposium on, volume 2, pages II–804. IEEE, 2003.  
    20	
[195] N. P.  Sedcole, P.Y.K. Cheung, G. A. Constantinides, and W. Luk. A reconfigurable 
platform for real-time embedded video image processing. In Field Programmable Logic 
and Application, pages 606–615. Springer Berlin Heidelberg, 2003.  
[196] A.Royal and P.Y.K. Cheung. Globally asynchronous locally synchronous FPGA 
architectures. In Field Programmable Logic and Application, pages 355–364. Springer 
Berlin Heidelberg, 2003.  
[197] T. Lee, A. Derbyshire, W. Luk, and P.Y.K. Cheung. High-level language extensions for 
run-time reconfigurable systems. In Field-Programmable Technology (FPT), 2003. 
Proceedings. 2003 IEEE International Conference on, pages 144–151. IEEE, 2003.  
[198] D.-U. Lee, W. Luk, J. Villasenor, and P.Y.K. Cheung. Non-uniform segmentation for 
hardware function evaluation. In Field Programmable Logic and Application, pages 796–
807. Springer Berlin Heidelberg, 2003.  
[199] D.-U. Lee, W. Luk, J. Villasenor, and P.Y.K. Cheung. Hierarchical segmentation 
schemes for function evaluation. In Field-Programmable Technology (FPT), 2003. 
Proceedings. 2003 IEEE International Conference on, pages 92–99. IEEE, 2003.  
[200] D.-U. Lee, W. Luk, J. Villasenor, and P.Y.K. Cheung. A hardware gaussian noise 
generator for channel code evaluation. In Field-Programmable Custom Computing 
Machines, 2003. FCCM 2003. 11th Annual IEEE Symposium on, pages 69–78. IEEE, 
2003.  
[201] S. Hettiaratchi and P.Y.K. Cheung. Mesh partitioning approach to energy efficient data 
layout. In Design, Automation and Test in Europe Conference (DATE), 2003, pages 
1076–1081. IEEE, 2003.  
[202] G. A. Constantinides, P.Y.K. Cheung, and W. Luk. Synthesis of saturation arithmetic 
architectures. ACM Transactions on Design Automation of Electronic Systems 
(TODAES), 8(3):334–354, 2003.  
[203] G. Constantinides, P.Y.K. Cheung, W. Luk. Wordlength optimization for linear digital 
signal processing. Computer-Aided Design of Integrated Circuits and Systems, IEEE 
Transactions on, 22(10):1432–1442, 2003.  
[204] T. Wiangtong, P.Y.K. Cheung, and W. Luk. Tabu search with intensification strategy for 
functional partitioning in hardware-software codesign. In Field-Programmable Custom 
Computing Machines, 2002. Proceedings. 10th Annual IEEE Symposium on, pages 297–
298. IEEE, 2002.  
[205] T. Wiangtong, P.Y.K. Cheung, and W. Luk. Comparing three heuristic search methods 
for functional partitioning in hardware–software codesign. ACM Transaction on Design 
Automation for Embedded Systems (TODAES), 6(4):425–449, 2002.  
[206] S. P. Seng, K. V. Palem, R. M. Rabbah, W.-F. Wong, W. Luk, and P.Y.K. Cheung. PD-
XML: extensible markup language for processor description. In Field-Programmable 
Technology, 2002.(FPT). Proceedings. 2002 IEEE International Conference on, pages 
437–440. IEEE, 2002.  
[207] S. P. Seng, W. Luk, and P.Y.K. Cheung. Customising flexible instruction processors: A 
tutorial introduction. Proc. Second International Workshop on Systems, Architectures, 
Modeling, and Simulation, 2002.  
[208] S. P. Seng, W. Luk, and P.Y.K. Cheung. Run-time adaptive flexible instruction 
processors. In Field-Programmable Logic and Applications: Reconfigurable Computing 
Is Going Mainstream, pages 545–555. Springer Berlin Heidelberg, 2002.  
[209] W. J. Melis, P.Y.K. Cheung, and W. Luk. Image registration of real-time video data 
using the sonic reconfigurable computer platform. In Field-Programmable Custom 
Computing Machines, 2002. Proceedings. 10th Annual IEEE Symposium  
    21	
[210] W. J. Melis, P.Y.K. Cheung, and W. Luk. Image registration of real-time broadcast 
video using the ultrasonic reconfigurable computer. In Field-Programmable Logic and 
Applications: Reconfigurable Computing Is Going Mainstream, pages 1148–1151. 
Springer Berlin Heidelberg, 2002.  
[211] D. Lee, T. Lee, W. Luk, and P.Y.K. Cheung. Incremental programming for 
reconfigurable engines. In Field-Programmable Technology, 2002.(FPT). Proceedings. 
2002 IEEE International Conference on, pages 411–415. IEEE, 2002.  
[212] S. Hettiaratchi, P.Y.K. Cheung, and T. J. Clarke.  Energy efficient address assignment 
through minimized memory row switching. In Computer Aided Design, 2002. ICCAD 
2002. IEEE/ACM International Conference on, pages 577–581. IEEE, 2002.  
[213] S. Hettiaratchi, P.Y.K. Cheung, and T. Clarke.  Performance-area trade-off of address 
generators for address decoder-decoupled memory. In Proceedings of the conference on 
Design, automation and test in Europe, page 902. IEEE Computer Society, 2002.  
[214] J. Gause, P.Y.K. Cheung, and W. Luk. Reconfigurable shape-adaptive template 
matching architectures. In Field-Programmable Custom Computing Machines, 2002. 
Proceedings. 10th Annual IEEE Symposium on, pages 98–107. IEEE, 2002.  
[215] A.A. Gaffar, O. Mencer, W. Luk, P.Y.K. Cheung, and N. Shirazi. Floating-point 
bitwidth analysis via automatic differentiation. In Field-Programmable Technology, 
2002.(FPT). Proceedings. 2002 IEEE International Conference on, pages 158–165. IEEE, 
2002.  
[216] A.A. Gaffar, W. Luk, P.Y.K. Cheung, N. Shirazi, and J. Hwang. Automating 
customisation of floating-point designs. In Field-Programmable Logic and Applications: 
Reconfigurable Computing Is Going Mainstream, pages 523–533. Springer Berlin 
Heidelberg, 2002.  
[217] A.A. Gaffar, W. Luk, P.Y.K. Cheung, and N. Shirazi. Customising floating-point 
designs. In Field-Programmable Custom Computing Machines, 2002. Proceedings. 10th 
Annual IEEE Symposium on, pages 315–317. IEEE, 2002.  
[218] G. Constantinides, P.Y.K. Cheung, W. Luk. Optimum wordlength allocation. In Field-
Programmable Custom Computing Machines, 2002. Proceedings. 10th Annual IEEE 
Symposium on, pages 219–228. IEEE, 2002.  
[219] P.Y.K. Cheung and S. D. Haynes. Reconfigurable multiplier array, Apr. 9 2002. US 
Patent 6,369,610.  
[220] T. Wiangtong, P.Y.K. Cheung, and W. Luk. Comparing three heuristic methods for 
hardware-software partitioning and scheduling. International Symposium on 
Communications and information Technology (ISCIT), pages 243–246, 2001.  
[221] C. Visavakul, P.Y.K.Cheung, and W. Luk. A digit-serial structure for reconfigurable 
multipliers. In Field-Programmable Logic and Applications, pages 565–573. Springer 
Berlin Heidelberg, 2001.  
[222] K. Tiew, A. Payne, and P.Y.K. Cheung. Mash delta-sigma modulators for wideband and 
multi-standard applications. In Circuits and Systems, 2001. ISCAS 2001. The 2001 IEEE 
International Symposium on, volume 4, pages 778–781. IEEE, 2001.  
[223] K. Tiew, A. Payne, and P.Y.K. Cheung. Novel implementation of 6th order mash delta-
sigma modulators for wideband and multi-standard applications. In VLSI Technology, 
Systems, and Applications, 2001. Proceedings of Technical Papers. 2001 International 
Symposium on, pages 129–132. IEEE, 2001.  
[224] N. Shirazi, D. Benyamin, W. Luk, P.Y.K. Cheung, and S. Guo. Quantitative analysis of 
FPGA-based database searching. Journal of VLSI signal processing systems for signal, 
image and video technology, 28(1-2):85–96, 2001.  
    22	
[225] J. Gause, C. Reuter, H. Kropp, P.Y.K. Cheung, and W. Luk. The effect of FPGA 
granularity on video codec implementations. In Field-Programmable Custom Computing 
Machines, 2001. FCCM’01. The 9th Annual IEEE Symposium on, pages 287–288. IEEE, 
2001.  
[226] J. T. de Sousa and P.  Y. Cheung. Structural interconnect diagnosis. Boundary-Scan 
Interconnect Diagnosis, pages 61–90, 2001.  
[227] G. Constantinides, P.Y.K. Cheung, W. Luk, et al. The multiple wordlength paradigm. In 
Field-Programmable Custom Computing Machines, 2001. FCCM’01. The 9th Annual 
IEEE Symposium on, pages 51–60. IEEE, 2001.  
[228] G. Constantinides, P.Y.K. Cheung, and W. Luk. Heuristic datapath alloca•tion for 
multiple wordlength systems. In Proceedings of the conference on Design, automation 
and test in Europe, (DATE) 2001, pages 791–797. IEEE Press, 2001.  
[229] N. Shirazi, W. Luk, and P.Y.K. Cheung. Framework and tools for runtime 
reconfigurable designs. IEE Proceedings-Computers and Digital Techniques, 147(3):147–
152, 2000.  
[230] S. P. Seng, W. Luk, and P.Y.K. Cheung. Flexible instruction processors. International 
Conference on Compilers, Architecture and Synthesis for Embedded Systems: 
Proceedings of the 2000 international conference on Compilers, architecture, and 
synthesis for embedded systems, 17(19):193– 200, 2000.  
[231] M. Scotti, Z. Malik, P.Y.K. Cheung, and J. Nelder. Optimisation of full-custom logic 
cells using response surface methodology. Electronics Letters, 36(1):14–16, 2000.  
[232] S. D. Haynes, J. Stone, P.Y.K. Cheung, and W. Luk. Video image processing with the 
sonic architecture. Computer, 33(4):50–57, 2000.  
[233] J. Gause, P.Y.K. Cheung, and W. Luk. Static and dynamic reconfigurable designs for a 
2D shape-adaptive DCT. In Field-Programmable Logic and Applications: The Roadmap 
to Reconfigurable Computing, pages 96–105. Springer Berlin Heidelberg, 2000.  
[234] G. A. Constantinides, P.Y.K. Cheung, and W. Luk. Roundoff-noise shaping in filter 
design. In Circuits and Systems, 2000. Proceedings. ISCAS 2000 Geneva. The 2000 
IEEE International Symposium on, volume 4, pages 57–60. IEEE, 2000.  
[235] G. A. Constantinides, P.Y.K. Cheung, and W. Luk. Multiple-wordlength resource 
binding. In Field-Programmable Logic and Applications: The Roadmap to 
Reconfigurable Computing, pages 646–655. Springer Berlin Heidelberg, 2000.  
[236] G. Constantinides, P.Y.K. Cheung, W. Luk. Multiple precision for resource 
minimization. In Field-Programmable Custom Computing Machines, (FCCM) 2000 IEEE 
Symposium on, pages 307–308. IEEE, 2000.  
[237] G. Constantinides, P.Y.K. Cheung, and W. Luk. Optimal datapath allocation for 
multiple-wordlength systems. Electronics Letters, 36(17):1508–1509, 2000.  
[238] N. Zhuang, M. Scotti, and P.Y.K. Cheung.  PTM: Technology mapper for pass-transistor 
logic. Computers and Digital Techniques, IEE Proceedings, 146(1):13–19, 1999.  
[239] N. Shirazi, W. Luk, D. Benyamin, and P.Y.K. Cheung. Quantitative analysis of run-time 
reconfigurable database search. In Field Programmable Logic and Applications, pages 
253–263. Springer Berlin Heidelberg, 1999.  
[240] W. Luk, T. Lee, J. Rice, N. Shirazi, and P.Y.K. Cheung. Reconfigurable computing for 
augmented reality. In Field-Programmable Custom Comput•ing Machines, 1999. 
FCCM’99. Proceedings. Seventh Annual IEEE Symposium on, pages 136–145. IEEE, 
1999.  
    23	
[241] S. D. Haynes, A. B. Ferrari, and P.Y.K. Cheung. Flexible reconfigurable multiplier 
blocks suitable for enhancing the architecture of FPGAs. In Custom Integrated Circuits 
(CICC), 1999. Proceedings of the IEEE 1999, pages 191–194. IEEE, 1999.  
[242] S. D. Haynes, P.Y.K. Cheung, W. Luk, and J. Stone. Sonic – a plug-in architecture for 
video processing. In Field Programmable Logic and Applications, pages 21–30. Springer 
Berlin Heidelberg, 1999.  
[243] G. A. Constantinides, P.Y.K.Cheung, and W. Luk. Truncation noise in fixed-point SFGs. 
Electronics Letters, 35(23):2012–2014, 1999.  
[244] G. A. Constantinides, P.Y.K. Cheung, and W. Luk. Synthia: Synthesis of interacting 
automata targeting lut-based fpgas. In Field Programmable Logic and Applications, pages 
323–332. Springer Berlin Heidelberg, 1999.  
[245] N. Zhuang and P.Y.K. Cheung. Logic synthesis for a fine-grain FPGA. IEE Proceedings-
Computers and Digital Techniques, 145(1):47–51, 1998.  
[246] N. Shirazi, W. Luk, and P.Y.K. Cheung. Run-time management of dynamically 
reconfigurable designs. In Field-Programmable Logic and Applications From FPGAs to 
Computing Paradigm, pages 59–68. Springer Berlin Heidelberg, 1998.  
[247] N. Shirazi, W. Luk, and P.Y.K. Cheung. Automating production of runtime 
reconfigurable designs. In FPGAs for Custom Computing Machines, 1998. Proceedings. 
IEEE Symposium on, pages 147–156. IEEE, 1998.  
[248] S. D. Haynes and P.Y.K. Cheung. A reconfigurable multiplier array for video image 
processing tasks, suitable for embedding in an FPGA structure. In FPGAs for Custom 
Computing Machines, 1998. Proceedings. IEEE Symposium on, pages 226–234. IEEE, 
1998.  
[249] S. D. Haynes and P.Y.K. Cheung. Configurable multiplier blocks for embedding in 
FPGAs. Electronics Letters, 34(7):638–639, 1998.  
[250] S. D. Haynes, A. Ferrari, and P.Y.K. Cheung. Algorithms and structures for 
reconfigurable multiplication units. In SBCCI, page 13. IEEE, 1998.  
[251] R. Wyman and P.Y.K. Cheung. Bit-plane differential EZW for the compression of video 
for variable bandwidth channels. In Circuits and Systems, 1997. ISCAS’97., Proceedings 
of 1997 IEEE International Symposium on, volume 2, pages 1125–1128. IEEE, 1997.  
[252] P. A. Molina and P.Y.K. Cheung. A quasi delay-insensitive bus proposal for 
asynchronous systems. In async, page 126. IEEE, 1997.  
[253] P. I. Mackinlay, P.Y.K. Cheung, W. Luk, and R. Sandiford. Riley-2: A flexible platform 
for codesign and dynamic reconfigurable computing research. In Field-Programmable 
Logic and Applications, pages 91–100. Springer Berlin Heidelberg, 1997.  
[254] W. Luk, N. Shirazi, S. Guo, and P.Y.K. Cheung. Pipeline morphing and virtual 
pipelines. In Field-Programmable Logic and Applications, pages 111–120. Springer 
Berlin Heidelberg, 1997.  
[255] W. Luk, N. Shirazi, and P.Y.K. Cheung. Compilation tools for run-time reconfigurable 
designs. In Field-Programmable Custom Computing Machines, 1997. Proceedings., The 
5th Annual IEEE Symposium on, pages 56–65. IEEE, 1997.  
[256] H. Demirel, T. Clarke, and P.Y.K. Cheung. Automatic segmentation of training set for 
facial feature detection. In Information, Communications and Signal Processing, 1997. 
ICICS., Proceedings of 1997 International Conference on, pages 984–988. IEEE, 1997.  
[257] J. T. de Sousa and P.Y.K. Cheung. Improved diagnosis of realistic interconnect shorts. In 
Proceedings of the 1997 European conference on Design and Test, page 501. IEEE 
Computer Society, 1997.  
    24	
[258] J. T. de Sousa and P.Y.K. Cheung. Diagnosis of boards for realistic interconnect shorts. 
Journal of Electronic Testing, 11(2):157–171, 1997.  
[259] P.Y.K. Cheung, M. v. Scotti, J. Blake, B. Brewer, R. Grisenthwaite, D. Hitchcox, and P. 
Shepherd. High speed arithmetic design using CPL and DPL logic. In Solid-State Circuits 
Conference, 1997. ESSCIRC’97. Proceedings of the 23rd European, pages 360–363. 
IEEE, 1997.  
[260] A.Chaudhuri, P.Y.K. Cheung, and W. Luk. A recon•gurable data-localised array for 
morphological algorithms. Field-Programmable Logic and Appli•cations, pages 344–353, 
1997.  
[261] D. S. Bormann and P.Y.K. Cheung. A novel globally asynchronous locally synchronous 
sliding window dft implementation. In VLSI: Integrated Systems on Silicon, pages 239–
250. Springer US, 1997.  
[262] D. S. Bormann and P.Y.K. Cheung. Asynchronous wrapper for heterogeneous systems. 
In Computer Design: VLSI in Computers and Processors, 1997. ICCD’97. Proceedings., 
1997 IEEE International Conference on, pages 307–314. IEEE, 1997.  
[263] G. Barry,  P.Y.K. Cheung, and C. Toumazou. Knowledge engineering for analogue 
design facilitation. In Circuits and Systems, 1997. ISCAS’97, Proceedings of 1997 IEEE 
International Symposium on, volume 3, pages 1496–1499. IEEE, 1997.  
[264] N. Zhuang, M. S. Benten, and P.Y.K. Cheung. Improved variable ordering of BDDs with 
novel genetic algorithm. In Circuits and Systems, 1996. ISCAS’96., Connecting the 
World., 1996 IEEE International Symposium on, volume 3, pages 414–417. IEEE, 1996.  
[265] J. Sousa, T. Shen, and P.Y.K. Cheung.  Realistic fault extraction for boards. In 
Proceedings of the 1996 European conference on Design and Test, page 612. IEEE 
Computer Society, 1996.  
[266] J. Sousa, T. Shen, and P.Y.K. Cheung. On structural diagnosis for interconnects. IEEE 
International Symposium on Circuits and Systems, pages IV–532, 1996.  
[267] P. Mackinlay, P.Y.K. Cheung and W. Luk. Hardware-software cosynthesis for the Riley 
system. IET Conference Proceedings, pages 10–10(1), January 1996.  
[268] D. Ostrowski and P.Y.K. Cheung. A fuzzy logic approach to handwriting recognition. In 
Fuzzy Logic, pages 299–314. Vieweg+ Teubner Verlag, 1996.  
[269] P.  Molina, P.Y.K. Cheung, D. S. Bormann. Quasi delay-insensitive bus for fully 
asynchronous systems. In Circuits and Systems, 1996. ISCAS’96., Connecting the 
World., 1996 IEEE International Symposium on, volume 4, pages 189–192. IEEE, 1996.  
[270] W. Luk, N. Shirazi, and P.Y.K. Cheung. Modelling and optimising run•time 
reconfigurable systems. In FPGAs for Custom Computing Machines, 1996. Proceedings. 
IEEE Symposium on, pages 167–176. IEEE, 1996.  
[271] T. Koskinen and P.Y.K. Cheung. Hierarchical tolerance analysis using statistical 
behavioral models. Computer-Aided Design of Integrated Circuits and Systems, IEEE 
Transactions on, 15(5):506–516, 1996.  
[272] H. Demirel, T. J. Clarke, and P.Y.K. Cheung. Adaptive automatic facial feature 
segmentation. In Automatic Face and Gesture Recognition, 1996. Proceedings of the 
Second International Conference on, pages 277–282. IEEE, 1996.  
[273] O. T. Albaharna, P.Y.K. Cheung, and T. J. Clarke. On the viability of fpga-based 
integrated coprocessors. In FPGAs for Custom Computing Machines, 1996. Proceedings. 
IEEE Symposium on, pages 206–215. IEEE, 1996.  
[274] P.Y.K. Cheung and W. Luk. A framework for developing hardware-software systems. 
IET Conference Proceedings, pages 6–6(1), January 1995.  
    25	
[275] S. Setty, C. Toumazou, P.Y.K. Cheung, and R. Spence. Ideas and concepts for the 
automation of analogue filters using qualitative reasoning. IET Conference Proceedings, 
pages 7–7(1), January 1995.  
[276] A.Sang-In and P.Y.K. Cheung. A method of representative fault selection in digital 
circuits for ATPG. In Circuits and Systems, 1994. ISCAS’94. 1994 IEEE International 
Symposium on, volume 1, pages 73–76. IEEE, 1994.  
[277] F. Mok, A.  Constantinides, and P.Y.K. Cheung. A flexible decimation fllter architecture 
for sigma-delta converters. In Oversampling Techniques and Sigma-Delta Modulation, 
IEE Colloquium on, pages 5–1. IET, 1994.  
[278] P.Y.K. Cheung, F. Mok, A.G. Constantinides. A VLSI decimation filter for sigma-delta 
A/D converters. IET Conference Proceedings, pages 36–41(5), January 1994.  
[279] O. T. Albaharna, P.Y.K. Cheung, and T. J. Clarke. Area and time limitations of FPGA-
based virtual hardware. In Computer Design: VLSI in Computers and Processors, 1994. 
ICCD’94. Proceedings., IEEE International Conference on, pages 184–189. IEEE, 1994.  
[280] O. Albaharna, P.Y.K. Cheung, and T. Clarke. Virtual hardware and the limits of 
computational speed-up. In Circuits and Systems, 1994. ISCAS’94., 1994 IEEE 
International Symposium on, volume 4, pages 159–162. IEEE, 1994.  
[281] S. Ahmed, P.Y.K. Cheung, and P. Collins. A model-based approach to analog fault 
diagnosis using techniques from optimisation. In European Design and Test Conference, 
1994. EDAC, The European Conference on Design Automation. ETC European Test 
Conference. EUROASIC, The European Event in ASIC Design, Proceedings., page 665. 
IEEE, 1994.  
[282] S. Ahmed and P.Y.K. Cheung. Analog fault diagnosis-a practical approach. In Circuits 
and Systems, 1994. ISCAS’94., 1994 IEEE International Symposium on, volume 1, pages 
351–354. IEEE, 1994.  
[283] D. Ostrowski, P.Y.K. Cheung, and K. Roubaud. An outline of the intuitive design of 
fuzzy logic and its efficient implementation. In Fuzzy Systems, 1993., Second IEEE 
International Conference on, pages 184–189. IEEE, 1993.  
[284] F. K. Mok, P.Y.K. Cheung, S. Quinlan, and J. Strange. STRETCH:  Self testing 
reliability evaluation chip. In Custom Integrated Circuits Conference, 1993., Proceedings 
of the IEEE 1993, pages 30–4. IEEE, 1993.  
[285] T. Koskinen and P.Y.K. Cheung. Statistical and behavioural modelling of analogue 
integrated circuits. IEE Proceedings G (Circuits, Devices and Systems), 140(3):171–176, 
1993.  
[286] P.Y.K. Cheung.  A new schematic-driven floorplanning algorithm for analog cell layout. 
In Circuits and Systems, 1993., ISCAS’93, 1993 IEEE International Symposium on, 
pages 1770–1773. IEEE, 1993.  
[287] M. J. Lee, S. Wright, C. Judge, P.Y.K. Cheung, and R. Wyman. High-performance 
CDSE TFT’s as drive circuits in am liquid crystal cells. In SPIE/IS&T 1992 Symposium 
on Electronic Imaging: Science and Technology, pages 290–296. International Society 
for Optics and Photonics, 1992.  
[288] T. Koskinen and P.Y.K. Cheung. Tolerance analysis using behavioral models. In Custom 
Integrated Circuits Conference, 1992. Proceedings of the IEEE 1992, pages 3–4. IEEE, 
1992.  
[289] N. Gohar, P.Y.K. Cheung, and C. K. Pun. RACHANA: An integrated placement and 
routing approach to CMOS analog cells. In Circuits and Sys•tems, 1992. ISCAS’92. 
Proceedings., 1992 IEEE International Symposium on, volume 6, pages 2981–2984. 
IEEE, 1992.  
    26	
[290] P.Y.K. Cheung, A. Ferrari, P. De Wilde, and G. Benyon-Tinker.  Neural network 
processor: a vehicle for teaching system design. IEE Proceedings G (Circuits, Devices 
and Systems), 139(2):244–248, 1992.  
[291] B. Rogel-Favila, A. Wakeling, and P.Y.K. Cheung. Model-based fault diagnosis of 
sequential circuits and its acceleration. In Proceedings of the conference on European 
design automation, pages 224–229. IEEE Computer Society Press, 1991.  
[292] M. Lee, S. Wright, C. Judge, and P.Y.K. Cheung. High mobility cadmium selenide 
transistors. In Display Research Conference, 1991., Conference Record of the 1991 
International, pages 211–214. IEEE, 1991.  
[293] M. Lee, S. Wright, C. Judge, and P.Y.K. Cheung. Display research conference. 
International Conference Record, 211, 1991.  
[294] T. Koskinen and P.Y.K. Cheung. Modelling behaviour and tolerances in analogue cells. 
In Custom Integrated Circuits Conference, 1991., Proceedings of the IEEE 1991, pages 
8–7. IEEE, 1991.  
[295] V. Fuentes-Sanchez and P.Y.K. Cheung. A TAG coprocessor architecture for symbolic 
languages. In Computer Design: VLSI in Computers and Processors, 1991. ICCD’91. 
Proceedings, 1991 IEEE International Conference on, pages 370–373. IEEE, 1991.  
[296] P.Y.K. Cheung and E. S. See. A comparison of decimation filter architectures for sigma-
delta a/d converters. In Circuits and Systems, 1991., IEEE International Sympoisum on 
(ISCAS) , pages 1637–1640. IEEE, 1991.  
[297] P.Y.K. Cheung and V. Fuentes-Sanchez. A TAG coprocessor for RISC architectures. In 
RISC Architectures and Applications, IEE Colloquium on, pages 5–1. IET, 1991.  
[298] C. Makris, C. Berrah, X. Xiao, M. Singha, A. Ilumoka, J. Stone, C. Toumazou, P.Y.K. 
Cheung, and R. Spence. Chipaide: A new approach to analogue integrated circuit design. 
In Analogue VLSI, IEE Colloquium on, pages 1–1. IET, 1990.  
[299] C. Toumazou, C. Makris, C. Berrah, and P.Y.K. Cheung. A methodology for automated 
generation of analogue integrated circuits. In Circuit Theory and Design, 1989., European 
Conference on (ECCTD), pages 624–628. IET, 1989.  
[300] C. Toumazou, F. Lidgey, and P.Y.K. Cheung.  Current-mode analogue signal processing 
circuits-a review of recent developments. In Circuits and Systems, 1989., IEEE 
International Symposium on (ISCAS), pages 1572–1575. IEEE, 1989.  
[301] R. Spence, C. Toumazou, P.Y.K. Cheung, C. Makris, M. Singha, and J. Stone. 
Approaches to analogue IC synthesis. In VLSI Analogue Design, IEE Colloquium on, 
pages 1–1. IET, 1989.  
[302] B. Rogel-Favila and P.Y.K. Cheung. Combinational and sequential circuit fault 
diagnosis using ai techniques. In Test Conference, 1989. Proceedings. Meeting the Tests 
of Time., International, page 950. IEEE, 1989.  
[303] B. Rogel-Favila and P.  Y. K. Cheung. Circuit representation and diagnosis using prolog. 
In Circuits and Systems, 1989., IEEE International Symposium on (ISCAS), pages 957–
960. IEEE, 1989.  
[304] B. Rogel-Favila and P.Y.K. Cheung. Deep reasoning approach to sequential circuit fault 
diagnosis. In Circuit Theory and Design, 1989., European Conference on, pages 665–669. 
IET, 1989. 
    27	
Teaching and Research Supervision 
 
Courses Taught 
1st year course on “Digital Electronics” 
2nd year course on  “Computer architecture and computer engineering” 
2nd year course on “Signals and Linear Systems” 
2nd year course on “Software Engineering: Algorithms and Data Structures” 
2nd year course on “Digital Electronics II” 
3rd year course on “Real-time Digital Signal Processing” 
3rd year course on “Digital System Design” 
3rd year/MSc course on “VHDL and Logic Synthesis” 
4th year /MSc course on Digital Integrated Circuit Design 
 
Post Experience Course on “DSP: Algorithm and Implementation” 
 
 
Current and Past Research Associate Supervision (17) 
 
Name Project Title Source of Support Date 
Dr J Davis Program Grant: PRiME EPSRC May 15 – now 
Dr J Levine Program Grant: PRiME EPSRC Oct 14 - now 
Dr E Stott Degradations in FPGAs EPSRC Apr 11 – Sep 15  
Dr J Wong Variation-Adaptive Design in FPGAs EPSRC Jan 10 – Dec 13 
Dr S. Chaudhuri Process variability in FPGAs EPSRC May 10 – Dec 10 
Dr D. Jones Reverse engineering of human visual systems EPSRC July 09 – June 10 
Dr A. Smith Domain Specific FPGAArchitectural Exploration  EPSRC Oct 06 – Aug 10 
Dr P. Sedcole Process variability in FPGAs EPSRC Feb 06 – Apr 09 
Dr C. Bouganis Reverse engineering human visual systems Research Councils Oct. 03 – Mar. 07 
Paul Mortier FPGA for communications Motorola Sept., 00 – Aug02 
Marcus v Scotti Architecture of video compression LSI Logic Oct., 99 – 
Sept 01 
Chris Smith Reconfigurable computing with DSP processors BlueWave Jul, 98 – Jun 
00 
G. Denkinson Design kit for IC Station Airflow Jun, 98 – 
May 00 
K. Zhang Reconfigurable computing with DSP processors BlueWave May., 97 – Jun 98 
Dr Matthew Woolf Behavioural Modelling Sony Semicond. Apr., 96 – Mar 98 
Guy Barry Qualitative Reasoning for analogue IC design EPSRC Dec., 96 – Oct 97 
Dr Tao Shen Design kit for IC Station Mentor Graphics Apr., 96 – Mar 
98  
 
 
Past PhD Research Students supervised (53) 
 
Name Thesis Title (Year) 
B. Rogel-Favila Model-based fault diagnosis of digital circuits (1991) 
T. Koskinen Tolerance analysis of integrated circuits using statistical behavioural models (1992) 
N. Gohar Schematic-driven analogue layout automation with iterative improvements (1993)  
S. Choomchuay Algorithms and architectures for Reed-Solomon decoding (1994) 
A. Sang-In Novel algorithms for automatic test pattern generation in digital circuits (1994) 
S. Ahmed Analog fault diagnosis using optimisation techniques (1994) 
M. Alwan Planning and navigation of an autonomous wheelchair (1997) 
P. Molina The design of a delay-insensitive bus architecture using handshake circuits (1997) 
N. Zhuang Logic synthesis and technology mapping using genetic algorithms (1998) 
D. Ostrowski Training fuzzy rulebases and recognising handwriting (1998)  
J. de Sousa Diagnosis of interconnect defects in electronic assemblies (1998)  
H. Desmier Training set analysis for image-based facial feature detection (1998) 
N. Kharma An incremental machine learning mechanism for robotic applications (1999) 
R. Wyman Bit-plane differential EZW for the compression of video bit-rate channels (1999)  
    28	
N. Shirazi Methods and tools for implementing run-time reconfigurable FPGA designs (1999)  
S. Haynes Reconfigurable architectures for video image processing (2000) 
R. Sandiford Hardware compilation based on communicating processess (2001) 
G. Constantinides High level synthesis and word length optimization of DSP systems (2002) 
K.T. Tiew Multi-stage noise shaping (MASH) delta-sigma modulators for wideband and  
 multi-standard applications (2002) 
J. Gause Reconfigurable computing for shape-adaptive video processing (2002)  
O. Albaharna Area-time efficiency of FPGA-based computation (2003) 
S. Hettiaratchi Power optimized memory access in high-level synthesis (2003) 
S.  P. Seng Adaptive flexible instruction processors (2004) 
T. Wiangtong Hardware/software partitioning and scheduling for reconfigurable systems (2004) 
C. Bouganis Multiple light source detection with application to face recognition (2004) 
D-U Lee Hardware designs for function evaluation and LDPC coding (2004) 
C. Sim Rational number techniques for high performance data converters (2005)  
W. Melis Memory architectures for reconfigurable systems (2005)  
A. Royal Globally asynchronous locally synchronous interconnect for FPGA (2005) 
N. Sidahao Multiple restricted multiplication for optimized FPGA-based arithmetic (2005)  
A. Gaffer Bit-width optimisation for arithmetic hardware (2005) 
A. Smith Heterogenous reconfigurable architecture design : an optimisation approach (2006) 
G. Morrison Guided synthesis for heterogeneous reconfigurable architectures (2006) 
P. Sedcole Reconfigurable platform-based design in FPGAs for video image processing (2006) 
N. Campregher interconnect yield analysis and fault tolerance for field programmable gate arrays 
(2007)  
T. Rissa Virtual hardware platforms: raising the design abstraction of reconfigurable systems 
(2007) 
C.C.R. Cheung Customisable arithmetic hardware designs (2007) 
B. Cope Video Processing Acceleration using Reconfigurable Logic and Graphic Processors 
(2008) 
S. Fahmy Hardware acceleration of the trace transform for vision applications (2008) 
Y. Liu Temporal unpredictability detection of real-time video sequence (2008)  
S. Arifin Online timing slack measurement and its application in Field-Programmable Gate 
Arrays (2008) 
Q. Liu Data Reuse and Parallelism in Hardware Compilation (2008) 
C.T. Ewe A New Number Representation for Hardware Implementation of DSP Algorithms 
(2008) 
K. Turkington Datapath and Memory Co-optimisation for FPGA-based Computation (2009) 
M. Angelopoulou Multi-resolution video processing with FPGAs (2009)  
T. Mak On-FPGA Communication Architectures (2010)  
T. Becker Low power FPGA (2011) 
J. Wong On-FPGA self-test and self-characterisation (2011)  
E. Stott Degradation Analysis and Mitigation in FPGAs (2011) 
A Powell High-Level Performance Estimation Framework for FPGA-based Soft Processors 
(2013) 
Z. Guan Variation-aware and adaptive timing optimisation methods in programmable devices 
(2013) 
J. Liu Context-based Image Acquisition (2014) 
J. Levine Online timing slack measurement and its application in Field-Programmable Gate 
Arrays (2014) 
 
 
Current PhD Research Students supervised (4) 
 
Name Thesis Title (Year to complete) 
J. Davis Low-overhead Fault-tolerant Logic for FPGAs (2016) 
A. Bean Acceleration of graphs through DMA (2016) 
J. Su Convolutional Neural Networking Acceleration using FPGAs (2017) 
Q. Li Machine Learning algorithms in FPGAs (2018) 
 
 
    29	
Professional, administrative and other activities (Selected) 
Plenary and Keynotes  
• “Process Variability & Degradation: New Frontier for Configurable”, ARC 2010, Bangkok 
Thailand. 
• “Applications and Tools for High Performance Reconfigurable Systems”, FPGA High 
Performance Computer Aliance, 25 May 2005, Edinburgh. 
• "Future Trends in FPGA and Reconfigurable Systems", Norchip, Oslo, Norway, Nov 2004. 
• "The Future of Reconfigurable Computing", DATE 2005 Workshop, March 2005. 
Panel Membership 
• RAE 2006 Hong Kong, EE Panel Member 
• Academy of Finland, Electrical & Electronic Engineering Panel 2006, 2007, 2012 
• Science Foundation Ireland Executive Panel Member 2009 - 2012 
• Science Foundation Ireland Engineering Panel Member 2004 - 2007 
• EPSRC Review College Member, 2003 - now 
Editorship 
• Associate Editor, IEEE Transaction on Circuits and Systems – II, 2008 – 2011 
• Guest Editor, ACM Transactions on Reconfigurable Technologies and Systems, Special Issue on 
“FPGA 2009”, 2009 
• Guest Editor, IEEE Transaction on Computers, Special Issue on "Field Programmable Logic and 
Applications", 2004 
• Associate Editor, ACM Transaction on Design Automation and Electronic Design, 2005-now 
• Associate Editor of IEEE Transaction on Circuits and Systems, 1997-1999 
Conference Committee 
• Technical Chair/General Chair, ACM Symposium on FPGA, 2009 - 2010 
• DATE Friday Workshop Chair and Steering Committee Member, 2007 - 2009 
• Technical Programme Chair, International Conference on Field Programmable Logic and 
Applications (FPL) 2002 - now 
• Steering Committee, International Conference on Field Programmable Logic and Applications 
(FPL)(2004 onwards) 
• Technical Program Committee for DATE 2001-2007 on Architectural Synthesis Track (B5) 
• Technical Program Committee for IEEE Conference on Field Programmable Technology (FPT) 
2002-now 
• Technical Program Committee, Reconfigurable Architecture Workshop, RAW 2003-2007 
• Technical Program Committee, International Conference on Architecture for Reconfigurable 
Computers, ARC 2005-7 
• Co-chair, 7th & 25th International Workshop on Field-Programmable Logic and Applications 
(FPL),1997 and 2015 
• Vice-chair & Technical Programme Committee, IEEE International Symposium on Circuit and 
Systems, 1994 
Patent Litigation Consultancy 
• Expert Consultant for Bird & Bird on “Nokia vs QualComm” patent litigation, 2008 
• Expert Consultant for Freshfields on "Philips Vs Sandisk" patent litigation, 2006 
• Expert Consultant for Jones Day on "TI vs Samsung" patent litigation, 2006 
• Expert Consultant for Freshfields on "Philips vs Salton" patent litigation, 2004 
• Expert consultant for Bristows on "Intel vs VIA" patent litigations, 2000-2003 
• Expert consultant for Bristows on "Rambus vs Hyundai" patent litigation, 2000 
• Expert consultant for Bristows on "TI vs Hyundai" patent litigation, 1999 
    30	
Due Diligence for Venture Capitals 
• Consultant for Celoxica (due diligence) 2001 
• Consultant for Baker Venture NY (due diligence), 2000 
External Examination Duties 
• External Examiner, MEng, BEng, Dept. of CS, University of Manchester, 2015-2018 
• External Examiner, MEng, BEng, Dept. of EEE, University of Leeds, 2013-2016 
• External Examiner, MEng, BEng, Dept. of Electronics, University of York, 2011-2015 
• External Examiner, MEng, BEng, Dept. of ECE, University of Southampton, 2008-2011 
• External Examiner, MEng, BEng, Dept. of EEE, Queen’s University Belfast 2006- 2009 
• External Examiner, MEng, Univ. of Edinburgh, 2005-2008 
• External Examiner, MEng, BEng, Dept. of EE, National University of Ireland, Galway, 2005-
2008 
• External Examiner, MSc, MEng, BEng, Dept. of EEE, Univ. of Newcastle 2002- 2005 
• External Examiner, MSc course in microelectronics, Univ. of Westminster 1999-2001 
• External Examiner, Electronics Sub-board, Royal Holloway and Bedford College, Univ. of 
London, 1996-1999 
• External Examiner, BEng (part-time) and MSc. courses in Electronic Engineering, University of 
Portsmouth 1995-1999 
I have acted as PhD examiners for: University College London, Kings College London, Cambridge 
University, Oxford University, UMIST, University of Manchester, University of Westminister, 
University of South Bank, University of Birmingham, EHT Zurich, Chinese University of Hong Kong 
and University of Southampton. 
 
Major Research Grants Held (Past 15 years) 
 
Source of 
funding 
Title of Project Duration 
(months) 
Start 
Date 
Total value Names of other 
investigators 
EPSRC Programme Grant: PRiME 60 
 
May 12 £5,644,636 Al-Hashimi (PI) 
and many others 
EPSRC Custom Computing 60 Oct 10 £1,259,142 Luk (PI) and 5 
others 
EPSRC 
UKRC 
Variation-Adaptive Design 
Reverse Engineering of 
Human Visual Systems 
42 
60 
Mar 10 
May 07 
£455,737 
£1,000,000 
Constantinides 
Petrou (PI), and 3 
other partners 
EPSRC Reconfigurable Architecture 
Design: An Optimization 
Approach 
42 Oct 06 £450,000 Constantinides 
(PI) 
EPSRC Reconfigurable Architecture 
for Floating Point Apps 
36 Jan 06 £450,000 Luk (PI), Leong, 
Mencer 
EPSRC PLATFORM: Field-
Programmable Logic for 
Custom Comuting 
60 Jun 05 £422,827 Luk (PI), Kelly, 
Mencer, 
Constantinides 
EPSRC Basic Technology – 
Artificial vision systems 
48 Oct 02 £2,000,000 Many partners, PI 
at Surrey. 
Panasonic Architecture for DAB 24 Apr 01 £170,000 Brookes(PI), etc. 
LSI Logic Architecture for Image 
Compression 
36 Oct 99 US$800,000 Brookes, Clarke 
    31	
	
	
	
	
	
	
	
	
List	of	Submitted	Publications	
	
    32	
List	of	Submitted	Publications	
The	 following	 submitted	 publications	 are	 arranged	 in	 three	 groups	 corresponding	 to	
the	author’s	three	aspects	of	research	in	reconfigurable	systems.		The	groups	are:	the	
design	of	 reconfigurable	 systems;	 variability,	 reliability	 and	 yield;	 and	applications	of	
reconfigurable	systems.			
Co-authors	 are	 labelled	 with	 a	 postfix	 representing:	 research	 student	 (RS),	 research	
assistant	(RA)	or	research	collaborator	(RC).	
Group	1:		The	Design	of	Reconfigurable	Systems	
[1] T.	Wiangtong	 (RS),	P.Y.K.	 Cheung,	 and	W.	 Luk	 (RC).	 Comparing	 three	 heuristic	
search	 methods	 for	 functional	 partitioning	 in	 hardware–software	 codesign.	
Design	 Automation	 for	 Embedded	 Systems,	 6(4):425–449.	 Kluwer	 Academic	
Publishers,	2002.		
[2] G.A.	 Constantinides	 (RS),	P.Y.K.	 Cheung,	W.	 Luk	 (RC).	Wordlength	optimization	
for	linear	digital	signal	processing.	Computer-Aided	Design	of	Integrated	Circuits	
and	Systems,	IEEE	Transactions	on,	22(10):1432–1442,	2003.		
[3] G.A.	 Constantinides	 (RS),	 P.Y.K.	 Cheung,	 and	 W.	 Luk	 (RC).	 Synthesis	 and	
optimization	 of	 DSP	 algorithms.	 Kluwer	 Academic	 Publishers,	 pp.	 1-164,	
ISBN:1402079303,	2004.	
[4] G.A.	 Constantinides	 (RS),	 P.Y.K.	 Cheung,	 W.	 Luk	 (RC).	 Optimum	 and	 heuristic	
synthesis	 of	 multiple	 word-length	 architectures.	 Very	 Large	 Scale	 Integration	
(VLSI)	Systems,	IEEE	Transactions	on,	13(1):39–57,	2005.		
[5] K.	 Turkington	 (RS),	 G.A.	 Constantinides	 (RC),	 K.	 Masselos	 (RC),	 P.Y.K.	 Cheung.	
Outer	 loop	 pipelining	 for	 application	 specific	 datapaths	 in	 FPGAs.	 Very	 Large	
Scale	Integration	(VLSI)	Systems,	IEEE	Transactions	on,	16(10):1268–1280,	2008.	
[6] Q.	 Liu	 (RS),	 G.A.	 Constantinides	 (RC),	 K.	 Masselo	 (RC)s,	 P.	 Y.	 K.	 Cheung.	
Combining	data	reuse	with	data-level	parallelization	for	FPGA-targeted	hardware	
compilation:	 A	Geometric	 Programming	 framework.	 Computer-Aided	Design	 of	
Integrated	Circuits	and	Systems,	IEEE	Transactions	on,	28(3):305–315,	2009.	
[7] A.M.	 Smith	 (RA),	 G.A.	 Constantinides	 (RC),	 P.Y.K.	 Cheung.	 FPGA	 architecture	
optimization	 using	 Geometric	 Programming.	 Computer-Aided	 Design	 of	
Integrated	Circuits	and	Systems,	IEEE	Transactions	on,	29(8):1163–1176,	2010.	
Group	2:		Variability,	Reliability	and	Yield	in	Reconfigurable	Systems	
[8] T.	 Koskinen	 (RS)	 and	 P.Y.K.	 Cheung.	 Hierarchical	 tolerance	 analysis	 using	
statistical	behavioral	models.	Computer-Aided	Design	of	 Integrated	Circuits	and	
Systems,	IEEE	Transactions	on,	15(5):506–516,	1996.		
    33	
[9] J.	 de	 Sousa	 (RC)	 and	 P.Y.K.	 Cheung.	 Boundary-scan	 Test	 and	 Diagnosis:	
Interconnect	Defects.	Kluwer	Academic	Publishers.	169	pages,	ISBN	0792373146,	
2001.	
[10] P.	Sedcole	(RA)	and	P.Y.K.	Cheung.	Parametric	yield	modeling	and	simulations	of	
FPGA	 circuits	 considering	 within-die	 delay	 variations.	 ACM	 Transactions	 on	
Reconfigurable	Technology	and	Systems	(TRETS),	1(2):1-10,	2008.	
[11] J.S.	 Wong	 (RS),	 P.	 	 Sedcole	 (RA),	 and	 P.Y.K.	 Cheung.	 Self-measurement	 of	
combinatorial	 circuit	 delays	 in	 FPGAs.	 ACM	 Transactions	 on	 Reconfigurable	
Technology	and	Systems	(TRETS),	2(2):10,	2009.	
[12] J.S.	Wong	 (RS)	 and	P.Y.K.	 Cheung.	 Timing	measurement	 platform	 for	 arbitrary	
black-box	 circuits	 based	 on	 transition	 probability.	 Very	 Large	 Scale	 Integration	
(VLSI)	Systems,	IEEE	Transactions	on,	21(12):2307–2320,	2013.	
[13] E.	Stott	(RS),	Z.	Guan	(RS),	J.	M.	Levine	(RS),	J.	S.	Wong	(RA),	and	P.Y.K.	Cheung.	
Variation	and	reliability	in	FPGAs.	IEEE	Design	&	Test,	6(30):50–59,	2013.	
Group	3:		Applications	of	Reconfigurable	Systems	
[14] S.D.	 Haynes	 (RS),	 J.	 Stone	 (RC),	 P.Y.K.	 Cheung,	 and	W.	 Luk	 (RC).	 Video	 image	
processing	with	the	Sonic	architecture.	IEEE	Computer,	33(4):50–57,	2000.	
[15] D.U.	Lee,	W.	Luk,	J.D.	Villasenor,	and	P.Y.K.	Cheung.	A	Gaussian	noise	generator	
for	hardware-based	simulations.	Computers,	IEEE	Transactions	on,	53(12):1523–
1534,	2004.	
[16] R.C.	Cheung,	N.J.B.	Telle,	W.	Luk,	and	P.	Y.	K.	Cheung.	Customizable	elliptic	curve	
cryptosystems.	Very	Large	Scale	Integration	(VLSI)	Systems,	IEEE	Transactions	on,	
13(9):1048–1059,	2005.	
[17] P.	Sedcole	 (RS),	P.Y.K.	Cheung,	G.A.	Constantinides	 (RC),	W.	Luk	 (RC).	Run-time	
integration	 of	 reconfigurable	 video	 processing	 systems.	 Very	 Large	 Scale	
Integration	(VLSI)	Systems,	IEEE	Transactions	on,	15(9):1003–1016,	2007.	
[18] M.E.	 Angelopoulou	 (RS),	 C.S.	 Bouganis	 (RC),	 P.Y.K.	 Cheung,	 and	 G.	 A.	
Constantinides	 (RC).	 Robust	 real-time	 super-resolution	 on	 FPGA	 and	 an	
application	 to	 video	 enhancement.	 ACM	 Transactions	 on	 Reconfigurable	
Technology	and	Systems	(TRETS),	2(4):22,	2009.	
[19] C.S.	 Bouganis	 (RA),	 I.	 Pournara	 (RC),	 and	 P.Y.K.	 Cheung.	 Exploration	 of	
heterogeneous	 FPGAs	 for	 mapping	 linear	 projection	 designs.	 Very	 Large	 Scale	
Integration	(VLSI)	Systems,	IEEE	Transactions	on,	18(3):436–449,	2010.	
[20] B.	 Cope	 (RS),	 P.Y.K.	 Cheung,	 W.	 Luk	 (RC),	 and	 L.	 Howes	 (RC).	 Performance	
comparison	 of	 graphics	 processors	 to	 reconfigurable	 logic:	 A	 case	 study.	
Computers,	IEEE	Transactions	on,	59(4):433–448,	2010.	
    34	
	
	
	
	
This	page	is	intentionally	left	blank.	
    35	
	
	
	
	
	
	
	
Statement	of	Contribution	
	
    36	
Statement	of	Contribution	
For	 the	 past	 twenty	 years	 the	 focus	 of	 my	 research	 has	 been	 in	 the	 field	 of	
reconfigurable	circuits	and	systems,	and	their	use	in	accelerating	real-time	embedded	
applications,	and	in	particular,	relating	to	the	use	of	Field	Programmable	Gate	Arrays	
(FPGAs).	 	 My	 contributions	 in	 the	 field	 have	 been	 in	 three	 related	 and	 coherent	
themes:	 the	 design	 of	 reconfigurable	 systems,	 the	 use	 of	 such	 systems	 in	 different	
application	domains,	and	the	use	of	configurability	to	mitigate	the	growing	challenges	
of	 process	 variability	 and	 reliability.	 	 Twenty	 publications	 have	 been	 selected	 to	
showcase	these	three	aspects	of	my	research.		They	have	all	received	rigorous	scrutiny	
from	independent	reviewers.		They	were	first	authored	by	either	research	students	or	
research	assistants	working	directly	under	my	guidance.			Throughout	my	career,	I	have	
followed	a	strict	principle	that	my	research	students	and	assistants	should	benefit	from	
first-authorship	 of	 the	 papers	 for	 which	 I	 initiated,	 guided	 and	 substantially	
contributed,	 in	order	to	help	 in	their	career	development.	 	Therefore	all	publications	
selected	here	are	conjoint.		None	of	the	works	have	been	previously	submitted	by	the	
author	for	any	other	degree.	
Group	1:	The	Design	of	Reconfigurable	Systems	
The	 introduction	of	Field	Programmable	Gate	Arrays	(FPGAs)	 in	the	 ‘80s	transformed	
the	way	 that	 digital	 integrated	 circuits	 are	 designed	 and	 used	 in	 electronic	 industry	
today.	 	 The	 configurability	nature	of	 FPGAs	makes	 them	accessible	 to	 a	much	 larger	
community	 of	 electronic	 design	 engineers	 in	 a	 way	 that	 is	 similar	 to	 the	 impact	 of	
programming	in	microprocessors.	Nevertheless,	reconfigurable	systems	also	bring	with	
them	new	challenges.		Unlike	writing	software,	designing	hardware	on	FPGAs	requires	
considerably	 higher	 level	 of	 skills,	 not	 only	 in	 programming,	 but	 also	 expertise	 in	
hardware	 and	 system	 level	 designs.	 	 The	 author’s	 work	 in	 advancing	 the	 design	 of	
reconfigurable	systems	has	been	mostly	in	the	area	of	design	tools	and	algorithms	that	
raise	 the	 level	 of	 abstraction	 a	 designer	 needs	 to	 operate	 while	 designing	 an	 FPGA	
based	circuit.	
    37	
Almost	all	FPGAs	are	used	in	systems	that	contain	one	or	more	microprocessors.		The	
issue	of	how	to	partition	a	problem	into	those	subtasks	that	can	best	be	done	on	the	
FPGA	 hardware	 and	 those	 that	 should	 remain	 as	 software	 algorithms	 on	 the	
microprocessor	 become	 increasingly	 important.	 The	 work	 in	 [1]	 addresses	 this	
“hardware-software	codesign”	problem	by	comparing	and	contrasting	three	heuristic	
algorithms,	 which	 are	 suitable	 to	 perform	 functional	 partitioning	 between	 FPGA	
hardware	 and	 microprocessor	 software.	 The	 three	 heuristic	 algorithms,	 genetic	
algorithm,	 simulated	annealing,	and	 tabu	search,	are	commonly	adopted	 in	 industry.	
This	work,	 for	 the	 first	 time,	 compares	 their	 efficiency	 and	 quality	 of	 solutions	with	
each	other	 in	a	rigorous	manner.	 	 It	concluded	that	tabu	search	 is	clearly	superior	to	
the	other	two	competing	algorithms	in	terms	of	quality	of	solutions	and	computation	
times.	This	work	has	attracted	a	good	level	of	attention	and	has	been	cited	for	more	
than	140	times	(Google	Scholar).	
Unlike	designing	for	microprocessors	where	data	are	represented	in	fixed	word-length	
(32	bits	or	64	bits),	 FPGAs	provide	complete	 flexibility	 to	 the	designer	 to	choose	 the	
optimum	 word-length	 at	 every	 stage	 of	 computation	 in	 the	 design.	 	 The	 research	
reported	 in	 the	 two	 journal	 papers	 [3]	 and	 [5],	 and	 the	 research	 monograph	 [4],	
addresses	 this	 problem	 for	 the	 class	 of	 linear	 digital	 signal	 processor	 (DSP)	 systems	
that	 are	 time-invariant.	 While	 using	 fewer	 number	 of	 bits	 at	 a	 given	 signal	 node	
reduces	the	hardware	resources	used,	leading	to	lower	power	consumption	and	higher	
performance,	it	could	also	introduce	higher	error	manifested	as	additional	noise.		By	a	
combination	of	back-propagating	the	word-length	from	the	output	through	the	system	
and	 a	 mixed	 integer	 linear	 programming	 formulation,	 [3]	 reports	 a	 method	 that	
optimizes	the	word-length	used	in	each	internal	variable	for	a	given	error	at	the	system	
output.	At	the	same	time,	the	resource	utilization	is	reduced	and	system	performance	
improved.		[5]	explores	the	issue	of	architectural	synthesis	when	using	multiple	word-
length	design.		This	work	addresses	the	scheduling,	allocation	and	binding	problem	in	a	
high-level	synthesis	context	where	word-lengths	of	internal	variables	(or	signals	nodes)	
are	also	part	of	 the	design	parameter	 set.	 	 	 [4]	 is	 a	 research	monograph	 that	brings	
together	the	body	of	work	in	multiple	word-length	optimization	under	one	volume.	It	
    38	
focuses	on	 techniques	 for	automating	 the	production	of	area-efficient	design	 from	a	
high-level	description,	while	satisfying	user-specified	constraints.		
This	body	of	work	 is	 important	 in	several	ways.	 	Firstly,	 it	 introduces	a	new	concept,	
which	we	 called	 “lossy	 synthesis”.	Automation	 tools	 for	digital	design	 conventionally	
takes	 a	 specification	 and	 synthesizes	 a	 solution	 that	 that	 is	 exactly	 the	 same	 as	
specified	 in	 the	design	description	 (i.e.	 “exact	 synthesis”).	 	 The	 trade-off	 parameters	
used	in	such	CAD	tools	are	energy	consumption,	performance	and	resource	utilization.		
By	 optimizing	 different	 word-length	 throughout	 the	 system,	 we	 introduce	 an	
additional	dimension	to	this	equation	–	error	or	noise.		By	allowing	the	designer	or	the	
CAD	 tools	 to	 choose	 the	 level	 of	 arithmetic	 errors	 that	 can	be	 tolerated	at	 different	
internal	nodes,	it	is	possible	to	produce	a	more	power-efficient,	faster	and	lower	cost	
design	that	still	meets	the	overall	specification.		This	work	has	also	been	extended	to	
floating-point	representations	and	non-linear	systems.	
The	impact	of	[2],	[3]	and	[4],	and	other	publications	by	the	author	and	his	team	in	the	
area	 of	 word-length	 optimization,	 is	 demonstrated	 by	 the	 high	 number	 of	 citations	
that	this	set	of	nine	journal/conference	papers	attracted	(over	600	citation	on	Google	
Scholar),	 and	 by	 the	 uptake	 of	 some	 of	 the	 ideas	 in	 industry	 (e.g.	 Xilinx’s	 System	
Generator	and	Maxeler’s	system).	
Another	 challenge	 and	 opportunity	 in	 FPGA-based	 designs	 is	 created	 by	 its	 plentiful	
supply	 of	 registers	 for	 pipelining	 purposes,	 and	 the	 highly	 distributed	 memory	
architecture	 found	 in	 modern	 FPGA	 devices.	 The	 work	 in	 [5]	 targets	 a	 different	
approach	to	loop	pipelining.		Whilst	most	conventional	hardware	compiler	tools	target	
the	innermost	loop	only,	this	work	explores	the	benefit	of	applying	loop	pipelining	to	
outer	 loops.	 	Taking	 into	account	memory	 throughput	constraints	and	highly	 custom	
loop	 controlling	 circuits	 possible	 in	 FPGAs,	 three	 to	 seven	 times	 of	 performance	
improvements	over	existing	techniques	has	been	achieved.	This	work	fits	perfectly	into	
the	 recent	 popular	 adoption	 of	 OpenCL	 as	 the	 preferred	 hardware/software	 design	
language	for	FPGA-bases	systems	where	loop	pipelining	is	an	important	enabler.	
    39	
The	main	reason	why	FPGAs	are	useful	for	accelerating	computational	tasks	is	because	
of	 its	 potential	 to	 execute	 the	 algorithm	with	 high	 degree	 of	 parallelism	 in	multiple	
processing	units.	This	 is	often	achieved	by	exploiting	 two	factors:	1)	maximizing	data	
reuse	 with	 on-chip	 memory	 to	 avoid	 having	 to	 go	 off-chip,	 and	 2)	 maximizing	 the	
degree	 of	 data	 parallelism	 available	 within	 the	 algorithm.	 	 The	 former	 is	 achieved	
through	the	use	of	on-chip	memory	as	buffers,	while	the	 latter	 is	conducted	through	
careful	 data-dependency	 analysis,	 scheduling	 and	 resource	 allocation.	 However,	 on-
chip	memory	resources	in	an	FPGA	are	limited	and	this	imposes	a	significant	constraint	
on	the	optimization	process.	[6]	employs	a	Geometric	Programming	approach	to	solve	
the	 two	 optimization	 problems	 simultaneously	 to	 produce	 a	 five-fold	 performance	
benefit	 over	 the	 convention	 approach	 that	 optimizes	 for	 the	 two	 factors	 separately.		
The	algorithm	reported	in	[6]	has	been	used	in	industry	such	as	Honeywell	and	others	
(zlatko.petrov@honeywell.com).	
The	 last	 paper	 [7]	 in	 this	 group	differs	 from	all	 the	 others	 in	 that	 it	 does	 not	 target	
digital	 designs	 using	 FPGAs,	 but	 aims	 to	 help	 designing	 new	 FPGA	 architectures.		
Conventional	approach	to	new	FPGA	architecture	design	is	to	use	intuition	to	come	up	
with	different	new	ideas,	then	run	many	benchmark	circuits	to	see	which	idea	provides	
the	 best	 average	 solution	 for	 a	 given	 specification.	 This	 approach	 also	 implies	 that	
synthesis,	placement	and	 routing	 tools	 should	already	be	 in	place	 in	order	 for	 large-
scale	 benchmark	 evaluation	 to	 take	 place.	 In	 contrast,	 [7]	 uses	 mathematical	
optimization	 techniques	 to	 explore	 the	 FPGA	 architectural	 design	 space	 with	 well-
calibrated	area	and	delay	models.		While	this	work	does	not	replace	the	conventional	
method	 in	 designing	 new	 FPGA	 architecture,	 it	 provides	 a	 fast	 exploration	 method	
particularly	at	the	early	stage	of	the	architectural	design	cycle.	
Group	2:	Variability,	Reliability	and	Yield	in	Reconfigurable	Systems	
Variations	 in	 the	 semiconductor	 fabrication	 process	 results	 in	 differences	 in	
parameters	 between	 transistors	 on	 the	 same	 silicon	 die,	 a	 problem	 exacerbated	 by	
lithographic	scaling.	As	transistors	get	smaller,	they	also	experience	accelerated	ageing	
    40	
effects,	thus	reducing	the	reliability	of	electronic	systems.	In	the	past	ten	years,	one	of	
the	 author’s	 research	 focuses	 has	 been	 to	 seek	 answers	 to	 the	 following	 important	
question:	“How	can	configurability	(and	reconfigurability)	in	FPGA	systems	be	exploited	
to	 mitigate	 the	 problems	 caused	 by	 increasing	 process	 variability	 and	 decreasing	
system	reliability?”		
[8]	 is	 an	 early	 article	 by	 the	 author	 in	 the	 area	 of	 variations,	 not	 for	 reconfigurable	
systems,	but	 for	 analogue	and	mixed	 signal	 circuits.	 This	work	demonstrates	how	 to	
use	 statistical	 behavioural	 modelling	 techniques	 to	 alleviate	 the	 high	 computation	
costs	of	device-level	simulation	in	a	Monte-Carlo	approach	to	yield	estimation.		It	was	
one	of	the	earliest	works	that	provided	both	computational	accuracy	and	efficiency	in	
handling	variations	of	device	parameters	in	a	large	circuit	that	was	applicable	to	both	
analogue	and	digital	circuits.			
A	 related	 area	 of	 work	 during	 this	 period	 was	 in	 the	 fault	 diagnosis	 of	 electronic	
systems.	 	 As	 feature	 size	 continues	 to	 shrink,	 good	 degree	 of	 controllability	 and	
observability	of	 individual	circuit	nodes	become	 increasingly	difficult	 to	achieve.	 	The	
research	monograph	 [9]	 entitled	 “Boundary-Scan	 Interconnect	 Diagnosis”	 introduces	
methods	that	drastically	reduce	the	number	of	test	patterns	required	for	interconnect	
fault	diagnosis.		Although	[8]	and	[9]	are	not	applied	to	reconfigurable	hardware,	they	
provide	a	useful	 springboard	 for	 the	author’s	 later	 research	 in	 the	area	of	 variability	
and	reliability	in	FPGAs.	
[10]	is	an	extended	and	combined	version	of	two	earlier	conference	papers,	and	forms	
part	of	a	series	of	publications	on	characterizing,	modelling	and	mitigating	within-die	
variability	on	FPGAs.		This	is	the	first	of	many	papers	by	the	author	and	his	team	to	use	
reconfigurability	 of	 FPGAs	 to	 avoid	 the	penalty	 of	 large	delay	margins	demanded	by	
the	conventional	worst-case	delay	approach	to	designing	digital	circuits.	This	is	also	the	
first	 work	 in	 the	 FPGA	 community	 where	 the	 theoretical	 bounds	 on	 the	 achievable	
gains	in	parametric	yield	using	reconfiguration	are	derived.		This	paper	opens	the	door	
to	the	possibility	of	a	completely	new	approach	to	the	design	of	digital	systems,	where	
    41	
FPGAs	and	other	digital	 integrated	circuits	are	not	assumed	to	be	 identical,	but	have	
different	delay	characteristics.		Our	novel	approach	is	to	match	the	physical	placement	
and	 routing	 of	 hardware	 resources	 to	 an	 individual	 chip’s	 delay	 characteristics	 by	
exploiting	the	configurable	properties	of	FPGAs.		In	this	way,	one	can	depart	from	the	
need	 to	 design	 for	 the	 worst-case	 delay	 parameters.	 	 This	 new	 paradigm	 in	 digital	
design	has	been	designated	by	 the	 author	 as	 “late	binding”.	 	 In	 conventional	 design	
method,	a	 logical	 circuit	 (in	 terms	of	 its	hardware	 resources	and	 interconnections)	 is	
committed	(“bind”)	to	the	final	physical	layout	through	placement	and	routing	early	in	
the	design	cycle.	The	new	method	performs	such	“binding”	only	after	the	actual	delay	
characteristics	 are	 known	 and	 therefore	 allows	 a	 more	 optimum	 placement	 and	
routing	solution	to	be	found	for	a	given	chip.	This	work	has	resulted	 in	a	granted	US	
patent	(US2010/0180246),	which	has	been	assigned	to	industry.			
The	 proposed	 approach	 in	 [10]	 is	 possible	 predicated	 upon	 the	 ability	 to	 measure	
delays	of	circuit	paths	 in	digital	 ICs	with	good	accuracy	and	efficiency.	 	 [11]	and	 [12]	
provide	 solutions	 to	 the	delay	 characterization	problem.	 	The	 technique	proposed	 in	
[11]	measures	the	propagation	delays	in	combinational	circuits	with	a	resolution	in	the	
order	tens	of	picoseconds.		It	relies	on	the	configurability	of	the	clock	module	(and	its	
frequency)	 in	an	FPGA.	 	By	sweeping	the	clock	frequency	from	a	 lower	value	when	a	
circuit	is	working,	to	a	higher	value	when	the	circuit	is	failing,	the	propagation	delay	of	
the	combinational	paths	in	the	circuit	can	be	derived.	Our	technique	departs	from	the	
conventional	method	of	using	ring	oscillators	to	characterize	delays	and	is	far	superior	
because	characterization	is	performed	on	the	actual	circuit	under	test.		This	technique	
was	 later	modified	 and	 improved,	 not	 only	 to	measure	 propagation	 delay	while	 the	
circuit	 is	 under	 test	 mode	 (i.e.	 offline),	 but	 also	 to	measure	 timing	 slack	 under	 the	
circuit’s	normal	operation	(i.e.	online).		These	techniques	are	now	widely	used	by	other	
research	groups	elsewhere,	e.g.	University	of	Pennsylvania	and	University	of	Sydney.	
In	 [12]	 the	 technique	described	 in	 [11]	 is	extended	 to	cope	with	both	combinational	
and	sequential	circuits.		Instead	of	detecting	and	measuring	failures	in	the	circuit	as	the	
clock	 frequency	 is	 being	 swept,	 this	 new	 technique	 measures	 the	 transition	
    42	
probabilities	 at	 outputs	 of	 a	 “black-box”	 circuit,	 i.e.	 without	 the	 need	 to	 know	 the	
exact	netlist	of	 the	circuit	under	test.	 In	other	words,	 the	method	reported	 in	 [12]	 is	
capable	 of	 measuring	 delay	 performances	 in	 arbitrary	 digital	 circuits	 with	 different	
granularity.			
The	techniques	reported	in	[11]	and	[12],	and	other	extensions	reported	elsewhere	not	
included	in	this	thesis	submission,	form	a	set	of	tools	that	facilitate	a	fresh	approach	to	
the	 design	 of	 future	 digital	 circuits	 and	 systems.	 By	 providing	 measurement	
instruments	within	 the	chip	 (embedded	 instrumentations),	 it	 is	now	possible	 to	 tune	
the	 operation	 of	 a	 digital	 chip	 to	 cope	 with	 the	 changing	 operating	 environment	
through	 voltage	 and	 frequency	 scaling.	 	 Although	 both	 [11]	 and	 [12]	 are	 targeting	
FPGA	 technology,	 these	methods	 can	 easily	 be	 adopted	 to	 ASIC	 designs	 in	 general.		
However	 it	 is	worth	noting	that	FPGAs	offer	one	further	advantage	over	ASICs	 in	this	
respect:	 since	 FPGAs	 are	 fully	 reconfigurable,	 offline	 characterization	 of	 a	 device	
effectively	comes	free.		During	power-on,	an	FPGA	can	be	put	into	a	“test	mode”	and	
be	configured	to	perform	the	self-characterization	function.	 	Once	characterization	 is	
completed,	 the	 device	 can	 be	 reconfigured	 to	 implement	 the	 required	 design,	
effectively	overlaying	the	test	configuration.			
[11]	and	 [12]	have	resulted	 in	 the	granted	patent:	US2011/0095768,	which	has	been	
assigned	 to	 industry.	 	 The	 techniques	 described	 also	 provide	 part	 of	 the	 technical	
underpinning	for	the	EPSRC	programme	grant	PRiME:	Power-efficient,	Reliabile,	Many-
core	Embedded	systems	(EP/K034448/1).	
In	 addition	 to	 the	 variability	 issues	 facing	modern	 digital	 circuits,	 another	 challenge	
looming	 large	 in	 the	background	 is	 that	of	device	ageing.	 	Electronic	circuits	degrade	
over	 time	 and	 such	 degradations	 are	 expected	 to	 increase	 in	 magnitude	 and	 occur	
more	rapidly	in	the	future.	[13]	is	a	journal	paper	in	a	Special	Issue	of	IEEE	Design	and	
Test	 focusing	 entirely	 on	 variability	 and	 ageing.	 This	work	demonstrates	 for	 the	 first	
time	how	variability	aware-design	 together	with	configurability	of	FPGA	can	mitigate	
both	the	variability	and	ageing	problems.	
    43	
Group	3:	Application	of	Reconfigurable	Systems	
One	of	the	most	popular	application	domains	for	FPGA-based	systems	is	in	image	and	
video	processing.		In	this	domain,	the	dataset	is	inherently	parallel	with	low	degree	of	
data	dependency,	and	has	very	high	data	rates.	Integer	arithmetic	is	usually	sufficient	
to	 produce	 good	 results	 without	 having	 to	 resort	 to	 floating	 point	 computations,	
something	 that	 matches	 FPGA	 architecture	 very	 well	 (that	 is	 until	 the	 recent	
introduction	of	floating-point	hard-blocks	by	Altera).		[14]	is	a	journal	paper	reporting	
one	of	 the	earliest	 reconfigurable	 computers	designed	and	built	with	 the	 support	of	
our	industrial	partner	Sony	Broadcast	Europe.		This	work	was	published	in	the	Special	
Issue	 of	 IEEE	 Computer	 on	 Reconfigurable	 Computing.	 	 According	 to	 the	 Editorial,	
twenty	four	submissions	by	research	groups	around	the	world	were	received,	and	only	
four	full	papers	(including	ours)	was	selected	for	publication.		Furthermore,	this	system	
(known	 as	 Sonic	 architecture,	 which	 is	 a	 combination	 of	 SONY	 and	 IC	 for	 Imperial	
College)	 was	 adopted	 by	 Sony	 Broadcast	 Europe	 as	 their	 R&D	 laboratory’s	 main	
prototyping	vehicle	for	a	number	of	years	thereafter.			
The	 work	 of	 [14]	 sparked	 interests	 in	 other	 industrial	 partner	 such	 as	 Xilinx.	 	 They	
subsequently	funded	a	follow-on	project	known	as	“Sonic-on-a-Chip”,	where	the	Sonic	
architecture	was	 integrated	into	a	much	larger,	single,	FPGA	device.	[17]	 is	the	result	
from	this	work	where	two	major	challenges	are	addressed:	1)	how	to	take	a	platform-
based	design	approach	to	allow	run-time	system	assembly	instead	of	full	compilation?,	
2)	how	to	model	and	schedule	on-chip	communication	using	time-division	multiplexing	
in	a	way	that	guarantees	no	data	is	lost?				In	additional	to	the	industrial	impact	of	[14]	
and	 [17],	 they,	 and	 their	 associated	 conference	 publications,	 attracted	 over	 250	
citations	(Google	Scholar).	
The	author’s	collaboration	with	Sony	Broadcast	lasted	for	over	a	decade	including	the	
work	 reported	 in	 [20].	 	 Graphic	 Processor	 Units	 (GPUs)	 have	 emerged	 as	 a	 strong	
contender	to	replace	FPGAs	as	the	better	and	more	efficient	computational	hardware	
alternative.	[20]	studies	this	hypothesis	with	real	industrial	problems	provided	by	Sony	
Broadcast,	 and	 identifies	 the	 classes	 of	 problems	 for	which	 each	 technology	 is	most	
    44	
suitable.		This	work	has	attracted	a	good	level	of	attention	since	its	publication	in	2010	
(92	citations	in	Google	Scholar).	
[18]	 is	 another	 application	 in	 the	 video	 domain	 where	 an	 adaptive	 image	 senor	 is	
integrated	with	an	FPGA	in	order	to	achieve	super-resolution	image	sensing	for	video	
capture	 applications.	 	 By	 integrating	 an	 image	 sensor	 with	 an	 FPGA	 in	 the	 same	
system,	 it	 is	 possible	 to	 provide	 real-time	 trade-off	 between	 spatial	 resolution	 and	
exposure	time.	Using	the	Iterative	Back	Project	algorithm	implemented	on	the	FPGA,	a	
motion-deblurring	system	that	works	 in	the	presence	of	noise	 is	demonstrated.	 	This	
work	has	subsequently	attracted	funding	from	DSTL	to	explore	other	real-time	super-
resolution	techniques	using	FPGAs.	
Apart	 from	video	 and	 image	processing,	many	other	 application	domains	potentially	
can	benefit	 from	the	use	of	FPGAs	and	 reconfigurable	 systems.	 	One	such	domain	 is	
hardware-accelerated	 simulation.	 [15]	 reports	 the	 design	 of	 a	 Gaussian	 noise	
generator	using	FPGA	for	exploring	channel	code	behaviour.		The	main	novelty	in	this	
design	 is	 the	 used	 of	 non-uniform	 piecewise	 linear	 approximations	 in	 computing	
trigonometric	and	logarithmic	functions,	achieving	a	50	times	speed	improvement	over	
a	 software	 implementation.	 The	 significance	 of	 this	 work	 comes	 from	 the	 general	
applicability	of	the	noise	generator	to	other	stochastic	modelling	and	simulation.		For	
example,	 this	 work	 has	 been	 extended	 to	 model	 financial	 systems	 and	 to	 perform	
online	 trading,	 where	 low	 computation	 latency	 is	 of	 paramount	 importance.	 This	
journal	paper	has	attracted	a	good	level	of	attention	with	88	citations	(Google	Scholar).	
Encryption	and	decryption	is	yet	another	domain	where	FPGAs	are	particular	suited	in	
providing	 acceleration.	 This	 is	 partly	 due	 to	 the	 fact	 that	 encryption	 and	 decryption	
usually	requires	a	 large	number	of	bit-level	manipulations,	something	that	FPGAs	are	
particularly	 well-suited.	 Such	 hardware	 systems	 often	 demand	 flexibility,	 high	
throughput	and,	when	used	 in	embedded	application,	power	efficiency.	 	 [16]	reports	
the	design	of	an	elliptic	curve	cryptosystem	that	 is	highly	flexible.	 	This	design	allows	
the	user	 to	perform	trade-offs	between	speed,	size	and	 level	of	security.	 	Over	1000	
    45	
times	speed	improvement	when	compared	to	a	software	implementation	is	achieved.		
Even	more	significantly,	this	performance	improvement	is	accompanied	by	much	lower	
energy	consumption:	our	design	only	runs	at	35MHz	on	a	Xilinx	FPGA	as	compared	to	
the	 software	 solution	 running	 on	 a	 Xeon	 processor	 operating	 at	 2.6GHz.	 	 This	work	
provides	one	of	the	best	solutions	for	Elliptic	Curve	Cryptography	in	the	literature,	and	
has	attracted	nearly	100	citations	(Google	Scholar).	
The	work	 report	 in	 [19]	 demonstrates	 yet	 another	 application	 domain	where	 FPGAs	
could	play	an	important	role.		In	the	general	field	of	“big	data”,	the	key	challenge	is	to	
reduce	the	amount	of	original	data	that	require	processing.	Among	many	techniques	
available,	 one	 common	algorithm	used	 for	 this	 application	 is	 to	 computer	 the	 linear	
projection.	 	 [19]	 reports	 a	 Bayesian	 factor	 analysis	 framework	 for	 dimensionality-
reduction	using	FPGAs.		The	novelty	of	this	work	is	in	coupling	the	data	approximation	
using	 a	 small	 set	 of	 variables,	 FPGA	 resource	 optimization	 and	 the	 exploitation	 of	
heterogeneity	in	modern	FPGAs	under	one	framework.		The	impact	of	this	work	is	in	its	
general	applicability	in	many	related	problems	in	big	data	and	machine	learning.	
Concluding	remarks	
To	summarize,	the	collection	of	20	original	works	selected	for	this	submission	includes	
seminal	 journal	 publications	 and	 research	 monographs,	 providing	 evidence	 of	 my	
international	 standing	 in	 the	 field	 of	 FPGAs	 and	 reconfigurable	 systems.	 The	
significance	and	impact	of	my	work	over	the	past	two	decades	is	also	demonstrated	by	
the	 results	 from	 the	 2014	 Research	 Excellence	 Framework	 (REF)	 assessment.	 	 My	
research	in	this	field	formed	one	of	the	five	submitted	impact	case	studies,	which	was	
given	the	top	score	of	four	stars.	
    46	
	
	
	
	
This	page	is	intentionally	left	blank.	
    47	
	
	
	
	
	
	
	
Statement	for	Conjoint	Work	
	
    48	
Statement	for	Conjoint	Work	
For	works	with	research	collaborators,	e-mail	addresses	are	provided	for	confirmation	
of	 contribution	 if	desired,	but	not	 for	works	with	past	 research	 students	or	 research	
assistants	working	under	my	direct	supervision	and	guidance.	
1. T.	Wiangtong	(Research	Student)	–	As	the	primary	PhD	supervisor	(WL	as	second	
supervisor),	 I	 initiated,	 directed	 and	 contributed	 in	 the	 execution	 of	 the	
work.		
2. G.A.	 Constantinides	 (Research	 Student)	 –	 As	 the	 primary	 PhD	 supervisor	 (WL	 as	
second	 supervisor),	 I	 took	 the	 major	 role	 to	 initiated,	 directed	 and	
contributed	 in	 the	 execution	 of	 the	 work.	 I	 also	 obtained	 the	 industrial	
support	from	HP	Lab	Bristol	who	sponsored	this	project.	
3. G.A.	 Constantinides	 (Research	 Student)	 –	 As	 the	 primary	 PhD	 supervisor	 (WL	 as	
second	 supervisor),	 I	 took	 the	 major	 role	 to	 initiated,	 directed	 and	
contributed	 in	 the	 execution	 of	 the	 work.	 I	 also	 obtained	 the	 industrial	
support	from	HP	Lab	Bristol	who	sponsored	this	project.	
4. G.A.	 Constantinides	 (Research	 Student)	 –	 As	 the	 primary	 PhD	 supervisor	 (WL	 as	
second	 supervisor),	 I	 took	 the	 major	 role	 to	 initiated,	 directed	 and	
contributed	 in	 the	 execution	 of	 the	 work.	 	 I	 also	 obtained	 the	 industrial	
support	from	HP	Lab	Bristol	who	sponsored	this	project.	
5. K.	Turkington	(Research	Student)	–	I	was	the	second	PhD	supervisor	(with	GAC	as	
the	primary	supervisor).		I	assisted	to	initiate,	direct	and	contributed	in	the	
execution	of	the	work.		K.	Masselos	(kmas@hwlab.uop.gr).	
6. Q.	 Liu	 (Research	 Student)	 –	 I	 was	 the	 second	 PhD	 supervisor	 (with	 GAC	 as	 the	
primary	 supervisor).	 	 I	 assisted	 to	 initiate,	 direct	 and	 contributed	 in	 the	
execution	of	the	work.		K.	Masselos	(kmas@hwlab.uop.gr).	
7. A.M.	Smith	(Research	Assistant)	–	a	former	research	student	who	stayed	as	a	RA	
on	an	EPSRC	grant	where	 I	was	as	a	co-investigator.	 	As	co-investigator,	 I	
contributed	to	the	initiation,	direction	and	execution	of	the	work.	
8. T.	 Koskinen	 (Research	 Student)	 –	 As	 the	 primary	 PhD	 supervisor,	 I	 initiated,	
directed	and	contributed	in	the	execution	of	the	work.	
9. J.	 de	 Sousa	 (Research	 Student)	 –	 As	 the	 primary	 PhD	 supervisor,	 I	 initiated,	
directed	and	contributed	in	the	execution	of	the	work.			
10. P.	Sedcole	(Research	Assistant)	–	A	former	research	student	who	stayed	as	a	RA	on	
an	 EPSRC	 grant	 that	 I	 was	 the	 PI.	 	 Therefore	 I	 initiatied,	 directed	 and	
contributed	in	the	execution	of	the	work.	
    49	
11. J.S.	Wong	 (Research	 Student),	 P.	 Sedcole	 (Research	 Assistant)	 –	 As	 the	 primary	
PhD	 supervisor	 of	 one	 and	 the	 line	 manager	 of	 the	 other,	 I	 initiated,	
directed	and	contributed	in	the	execution	of	the	work.		
12. J.S.	Wong	(Research	Student)	–	As	the	primary	PhD	supervisor,	I	initiated,	directed	
and	contributed	in	the	execution	of	the	work.	
13. E.	 Stott,	 Z.	 Guan,	 J.M.	 Levine	 (all	 Research	 students)	 -	 	 As	 the	 primary	 PhD	
supervisor	to	all	three	students,	I	initiated,	directed	and	contributed	in	the	
execution	of	the	work.		J.S.	Wong	was	a	fomer	PhD	student	who	stayed	as	a	
RA	on	an	EPSRC	grant	where	I	was	the	PI.	
14. S.D.	 Haynes	 (Research	 Student)	 –	 As	 the	 primary	 PhD	 supervisor,	 I	 initiated,	
directed	and	 contributed	 in	 the	execution	of	 the	work.	 	 J.	 Stone	was	 the	
industrial	supervisor	from	Sony	Broadcast	(john.stone@eu.sony.com)	who	
sponsored	this	project.	
15. D.U.	 Lee	 (Research	 Student)	 -	 I	was	 the	 second	 PhD	 supervisor	 (with	WL	 as	 the	
primary	 supervisor).	 	 I	 assisted	 to	 initiate,	 direct	 and	 contributed	 in	 the	
execution	of	the	work.	
16. R.C.	Cheung	(Research	Student)	-	I	was	the	second	PhD	supervisor	(with	WL	as	the	
primary	 supervisor).	 	 I	 assisted	 to	 initiate,	 direct	 and	 contributed	 in	 the	
execution	of	the	work.			
17. P.	 Sedcole	 (Research	 Student)	 -	 –	 As	 the	 primary	 PhD	 supervisor	 (co-supervised	
with	WL),	I	initiated,	directed	and	contributed	in	the	execution	of	the	work.	
I	 also	 obtained	 the	 industrial	 support	 from	 Xilinx	 who	 sponsored	 this	
project.	
18. M.E.	 Angelopoulou	 (Research	 Student)	 –	 As	 the	 primary	 PhD	 supervisor	 (co-
supervised	with	CSB),	I	initiated,	directed	and	contributed	in	the	execution	
of	the	work.	
19. C.S.	Bouganis	 	 (Reseach	Assistant)	 -	 	 I	was	his	 line	manager	on	an	EPSRC	 funded	
project	and	CS	Bouganis	became	a	Lecturer	at	the	end	of	the	contract.		As	
his	 line	manager	 and	Co-I	 of	 the	project,	 I	 I	 contributed	 to	 the	 initiation,	
direction	and	execution	of	the	work.	
20. B.	Cope	 (Research	Student)	–	As	 the	primary	PhD	supervisor	 (co-supervised	with	
WL),	 I	 initiated,	 directed	 and	 contributed	 in	 the	 execution	 of	 the	work.	 I	
also	obtained	the	industrial	support	from	Sony	Broadcast,	who	sponsored	
this	project.	
	
    50	
	
	
	
	
	
This	page	is	intentionally	left	blank.	
