






















Research of Memory Computing Processor
Katsumi Inoue
Currently, the computers we use every day are Neumann computers. This computer has high
versatility and is used in various fields. A full-scale big data society and a highly intelligent infor-
mation processing society have begun,however,semiconductor miniaturization technology known
as Moore’s Law will soon reach its limits. In view of the above background, it is necessary to solve
the biggest problem of the Neumann computer. Neumann-type computing is based on sequen-
tial processing computing using two devices:a computing device and a storage device or memory.
Therefore,the biggest problem is the bus bottleneck? the Neumann bus bottleneck?between the
computing device and the storage or memory device. Against this background, research has been
accelerated to improve information processing The addition of computing function in the memory
has been studied as in-memory computing technology. And also, the addition of a computing
function near of the memory has been studied as a near-memory computing technology. Many
of the goals of these studies are to solve the Neumann bass bottleneck. In addition, it is nec-
essary to know that it is not reasonable to make arithmetic units such as arithmetic logic units
?ALUs?and floating-point units?FPUs?, which are Central Processing Unit?CPU?or Graph-
ics Processing Unit?GPU?arithmetic units, repeatedly execute processing such as? information
detection?. This study?Memory Computing Processor?belongs to in-memory computing and
aims to fundamentally solve the problem of? information detection?. The Memory Computing
Processors are devices in which an arithmetic device and a storage or memory device are mounted
on the same semiconductor chip in order to solve the adverse effects caused by the separation
of the arithmetic device and the storage or memory device. The Memory Computing Processors
specializes only in? information detection?processing, which is difficult for CPU and GPU, In
the Memory Computing Processors,Group Array Processors?GAPs?are used in large quantities
to reduce the operating cost of computing devices to low-cost, space-saving computing units, and
to execute massively parallel computing in chips enables high-speed, power-saving processing. In
addition, memory computing processors eliminate the need for? information discovery?indexes
and metadata, simplifying information processing and reducing expertise. ? Detection of infor-
mation?by the memory computing processor is quick, and has the effect of saving power and
simplifying information processing. In the future, information processing will be mainly divided
between CPU and GPU, which are conventional processors, and in-memory computing processors



































? 1? ?? 1
1.1 ???????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 ?????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.3 ???????????????????????????????? . . . . . . 2
1.3.1 ???????????????????? . . . . . . . . . . . . . . . . . 2
1.4 ????????????????????? . . . . . . . . . . . . . . . . . . . . 4
1.4.1 ????????????? . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.4.2 ????????????? . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.4.3 ???????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.5 ??????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
1.6 ?????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
? 2? ??????????? 7
2.1 ??? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.2 ?????????????????? . . . . . . . . . . . . . . . . . . . . . . . 8
2.2.1 ?????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.2.2 ???????????????? . . . . . . . . . . . . . . . . . . . . . . 9
2.2.3 ???????????????? . . . . . . . . . . . . . . . . . . . . . . 10
2.3 ???????????????? . . . . . . . . . . . . . . . . . . . . . . . . . . 12
2.3.1 1????????????????????? . . . . . . . . . . . . . . . 12
2.3.2 ????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
2.3.3 ??????????????? . . . . . . . . . . . . . . . . . . . . . . . 15
2.3.4 ?????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
2.3.5 ?????????????? . . . . . . . . . . . . . . . . . . . . . . . . 18
2.4 ??????????????????? . . . . . . . . . . . . . . . . . . . . . . 20
2.5 FPGA?????????????? . . . . . . . . . . . . . . . . . . . . . . . . 22
2.6 ???????????????????? . . . . . . . . . . . . . . . . . . . . . 22
? 3? ??????????????? 27
3.1 ???????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
3.2 ?????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
3.3 ?????????????????? . . . . . . . . . . . . . . . . . . . . . . . 30
3.3.1 ???????????????????? . . . . . . . . . . . . . . . . . 30
3.3.2 ???????????????????? . . . . . . . . . . . . . . . . . 31
3.3.3 ?????????????????? . . . . . . . . . . . . . . . . . . . 32
3.3.4 ??????????????????? . . . . . . . . . . . . . . . . . . 34
ii ??
3.4 ?????????????????????????????? . . . . . . . . 35
3.4.1 ?????????????? . . . . . . . . . . . . . . . . . . . . . . . . 35
3.4.2 ?????????????? . . . . . . . . . . . . . . . . . . . . . . . . 35
3.4.3 ???????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
? 4? ????????? 41
4.1 ???????????????????? . . . . . . . . . . . . . . . . . . . . . 41
4.1.1 ?????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
4.1.2 ???????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
4.1.3 ?????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
4.2 ??????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
4.2.1 ????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
4.2.2 ?????????????? . . . . . . . . . . . . . . . . . . . . . . . . 46
4.2.3 ????????????? . . . . . . . . . . . . . . . . . . . . . . . . . 47
4.2.4 ???????????????? . . . . . . . . . . . . . . . . . . . . . . 48
4.2.5 ????????????????? . . . . . . . . . . . . . . . . . . . . 49
4.3 ??????????????????? . . . . . . . . . . . . . . . . . . . . . . 50
4.3.1 ???????????????? . . . . . . . . . . . . . . . . . . . . . . 51
4.3.2 ??????????????? . . . . . . . . . . . . . . . . . . . . . . . 51
4.3.3 ????????????? . . . . . . . . . . . . . . . . . . . . . . . . . 54
4.3.4 ???????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
4.3.5 ?????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
4.4 ???????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
4.4.1 ??????????????? . . . . . . . . . . . . . . . . . . . . . . . 60
4.4.2 ???????????????????? . . . . . . . . . . . . . . . . . 60
4.4.3 ????????????????? . . . . . . . . . . . . . . . . . . . . 68
4.5 ????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
4.5.1 ?????????????? . . . . . . . . . . . . . . . . . . . . . . . . 70
4.5.2 FPGA????????????? . . . . . . . . . . . . . . . . . . . . . . 72
4.5.3 ASIC??????????????? . . . . . . . . . . . . . . . . . . . . 74
4.5.4 ?????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
4.5.5 ???? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
4.6 ???????????????????? . . . . . . . . . . . . . . . . . . . . . 78
4.6.1 CPU?????????????????? . . . . . . . . . . . . . . . . 78
4.6.2 ??????????????? . . . . . . . . . . . . . . . . . . . . . . . 80
4.7 AI?????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
4.7.1 ?????????????????????? . . . . . . . . . . . . . . 81
4.7.2 ?????????????????? . . . . . . . . . . . . . . . . . . . 82
4.8 2????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
4.8.1 1????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
4.8.2 3????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
?? iii
? 5? ?? 85
5.1 ?????????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . 85
5.2 ?????????????????????? . . . . . . . . . . . . . . . . . . 88
5.2.1 ????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
5.2.2 ???????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
5.2.3 ??? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
5.3 ?????????????????????? . . . . . . . . . . . . . . . . . . 89
5.3.1 ?????????????????? . . . . . . . . . . . . . . . . . . . 89
5.3.2 ????????????? . . . . . . . . . . . . . . . . . . . . . . . . . 90
5.3.3 IoT?????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . 90
5.3.4 ?????AI???? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90
5.4 ??????????????????? . . . . . . . . . . . . . . . . . . . . . . 91
5.4.1 ???????????????? . . . . . . . . . . . . . . . . . . . . . . 91
5.4.2 ??????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
5.4.3 ??????????????? . . . . . . . . . . . . . . . . . . . . . . . 91
5.4.4 ????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91







2.1 ??????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.2 ??????????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2.3 ???????????? 1?????????????? . . . . . . . . . . . . 11
2.4 1??????????????????? . . . . . . . . . . . . . . . . . . . . . 14
2.5 1??????????????????? . . . . . . . . . . . . . . . . . . . . . 15
2.6 1?????????????????? . . . . . . . . . . . . . . . . . . . . . . 17
2.7 1????????????????????????? . . . . . . . . . . . . . . 19
2.8 ???????????????????? . . . . . . . . . . . . . . . . . . . . . 21
2.9 ??????????????????BIC?????? . . . . . . . . . . . . . 23
2.10 ???????????QLA?????? . . . . . . . . . . . . . . . . . . . . . . 24
3.1 ????????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
3.2 ??????????????????? . . . . . . . . . . . . . . . . . . . . . . 30
3.3 ?????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
3.4 ???????????????? . . . . . . . . . . . . . . . . . . . . . . . . . . 33
3.5 ?????????????????????? . . . . . . . . . . . . . . . . . . 34
3.6 FPGA????????????????????? . . . . . . . . . . . . . . . . 35
3.7 FIC?????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
3.8 ?????????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
3.9 ????????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
4.1 ????????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
4.2 ??????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
4.3 ??????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
4.4 ???????????????? . . . . . . . . . . . . . . . . . . . . . . . . . . 48
4.5 ????????????????? . . . . . . . . . . . . . . . . . . . . . . . . 50
4.6 ???????????????????? . . . . . . . . . . . . . . . . . . . . . 52
4.7 ???????????????? 1??? . . . . . . . . . . . . . . . . . . . . . 53
4.8 ???????????????? 2??? . . . . . . . . . . . . . . . . . . . . . 55
4.9 ??????????????? 1??? . . . . . . . . . . . . . . . . . . . . . . 56
4.10 ??????????????? 2??? . . . . . . . . . . . . . . . . . . . . . . 57
4.11 ??????????????????? . . . . . . . . . . . . . . . . . . . . . . 59
4.12 ??????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
4.13 ???????????????????? . . . . . . . . . . . . . . . . . . . . . 62
4.14 ??????????????????? 1?3?? . . . . . . . . . . . . . . . . . 64
4.15 ??????????????????? 4?6?? . . . . . . . . . . . . . . . . . 65
vi ? ? ?
4.16 ??????????????????? 7?10?? . . . . . . . . . . . . . . . . 66
4.17 ??????????????????? 11?12?? . . . . . . . . . . . . . . . . 67
4.18 ?????????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
4.19 2?????????????????? . . . . . . . . . . . . . . . . . . . . . . 72
4.20 ?????????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73
4.21 ???????????????? . . . . . . . . . . . . . . . . . . . . . . . . . . 73
4.22 SOP????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
4.23 ???????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
4.24 ???????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
4.25 SOP????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
4.26 ???????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
4.27 ?????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
4.28 ?????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
4.29 ?????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
vii
? ? ?
2.1 1?????????????????????? . . . . . . . . . . . . . . . . . 13
2.2 BIC32K16????? FPGA????? . . . . . . . . . . . . . . . . . . . . . . . 24
2.3 BIC32K16???????????????????? . . . . . . . . . . . . . . . 25
3.1 FIC????????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
4.1 CPU????????????????? . . . . . . . . . . . . . . . . . . . . . . 69
4.2 FPGA??????????????????????? . . . . . . . . . . . . . 74
4.3 ?????????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
4.4 ????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
4.5 ASIC??????????????????????? . . . . . . . . . . . . . . 81
5.1 ?????????? . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86














1. ???????????????Near Memory Computing?:?????????????
?????????????????????????????????????????
???





























































???????? SRAM?Static Random Access Memory??????? 128???? 256??
??NAND?NOR?XOR???????????????????????????????
????????????????????X-SRAM?????????????????128
?????? 256???????????????ECB????Electronic CodeBook mode??
??????? 74.7%??? 74.6%????????????????????????





??????????????Inception v3????????????????CPU (Xeon E5)
? 7.7?????????GPU?Titan Xp?? 18.3?????????????????Neural

























































































































































































? 2.3: ???????????? 1??????????????


























?? 4??? 2????????????????????????????? 1???????
????????????????????????????????10??? 2??????












???? ????8? ?4? ?2? ?1?
0
?? ?/8?∗?/4?∗?/2?∗?/1?




























































1 1 1 1 ?? N/A
?? N/A
( / : ????;? ∗ : ???;?+ : ???)
















? 25? 2??????? 10??????10????????????????????????






?10???????????????MSB????? 10?????? 13?? 4???????
??????????10???????16????????????????????????
??? 4????????? 15? 16?????????? 14?????????????10?
?????10????16??????????? 10???????16????????????
????????10??????10????????????????????????
16 ? 2? ???????????






























• ???? 1: 1???????????? 1????????????????X ? 4??
???????1???????REG???????????
• ???? 2: ?????????1????????????X? 3????????1??
?????REG?????????????????????????1????????
???????????
• ???? 3: ??????????????X? 8??????




18 ? 2? ???????????
• ???? 5: ???????????????????1????????X? 3?????
???????XOR?????????????1????????????
• ???? 6: ????X? 3???????1????????????????1????
???????????






































































































































? 2.2: BIC32K16????? FPGA?????









????? Hadware Power(W)? Througput(MB/s)
Ref.[107] 20,000 cores per 95,900 510
????? 834 Intel CPUs
Ref.[108] 1,344 cores per ? 170 0.45
????? 1 NVIDIA GPU ??
??BIC32K16(IS2) Intel Arria V 18.2 1.44











































































?????? 80?????? 40????? 10?????????????????????1




























































???????? 7????128????? 8????256cm???? 8????256kg???? 1























0?? 0???? 0?? 1?????????????1???????????? 1?? 0???
? 1?? 1?????????????0???????????????????n??m??
??????????????????????????????????????? 0???
??????? 0?? 0???? 1?? 0????????????1???????????? 0?
? 1???? 1?? 1????????????0???????????????????n??
m???????????????????????
??????? 4??????????????????????? 0?????? 3???
???????????????????????????????????????????






















































????FIC?Frequent Item Counter?IP?? (Intellectual Property Core) ?????????
?????????????????????????????? FIC???????????
????????FIC??????????? FIC IP???DMA?Direct Memory Access??









































????? I??? 1????????????????????8? 256??????????
???3???????????????
? 1????????????? 2?????????????????????A0?A¯0?A1?
??? A¯1?? (A0 AND A1), (A0 AND A¯1), (A¯0 AND A1)???? (A¯0 AND A¯1)??????
????????






??????????? 6???????????6???? sum-result? 0?31????????















Design [74] (2010) [75] (2011)
Approach Approximation
Method Pipelined model






of output 32 64 128 64 128 256 512 1,024
results
LUT 8,720 16,887 32,023 16,846 34,108 71,302 135,424 269,285
Register 8,312 6,880 12,033 5,377 10,529 20,804 41,441 82,657
Slide N/A 69,433
FMax
Origin: 80 110 115 105 110 95







Scaled: 185.33 140.28 133.90 121.15











Arria V SoC (28nm)
(65nm)





of output 256 256 64 128 256 512 1,024
results
LUT 62,260 N/A
ALUT N/A 51,094 10,312 19,246 36,692 64,709 83,709




ALM 45,258 7,202 13,436 25,990 47,055 80,975
FMax
Origin: 80











































































































































• E0????????????????? (x=0, y=0)????
• E1???????? E0?? (x=−2, y=4)?????????
• E2???????? E0?? (x=3, y=2)?????????
• E3???????? E0?? (x=−5, y=−4)?????????
































































? (R=0)? (G=0)? (B=0)???????????????????????????????
(R = 0)? (G = 0)? (B = 0)= (R = 0)? (G = 0)? (B = 0)=
(R=1?255)? (G=1?255)? (B=1?255) ???????????????
????(R=100?110)? (G=100?110)? (B=100?110)??????????
(R=0?99? 111?255)? (G=0?99? 111?255)? (B=0?99? 111?255)?????????
?????????????R?????????
(R=0?99)? (G=0?255)? (B=0?255)? (R=111?255)? (G=0?255)? (B=0?255)
G?????????
(R=0?255)? (G=0?99)? (B=0?255)? (R=0?255)? (G=111?255)? (B=0?255)
B?????????
(R=0?255)? (G=0?255)? (B=0?99)? (R=0?255)? (G=0?255)? (B=111?255)
???????RGB???
?(R=0?99)? (G=0?255)? (B=0?255)? (R=111?255)? (G=0?255)? (B=0?255)??
?(R=0?255)? (G=0?99)? (B=0?255)? (R=0?255)? (G=111?255)? (B=0?255)??










































































52 ? 4? ?????????
? 4.6: ????????????????????
4.3. ??????????????????? 53
? 4.7: ???????????????? 1???









































? 4.8: ???????????????? 2???
56 ? 4? ?????????
? 4.9: ??????????????? 1???
4.3. ??????????????????? 57
? 4.10: ??????????????? 2???







???? E0?????????????????????????? E1?????? x=0±2


































































































































???? 8???????????????????????????????? 246 × 205?




???? 9??????????????????????????????? 320× 212??
???? 4.16????? 9???????????????? 2??????????????
???????????????????????????????? 455?????????
????????????


















































































































68 ? 4? ?????????
???? 11???????????????????? 225× 225?????? 4.17????





???? 12????????????????????? 218× 240?????? 4.17???


























































































































































































































































































































































































































































































































































































































































































































































































































































































???????????? FG1????? FG1?????????????? FG1??E0?








???????????????? FG2? FG???????????????? AND?OR?
NOT???????????????????????????????????? FG2??








































• ? 1???????1?? 50????????????????????????????
??????????????????? 5????????
• ? 2????????????????????????????????????? 5?
??????????????????? FG??????????? 2????????
??????














? 4.2????????????????????????????? 128 × 64??????
?????????? 32× 32??????????????????????????????




74 ? 4? ?????????
• ??????????????????
? 4.2: FPGA???????????????????????
Refarences Ref.[109] Ref.[110]? Our Work
Object image[pixels] 128 ? 128 128 ? 128 128 ? 64
Query image[pixels] 36 ? 36 64 ? 64 32 ? 32
Logic resources 32,115 35,586? 56,492?
Memory[bits] 1,204,985 ? 456,625? 2,236,336
??DSP blocks 113 ? 113 ? 0?
Frequency[Mhz] ? 75? ? 85? ? 50?
Device ? Stratix II ? Stratix II ?Cyclon IV















• ????????:XY ? 2?????????2?4?8???????????
• ??:2?4?8??????














76 ? 4? ?????????
? 4.23?????? Pixel???????????
? 4.23: ????????
?????QVGA?? 320× 240?????? 76,800??????? 1??????????
???????????? RGB??????????????? RGB MIN/MAX?????
RGB?????????????????????? 2??????????????????
?????? FGS?Flag Shifter????????????????????????? FGS?
???????????????????????????????????????????
?????PO?Parallel Operation????????????,???????????????






? 4.25? SOP?????????????????? 4.22???? SOP?????????
???????????????????????????????????? CAD????
??????????????????????? 9mm×9mm??????.








??????????? 1??? 4.26? 5???????????????????????
??????????(E0: X=0, Y=0, E1: X=−16±0, Y=16±0, E2: X=16±0, Y=16±0, E3:
X=−16±0, Y=−16±0, E4: X=16±0, Y=−16±0)
???????????2???4.26?5?????????????????????????
??????(E0: X=0, Y=0, E1: X=−16±8, Y=16±8, E2: X=16±8, Y=16±8, E3: X=−16±8,










??????????? 1 ??????????? 2
??? ?? ??? ??? ??? ?? ??? ???
???? 0 0 0 13 ???? 0 0 0 13
E0??? 6 0 0 2 E0??? 6 0 0 2
E1??? 6 0 4 2 E1??? 6 10 2 2
E2??? 6 0 4 2 E2??? 6 10 2 2
E3??? 6 0 4 2 E3??? 6 10 2 2
E4??? 6 0 4 2 E4??? 6 10 2 2
???? 0 0 0 1 ???? 0 0 0 1
??????????? 70 ??????????? 102





























































?? ?? [104] ?? [105] ???
????CPU Intel Core i7 860 ARM Cortex-A9 Intel Core i5-3317U
?? 4?? 8???? 2?? 2?? 4????
TDP???? 95 W 1.9 ? 17 W
???????? 2.8?GHz 2.0 GHz 1.76 GHz
???? 81.676 m? 2.01 ? 22 m?
???????? Xilinx Virtex-6 FPGA Xilinx Zynq-7000 FPGA ASIC
???????? 280.004 MHz 25 MHz 33 MHz
???? 0.234 m? ? 0.43 ? 2.10 µ?


















































????????? 4??? 128?????? 4???????????? 2K????????
??????????
? 4.29: ??????????









































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































































[1] S. Khoram,et al., “Challenges and Opportunities: From Near-memory Computing to In-
memory Computing,” in Proc. of 2017 ACM International Symposium on Physical Design
(ISPD), pp. 1-4, Mar. 2017.
[2] G. Singh, et al., “Near-Memory Computing: Past, Present, and Future,” Cornell University,
pp. 1-16, Aug. 2019.
[3] Amogh Agrawal, Akhilesh Jaiswal, Chankyu Lee, and Kaushik Roy, “X-SRAM-?Enabling
Boolean Computations in CMOS Static Random Access Memories,” IEEE Transactions on
Circuits and Systems I, vol. 65, no. 12, pp. 4219-4232, Dec. 2018. ?
[4] Charles Eckert, et al.,“Neural Cache?Bit-Serial In-Cache Acceleration of Deep Neural Net-
works,” in Proceedings of 2018 ACM/IEEE 45th Annual International Symposium on Com-
puter Architecture (ISCA), pp.383-396?May. 2018.
[5] ?? ???“??????????????????????????????,”????,
2019? 8? 18??
[6] ?? ???“???????????????,”??????? 6516775??2019? 4?
26??
[7] ?????“????????????????????????,”?????, ? 6393852
?, 2018? 8? 31??
[8] ?? ???“??????????????????????????????,”????
?, ? 6229024?, 2017? 10? 20??
[9] ?? ???“?????????????????????????????????,”?
???????, ?? 221454, 2017? 7? 31??
[10] ?? ???“?????????????????????????????????,”?
?????, ? 2790009?, 2017? 1? 17??
[11] ?? ???“??????????????????????????????,”????,
US9627065B2, 2017? 4? 18??
[12] ?? ???“?????????????????????????????,”?????,
? 6014120?, 2016? 9? 30??
[13] ?? ???“?????????????????,???????????????,”?
????, ? 5992073?, 2016? 8? 26??
94 ????
[14] ?? ???“??????????????????????????????,”????
?, ? 5981666?, 2016? 8? 5??
[15] ?? ???“?????????????????????????????????,”?
???, US9275734B2, 2016? 3? 1??
[16] ?? ???“?????????????????????????????????,”?
????, ? 5763616?, 2015? 6? 19??
[17] ?? ???“????????????????????????,??????????,”
?????, ??? 1, 446192?, 2014? 7? 20??
[18] ?? ???“????????????????????????,??????????,”
?????, ? 4588114?, 2010? 9? 17??
[19] Katsumi Inoue, and Cong-Kha Pham, “The Memorism Processor: Towards a Memory-Based
Artificially Intelligence Complementing the von Neumann Architecture,” SICE Journal of
Control, Measurement, and System Integration, vol.10, no.6, pp.544-550, Nov. 2017.
[20] Katsumi Inoue, Trong-Thuc Hoang, and Cong-Kha Pham,“Frequent Items Counter Based
on Binary Decoders,” IEICE Electronics Express, vol. no.20, pp.1-12, Oct. 2018.
[21] ?? ??, ? ??,“??????????????????????,”????????
??????????? 118(10), pp. 17-22, 2018? 4? 19?.
[22] ?? ??, ?? ??, ? ??,“???????????????????????????
?,”???????? SSI2016, 2016? 12? 7?.?
[23] ?? ??, Nguyen Xuan-Thuan, ? ??,“?????????????????????
?????? (DBP),” ????????????????????? 2015? ?????
??? (2), 2015? 8? 25?.
[24] ?? ??, ? ??,“?????????????????????????? (DBP),” ?
?????????????, ???? 114(13), pp. 91-96, 2014? 4? 17?.
[25] ?? ??, ? ?????, ?? ??, ???,“?????????? (SOP),”??????
????????, ???? 113(236), pp. 35-40, 2013? 10? 7?.
[26] Takahiro Hosaka, Trong-Thuc Hoang, Van-Phuc Hoang, Duc-Hung Le, Katsumi Inoue, and
Cong-Kha Pham, “Live Demonstration: Real-Time Auto-Exposure Histogram Equalization
Video-System using Frequent Items Counter,” in Proceedings of 2019 IEEE International
Symposium on Circuits and Systems (ISCAS), pp. 1-2, May. 2019.
[27] Xuan-Thuan Nguyen, Trong-Thuc Hoang, Katsumi Inoue, Ngoc-Tu Bui, Van-Phuc Hoang,
and Cong-Kha Pham, “A 1.2-V 90-MHz Bitmap Index Creation Accelerator with 0.27-nW
Standby Power on 65-nm Silicon-On-Thin-Box (SOTB) CMOS,” in Proceedings of 2019
IEEE International Symposium on Circuits and Systems (ISCAS), pp. 1-5, May. 2019.
???? 95
[28] Xuan-Thuan Nguyen, Trong-Thuc Hoang, Hong-Thu Nguyen, Katsumi Inoue, and Cong-
Kha Pham, “An Efficient I/O Architecture for RAM-Based Content-Addressable Memory
on FPGA,” IEEE Transactions on Circuits and Systems II: Express Briefs vol. 66, no. 3,
2019.
[29] Katsumi Inoue, Trong-Thuc Hoang, Xuan-Thuan Nguyen, Hong-Thu Nguyen, and Cong-
Kha Pham, “VLSI Design of Frequent Items Counting Using Binary Decoders Applied to
8-bit per Item Case-study,” in Proceedings of 2018 14th Conference on Ph.D. Research in
Microelectronics and Electronics (PRIME). pp. 161-165, 2018.
[30] Xuan-Thuan Nguyen, Trong-Thuc Hoang, Hong-Thu Nguyen, Katsumi Inoue, and Cong-
Kha Pham, “A 219µW 1D-to-2D-Based Priority Encoder on 65-nm SOTB CMOS,” in Pro-
ceedings of 2018 IEEE International Symposium on Circuits and Systems (ISCAS), pp. 1-5,
2018.
[31] Xuan-Thuan Nguyen, Trong-Thuc Hoang, Hong-Thu Nguyen, Katsumi Inoue, and Cong-
Kha Pham, “An FPGA-Based Hardware Accelerator for Energy-Efficient Bitmap Index
Creation,” IEEE Access. vol. 6, pp. 16046-16059, Mar. 2018.
[32] Trong-Thuc Hoang, Xuan-Thuan Nguyen, Hong-Thu Nguyen, Nhu-Quynh Truong, Duc-
Hung Le, Katsumi Inoue, and Cong-Kha Pham, “FPGA-based frequent items counting
using matrix of equality comparators,” in Proceedings of 2017 IEEE 60th International
Midwest Symposium on Circuits and Systems (MWSCAS), pp. 285-289, 2017.
[33] Xuan-Thuan Nguyen, Hong-Thu Nguyen, Katsumi Inoue, Osamu Shimojo, and Cong-Kha
Pham, “Highly parallel bitmap-based regular expression matching for text analytics,” in
Proceedings of 2017 IEEE International Symposium on Circuits and Systems (ISCAS), pp.
2667-2671, 2017.
[34] Xuan-Thuan Nguyen, Hong-Thu Nguyen, Trong-Thuc Hoang, Katsumi Inoue, Osamu Shi-
mojo, Toshio Murayama, Kenji Tominaga, and Cong-Kha Pham,“An efficient FPGA-based
database processor for fast database analytics,” in Proceedings of 2016 IEEE International
Symposium on Circuits and Systems (ISCAS), pp. 1-5, 2016.
[35] Duc-Hung Le, Katsumi Inoue, and Cong-Kha Pham,“Design of a parallel CAM-based multi-
match search system using 0.18-µm CMOS process,” in Proceedings of 2014 IEEE Fifth
International Conference on Communications and Electronics (ICCE), pp. 336-340, 2014.
[36] Duc-Hung Le, Tran-Bao-Thuong Cao, Katsumi Inoue, and Cong-Kha Pham, “A CAM-based
Information Detection Hardware System for fast exact pattern matching,” in Proceedings of
2013 IEEE 56th International Midwest Symposium on Circuits and Systems (MWSCAS),
pp. 848-852, 2013.
[37] Duc-Hung Le, Katsumi Inoue, and Cong-Kha Pham,“Design a fast CAM-based information
detection system on FPGA and 0.18µm ASIC technology,” in Proceedings of 2013 IEEE
International Conference of Electron Devices and Solid-state Circuits Conference Paper, pp.
1-4, 2013.
96 ????
[38] Duc-Hung Le, Tran-Bao-Thuong Cao, Katsumi Inoue, and Cong-Kha Pham,“A fast CAM-
based Watermarking extraction on FPGA,” in Proceedings of 2013 International Conference
on IC Design & Technology (ICICDT), pp. 203-207, 2013.
[39] Duc-Hung Le, Tran Bao Thuong Cao, Katsumi Inoue, and Cong-Kha Pham,“A fast CAM-
based image matching system on FPGA,” in Proceedings of 2013 IEEE International Sym-
posium on Circuits and Systems (ISCAS2013), pp.1797-1801, 2013.
[40] Duc-Hung Le, Masahiro Sowa, Katsumi Inoue, and Cong-Kha Pham,“A fully-parallel infor-
mation detection hardware system employing Content Addressable Memory,” in Proceedings
of 2012 Fourth International Conference on Communications and Electronics (ICCE), pp.
447-452, 2012.
[41] Duc-Hung Le, Katsumi Inoue, and Cong-Kha Pham,“A novel CAM-based Information De-
tection Hardware System on FPGA,” in Proceedings of PRIME 2012 8th Conference on
Ph.D. Research in Microelectronics & Electronics, pp. 1-4, 2012.
[42] H. Zhang, et al.,“In-memory big data management and processing: A survey,” IEEE Trans-
actions on Knowledge and Data Engineering, vol.27, no.7, pp.1920-1948, 2016.
[43] X.-T. Nguyen, S.-C. Haw, S. Subramaniam, and C.-K. Pham,“Dynamic Node Labeling
Schemes for XML Updates,” in Proceedings of the 6th International Conference On Com-
puting & Informatics(ICOCI),pp.505-510, 2017.
[44] H. Wong, et al.,“Bit transposed files,” in Proceedings of the 11th International Conference
on Very Large Data Bases,pp.448-457, 1985.
[45] P. O ?Neil, and D. Quass,“Improved query performance with variant indexes,” ACM SIG-
MOD Record, vol.26, no.2, pp.38-49, 1997.
[46] R. R. Sinha and M. Winslett,“Multi-resolution bitmap indexes for scientific data,” ACM
Transactions on Database Systems, vol.32, no.3, pp.16-39, 2007.
[47] K. Wu, A. Shoshani, and K. Stockinger,“Analyses of multi-level and multi-component com-
pressed bitmap indexes,” ACM Transactions on Database Systems, vol.35, no.1, pp.1-52,
2010.
[48] K. Stockinger, and K. Wu,“Bitmap Indices for Data Warehouses,” in Data Warehouses and
OLAP: Concepts, Architectures and Solutions, pp.157-178, 2007.
[49] K. Wu, E. J. Otoo, and A. Shoshani,“Optimizing bitmap indices with efficient compression,”
ACM Transactions on Database Systems, vol.31, no.1, pp.1-38, 2006.
[50] M. Rene, T.J ens, and A. Gustavo,“Data Processing on FPGAs,” in Proceedings of the
VLDB Endowment, vol.2, iss.1, pp.910-912, 2009.
[51] A. Dollas.“Big Data Processing with FPGA Supercomputers: Opportunities and Chal-
lenges,” in Proceedings of IEEE Computer Society Annual Symposium on VLSI, pp.474-479,
2014.
???? 97
[52] A. Putnam, et al.,“A reconfigurable fabric for accelerating large-scale datacenter services,”
in Proceedings of ACM/IEEE 41st International Symposium on Computer, pp. 10-22, Jun.
2014.
[53] B. Sukhwani, et al.,“A Hardware/Software Approach for Database Query Acceleration with
FPGAs,” International Journal of Parallel Programming, vol.43, no.6, pp.1129-1159, 2015.
[54] I. Kuon, and J. Rose,“Measuring the Gap Between FPGAs and ASICs,” Systems, vol.26,
no.2, pp.203-215, 2007.
[55] D. Singh, and C. K. Reddy,“A survey on platforms for big data analytics,” Journal of Big
Data, vol.2, no.1, pp.1-20, 2014.
[56] O. Jian, et al.,“SDA : Software-defined accelerator for large-scale DNN systems,” in Pro-
ceedings of IEEE Hot Chips 26 Symposium(HCS), pp.1-23, 2014.
[57] R. Tsuchiya, et al.,“Silicon on thin BOX : a new paradigm of the CMOSFET for low-power
and high-performance application featuring wide-range back-bias control,” in Proceedings
of IEEE International Electron Devices Meeting(IEDM) Technical Digest, pp.631-634, 2014.
[58] T. Ishigaki, et al.,“Ultralow-power LSI Technology with Silicon on Thin Buried Oxide
(SOTB) CMOSFET,” in Solid State Circuits Technologies, Chapter 7, pp.146-156, 2010.
[59] D.-H. Le, et al.,“Design of a Low-power Fixed-point 16-bit Digital Signal Processor Using
65nm SOTB Process,” in Proceedings of The International Conference on Integrated Circuit
Design and Technology(ICICDT), pp.1-4, 2015.
[60] K. Ishibashi, et al.,“A Perpetuum Mobile 32bit CPU on 65nm SOTB CMOS Technology
with Reverse-Body-Bias Assisted Sleep Mode,” IEICE Transactions on Electronics, vol.E98-
C, no.7, pp.536-543, 2015.
[61] Z. Li, et al.,“IR-Tree : An Efficient Index for Geographic Document Search,” IEEE Trans-
actions on Knowledge Data Engineering, vol.23, no.4, pp.585-599, 2011.
[62] H. Wang, et al.,“Efficient query processing framework for big data warehouse: an almost
join-free approach,” Frontiers of Computer Science, vol.9, no.2, pp.224-236, 2015.
[63] T. Arici, et al.,“A Histogram Modification Framework and Its Application for Image Con-
trast Enhancement,” IEEE Trans. on Image Processing, pp. 1921-1935, Sep. 2009.
[64] J. E. Duarte-Sanchez, et al.,“Hardware Accelerator for the Multifractal Analysis of DNA
Sequences,” IEEE/ACM Trans. on Computational Biology and Bioinformatics, pp. 1611-
1624, May 2017.
[65] C. Estan, and G. Varghese,“New Directions in Traffic Measurement and Accounting,” in
Proceedings of the 2002 conference on Applications, technologies, architectures, and proto-
cols for computer communications, pp. 323-336, Aug. 2002.
98 ????
[66] S. Das, et al.,“Thread Cooperation in Multicore Architectures for Frequency Counting Over
Multiple Data Streams,” in Proceedings of Int. Conf. on Very Large Data Bases Endowment
(PVLDB), pp. 217-228, Aug. 2009.
[67] S. Das, et al.,“CAM Conscious Integrated Answering of Frequent Elements and Top-k
Queries Over Data Streams,” in Proceedings of 4th Int. Workshop on Data Management on
New Hardware, pp. 1-10, 2008.
[68] P. Roy, et al.,“Efficient Frequent Item Counting in Multi-core Hardware,” in Proceedings of
18th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining, pp. 1451-1459,
Aug. 2012.
[69] X. Yang, et al.,“A Parallel Frequent Item Counting Algorithm,” in Proceedings of 8th Int.
Conf. on Intelligent Human-Machine Syst. and Cybernetics (IHMSC), pp. 1-4, Aug. 2016.
[70] M. Cafaro, et al.,“On Frequency Estimation and Detection of Frequent Items in Time Faded
Streams,” IEEE Access vol. 5, pp. 24078-24093, Oct. 2017.
[71] M. Greenwald, and S. Khanna,“Space-Efficient Online Computation of Quantile Sum-
maries,” in Proceedings of ACM SIGMOD Int. Conf. on Management of Data 30, pp. 58-66,
May 2001.
[72] J. Han, et al.,“Efficient Computation of Iceberg Cubes with Complex Measures,” in Pro-
ceedings of ACM SIGMOD Int. Conf. on Management of Data 30, pp. 1-12, May 2001.
[73] B. He, et al.,“Efficient Iceberg Query Evaluation Using Compressed Bitmap Index,” IEEE
Trans. on Knowledge and Data Engineering, vol. 24, no. 9, pp. 1570-1583, Sep. 2012.
[74] J. Teubner, et al.,“FPGA Acceleration for the Frequent Item Problem,” in Proceedings of
IEEE 26th Int. Conf. on Data Engineering (ICDE 2010), pp. 1-4, Mar. 2010.
[75] J. Teubner, et al.,“Frequent Item Computation on a Chip,” IEEE Trans. On Knowledge and
Data Engineering, vol. 23, no. 8, pp. 1169-1181, Aug. 2011.
[76] A. Stillmaker, and B. Baas,“Scaling Equations for the Accurate Prediction of CMOS Device
Performance from 180 nm to 7 nm,” Integration the VLSI Journal, vol. 58, pp. 74-81, Feb.
2017.
[77] Trong-Thuc Hoang, et al.,“FPGA-based Frequent Items Counting Using Matrix of Equality
Comparators,” in Proceedings of IEEE Int. Midwest Symp. On Circuits and Syst. (MW-
CAS), pp. 1-4, Aug. 2017.
[78] J. E. Rice, J. Schultz, and W. Osborn,“Exploring different methods for 2DR-tree binary
search on FPGA,” in Proceedings of IEEE International Midwest Symposium on Circuits
and Systems (MWSCAS), pp. 646-649, Aug. 2007.
[79] H. C. Lee, and F. Ercal,“RMESH Algorithms for parallel string matching,” in Proceedings
of International Symposium on Parallel Architectures, Algorithms and Networks (I-SPAN),
pp. 223-226, Dec. 1997.
???? 99
[80] K. McLaughlin, et al.,“Design and analysis of matching circuit architectures for a closest
match lookup,” in Proceedings of 20th International Conference on Parallel and Distributed
Processing (IPDPS), pp. 214-218, Apr. 2006.
[81] Y. Utan, S. Wakabayashi, and S. Nagayama,“An FPGA-based text search engine for approx-
imate regular expression matching,” in Proceedings of International Conference on Field-
Programmable Technology (FPT), pp. 184-191, Dec. 2010.
[82] H. Le, and V. K. Prasanna,“A memory-efficient and modular approach for string match-
ing on FPGAs,” in Proceedings of 18th IEEE Annual International Symposium on Field-
Programmable Custom Computing Machines (FCCM), pp. 192-200, May. 2010.
[83] S. Paul, and S. Bhunia,“Reconfigurable computing using Content Addressable Memory for
improved performance and resource usage,” in Proceedings of 45th annual Design Automa-
tion Conference, pp. 786-791, Jun. 2008.
[84] H. Yamada, M. Hirata, H. Nagai, and K. Takahashi,“A High-Speed String search Engine,”
IEEE J.of Solid-State Circuits, vol. 22, no. 5, pp. 829-834, Oct. 1987.
[85] T. Hanamoto, et al.,“A Flexible Search Managing Circuitry for High-Density Dynamic
CAMs,” IEICE Trans. Electron., vol. E77-C, no. 8, pp. 1377-1384, Aug. 1994.
[86] H. J. Mattausch, T. Gyohten, Y. Soda, and T. Koide,“Compact associative-memory archi-
tecture with fully parallel search capability for the minimum Hamming distance,” IEEE J.
Solid-State Circuits, vol. 37, no. 2, pp. 218-227, 2002.
[87] Y. Yano, T. Koide, and H. J. Mattausch,“Associative memory with fully parallel nearest-
Manhattan-distance search for low-power real-time single-chip applications,” in Proceedings
of Asia and South Pacific Design Automation Conference (ASP-DAC), pp. 543-544, Jan.
2004.
[88] G. Nilsen, J. Torresen, and O. Srasen,“A Variable Word-Width Content Addressable Mem-
ory (CAM) for Fast String Matching,” in Proceedings of 22nd Norchip Conference, pp.
214-217, Nov. 2004.
[89] I. Sourdis, D. Pnevmatikatos, and S. Vassiliadis,“An Evaluation of FPGA-based IDS Pattern
Matching Techniques,” in Proceedings of 16th Annual Workshop on Circuits, Systems and
Signal Processing, pp. 449-453, Nov. 2005.
[90] I. Sourdis, and D. Pnevmatikatos,“Pre-decoded CAMs for Efficient and High-Speed NIDS
Pattern Matching,” in Proceedings of IEEE Symposium on Field-Programmable Custom
Computing Machines, pp. 258-267, Apr. 2004.
[91] Y.-H.E.Yang, and V.K.Prasanna,“Memory-efficient pipelined architecture for large-scale
string matching,” in Proceedings of 17th Annual IEEE Symposium on Field-Programmable
Custom Computing Machines (FCCM), pp. 104-111, Apr. 2009.
100 ????
[92] J.Van Lunteren,“High-performance pattern-matching for intrusion detection,” in Proceed-
ings of 25th IEEE International Conference on Computer Communications, pp. 1-13, Apr.
2006.
[93] B. C. Brodie, R. K. Cytron, and D. E. Taylor,“A Scalable Architecture For High-Throughput
Regular-Expression Pattern Matching,” in Proceedings of 33rd International Symposium on
Computer Architecture (ISCA), pp. 192-202, Jun. 2006.
[94] W. Jiang, Y.-H. E. Yang, and V. K. Prasanna,“Scalable multi-pipeline architecture for high
performance multi-pattern string matching,” in Proceedings of 15th International Parallel
and Distributed Processing Symposium (IPDPS), pp. 1-12, May. 2010.
[95] J. Huang, Z. Yang, X. Du, and W. Liu,“FPGA based High speed and low area cost pattern
matching,” in Proceedings of IEEE TENCON 2005 Conference, pp. 1-5, Nov. 2005.
[96] C. R. Clark, and D. E. Schimmel,“Scalable Parallel Pattern Matching on High-Speed Net-
works,” in Proceedings of IEEE Symposium on Field-Programmable Custom Computing
Machines, pp. 249-257, Apr. 2004.
[97] G. Papadppoulos, and D. Pnevmatikatos,“Hashing + Memory = Low Cost, Exact Pattern
Matching,” in Proceedings of 15th International Conference on Field-Programmable Logic
and Applications, pp. 39-44, Aug. 2005.
[98] R. Sidhu, and V. K. Prasanna,“Fast Regular Expression Matching using FPGAs,” in Pro-
ceedings of IEEE Symposium on Field-Programmable Custom Computing Machines, pp.
227-238, Apr. 2001.
[99] I. Sourdis, D. Pnevmatikatos, S. Wong, and S. Vassiliadis,“A Reconfigurable Perfect-Hashing
Scheme for Packet Inspection,” in Proceedings of 15th International Conference on Field-
Programmable Logic and Applications, pp. 644-647, Aug. 2005.
[100] K. Pagiamtzis and A. Sheikholeslami,“Content-Addressable Memory (CAM) Circuits and
Architectures : A Tutorial and Survey,” IEEE J. of Solid-State Circuits, vol. 41, no. 3, pp.
712-727, Mar. 2006.
[101] S. A. Guccione, D. Levi, and D. Downs,“A Reconfigurable Content Addressable Memory
(CAM),” in Proceedings of 15th International Parallel and Distributed Processing Sympo-
sium (IPDPS), pp. 882-889, May. 2000.
[102] H. Nakahara, T. Sasao, and M. Matsuura,“A CAM Emulator Using Look-Up Table Cas-
cades,” in Proceedings of 21st International Parallel and Distributed Processing Symposium
(IPDPS), pp. 26-30, Mar. 2007.
[103] M. Faezipour, and M. Nourani,“Wire-Speed TCAM-Based Architectures for Multimatch
Packet Classification,” IEEE Trans. Computers, vol. 58, no. 1, pp. 5-17, Jan. 2009.
[104] K. Hashimoto, Y. Ito, and K. Nakano,“Template Matching using DSP slices on the FPGA,”
in Proceedings of 2013 First International Symposium on Computing and Networking, pp.
1-4, Dec. 2013
???? 101
[105] Yuri Marchetti Tavares, Nadia Medjah, and Luiza de Macedo Mourelle,“Hardware Software
Codesign System for Template Matching using Particle Swarm Optimization and Pearson’s
Correlation,” in Proceedings of 2016 IEEE Latin American Conference on Computational
Intelligence (LA-CCI), pp. 1-4, Mar. 2017.
[106] Gagandeep Singh, Lorenzo Chelini, Stefano Corda, Ahsan Javed Awan, Sander Stuijk, Roel
Jordans, Henk Corporaal, Albert-Jan Boonstra?“Near-Memory Computing: Past, Present,
and Future,’ ?arXiv:1908.02640v1 [cs.AR] 7 Aug 2019
[107] C. Hsuan-Te, J. Chou, V. Vishwanath, andW. Kesheng, “In-memory query system for sci-
entic dataseis,” in Proc. IEEE 21st Int. Conf. Parallel Distrib. Syst. (ICPADS), Dec. 2015,
pp. 362-371.
[108] F. Fusco, M. Vlachos, X. Dimitropoulos, and L. Deri, “Indexing mil- lion of packets per
second using GPUs,” in Proc. Conf. Internet Meas. Conf. (IMC), 2013, pp. 327-332.
[109] T. Zhang, H. Quan, L. Zhao, F. Yu,? High Efficient Implementation of Image Matching
Algorithm,?Proc. of the 2nd International Congress on Image and Signal Processing, pp.
1-5, 2009.
[110] Z. Tao, Y. F. Ping, Q. Hao-jun,?An Optimized High-Speed High- Accuracy Image Match-
ing System Based on FPGA,?Proc. IEEE International Conference on Information and
Automation, pp. 1107-1112, Jun. 2010.
[111] Fei Gao,Georgios Tziantzioulis?David Wentzlaff??ComputeDRAM: In-Memory Compute
Using Off-the-Shelf DRAMs,?IEEE/ACM International Symposium on Microarchitecture




(1) Katsumi Inoue, and Cong-Kha Pham, “The Memorism Processor: Towards a Memory-Based
Artificially Intelligence Complementing the von Neumann Architecture,” SICE Journal of
Control, Measurement, and System Integration, vol.10, no.6, pp.544-550, Nov. 2017.
(2) Katsumi Inoue, Trong-Thuc Hoang, and Cong-Kha Pham,“Frequent Items Counter Based
on Binary Decoders,” IEICE Electronics Express, vol. no.20, pp.1-12, Oct. 2018.
(3) Xuan-Thuan Nguyen, Trong-Thuc Hoang, Hong-Thu Nguyen, Katsumi Inoue, and Cong-
Kha Pham, “An FPGA-Based Hardware Accelerator for Energy-Efficient Bitmap Index Cre-
ation,” IEEE Access. vol. 6, pp. 16046-16059, Mar. 2018.
(4) Xuan-Thuan Nguyen,Trong-Thuc Hoang, Hong-Thu Nguyen, Katsumi Inoue, and Cong-Kha
Pham, “An Efficient I/O Architecture for RAM-Based Content-Addressable Memory on
FPGA,” IEEE Transactions on Circuits and Systems II: Express Briefs vol. 66, no. 3, 2019.
??????
(1) Takahiro Hosaka, Trong-Thuc Hoang, Van-Phuc Hoang, Duc-Hung Le, Katsumi Inoue, and
Cong-Kha Pham, “Live Demonstration: Real-Time Auto-Exposure Histogram Equalization
Video-System using Frequent Items Counter,” in Proceedings of 2019 IEEE International
Symposium on Circuits and Systems (ISCAS), pp. 1-2, May. 2019.
(2) Xuan-Thuan Nguyen, Trong-Thuc Hoang, Katsumi Inoue, Ngoc-Tu Bui, Van-Phuc Hoang,
and Cong-Kha Pham, “A 1.2-V 90-MHz Bitmap Index Creation Accelerator with 0.27-nW
Standby Power on 65-nm Silicon-On-Thin-Box (SOTB) CMOS,” in Proceedings of 2019
IEEE International Symposium on Circuits and Systems (ISCAS), pp. 1-5, May. 2019.
(3) Katsumi Inoue, Trong-Thuc Hoang, Xuan-Thuan Nguyen, Hong-Thu Nguyen, and Cong-
Kha Pham, “VLSI Design of Frequent Items Counting Using Binary Decoders Applied to
8-bit per Item Case-study,” in Proceedings of 2018 14th Conference on Ph.D. Research in
Microelectronics and Electronics (PRIME). pp. 161-165, 2018.
(4) Xuan-Thuan Nguyen, Trong-Thuc Hoang, Hong-Thu Nguyen, Katsumi Inoue, and Cong-
Kha Pham, “A 219µW 1D-to-2D-Based Priority Encoder on 65-nm SOTB CMOS,” in Pro-
ceedings of 2018 IEEE International Symposium on Circuits and Systems (ISCAS), pp. 1-5,
2018.
(5) Trong-Thuc Hoang, Xuan-Thuan Nguyen, Hong-Thu Nguyen, Nhu-Quynh Truong, Duc-
Hung Le, Katsumi Inoue, and Cong-Kha Pham, “FPGA-based frequent items counting using
matrix of equality comparators,” in Proceedings of 2017 IEEE 60th International Midwest
Symposium on Circuits and Systems (MWSCAS), pp. 285-289, 2017.
(6) Xuan-Thuan Nguyen, Hong-Thu Nguyen, Katsumi Inoue, Osamu Shimojo, and Cong-Kha
Pham, “Highly parallel bitmap-based regular expression matching for text analytics,” in
Proceedings of 2017 IEEE International Symposium on Circuits and Systems (ISCAS), pp.
2667-2671, 2017.
103
(7) Xuan-Thuan Nguyen, Hong-Thu Nguyen, Trong-Thuc Hoang, Katsumi Inoue, Osamu Shi-
mojo, Toshio Murayama, Kenji Tominaga, and Cong-Kha Pham,“An efficient FPGA-based
database processor for fast database analytics,” in Proceedings of 2016 IEEE International
Symposium on Circuits and Systems (ISCAS), pp. 1-5, 2016.
(8) Duc-Hung Le, Katsumi Inoue, and Cong-Kha Pham,“Design of a parallel CAM-based multi-
match search system using 0.18-?m CMOS process,” in Proceedings of 2014 IEEE Fifth
International Conference on Communications and Electronics (ICCE), pp. 336-340, 2014.
(9) Duc-Hung Le, Tran-Bao-Thuong Cao, Katsumi Inoue, and Cong-Kha Pham, “A CAM-based
Information Detection Hardware System for fast exact pattern matching,” in Proceedings of
2013 IEEE 56th International Midwest Symposium on Circuits and Systems (MWSCAS),
pp. 848-852, 2013.
(10) Duc-Hung Le, Katsumi Inoue, and Cong-Kha Pham,“Design a fast CAM-based information
detection system on FPGA and 0.18?m ASIC technology,” in Proceedings of 2013 IEEE
International Conference of Electron Devices and Solid-state Circuits Conference Paper, pp.
1-4, 2013.
(11) Duc-Hung Le, Tran-Bao-Thuong Cao, Katsumi Inoue, and Cong-Kha Pham,“A fast CAM-
based Watermarking extraction on FPGA,” in Proceedingsof 2013 International Conference
on IC Design & Technology (ICICDT), pp. 203-207, 2013.
(12) Duc-Hung Le, Tran Bao Thuong Cao,Katsumi Inoue, and Cong-Kha Pham,“A fast CAM-
based image matching system on FPGA,” in Proceedings of 2013 IEEE International Sym-
posium on Circuits and Systems (ISCAS2013), pp.1797-1801, 2013.
(13) Duc-Hung Le, Masahiro Sowa, Katsumi Inoue, and Cong-Kha Pham,“A fully-parallel infor-
mation detection hardware system employing Content Addressable Memory,” in Proceedings
2012 Fourth International Conference on Communications and Electronics (ICCE), pp. 447-
452, 2012.
(14) Duc-Hung Le, Katsumi Inoue, and Cong-Kha Pham,“A novel CAM-based Information De-
tection Hardware System on FPGA,” in Proceedings PRIME 2012 8th Conference on Ph.D.
Research in Microelectronics & Electronics, pp. 1-4, 2012.
???????
(1) ?? ??, ? ??,“??????????????????????,”?????????
?????????? 118(10), pp. 17-22, 2018? 4? 19?.
(2) ?? ??, ?? ??, ? ??,“???????????????????????????
?,”???????? SSI2016, 2016? 12? 7?.?
(3) ?? ??, Nguyen Xuan-Thuan, ? ??,“??????????????????????
????? (DBP),” ????????????????????? 2015? ???????
? (2), 2015? 8? 25?.
104
(4) ?? ??, ? ??,“?????????????????????????? (DBP),” ?
?????????????, ???? 114(13), pp. 91-96, 2014? 4? 17?.
(5) ?? ??, ? ?????, ?? ??, ???,”?????????? (SOP),“??????




(2) ?? ???“???????????????,”??????? 6516775??2019? 4?
26??
(3) ?? ???“????????????????????????,”?????, ? 6393852
?, 2018? 8? 31??
(4) ?? ???“??????????????????????????????,”????
?, ? 6229024?, 2017? 10? 20??
(5) ?? ???“?????????????????????????????????,”?
???????, ?? 221454, 2017? 7? 31??
(6) ?? ???“?????????????????????????????????,”?
?????, ? 2790009?, 2017? 1? 17??
(7) ?? ???“??????????????????????????????,”????,
US9627065B2, 2017? 4? 18??
(8) ?? ???“?????????????????????????????,”?????,
? 6014120?, 2016? 9? 30??
(9) ?? ???“?????????????????,???????????????,”??
???, ? 5992073?, 2016? 8? 26??
(10) ?? ???“??????????????????????????????,”????
?, ? 5981666?, 2016? 8? 5??
(11) ?? ???“?????????????????????????????????,”?
???, US9275734B2, 2016? 3? 1??
(12) ?? ???“?????????????????????????????????,”?
????, ? 5763616?, 2015? 6? 19??
(13) ?? ???“????????????????????????,??????????,”
?????, ??? 1, 446192?, 2014? 7? 20??
(14) ?? ???“????????????????????????,??????????,”












??????????????????????? Duc-Hung Le san?Xuan-Thuan Nguyen
san?Trong-Thuc Hoang san??????????
??????????????????????????????????????????
???????????????????
??????????????????????????????????????????
????????
106
????
?????????????
??? 44? 3?????????????????
??? 30? 4???????????????????????????????
???????????
??? 2? 3???????????????????????????????
? ???????????
