Core E4
Core E4
5, AUGUST 2016
1301205
AbstractWe present a design of an 8-bit bit-serial rapid singleflux-quantum (RSFQ) microprocessor, which is called CORE e4,
and its high-speed functionality test results. The CORE e4 is
equipped with four general-purpose registers and can execute
20 different instructions. The first version of the CORE e4, which
is called CORE e4v1, has been implemented using the National
Institute of Advanced Industrial Science and Technology (AIST)
10-kA/cm2 advanced process (ADP2) and the CONNECT cell
library. The CORE e4v1 consists of approximately 7000 Josephson
junctions and occupies a circuit area of 3.00 mm 1.98 mm. The
estimated power consumption and performance are 2.03 mW and
333 million instructions per second, respectively. CORE e4 is one
of the highest performing RSFQ microprocessors.
Index TermsArithmetic logic unit (ALU), microprocessor,
rapid single flux quantum (RSFQ), superconducting integrated
circuits.
I. I NTRODUCTION
1051-8223 2016 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
1301205
TABLE I
I NSTRUCTION S ET OF THE CORE e4
The instruction is decoded at the controller, and the behavior of each component is determined. At ALU instructions, the registers to be read are selected by RegA_num
and RegB_num. Instruction HLT is executed in this phase
(hlt_trg). At instruction JMP, the lower five bits in the IR
are loaded to the PC by pc_ld_trg.
4) Execution 0 (EX0)
At ALU instructions, the content of the registers is read
out. At instruction LD, the DM is accessed to obtain the
value by DM_trg. At instruction ST, the existing data in
the word to be written in the DM are cleared. If the branch
condition is satisfied, the PC is incremented to skip the next
instruction by pc_inc_trg.
5) Execution 1 (EX1)
At ALU instructions, calculation in the ALU is completed and the registers to be read are deselected by
RegA/B_reset. At instruction LD, this phase is the latency
period for data transfer. At instruction ST, data are transferred from the register to the DM through DS2.
6) Write Back (WB)
At ALU instructions, as well as instructions SET and
LD, the register to be written is selected by Write_num.
At ALU instructions, alu_sw is sent to DS1, and the
calculation result of the ALU is sent to the LDB. The value
of the LDB is then written to the register by buffer_trg.
At instruction LD, ld_sw is sent to DS1, and the value of
the DM is sent to the LDB. The value of the LDB is then
written to the register by buffer_trg. At instruction SET,
the lower five bits of the LDB are written to the register by
set_trg.
III. L OGICAL AND P HYSICAL D ESIGN
A. Controller
C. Component Circuits
Fig. 1 shows the microarchitecture of the CORE e4. The
main circuit components are a 5-bit program counter (PC), an
instruction memory (IM), an instruction register (IR), a register
file with four 8-bit registers, a load buffer (LDB), a data memory (DM), an ALU, and a controller. DS1 and DS2 are switches
to select the input data.
D. Instruction Execution
CORE e4 is a multicycle microprocessor. Pipelining is not
adopted for simplicity, and data hazards do not occur in this
architecture. Execution of instructions in CORE e4 consists of
the following six phases.
1) Instruction Fetch 0 (IF0)
An instruction is read out from the IM according to the
value of the PC. The PC is incremented.
2) Instruction Fetch 1 (IF1)
This phase is reserved as the latency period for instruction propagation delay.
3) Instruction Decode (ID)
ANDO et al.: DESIGN AND DEMONSTRATION OF AN 8-bit BIT-SERIAL RSFQ MICROPROCESSOR: CORE e4
1301205
selected by the op2 and op3 signals. The output of the RegA
register is X and that of the RegB register is Y.
C. ALU
We designed an 8-bit ALU to perform ten different instructions. The first version of our ALU is called ALUv1.
Fig. 3 shows a schematic diagram of ALUv1. The operation
to be executed is determined by 4-bit control signals: 1100
(ADD), 1101 (MV), 1110 (SUB), 1111 (CMP), 1010 (AND),
1011 (OR), 1001 (XOR), 1000 (NOR), 0100 (INC), 0110
(DEC). The bits of the control signal are named alu1, alu2,
alu3, and alu4 from left to right. These signals are determined
according to the following equations.
alu1 = op7;
alu2 = op6 INC DEC;
alu3 = op5;
alu4 = op4.
We implemented the circuits using the National Institute of Advanced Industrial Science and Technology (AIST)
10-kA/cm2 advanced process (ADP2) [20]. First, we demonstrated the operation of ALUv1 using an on-chip high-speed
test [24]. As a result of the measurement, we successfully
obtained the correct operation of ten ALU instructions. The estimated maximum operating frequency based on measurement
results is 80 GHz.
1301205
addr_out and outz represent an address and a calculation result of the ALU, respectively. The output line labeled psw_nz
and psw_neg represent the NZ and NEG flags, respectively.
This result shows the correct operation of 00010101(21) +
00011010(26) = 00101111(47), including the NZ flag which
is correctly set.
Thirteen of the 20 instructions (ADD, MV, SUB, CMP, AND,
XOR, OR, NOR, SET, LD, ST, NOP, and HLT) were successfully demonstrated in our measurement. However, the output
lines labeled addr_out and the operation of INC/DEC instructions were not successfully demonstrated because of errors in
the clock wiring.
V. C ONCLUSION
We have designed, fabricated, and tested the first version
of an 8-bit bit-serial RSFQ microprocessor, which is called
CORE e4. The functionality of some instruction sequences was
demonstrated in the RSFQ microprocessor. A 256-bit (8 bits
32 words) memory component was demonstrated on another
chip [25]. Integrating the memory with CORE e4 will result in
a complete microprocessor system.
ACKNOWLEDGMENT
The circuits were fabricated in the clean room for analog
digital superconductivity (CRAVITY) of the National Institute
of Advanced Industrial Science and Technology (AIST) with
the Advanced Process (ADP2).
R EFERENCES
[1] K. K. Likharev and V. K. Semenov, RSFQ logic/memory family: A new
Josephson-junction technology for sub-terahertz-clock-frequency digital systems, IEEE Trans. Appl. Supercond., vol. 1, no. 1, pp. 328,
Mar. 1991.
[2] A. V. Rylyakov, New design of single-bit all-digital RSFQ autocorrelator, IEEE Trans. Appl. Supercond., vol. 7, no. 2, pp. 27092712,
Jun. 1997.
[3] N. Yoshikawa and Y. Kato, Reduction of power consumption of RSFQ
circuits by inductance-load biasing, Supercond. Sci. Technol., vol. 12,
pp. 918920, 1999.
ANDO et al.: DESIGN AND DEMONSTRATION OF AN 8-bit BIT-SERIAL RSFQ MICROPROCESSOR: CORE e4
Yuki Ando received the B.E. degree from Ritsumeikan University, Shiga,
Japan, in 2014 and the M.E. degree in informatics from Kyoto University,
Kyoto, Japan, in 2016.
She was with the Graduate School of Informatics, Kyoto University. She has
been involved in the research of architectural and logical design of superconducting circuits.
1301205
Ryo Sato received the B.E. degree in electronics from Nagoya University,
Nagoya, Japan, in 2015.
He is currently with the Department of Quantum Engineering, Nagoya
University, studying high-performance microprocessors based on rapid singleflux-quantum circuits.
Masamitsu Tanaka (M06) received the M.E. and Ph.D. degrees in electronics
and information electronics from Nagoya University, Nagoya, Japan, in 2003
and 2006, respectively.
From 2005 to 2007, he was a Research Fellow of the Japan Society for
the Promotion of Science. In 2007, he joined the Department of Information
Engineering, Nagoya University, and moved to the Department of Quantum
Engineering, where he has been a Designated Lecturer since 2010. In 2011, he
was a Research Scholar with the University of California at Berkeley, Berkeley,
CA, USA. Since 2013, he has also been a Research Associate with the VLSI
Design and Education Center, University of Tokyo, Tokyo, Japan. His research
interests include ultrafast/energy-efficient computing using rapid single-fluxquantum circuits and logic design methodologies.
He is a member of the Institute of Electronics, Information and Communication Engineers, the Japan Society of Applied Physics, the Cryogenics and
Superconductivity Society of Japan, and the Institute of Electrical Engineers of
Japan.
Kazuyoshi Takagi (M04) received the B.E., M.E., and Dr.Eng. degrees from
Kyoto University, Kyoto, Japan, in 1991, 1993, and 1999, respectively, all in
information science.
From 1995 to 1999, he was a Research Associate with the Nara Institute
of Science and Technology, Ikoma, Japan. In 1999, he was an Assistant
Professor with the Department of Information Engineering, Nagoya University,
Nagoya, Japan, where he was promoted to an Associate Professor in 2006. In
2011, he joined the Department of Communications and Computer Engineering, Kyoto University. His current interests include large-scale integration
design and design algorithms.
Naofumi Takagi (SM03) received the B.E., M.E., and Ph.D. degrees from
Kyoto University, Kyoto, Japan, in 1981, 1983, and 1988, respectively, all in
information science.
In 1984, he joined Kyoto University as an Instructor and was promoted to
an Associate Professor in 1991. In 1994, he joined Nagoya University, Nagoya,
Japan, where he was promoted to a Professor in 1998. He returned to Kyoto
University in 2010. His current research interests include computer arithmetic,
hardware algorithms, and logic design.
Dr. Takagi was a recipient of the Japan IBM Science Award and the Sakai
Memorial Award of the Information Processing Society of Japan in 1995 and
the Commendation for Science and Technology by the Minister of Education,
Culture, Sports, Science and Technology of Japan in 2005.
Akira Fujimaki (M10) received the B.E., M.E., and Dr.Eng. degrees from
Tohoku University, Sendai, Japan, in 1982, 1984, and 1987, respectively.
In 1987, he was a Visiting Assistant Research Engineer with the University
of California at Berkeley, Berkeley, CA, USA. Since 1988, he has been working
on superconductor devices and circuits with the School of Engineering, Nagoya
University, Nagoya, Japan, where he is currently a Professor. His current
research interests include single-flux-quantum circuits and their applications
based on low- and high-temperature superconductors.