# **Design of a Simple Pipeline** ## **Objective** To design and implement a simple pipelined system (other than CPU). It is important to obtain a deep understanding of the basic concepts of pipelining such as data-stationary control, forwarding, and stalling. Since the textbook has presented a completed design of the pipelined CPU, it does not provide an opportunity for students to arrive at the basic design of a new pipelined system by themselves. It is hoped that this lab provides such an opportunity. #### Introduction In this lab the operations to be performed (the instructions to be executed) are very simple. Using a pipelined system, a series of such simple instructions are to be executed very much like in a CPU. We need to take care of data dependencies by designing appropriate forwarding unit (FU) and hazard detection and stalling unit (HDU). The design has 5 pipeline stages: **Instruction Fetch (IF), Instruction Decode (ID), Execution Stage 1 (EX1), Execution Stage 2 (EX2),** and **Write Back Stage (WB)**. In EX1 we have a subtractor to subtract a constant of 3 (given A, it produces A - 3). In EX2, we have an adder to add a constant of 4 (given A, it produces A + 4). Using these resources, we can perform a MOV or a SUB3 or a ADD4 or even a ADD1. The ADD1 operation requires performing both operations, SUB3 and ADD4. The MOV operation requires neither operation. ## **Datapath** As stated in the introduction, we can perform four operations (besides a NOP) namely MOV, SUB3, ADD4, and ADD1. Figure 1 serves as a block diagram for this lab. The subtract\_3 operation in EX1 is implemented by actually tying a MINUS\_THREE constant (1111\_1111\_1111\_1101) to one of the two inputs of a regular adder. Similarly the add\_4 operation in EX2 is implemented by actually tying a PLUS\_FOUR constant (0000\_0000\_0000\_0100) to one of the two inputs of a regular adder. The ADD1 operation is performed by subtracting 3 in EX1 and then adding 4 in EX2. The MOV operation is performed by neither subtracting 3 nor adding 4 to the given data. Since the SUB3 instruction does not require any processing by the EX2 stage and similarly the ADD4 instruction does not require any processing by the EX1 stage, (and further since the MOV instruction does not require any processing by either EX1 or EX2 stages), we have two **result-selection muxes**, (R1-Mux and R2-Mux) at the output of the adders in EX1 and EX2 stages to **bypass** the subtract\_3 and/or add\_4 operations, as needed. Here the register file has one read port and one separate write port. It is an *internally-forwarding* register file. Actually we are providing the register file from lab 7 Part 1 for this part also. It has three read ports and one write port. You can ignore two out of the three read ports. **Instruction format**: The most significant 4 bits [31:28] are used for opcode which is **one-hot coded**. The result register address (RA) is specified by the 4 bits [7:4] and the source register address (XA) is specified by the 4 bits [3:0]. Rest of the 20 bits [27:8] are all zeros. | Instruction | Operation | | Opcode | | | MSD | 32-bit instruction in hex | |----------------|-----------------------|-----|--------|------|------|-----|-----------------------------------------| | | | MOV | SUB3 | ADD4 | ADD1 | | <b>D</b> =Destination, <b>S</b> =Source | | NOP | | 0 | 0 | 0 | 0 | 0 | 000000 <mark>DS</mark> | | MOV \$R, \$X; | $(R) \le (X)$ | 1 | 0 | 0 | 0 | 8 | <b>8</b> 00000 <b>DS</b> | | SUB3 \$R, \$X; | $(\$R) \le (\$X) - 3$ | 0 | 1 | 0 | 0 | 4 | 400000 <b>DS</b> | | ADD4 \$R, \$X; | $(\$R) \le (\$X) + 4$ | 0 | 0 | 1 | 0 | 2 | 200000 <b>DS</b> | | ADD1 \$R, \$X; | $(\$R) \le (\$X) + 1$ | 0 | 0 | 0 | 1 | 1 | 100000 <b>DS</b> | LAB 7 Part 3 Block Diagram EX2\_RA ID XA EX1 RA X 0 ID XMEX2 ID\_XMEX1 revised 7/18/2010 #### **Initial contents of the Register File** ``` Register id id in hex Content in hex Register 00 0 0001h <= Unlike MIPs, the register 0 is like any other register here Register 01 1 0002h Register 02 2 0004h Register 03 3 0008h Register 04 4 0010h Register 05 5 0020h Register 06 6 0040h 7 Register 07 0080h Register 08 8 0100h Register 09 9 0200h Register 10 A 0400h Register 11 B 0800h Register 12 C 1000h Register 13 D 2000h Register 14 E FFF8h Register 15 F FFFFh ``` #### **Control:** Please consider the following questions **before** designing and implementing your control. You need to think who can wait for forwarding data latest until when and who can provide forwarding data earliest by when. May be MOV is in some aspects like SUB3 because it is capable of helping subsequent instruction(s) when it is in EX2. May be MOV is in some aspects like ADD4 instruction because it can postpone receiving help until it reaches EX2. **Note:** Separate pages are provided at the end of this assignment for submitting your answers to these questions. Question 1 Can an instruction postpone receiving forwarding data until it reaches EX2 stage? If an instruction can postpone receiving forwarding help until reaching EX2, would it still try to receive help while it is in EX1 (may be because the donor instruction can not wait)? Question 2 Are there occasions where you end up stalling an instruction because you could not provide the needed forwarding data to it in EX1? Question 3 Can an instruction in EX2 provide forwarding help to an instruction in EX1? Or is it too early for any instruction to start providing forwarding help while it is in EX2? Question 4 Which instructions need to wait until they reach WB stage for them to provide forwarding help? And why they need to wait until then? Question 5 Priority in Forwarding: Recall that, in the MIPS pipelined CPU design, if the instructions in both MEM stage and also in WB stage are willing to provide forwarding help to the instruction in EX stage, we exercise priority and accept help from \_\_\_\_\_(MEM/WB) stage. Do we have such a situation here? If so, explain with an example instruction sequence. Question 6 Normally forwarding is done at the beginning of a clock so that the recipient instruction can process the information during the clock. However, sometimes, it may make sense (as in Lab 7 Part 1) to forward information at the end of the clock, if the receiving instruction is not performing any operation on the data received. Can an instruction such as ADD4 or ADD1 in EX2 provide forwarding help to an ADD4 or MOV instruction in EX1 towards the end of the clock? If yes, did you provide such an arrangement in your design? If not, is it desirable to provide such an arrangement? Does it cost extra? Does it avoid any stalls, thereby improving the pipeline performance? Or is it that the particular help we plan to offer at the end of the clock will anyway be available at the beginning of the next clock and it is just one and the same (one and the same, whether you provide data at the end of the current clock or at the beginning of the next clock)? Explain briefly. Question 7 Also answer question #3 from Spring 2002 Final Exam (slightly modified for this lab) included later in this assignment. Answer this question before you attempt the second part of this lab (ee457 lab7 P3 EX1 EX2 merged). Note: The original Spring 2002 question did not have the MOV instruction. It has only SUB3, ADD4, ADD1, and of course the NOP. Now we modified the question to include the MOV instruction also. # Fill-in the following to understand stalling needs and forwarding opportunities | | | ng forwarding he | Providing forwarding help | | | | |-------------|------------------|------------------|---------------------------|--------------------|--------------------------|--| | Instruction | | | Doesn't mind | | Capable of | | | | receiving in EX1 | receiving in EX1 | receiving in EX2 | providing from EX2 | providing from <b>WB</b> | | | MOV | | | | | | | | SUB3 | | | | | | | | ADD4 | | | | | | | | ADD1 | | | | | | | | | above, if an instruction is dependent on a senior instruction which is just before), there is never a need to stall the dependent instruction. | | |---------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------| | ginning of the the senior inseither producing the when the de | ent instruction is either or, then it needs to clock when it is in EX1 as it needs to process the data using SUB3 struction (donor instruction) is just above it (just before it) in EX2 st or, then it can't help at the beginning of the clock data using the ADD4 in EX2. Hence this dependency hazard show the pendent instruction is in the ID stage and should be stalled. To (just 1 clock, minimum for 1 clock). | in EX1. And if age, and if it is ock, as it is still ald be detected | | source registe | (Unlike/Like) the MIPS 5 stage pipeline, where the (have only one source register / can have the instructions (is / is ndent instruction in EX1 stage instead of the ID stage. | two source reg- | | Draw logic to | go into HDU and FU2 | | | | HDU | | | | | STALL | | | | | | | FU2 | | | | | FORW2 | ## Draw logic to go into FU1 as per the diagram on page 2 Redesign the logic to go into FU1 if the arrangement of the forwarding muxes is changed as shown Any advantage of the NEW\_FU1 over the original FU? Note: If logic is reduced, then it is cheaper and faster! #### Instruction Stream in the TestBench It is important to go through and understand the instruction stream (given in the testbench) to be run on your CPU design before you start debugging. You need to know what behavior you are expecting. Please go through ee457\_lab7\_P3\_tb.v after you download files for the first subpart of this lab. The instructions loaded in the instruction memory are specified there and they are fully explained. The instruction stream is common to all four subparts. ### What you have to do 1. There are four subparts to this Lab#7 Part3. The fours subparts are: Subpart File Name Prefix Lab 7 Part 3 subpart#1 ee457\_lab7\_P3 Lab 7 Part 3 subpart#2 ee457\_lab7\_P3\_with\_EX1\_EX2\_merged Lab 7 Part 3 subpart#3 ee457\_lab7\_P3\_RTL\_Coding\_Style Lab 7 Part 3 subpart#4 ee457\_lab7\_P3\_RTL\_Coding\_Style with EX1\_EX2\_merged #### 2. First we do Lab7 Part 3 subpart#1. - 3. Create a folder C:\ModelSim\_projects\ee457\_lab7\_P3 (under your C:\ModelSim\_projects). Download the .zip file provided (ee457\_lab7\_P3.zip) and extract the .v and .do files and place them in the above directory. - 4. Create a modelsim project with the project name ee457\_lab7\_P3. Choose ee457\_lab7\_P3 for the project directory. - 5. Add all verilog files to the project. - 6. Edit (in Notepad++) and complete the **ee457\_lab7\_P3.v**. - 7. Go through ee457\_lab7\_P3\_tb.v and understand the instruction stream used for testing. Compile all Verilog files. - 8. Start simulation by selecting ee457 lab7 P3 tb. Unselect "Enable optimization". - 9. Use the given .do file "ee457\_lab7\_P3\_wave.do" to set up the waveform (command: do ee457\_lab7\_P3\_wave.do). - 10. Select the Memories tab in the workspace and double click on the reg file to display its contents in the right pane. 11. Initially the data content of the memory is displayed as xxxx. You can simulate for a very short time (say 1ns) (run 1ns) to display the actual initial contents. Steps 7-11 can be performed by using the ee457\_lab7\_P3.do file (do ee457\_lab7\_P3.do). - 12. Run the simulation for 1199ns more (run 1199ns) (total 1200ns). - 13. Verify the final register contents. Look at the waveform to see if any signals are misbehaving. Look at the <code>TimeSpace.txt</code> file produced and placed in the project directory. Use Notepad++ to look at this file as wordpad refuses to open this file as it is still being controlled/updated by ModelSim. (On Windows-XP, I could open the <code>TimeSpace.txt</code> file in Notepad++ while simulation is not yet done. If Windows-7 does not allow you to open the file while simulation is going on, please end the simulation, inspect the file and restart the simulation again.) Perform incremental simulation to find errors. Use <code>restart -f</code> to start the simulation again from Ons. Now run for short lengths of time examining the register file contents, the waveforms, and the <code>TimeSpace.txt</code>. Optimization Enable optimization - 14. After finishing all debugging, compile the revised .v file, again restart (restart -f), run for 1ns, examine register contents using the command-line command at VSIM> prompt: "examine -radix hex UUT/REG\_FILE/reg\_file". Copy the response in the transcript window into "RF\_Content\_Lab7\_P3.txt". Further run simulation for 1199ns more (run 1199ns) and again examine register contents using command "examine -radix hex UUT/REG\_FILE/reg\_file". Copy the response in the transcript window again into "RF\_Content\_Lab7\_P3.txt". - 15. Now we do Lab7 Part 3 subpart#2 (ee457\_lab7\_P3\_with\_EX1\_EX2\_merged) - 16. Finish answering Question 7.3 at the end of this assignment and complete the diagram on page 14/14. - 17. Download ee457\_lab7\_P3\_with\_EX1\_EX2\_merged.zip, extract files, and place the .v and .do files in C:\ModelSim\_projects\ee457\_lab7\_P3\_with\_EX1\_EX2\_merged. Create a ModelSim project with the name ee457\_lab7\_P3\_with\_EX1\_EX2\_merged. Add the verilog files to it. Note: The file, ee457\_lab7\_components.v is identical to the file with the same name provided in previous subpart. And the file ee457\_lab7\_P3\_tb.v is nearly identical to the testbench file of the previous subpart. - 18. Complete ee457 lab7 P3 with EX1 EX2 merged.v file. - 19. Compile all verilog files. - 20. Start simulation of the module ee457 lab7 P3 tb. Unselect "Enable optimization". - 21. Run the do file ee457\_lab7\_P3\_with\_EX1\_EX2\_merged\_wave.do to setup the waveform. - 22. Run for 1ns, examine register contents using command "examine -radix hex UUT/REG\_FILE/reg\_file" Steps 19-22 can be performed by using the ee457\_lab7\_P3\_with\_EX1\_EX2\_merged.do file (do ee457\_lab7\_P3\_with\_EX1\_EX2\_merged.do). - 23. Now run for 1199ns more and again examine the register contents and the waveforms. Debug. - 24. Copy the two sets of register file contents into file "RF\_Content\_Lab7\_P3\_EX1\_EX2\_merged.txt" - 25. For Lab7 Part 3 subpart#3 (ee457\_lab7\_P3\_RTL\_Coding\_Style) and Lab7 Part 3 subpart#4 (ee457\_lab7\_P3\_RTL\_Coding\_Style with EX1\_EX2\_merged), please see a separate handout. Sub part 3 is a demonstration of the sub part 1 in **RTL coding style**. Sub part 3 is an exercise asking you to recode the earlier sub part #2 in RTL coding style. The coding in sub parts 1 and 2 is structural coding using too many small components. This is not the desirable way of coding. Besides showing RTL coding style, these two sub parts try to place emphasis on appropriate use of Blocking and Non-Blocking assignments in RTL coding, paying attention to the fact that you need to produce an intermediate variable before you attempt to consume it, and that the combinational paths in a pipeline may go across stages. These sub parts also show that an intermediate signal (in the NSL combinational logic) generated in a clocked procedural block should not be accessed outside that procedural block either for viewing in waveform or for actual logic coding. #### 26. General Guidelines - 27. Start early and seek help early if needed. - 28. Finally submit online (through your unix account) (submission commands specified separately) one set of files for a team of two students: - 29. You need to use the file names exactly as stated and follow the submission procedure exactly as specified. We use unix script files to automate grading. - 30. Non-working lab submission: In simulation, it will be evident if your lab is not working. We discourage you from submitting a non-working lab. If you want to submit a non-working lab, each member of your team needs to send an email to all lab graders (with a copy to all TAs) stating in the subject line, "EE457 Non-working lab submission request" and obtain an approval from one of them. Submitting a non-working lab or partial lab without such approval is interpreted as an intention to cheat. Sorry to say all this, but this makes sure that the system works well. #### What you have to turn-in #### On-line (one submission for a team of 2 students) Please turn in the following (three separate submissions): - 1. submit -user ee457lab -tag puvvada\_lab7\_p3 ee457\_lab7\_P3.v RF\_Content\_Lab7\_P3.txt TimeSpace.txt names.txt - 2. submit -user ee457lab -tag puvvada\_lab7\_p3\_merged ee457\_lab7\_P3\_with\_EX1\_EX2\_merged.v RF Content Lab7 P3 EX1 EX2 merged.txt TimeSpace.txt names.txt #### Paper submission (individual effort, each student separately) Please complete, staple together, and submit pages2/14, 4/14, 5/14 and also pages 8/14 to 14/14. ee457\_lab7\_P3\_simple\_pipeline.fm 3/5/11 7 <sub>/ Lab 7P3</sub>) Questions (individual effort, paper submission, submit pages 2/14, 4/14, 5/14 and also pages 8/14 to 14/14) Please consider the following questions before implementing and designing your control. You need to think who can wait for forwarding data latest until when and who can provide forwarding data earliest by when. | Q 1 Can an instruction postpone receiving forwarding data until it reaches EX2 stage? If an instruction can postpone receiving forwarding help until reaching EX2, would it still try to receive help while it is in EX1 (may be because the donor instruction can not wait)? | |-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | | | | | | | | | | | <b>Q 2</b> Are there occasions where you end up stalling an instruction because you could not provide the needed forwarding data to it in EX1? | | | | | | | | | | | | Q 3 Can an instruction in EX2 provide forwarding help to an instruction behind it? Or is it too early for any instruction to start pro- | | viding forwarding help while it is in EX2? | | | | | | | | Q 4 Which instructions need to wait until they reach WB stage for them to provide forwarding help? And why they need to wait until then? | | | | Q 5 Priority in Forwarding: Recall that, in the MIPS pipelined CPU design, if the instructions in both MEM stage and also i stage are willing to provide forwarding help to the instruction in EX stage, we exercise priority and accept help(MEM/WB) stage. Do we have such a situation here? If so, explain with an example instruction sequence. | | |------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------| | | | | | | | Normally forwarding is done at the beginning of a clock so that the recipient instruction can process the information duri clock. However, sometimes it may make sense (as in Lab 7 Part 1) to forward information at the end of the clock. Can struction such as ADD4 or ADD1 in EX2 provide forwarding help to an ADD4 or MOV instruction in EX1 towards the clock? If yes, did you provide such an arrangement in your design? If not, is it desirable to provide such an arrange Does it cost extra? Does it avoid any stalls, thereby improving the pipeline performance? Or is it that the particular help we to offer at the end of the clock will <i>anyway</i> be available at the beginning of the next clock and it is just one and the same and the same, whether you provide data at the end of the current clock or at the beginning of the next clock)? Explain broads and the same, whether you provide data at the end of the current clock or at the beginning of the next clock)? Explain broads are the same and the same. | an in-<br>end of<br>ement?<br>we plan<br>ne (one | | | | | | | | | ••••• | • • • • • • • • • • • • • • • • • • • • | ••••• | ••••• | |--------------|----------------------|-----------------------------------------|---------------------|-----------------------------------------| | | | | | | | | | ••••• | ••••• | • • • • • • • • • • • • • • • • • • • • | | | | | | | | | | | | | | | | | | | | | | | | | | O = TEL C 11 | 1: -1.41 1: C - 1: - | C.1 O.1/2 C | 2002 Einel E Dlesse | | **Q** 7 The following is a slightly modified version of the Q#3 from Spring 2002 Final Exam. Please answer this as part of this lab questions. - Suppose the current design is working at 500 MHz (clock period = 2 ns). Due to VLSI technology improvements, you can either (1) double the clock rate to 1 GHz (clock period = 1 ns) or (2) keep the clock rate at the same level (500 MHz) and combine IF and ID stages into one stage called IFID and also the EX1 and EX2 stages into one stage called EX12 stage. Circle your choice and explain. - a) **Both** options are equally good (b) Option 1 is better than option 2 (c) Option 2 is better than option 1 - 7.2 Given below are **four** flip-flop hook-ups and **five** statements describing their operation. You need to find a matching statement for each of the hook-ups. - (a) Once SET, it remains set. - (b) Once RESET, it remains reset. - (c) If it is currently SET, it will RESET on the next clock. - (d) If it is currently RESET, it will SET on the next clock. - (e) none of the above | 7.3 | Let us go back to the original (slow) VLSI technology. Let us still combine the two stages EX1 and EX2 | |-----|----------------------------------------------------------------------------------------------------------| | | into one stage EX12 as shown in the incomplete design on page 14/14. The SUB3 and ADD4 instructions | | | require only one of two resources in EX12 and take only one clock to pass through EX12. MOV and NOP | | | do not need any computation. Only the ADD1 instruction requires both subtract_three and add_four | | | operations and takes two-clocks through EX12. At that time we need to stall the entire pipe including WB | | | stage for one clock so that the ADD1 completes using EX12 stage. | | 7.3.1 | Explain why do we need to stall the WB stage also and why cannot we send a bubble into the WB stage. | |-------|------------------------------------------------------------------------------------------------------| | | Use the sequence on this page (where the entire pipeline is stalled during clock 2) to explain. | | ADD4 | \$5, | \$6 | ; | (\$5) | <= | (\$6) | + | 4 | | |------|--------------|-----|---|-------|----|-------|---|---|--| | ADD1 | \$4, | \$5 | ; | (\$4) | <= | (\$5) | + | 1 | | | SUB3 | \$3, | \$5 | ; | (\$3) | <= | (\$5) | - | 3 | | | ADD4 | <b>\$1</b> . | \$2 | • | (\$1) | <= | (\$2) | + | 4 | | | | | IF | ID | EX12 | WB | |-----------------|---------|------|------|------|------| | | Clock 1 | ADD4 | SUB3 | ADD1 | ADD4 | | Pipe<br>Stalled | Clock 2 | ADD4 | SUB3 | ADD1 | ADD4 | | ' | Clock 3 | | ADD4 | SUB3 | ADD1 | | | Clock 4 | | | ADD4 | SUB3 | 7.3.2 Notice that in the *incomplete* design, we have provided a flip-flop in the EX12 stage to help you stall the entire pipe for one clock (no more no less) when ADD1 is passing through the EX12 stage. Complete the design after answering the following questions. • Do you need to stall the pipe to resolve any dependency problem? Yes / No Explain. Notice that we have removed the hazard detection unit. • Notice that - -- we removed one of the two comparators - -- we removed the forwarding mux X2 Mux and FU2 - -- we removed the prioritization mux. Explain why it is appropriate to remove these. | 7 F<br>the | art 3 sub<br>code a | part 2). B<br>ny one o | Both are r | unning ould per | original L<br>at 500mhz<br>form bett | z. Does<br>ter? Dec | one of the | em perf | orm alwa | ays bette | r? Or de | pendir | |---------------------|---------------------|------------------------|-------------------------------|-----------------|--------------------------------------|---------------------|-------------------------------|------------------|-------------|----------------------|--------------------|--------| | | | | | | grams on DD4 's | | seque | | e inda | | | 11 / 6 | | D4 \$5,<br>IB3 \$4, | \$6 ;<br>\$5 ; | (\$5) < (\$4) < | = (\$6)<br>= (\$5)<br>= (\$4) | + 4; | ADD4#1<br>SUB3#2 | ADD1 | \$15, ;<br>\$13, ;<br>\$11, ; | \$10 ;<br>\$14 ; | (\$15) < | <= (\$10<br><= (\$14 | )) + 1;<br>l) + 1; | ADD: | | Code A r | unning o | n Lab #7 l | Part 3 | | | | Code B r | unning or | Lab #7 F | Part 3 | | | | | IF | ID | EX1 | EX2 | WB | | | IF | ID | EX1 | EX2 | WB | | Clock 1 | ADD4#3 | SUB3#2 | ADD4#1 | | | | Clock 1 | ADD1#3 | ADD1#2 | ADD1#1 | | | | Clock 2 | | | | | | | Clock 2 | | | | | | | Clock 3 | | | | | | | Clock 3 | | | | | | | Clock 4 | | | | | | | Clock 4 | | | | | | | Code A r | unning or | ı design ir | section 7 | .3 above | 1 1 | | Code B r | unning o | ı design iı | section 7 | 7.3 above | | | | IF | ID | EX12 | WB | ] | | | IF | ID | EX12 | WB | | | Clock 1 | ADD4#3 | SUB3#2 | ADD4#1 | | - | | Clock 1 | ADD1#3 | ADD1#2 | ADD1#1 | | | | Clock 2 | | | | | - | | Clock 2 | | | | | | | Clock 3 | | | | | 1 | | Clock 3 | | | | | | | Clock 4 | | | | | 1 | | Clock 4 | | | | | | | One of t | he two ( | lesions i | s alwave | hetter | TRUE / | FAT.C | E Fynl | ain | l | <u> </u> | <u> </u> | L | | One or t | 110 1000 | 1001E110 1 | s aiways | octici. | INOE / | T ALIS | <b>-</b> | u111. | | | | | - 7.5 Multicycle implementation: - 7.5.1 The Datapath below is complete. Complete the state diagram and produce the two outputs, R\_Write and PC EN (draw combinational logic necessary to produce R Write and PC EN). - 7.5.2 The design below corresponds to - (i) the Lab 7 P3 Subpart #1 design (with separate EX1 and EX2 stages) only - (ii) the design in Lab 7 P3 Subpart #2 design (with EX1 and EX2 merged) only - (iii) both the above two designs. - 7.5.3 If an IR (Instruction register) is available, PC can be incremented early. TRUE / FALSE Performance improves if we add IR. TRUE / FALSE Figure for Question 7.5 13<sub>/ Lab 7P3</sub>