# Am386 Microprocessors for Personal Computers Data Book Advanced Micro Devices # **Am386™ Microprocessors** for Personal Computers **Data Book** © 1992 Advanced Micro Devices, Inc. Advanced Micro Devices reserves the right to make changes in its products without notice in order to improve design or performance characteristics. This publication neither states nor implies any warranty of any kind, including but not limited to implied warrants of merchantability or fitness for a particular application. AMD® assumes no responsibility for the use of any circuitry other than the circuitry in an AMD product. The information in this publication is believed to be accurate in all respects at the time of publication, but is subject to change without notice. AMD assumes no responsibility for any errors or omissions, and disclaims responsibility for any consequences resulting from the use of the information included herein. Additionally, AMD assumes no responsibility for the functioning of undescribed features or parameters. #### Trademarks AMD is a registered trademark and Am386 is a trademark of Advanced Micro Devices, Inc. Product names used in this publication are for identification purposes only and may be trademarks of their respective companies. #### Am386 MICROPROCESSORS FOR **PERSONAL COMPUTERS DATA BOOK** | Am386DX/DXL Microprocessor Data Sheet | | . 1 | |---------------------------------------|------|-----| | Am386SX/SXL Microprocessor Data Sheet | . 14 | 41 | Advanced Micro Devices' microprocessors are breaking performance barriers. The cost effective Am386 microprocessors are ideal for both desktop and battery powered portable computers. This data book describes the 32-bit Am386 Microprocessor Family with data sheets on Am386DX/DXL and Am386SX/SXL microprocessors. Am386 microprocessors are the fastest available with 40 MHz on the Am386DXL microprocessor, a 21% performance increase over the 33-MHz version; and 25 MHz on the Am386SXL microprocessor, a 25% performance increase over the 20-MHz version. For portable systems the Am386DXL/SXL microprocessors' true static design allows longer battery life by offering low operating power consumption and a standby mode. We also offer 3-V Am386 microprocessors optimized for notebook applications. Information on these can be found in the AMD® *3-Volt System Logic for Personal Computers Data Book*, publication #17028. Remember, our partnership helps you gain and keep the competitive edge. We are not your competition. Robert G. McConnell Vice President Personal Computer Products Division ## J ## Am386™DX/DXL High-Performance, Low-Power, 32-Bit Microprocessor Advanced Micro Devices #### DISTINCTIVE CHARACTERISTICS #### ■ Ideal for portable PCs - —True static design for long battery life (Am386DXL microprocessor) - —Typical standby Icc < 20 μA at DC (0 MHz) (Am386DXL microprocessor) - --- Typical operating Icc = 210 mA at 33 MHz - —Lower power consumption than Intel i386DX or Intel i386SX - -Small footprint 132-pin PQFP package - Wide range of chip sets and BIOS available to support standby mode capabilities -Performance on demand (0 to 40 MHz) #### ■ Ideal for desktop PCs - -40-, 33-, 25-, and 20-MHz operating speeds - Lower heat dissipation facilitates fan reduction or elimination for cost savings and noise reduction - —Pin-for-pin replacement for Intel i386DX - Compatible with 386DX systems and software - Supports 387DX-compatible math coprocessors - AMD advanced 0.8 micron CMOS technology #### **GENERAL DESCRIPTION** The Am386DX/DXL microprocessor is a high-speed, true static implementation of the Intel i386DX microprocessor. It is ideal for both desktop and battery-powered portable personal computers. For desktop PCs, the Am386DXL microprocessor offers a 21% increase in the maximum operating speed from 33 to 40 MHz. Also, his device offers lower heat dissipation, allowing system designers to remove or reduce the size and cost of the system cooling fan. For portables, the Am386DXL microprocessor's true static design offers longer battery life with low operating power consumption and standby mode. At 33 MHz, this device has 40% lower operating lcc than the Intel i386DX. Standby mode allows the Am386DXL microprocessor to be clocked down to 0 MHz (DC) and retain full register contents. In standby mode, typical current draw is less than 20 $\mu\text{A}$ , nearly a 1000× reduction in power consumption versus the Intel i386DX or Intel i386SX. Additionally, the Am386DXL microprocessor is available in a small footprint 132-pin plastic quad flat pack (PQFP) package. This surface-mount package is 40% smaller than PGA, allowing smaller, lower-cost board designs without the need for a socket. - o Intel i386DX @ 5.0 V - ▲ Am386DX/DXL CPU @ 5.0 V - Am386DXLV CPU @ 3.3 V Note: Inputs at Vcc or Vss. Typical Power Consumption #### **BLOCK DIAGRAM** #### **FUNCTIONAL DESCRIPTION** #### True Static Operation (Am386DXL CPU) The Am386DXL microprocessor incorporates a true static design. Unlike dynamic circuit design, the Am386DXL device eliminates the minimum operating frequency restriction. It may be clocked from its maximum speed of 40 MHz all the way down to 0 MHz (DC). System designers can use this feature to design true 32-bit battery-powered portable PCs with long battery life. #### Standby Mode (Am386DXL CPU) This true static design allows for a standby mode. At any of its operating speeds (40 MHz to DC), the Am386DXL microprocessor will retain its state (i.e., the contents of all of its registers). By shutting off the clock completely, the device enters standby mode. Since power consumption is a function of clock frequency, operating power consumption is reduced as the frequency is lowered. In standby mode, typical current draw is reduced to less than 20 µA at DC. Not only does this feature save battery life, but it also simplifies the design of power-conscious notebook computers in the following ways: - Eliminates the need for software in BIOS to save and restore the contents of registers. - Allows simpler circuitry to control stopping of the clock since the system does not need to know the processor state. #### Lower Operating Icc True static design also allows lower operating $l_{CC}$ when operating at any speed. See the following graph for typical current at operating speeds. #### Performance On Demand The Am386DXL microprocessor retains its state at any speed from 0 MHz (DC) to its maximum operating speed (20, 25, 33, or 40 MHz). With this feature, system designers may vary the operating speed of the system to extend the battery life in portable systems. For example, the system could operate at low speeds during inactivity or polling operations. However, upon interrupt, the system clock can be increased up to its maximum speed. After a user-defined time-out period, the system can be returned to a low (or 0 MHz) operating speed without losing its state. This design maximizes life while achieving optimal performance. ## CONNECTION DIAGRAMS 132-Lead Ceramic Pin Grid Array (PGA) Package — Top Side View | | Α | В | С | D | Е | F | G | Н | J | Κ | L | М | N | Р | _ | |----|----------------------|----------------------|---------------------------|----------------|-----------------|-----------------------------------------|-----------------------------------|-----------------|-----------------------------------------|----------------|-----------------|---------------------|-----------------------------|---------------------|----| | 14 | V∞<br><b>O</b> | V <sub>ss</sub> | BS16 | HOLD | ADS<br>O | V <sub>ss</sub> | V <sub>cc</sub> | D2<br>O | D3 | D4<br>O | D6<br>O | HLDA<br>O | D9<br>O | V <sub>ss</sub> | 14 | | 13 | BE3 | BE2<br>O | BE1 | NA<br>O | NC<br>O | NC F | READY | 7 D1 | $\overset{V_{SS}}{O}$ | D5<br>O | O<br>D8 | V <sub>∞</sub> | D11<br><b>O</b> | D13 | 13 | | 12 | M/IO<br>O | O <sub>O</sub> | V <sub>cc</sub> | V∞<br>O | BE0<br>O | CLK2<br>O | V <sub>∞</sub> | O <sub>O</sub> | V <sub>ss</sub> | D7<br><b>O</b> | V <sub>cc</sub> | D10<br><b>O</b> | D12 | D14<br><b>O</b> | 12 | | 11 | D/C<br>O | V <sub>ss</sub> | V <sub>ss</sub> | | | | | | | | | D15 | D16 | D18 | 11 | | 10 | V <sub>cc</sub> | W/R | O | | | | | | | | | V <sub>ss</sub> | D17 | D19 | 10 | | 9 | Õ | 0 | RESE | | | | | | | | | D20<br><b>O</b> | D21 | D22 | 9 | | 8 | O | 0 | PERE | Q | | | | | | | | V <sub>ss</sub> | D23 | V <sub>∞</sub><br>O | 8 | | 7 | V<br>O | O<br>NIC | NC<br>O | | | | | | | | | V <sub>∞</sub><br>O | V <sub>∞</sub><br>O | D24<br>O | 7 | | 6 | V <sub>ss</sub> | NC<br>O | NC<br>O | | | | | | | | | D28<br>O | D25 | V <sub>ss</sub> | 6 | | 5 | V | V <sub>ss</sub> | V<br>O | | | | | | | | | D31 | D27 | D26<br>O | 5 | | 4 | O<br>A3 | NC<br>O<br>A4 | A2<br>O<br>A6 | 40 | 410 | V | V | 410 | V | A25 | A 0.0 | V <sub>ss</sub> | V <sub>∞</sub><br>O | D29<br>O<br>D30 | 4 | | 3 | O<br>Vss | O<br>A5 | A <sup>0</sup><br>O<br>A7 | A9<br>O<br>A10 | A12<br>O<br>A13 | V <sub>ss</sub><br>O<br>V <sub>ss</sub> | V <sub>cc</sub> O V <sub>cc</sub> | A19<br>O<br>A18 | V <sub>ss</sub><br>O<br>V <sub>ss</sub> | O<br>A22 | A28<br>O<br>A24 | V∞<br>O<br>A29 | V <sub>ss</sub><br>O<br>A31 | O<br>V | 3 | | 2 | O<br>V <sub>ss</sub> | O<br>V <sub>ss</sub> | O<br>A8 | O<br>A11 | O<br>A14 | O<br>A15 | O<br>A16 | O<br>A17 | O<br>A20 | O<br>A21 | O<br>A23 | O<br>A26 | O<br>A27 | O<br>A30 | 2 | | 1 | \ <u>Ö</u> | Ö | õ | Ö | Ö | Õ | õ | Ö | Õ | Ö | Õ | Õ | Ö | <u> </u> | 1 | | | Α | В | С | D | Е | F | G | Н | J | κ | L | М | N | Р | | Note: NC = Not connected; connection of any NC pin may cause a malfunction or incompatability with future shippings of the Am386DX/DXL microprocessor. ## **CONNECTION DIAGRAMS (continued)** 132-Lead Ceramic Pin Grid Array (PGA) Package — Pin Side View Note: NC = Not connected; connection of any NC pin may cause a malfunction or incompatability with future shippings of the Am386DX/DXL microprocessor. ## **PGA Pin Designations (sorted by Functional Grouping)** | Addı | ress | Da | ta | Con | trol | NC | Vcc | V <sub>ss</sub> | |-------------|------------|-------------|------------|-------------|------------|------------|------------|-----------------| | Pin<br>Name | Pin<br>No. | Pin<br>Name | Pin<br>No. | Pin<br>Name | Pin<br>No. | Pin<br>No. | Pin<br>No. | Pin<br>No. | | A2 | C4 | DO | H12 | ADS | E14 | A4 | A1 | <b>A</b> 2 | | A3 | <b>A</b> 3 | D1 | H13 | BE0 | E12 | B4 | A5 | <b>A</b> 6 | | A4 | B3 | D2 | H14 | BE1 | C13 | B6 | A7 | A9 | | A5 | B2 | D3 | J14 | BE2 | B13 | B12 | A10 | B1 | | A6 | C3 | D4 | K14 | BE3 | A13 | C6 | A14 | B5 | | A7 | C2 | D5 | K13 | BS16 | C14 | C7 | C5 | B11 | | A8 | C1 | D6 | L14 | BUSY | B9 | E13 | C12 | B14 | | A9 | D3 | D7 | K12 | CLK2 | F12 | F13 | D12 | C11 | | A10 | D2 | D8 | L13 | D/C | A11 | | G2 | F2 | | A11 | D1 | D9 | N14 | ERROR | A8 | | G3 | F3 | | A12 | E3 | D10 | M12 | HLDA | M14 | | G12 | F14 | | A13 | E2 | D11 | N13 | HOLD | D14 | | G14 | J2 | | A14 | E1 | D12 | N12 | INTR | B7 | | L12 | J3 | | A15 | F1 | D13 | P13 | LOCK | C10 | | M3 | J12 | | A16 | G1 | D14 | P12 | M/IO | A12 | | M7 | J13 | | A17 | H1 | D15 | M11 | NA | D13 | | M13 | M4 | | A18 | H2 | D16 | N11 | NMI | B8 | | N4 | M8 | | A19 | H3 | D17 | N10 | PEREQ | C8 | | N7 | M10 | | A20 | J1 | D18 | P11 | READY | G13 | | P2 | N3 | | A21 | K1 | D19 | P10 | RESET | C9 | | P8 | P6 | | A22 | K2 | D20 | M9 | W/R | B10 | | | P14 | | A23 | L1 | D21 | N9 | | | | | | | A24 | L2 | D22 | P9 | | | | 1 | | | A25 | КЗ | D23 | N8 | | | | | 1 | | A26 | M1 | D24 | P7 | | | | | | | A27 | N1 | D25 | N6 | | | | | | | A28 | L3 | D26 | P5 | | | | | | | A29 | M2 | D27 | N5 | | | | | i | | A30 | P1 | D28 | M6 | | | | | | | A31 | N2 | D29 | P4 | | | | | ] | | | | D30 | P3 | | | | | | | | | D31 | M5 | | | | | | ## **CONNECTION DIAGRAMS (continued) PGA Pin Designations (sorted by Pin Number)** | Pin<br>No. | Pin<br>Name | Pin<br>No. | Pin<br>Name | Pin<br>No. | Pin<br>Name | Pin<br>No. | Pin<br>Name | Pin<br>No. | Pin<br>Name | Pin<br>No. | Pin<br>Name | |-------------|-------------|------------|-------------|------------|-------------|------------|-------------|------------|-------------|------------|-------------| | <b>A</b> 1 | Vcc | B9 | BUSY | D3 | <b>A</b> 9 | H1 | A17 | L13 | D8 | N7 | Vcc | | <b>A</b> 2 | Vss | B10 | W/R | D12 | Vcc | H2 | A18 | L14 | D6 | N8 | D23 | | АЗ | <b>A</b> 3 | B11 | Vss | D13 | NA | НЗ | A19 | M1 | <b>A</b> 26 | N9 | D21 | | A4 | NC | B12 | NC | D14 | HOLD | H12 | D0 | M2 | <b>A</b> 29 | N10 | D17 | | <b>A</b> 5 | Vcc | B13 | BE2 | E1 | A14 | H13 | D1 | МЗ | Vcc | N11 | D16 | | <b>A</b> 6 | Vss | B14 | Vss | E2 | A13 | H14 | D2 | M4 | Vss | N12 | D12 | | <b>A</b> 7 | Vcc | C1 | <b>A8</b> | E3 | A12 | J1 | A20 | M5 | D31 | N13 | D11 | | <b>A</b> 8 | ERROR | C2 | <b>A</b> 7 | E12 | BE0 | J2 | Vss | M6 | D28 | N14 | D9 | | <b>A</b> 9 | Vss | СЗ | <b>A</b> 6 | E13 | NC | J3 | Vss | M7 | Vcc | P1 | <b>A</b> 30 | | <b>A</b> 10 | Vcc | C4 | <b>A</b> 2 | E14 | ADS | J12 | Vss | M8 | Vss | P2 | Vcc | | A11 | D/C | C5 | Vcc | F1 | A15 | J13 | Vss | M9 | D20 | Р3 | D30 | | A12 | M/IO | C6 | NC | F2 | Vss | J14 | D3 | M10 | $V_{ss}$ | P4 | D29 | | A13 | BE3 | C7 | NC | F3 | Vss | K1 | A21 | M11 | D15 | P5 | D26 | | A14 | Vcc | C8 | PEREQ | F12 | CLK2 | K2 | A22 | M12 | D10 | P6 | $V_{ss}$ | | B1 | Vss | C9 | RESET | F13 | NC | КЗ | A25 | M13 | Vcc | P7 | D24 | | B2 | <b>A</b> 5 | C10 | LOCK | F14 | Vss | K12 | D7 | M14 | HLDA | P8 | Vcc | | ВЗ | A4 | C11 | Vss | G1 | A16 | K13 | D5 | N1 | A27 | P9 | D22 | | B4 | NC | C12 | Vcc | G2 | Vcc | K14 | D4 | N2 | A31 | P10 | D19 | | B5 | Vss | C13 | BE1 | G3 | Vcc | L1 | A23 | N3 | Vss | P11 | D18 | | B6 | NC | C14 | BS16 | G12 | Vcc | L2 | A24 | N4 | Vcc | P12 | D14 | | В7 | INTR | D1 | A11 | G13 | READY | L3 | A28 | N5 | D27 | P13 | D13 | | В8 | NMI | D2 | A10 | G14 | Vcc | L12 | Vcc | N6 | D25 | P14 | Vss | #### 132-Lead Plastic Quad Flat Pack (PQFP) Package — Top Side View Notes: Pin 1 is marked for orientation. NC = Not connected; connection of an NC pin may cause a malfunction or incompatibility with future shippings of the Am386DX/DXL microprocessor. #### 132-Lead Plastic Quad Flat Pack (PQFP) Package — Pin Side View Notes: Pin 1 is marked for orientation. NC = Not connected; connection of an NC pin may cause a malfunction or incompatibility with future shippings of the Am386DX/DXL microprocessor. ## **PQFP Pin Designations (sorted by Functional Grouping)** | Addı | ess | Da | ta | Con | trol | NC | Vcc | Vss | |-------------|------------|-------------|------------|-------------|------------|------------|------------|------------| | Pin<br>Name | Pin<br>No. | Pin<br>Name | Pin<br>No. | Pin<br>Name | Pin<br>No. | Pin<br>No. | Pin<br>No. | Pin<br>No. | | A2 | 67 | D0 | 20 | ADS | 27 | 36 | 2 | 1 | | A3 | 68 | D1 | 19 | BE0 | 31 | 37 | 16 | 10 | | A4 | 69 | D2 | 18 | BE1 | 32 | 39 | 22 | 11 | | A5 | 70 | D3 | 17 | BE2 | 33 | 59 | 34 | 21 | | A6 | 71 | D4 | 15 | BE3 | 38 | 60 | 49 | 23 | | A7 | 72 | D5 | 14 | BS16 | 29 | 61 | 56 | 25 | | A8 | 74 | D6 | 13 | BUSY | 46 | 62 | 58 | 35 | | A9 | 75 | D7 | 12 | CLK2 | 24 | 63 | 73 | 44 | | A10 | 76 | D8 | 9 | D/C | 41 | | 85 | 48 | | A11 | 77 | D9 | 7 | ERROR | 47 | | 99 | 51 | | A12 | 78 | D10 | 6 | FLT | 54 | | 106 | 55 | | A13 | 79 | D11 | 5 | HLDA | 8 | | 110 | 57 | | A14 | 81 | D12 | 4 | HOLD | 28 | ĺ | 117 | 64 | | A15 | 82 | D13 | 3 | INTR | 53 | | 123 | 65 | | A16 | 84 | D14 | 131 | LOCK | 42 | ì | 127 | 66 | | A17 | 86 | D15 | 130 | M/ĪŌ | 40 | i e | | 80 | | A18 | 87 | D16 | 129 | NA | 30 | l | | 83 | | A19 | 88 | D17 | 128 | NMI | 52 | | | 90 | | A20 | 89 | D18 | 126 | PEREQ | 50 | | | 91 | | A21 | 93 | D19 | 125 | READY | 26 | | | 92 | | A22 | 94 | D20 | 124 | RESET | 45 | | | 105 | | A23 | 95 | D21 | 121 | W/R | 43 | | | 111 | | A24 | 96 | D22 | 120 | | | | | 114 | | A25 | 97 | D23 | 119 | | | | | 122 | | A26 | 98 | D24 | 118 | | | | | 132 | | A27 | 100 | D25 | 116 | | | | | | | A28 | 101 | D26 | 115 | | | | | | | A29 | 102 | D27 | 113 | | | | ĺ | | | A30 | 103 | D28 | 112 | | | | | l | | A31 | 104 | D29 | 109 | | | | | l | | | | D30 | 108 | | | | | Ì | | | | D31 | 107 | | | | | | ## **CONNECTION DIAGRAMS (continued) PQFP Pin Designations (sorted by Pin Number)** | Pin<br>No. | Pin<br>Name | Pin<br>No. | Pin<br>Name | Pin<br>No. | Pin<br>Name | Pin<br>No. | Pin<br>Name | Pin<br>No. | Pin<br>Name | Pin<br>No. | Pin<br>Name | |------------|-------------|------------|-------------|------------|-------------|------------|-----------------|------------|-----------------|------------|-------------| | 1 | Vss | 23 | Vss | 45 | RESET | 67 | A2 | 89 | <b>A</b> 20 | 111 | Vss | | 2 | Vcc | 24 | CLK2 | 46 | BUSY | 68 | <b>A</b> 3 | 90 | Vss | 112 | D28 | | 3 | D13 | 25 | Vss | 47 | ERROR | 69 | A4 | 91 | Vss | 113 | D27 | | 4 | D12 | 26 | READY | 48 | Vss | 70 | <b>A</b> 5 | 92 | Vss | 114 | Vss | | 5 | D11 | 27 | ADS | 49 | Vcc | 71 | <b>A</b> 6 | 93 | A21 | 115 | D26 | | 6 | D10 | 28 | HOLD | 50 | PEREQ | 72 | <b>A</b> 7 | 94 | A22 | 116 | D25 | | 7 | D9 | 29 | BS16 | 51 | Vss | 73 | Vcc | 95 | A23 | 117 | Vcc | | 8 | HLDA | 30 | NA | 52 | NMI | 74 | <b>A8</b> | 96 | A24 | 118 | D24 | | 9 | D8 | 31 | BE0 | 53 | INTR | 75 | <b>A</b> 9 | 97 | A25 | 119 | D23 | | 10 | Vss | 32 | BE1 | 54 | FLT | 76 | A10 | 98 | A26 | 120 | D22 | | 11 | Vss | 33 | BE2 | 55 | Vss | 77 | A11 | 99 | Vcc | 121 | D21 | | 12 | D7 | 34 | Vcc | 56 | Vcc | 78 | A12 | 100 | <b>A</b> 27 | 122 | Vss | | 13 | D6 | 35 | Vss | 57 | Vss | 79 | A13 | 101 | A28 | 123 | Vcc | | 14 | D5 | 36 | NC | 58 | Vcc | 80 | $V_{\text{ss}}$ | 102 | A29 | 124 | D20 | | 15 | D4 | 37 | NC | 59 | NC | 81 | A14 | 102 | <b>A</b> 30 | 125 | D19 | | 16 | Vcc | 38 | BE3 | 60 | NC | 82 | A15 | 104 | <b>A</b> 31 | 126 | D18 | | 17 | D3 | 39 | NC | 61 | NC | 83 | $V_{\text{ss}}$ | 105 | $V_{\text{ss}}$ | 127 | Vcc | | 18 | D2 | 40 | M/ĪŌ | 62 | NC | 84 | A16 | 106 | Vcc | 128 | D17 | | 19 | D1 | 41 | D/C | 63 | NC | 85 | Vcc | 107 | D31 | 129 | D16 | | 20 | D0 | 42 | LOCK | 64 | Vss | 86 | A17 | 108 | D30 | 130 | D15 | | 21 | Vss | 43 | W/R | 65 | Vss | 87 | A18 | 109 | D29 | 131 | D14 | | 22 | Vcc | 44 | Vss | 66 | Vss | 88 | A19 | 110 | Vcc | 132 | Vss | #### **LOGIC SYMBOL** #### ORDERING INFORMATION #### Standard Products AMD standard products are available in several packages and operating ranges. The order number (Valid Combination) is formed by a combination of the elements below. | Valid Combinations | | | | | | |--------------------|--------------------------|--|--|--|--| | A80386DX/DXL | -40<br>-33<br>-25<br>-20 | | | | | | NG80386DX | -40, -40/F | | | | | | NG80386DX/DXL | –33, –33/F<br>–25, –25/F | | | | | #### Valid Combinations Valid Combinations lists configurations planned to be supported in volume for this device. All speeds may not be available in all package combinations. Consult the local AMD sales office to confirm availability of specific valid combinations and to check on newly released combinations. #### PIN DESCRIPTION #### A31-A2 #### Address Bus (Outputs) Outputs physical memory or port I/O addresses. #### ADS #### Address Status (Active Low; Output) Indicates that a valid bus cycle definition and address (W/R, D/C, M/IO, BEO, BE1, BE2, BE3, and A31–A2) are being driven at the Am386DX/DXL microprocessor pins. #### BE3-BE0 #### **Byte Enables (Active Low: Outputs)** Indicate which data bytes of the data bus take part in a bus cycle. #### **BS16** #### **Bus Size 16 (Active Low; Input)** Allows direct connection of 32-bit and 16-bit data buses. #### BUSY #### **Busy (Active Low; Input)** Signals a busy condition from a processor extension. #### CLK<sub>2</sub> #### Clock (Input) Provides the fundamental timing for the Am386DX/DXL microprocessor. #### D31-D0 #### Data Bus (Inputs/Outputs) Inputs data during memory, I/O, and interrupt acknowledge read cycles and outputs data during memory and I/O write cycles. #### D/C #### Data/Control (Output) A bus cycle definition pin that distinguishes data cycles, either memory or I/O, from control cycles which are: interrupt acknowledge, halt, and instruction fetching. #### **ERROR** #### **Error (Active Low; Input)** Signals an error condition from a processor extension. #### FLT #### Float (Active Low: Input) An input signal which forces all bidirectional and output signals, including HLDA, to the three-state condition. $\overline{\text{FLT}}$ has an internal pull-up resistor, and if it is not used it should be unconnected. #### HLDA #### **Bus Hold Acknowledge (Active High; Output)** Indicates that the Am386DX/DXL microprocessor has surrendered control of its local bus to another bus master. #### HOLD #### **Bus Hold Request (Active High: Input)** Allows another bus master to request control of the local bus. #### INTR #### Interrupt Request (Active High; Input) A maskable input that signals the Am386DX/DXL microprocessor to suspend execution of the current program and execute an interrupt acknowledge function. #### LOCK #### **Bus Lock (Active Low; Output)** A bus cycle definition pin that indicates that other system bus masters are denied access to the system bus while it is active. #### M/IO #### Memory I/O (Output) A bus cycle definition pin that distinguishes memory cycles from input/output cycles. #### $\Delta N$ #### **Next Address (Active Low; Input)** Used to request address pipelining. #### NC #### No Connect Should always remain unconnected. Connection of a NC pin may cause the processor to malfunction or be incompatible with future steppings of the Am386DX/DXL microprocessor. #### NM ## Non-Maskable Interrupt Request #### (Active High; Input) A non-maskable input that signals the Am386DX/DXL microprocessor to suspend execution of the current program and execute an interrupt acknowledge function. #### PERFC #### Processor Extension Request (Active High; Input) Indicates that the processor extension has data to be transferred by the Am386DX/DXL microprocessor. #### READY #### **Bus Ready (Active Low; Input)** Terminates the bus cycle. #### RESET #### Reset (Active High; Input) Suspends any operation in progress and places the Am386DX/DXL microprocessor in a known reset state. #### Vcc #### System Power (Input) Provides the +5-V nominal DC supply input. #### Vss #### System Ground (Input) Provides 0-V connection from which all inputs and outputs are measured. #### W/R #### Write/Read (Output) A bus cycle definition pin that distinguishes write cycles from read cycles. #### **BASE ARCHITECTURE** #### Introduction The Am386DX/DXL microprocessor consists of a central processing unit, a memory management unit, and a bus interface. The central processing unit consists of the execution unit and instruction unit. The execution unit contains the eight 32-bit general purpose registers that are used for both address calculation, data operations, and a 64-bit barrel shifter used to speed shift, rotate, multiply, and divide operations. The multiply and divide logic uses a 1-bit per cycle algorithm. The multiply algorithm stops the iteration when the most significant bits of the multiplier are all zero. This allows typical 32-bit multiplies to be executed in under 1 ms. The instruction unit decodes the instruction op-codes and stores them in the decoded instruction queue for immediate use by the execution unit. The Memory Management Unit (MMU) consists of a segmentation unit and a paging unit. Segmentation allows the managing of the logical address space by providing an extra addressing component, one that allows easy code and data relocatability and efficient sharing. The paging mechanism operates beneath and is transparent to the segmentation process to allow management of the physical address space. Each segment is divided into one or more 4-Kb pages. To implement a virtual memory system, the Am386DX/DXL microprocessor supports full restartability for all page and segment faults. Memory is organized into one or more variable length segments, each up to 4 Gb in size. A given region of the linear address space, a segment, can have attributes associated with it. These attributes include its location, size, type (i.e., stack, code, or data), and protection characteristics. Each task on an Am386DX/DXL microprocessor can have a maximum of 16,381 segments of up to 4 Gb each, thus providing 64 tb (trillion bytes) or virtual memory to each task. The segmentation unit provides four levels of protection for isolating and protecting applications and the operating system from each other. The hardware enforced protection allows the design of system with a high degree of integrity. The Am386DX/DXL microprocessor has two modes of operation: Real Address Mode (Real Mode) and Protected Virtual Address Mode (Protected Mode). In Real Mode, the Am386DX/DXL device operates as a very fast 8086 but with 32-bit extensions, if desired. Real Mode is required primarily to setup the processor for Protected Mode operation. Protected Mode provides address to the sophisticated memory management, paging, and privilege capabilities of the processor. Within Protected Mode, software can perform a task switch to enter into tasks designated as Virtual 8086 Mode tasks. Each such task behaves with 8086 semantics, thus allowing 8086 software (an application program or an entire operating system) to execute. The Virtual 8086 tasks can be isolated and protected from one another and the host Am386DX/DXL microprocessor operating system by the use of paging and the I/O Permission Bitmap. Finally, to facilitate high-performance system hardware designs, the Am386DX/DXL microprocessor bus interface offers address pipelining, dynamic data bus sizing, and direct Byte Enable signals for each byte of the data bus. These hardware features are described fully beginning in the Functional Data section. #### **Register Overview** The Am386DX/DXL microprocessor has 32 register resources in the following categories. - General Purpose Registers - Segment Registers - Instruction Pointer and Flags - Control Registers - System Address Registers - · Debug Registers - · Test Registers The registers are a superset of the 8086, 80186, and 80286 registers, so all 16-bit 80186 and 80286 registers are contained within the 32-bit Am386DX/DXL microprocessor. Figure 1 shows all the Am386DX/DXL microprocessor base architecture registers that include the general address and data registers, the instruction pointer, and the flags register. The contents of these registers are task-specific, so these registers are automatically loaded with a new context upon a task switch operation. The base architecture also includes six directly accessible segments, each up to 4 Gb in size. The segments are indicated by the selector values placed in Am386DX/DXL CPU segment registers of Figure 1. Various selector values can be loaded as a program executes, if desired. The selectors are also task specific, so the segment registers are automatically loaded with new context upon a task switch operation. The other types of registers Control, System Address, Debug, and Test are primarily used by system software. Figure 1. Base Architecture Registers ## **Register Descriptions** #### **General-Purpose Registers** The eight general-purpose registers of 32 bits hold data or address quantities. The general registers, Figure 2, support data operands of 1, 8, 16, 32, and 64 bits and bit fields of 1 to 32 bits. They support address operands of 16 and 32 bits. The 32-bit registers are named EAX, EBX, ECX, EDX, ESI, EDI, EBP, and ESP. The least significant 16 bits of the registers can be accessed separately. This is done by using the 16-bit names of the registers AX, BX, CX, DX, SI, DI, BP, and SP. When accessed as a 16-bit operand, the upper 16 bits of the register are neither used nor changed. Finally, 8-bit operations can individually access the lower byte (bits 7–0) and the higher byte (bits 15–8) of general purpose registers AX, BX, CX, and DX. The lower bytes are named AL, BL, CL, and DL, respectively. The higher bytes are named AH, BH, CH, and DH, respectively. The individual byte accessibility offers additional flexibility for data operations, but is not used for effective address calculation. Figure 2. General Registers and Instruction Pointer #### Instruction Pointer The instruction pointer, Figure 2, is a 32-bit register named EIP. EIP holds the offset of the next instruction to be executed. The offset is always relative to the base of the code segment (CS). The lower 16 bits (bits 15–0) of EIP contain the 16-bit instruction pointer named IP, which is used by 16-bit addressing. #### Flags Register The Flags Register is a 32-bit register named EFLAGS. The defined bits and bit fields within EFLAGS, shown in Figure 3, control certain operations and indicate status of the Am386DX/DXL microprocessor. The lower 16 bits (bits 15–0) of EFLAGS contain the 16-bit flag register named FLAGS, which is most useful when executing 8086 and 80286 code. Note in the following descriptions, set means set to 1 and reset means reset to 0. #### VM (Virtual 8086 Mode, bit 17) The VM bit provides Virtual 8086 Mode within Protected Mode. If set while the Am386DX/DXL microprocessor is in Protected Mode, the Am386DX/DXL microprocessor will switch to Virtual 8086 operation, handling segment loads as the 8086 does, but generating Exception 13 faults on privileged op-codes. The VM bit can be set only in Protected Mode by the IRET instruction (if current privilege level = 0) and by task switches at any privilege level. The VM bit is unaffected by POPF. PUSHF always pushes a 0 in this bit, even if executing in Virtual 8086 Mode. The EFLAGS image pushed during interrupt processing or saved during task switches will contain a 1 in this bit if the interrupted code was executing as a Virtual 8086 task. The RF flag is used in conjunction with the debug register breakpoints. It is checked at instruction boundaries before breakpoint processing. When RF is set, it causes any debug fault to be ignored on the next instruction. RF is then automatically reset at the successful completion of every instruction (no faults are signaled) except the IRET instruction and the POPF instruction. (JMP, CALL, and INT instructions causing a task switch.) These instructions set RF to the value specified by the memory image. For example, at the end of the breakpoint service routine, the IRET instruction can pop an EFLAGS image having the RF bit set and resume the program's execution at the breakpoint address without generating another breakpoint fault on the same location. #### NT (Nested Task, bit 14) This flag applies to Protected Mode. NT is set to indicate that the execution of this task is nested within another task. If set, it indicates that the current nested task's Task State Segment (TSS) has a valid back link to the previous task's TSS. This bit is set or reset by control transfers to other tasks. The value of NT in EFLAGS is tested by the IRET instruction to determine whether to do an inter-task return or an intra-task return. A POPF or an IRET instruction will affect the setting of this bit according to the image popped at any privilege level. #### IOPL (Input/Output Privilege Level, bits 12-13) This two-bit field applies to Protected Mode. IOPL indicates the numerically maximum CPL (current privilege level) value permitted to execute I/O instructions without generating an Exception 13 fault or consulting the I/O Permission Bitmap. It also indicates the maximum CPL value allowing alteration of the IF (INTR Enable Flag) bit when new values are popped into the EFLAGS register. POPF and IRET instruction can alter the IOPL field when executed at CPL=0. Task switches can always alter the IOPL field when the new flag image is loaded from the incoming task's TSS. #### OF (Overflow Flag, bit 11) OD is set if the operation resulted in a signed overflow. Signed overflow occurs when the operation resulted in carry/borrow into the sign bit (highorder bit) of the result but did not result in a carry/ borrow out of the high-order bit or vice-versa. For 8-, 16-, and 32-bit operations, OF is set according to overflow at bits 7, 15, and 31, respectively. #### DF (Direction Flag, bit 10) DF defines whether ESI and/or EDI registers postdecrement or postincrement during the sg instructions. Postincrement occurs if DF is reset. Postdecrement occurs if DF is set. #### IF (INTR Enable Flag, bit 9) The IF flag, when set, allows recognition of external interrupts signaled on the INTR pin. When IF is reset, external interrupts signaled on the INTR are not recognized. IOPL indicates the maximum CPL value allowing alteration of the IF bit when new values are popped into EFLAGS or FLAGS. #### TF (Trap Enable Flag, bit 8) TF controls the generation of Exception 1 trap when single-stepping through code. When TF is set, the Am386DX/DXL microprocessor generates an Exception 1 trap after the next instruction is executed. When TF is reset, Exception 1 traps occur only as a function of the breakpoint addresses loaded into debug register DR3–DR0. #### SF (Sign Flag, bit 7) SF is set if the high-order bit of the result is set; it is reset otherwise. For 8-, 16-, and 32-bit operations, SF reflects the state of bits 7, 15, and 31, respectively. #### ZF (Zero Flag, bit 6) ZF is set if all bits of the result are 0. Otherwise it is reset. #### AF (Auxiliary Carry Flag, bit 4) The Auxiliary Flag is used to simplify the addition and subtraction of packed BCD quantities. AF is set if the operation resulted in a carry out of bit 3 (addition) or a borrow into bit 3 (subtraction). Otherwise, AF is reset. AF is affected by carry out of, or borrow into, bit 3 only; regardless of overall operand length: 8, 16, or 32 bits. #### PF (Parity flags, bit 2) PF is set if the low-order 8 bits of the operation contain an even number of 1s (even parity). PF is reset if the low-order 8 bits have odd parity. PF is a function of only the low-order 8 bits, regardless of operand size. #### CF (Carry Flag, bit 0) CF is set if the operation resulted in a carry out of (addition) or a borrow into (subtraction) the high-order bit. Otherwise, CF is reset. For 8-, 16-, or 32-bit operations, CF is set according to carry/borrow at bits 7, 15, or 31, respectively. Note: 0 indicates "Reserved for Future Use." Do not define; see Section Compatibility. 15021B-006 Figure 3. Flags Registers 15021B-007 Figure 4. Segment Registers and Associated Descriptor Registers #### **Seament Registers** Six 16-bit segment registers hold segment selector values identifying the currently addressable memory segments. Segment registers are shown in Figure 4. In Protected Mode, each segment may range in size from one byte up to the entire linear and physical space of the machine, 4 Gb (2<sup>32</sup>bytes). If a maximum sized segment is used (limit = FFFFFFFH), it should be Dword aligned (i.e., the least two significant bits of the segment base should be zero). This alignment will avoid a segment limit violation (Exception 13) caused by the wrap around. In Real Address Mode, the maximum segment size is fixed at 64 Kb (2<sup>16</sup> bytes). The six segments addressable at any given moment are defined by the segment registers: CS, SS, DS, ES, FS, and GS. The selector in SS indicates the current stack segment; the selectors in DS, ES, FS, and GS indicate the current data segments. #### Segment Descriptor Registers The segment descriptor registers are not programmer visible, yet it is very useful to understand their content. Inside the Am386DX/DXL microprocessor, a descriptor register (programmer invisible) is associated with each programmer-visible segment register, as shown by Figure 4. Each descriptor register holds a 32-bit segment base address, a 32-bit segment limit, and the other necessary segment attributes. When a selector value is loaded into a segment register, the associated descriptor register is automatically updated with the correct information. In Real Address Mode, only the base address is updated directly (by J shifting the selector value four bits to the left), since the segment maximum limit and attributes are fixed in Real Mode. In Protected Mode, the base address, the limit, and the attributes are all updated per the contents of the segment descriptor indexed by the selector. Whenever a memory reference occurs, the segment descriptor register associated with the segment being used is automatically involved with the memory reference. The 32-bit segment base address becomes a component of the linear address calculation, the 32-bit limit is used for the limit-check operation, and the attributes are checked against the type of memory reference requested. #### **Control Registers** The Am386DX/DXL microprocessor has three control registers of 32 bits: CR0, CR2, and CR3 to hold machine state of a global nature (not specific to an individual task). These registers, along with System Address Registers described in the next section, hold machine state that affects all tasks in the system. To access the Control Registers, load and store instructions are defined. ## CRO: Machine Control Register (Includes 80286 Machine Status Word) CR0, shown in Figure 5, contains six defined bits for control and status purposes. The low-order 16 bits of CR0 are also known as the Machine Status Word (MSW) for compatibility with 80286 Protected Mode. LMSW and SMSW instructions are taken as special aliases of the load and store CR0 operations, where only the low-order 16 bits of CR0 are involved. For compatibility with 80286 operating systems, the Am386DX/DXL microprocessor LMSW instructions work in an identical fashion to the LMSW instruction on the 80286 (i.e., it only operates on the low-order 16 bits of CR0 and it ignores the new bits in CR0). New Am386DX/DXL microprocessor operating systems should use the MOV CR0, Reg instruction. The defined CR0 bits are described below. #### PG (Paging Enable, bit 31) The PG bit is set to enable the on-chip paging unit. It is reset to disable the on-chip paging unit. R (Reserved, bit 4) This bit is Reserved for Future Use. When loading CR0 care should be taken to not alter the value of this bit. #### TS (Task Switched, bit 3) TS is automatically set whenever a task switch operation is performed. If TS is set, a coprocessor ESCape op-code will cause a Coprocessor Not Available trap (Exception 7). The trap handler typically saves a 387DX math coprocessor context belonging to a previous task, loads a 387DX math coprocessor state belonging to the current task, and clears the TS bit before returning to the faulting coprocessor op-code. #### EM (Emulate Coprocessor, bit 2) The Emulate coprocessor bit is set to cause all coprocessor op-codes to generate a Coprocessor Not Available fault (Exception 7). It is reset to allow coprocessor op-codes to be executed on an actual 387DX math coprocessor (this is the default case after reset). Note that the WAIT op-code is not affected by the EM bit setting. #### MP (Monitor Coprocessor, bit 1) The MP bit is used in conjunction with the TS bit to determine if the WAIT op-code will generate a Coprocessor Not Available fault (Exception 7) when TS=1. When both MP=1 and TS=1, the WAIT op-code generates a trap. Otherwise, the WAIT op-code does not generate a trap. Note that TS is automatically set whenever a task switch operation is performed. #### PE (Protection Enable, bit 0) The PE bit is set to enable the Protected Mode. If PE is reset, the processor operates again in Real Mode. PE may be set by loading MSW or CR0. PE can be reset only by a load into CR0. Resetting the PE bit is typically part of a longer instruction sequence needed for proper transition from Protected Mode to Real Mode. Note that for strict 80286 compatibility, PE cannot be reset by the LMSW instruction. Note: 0 indicates "Reserved for Future Use." Do not define; see Section Compatibility. Figure 5. Control Register 0 CR1: Reserved CR1 is reserved for future processors. #### CR2: Page Fault Linear Address CR2, shown in Figure 6, holds the 32-bit linear address that caused the last page fault detected. The error code pushed onto the page fault handler's stack when it is invoked provides additional status information on this page fault. #### CR3: Page Directory Base Address CR3, shown in Figure 6, contains the physical base address of the page directory table. The Am386DX/DXL microprocessor page directory table is always page-aligned (4-Kb aligned). Therefore, the lowest 12 bits of CR3 are ignored when written and they store as undefined. A task switch through a TSS that changes the value in CR3, or an explicit load into CR3 with any value, will invalidate all cached page table entries in the paging unit cache. Note that if the value in CR3 does not change during the task switch, the cached page table entries are not flushed. #### **System Address Registers** Four special registers are defined to reference the tables or segments supported by the 80286 CPU and Am386DX/DXL microprocessor protection model. These tables or segments are: GDT (Global Descriptor Table) IDT (Interrupt Descriptor Table) LDT (Local Descriptor Table) TSS (Task State Segment) The addresses of these tables and segments are stored in special registers, the System Address and System Segment Registers illustrated in Figure 7. These registers are named GDTR, IDTR, LDTR, and TR, respectively. The Protected Mode Architecture section describes the use of these registers. #### GDTR and IDTR These registers hold the 32-bit linear base address and 16-bit limit of the GDT and IDT, respectively. The GDT and IDT segments, since they are global to all tasks in the system, are defined by 32-bit linear addresses (subject to page translation if paging is enabled) and 16-bit limit values. #### LDTR and TR These registers hold the 16-bit selector for the LDT descriptor and the TSS descriptor, respectively. The LDT and TSS segments, since they are task-specific segments, are defined by selector values stored in the system segment registers. Note that a segment descriptor register (programmer-invisible) is associated with each system segment register. Note: 0 indicates "Reserved for Future Use." Do not define; see Section Compatibility. Figure 6. Control Registers 2 and 3 15021B-009 Figure 7. System Address and System Segment Registers #### **Debug and Test Registers** **Debug Registers**: The six programmer accessible debug registers provide on-chip support for debugging. Debug Registers DR3–DR0 specify the four linear breakpoints. The Debug Control Register DR7 is used to set the breakpoints, and the Debug Status Register DR6 displays the current state of the breakpoints. The use of the debug registers is described in the Debugging Support section. Test Registers: Two registers are used to control the testing of the RAM/CAM (Content Addressable Memories) in the Translation Look-Aside Buffer portion of the Am386DX/DXL microprocessor. TR6 is the command test register, and TR7 is the data register that contains the data of the Translation Look-Aside buffer test. Their use is discussed in the Testability section. Figure 8 shows the Debug and Test registers. | Debug Registers | • | |------------------------------------|-------------| | 31 | | | Linear Breakpoint Address ( | DR0 | | Linear Breakpoint Address 1 | DR1 | | Linear Breakpoint Address 2 | DR2 | | Linear Breakpoint Address 3 | DR3 | | Reserved for Future use. Do not of | define. DR4 | | Reserved for Future use. Do not of | define. DR5 | | Breakpoint Status | DR6 | | Breakpoint Control | DR7 | | | | | 31 | Test Registers (For Page Cache) | 1 | |----|---------------------------------|-----| | | Test Control | TR6 | | | Test Status | TR7 | 15021B-011 Figure 8. Debug and Test Registers #### **Register Accessibility** There are a few differences regarding the accessibility of the registers in Real and Protected Mode. Table 1 summarizes these differences. See the Protected Mode Architecture section for further details. #### Compatibility ## VERY IMPORTANT NOTE: COMPATIBILITY WITH FUTURE PROCESSORS In the preceding register descriptions, note certain Am386DX/DXL microprocessor register bits are Reserved for Future Use. When reserved bits are called out, treat them as fully undefined. This is essential for software compatibility with future processors! Follow the guidelines below: - Do not depend on the state of any undefined bits when testing the values of defined register bits. Mask them out when testing. - 2. Do not depend on the state of any undefined bits when storing them to memory or another register. - 3. Do not depend on the ability to retain information written into any undefined bits. - 4. When loading registers, always load the undefined bits as zeros. - 5. However, registers that have been previously stored may be reloaded without masking. Depending upon the values of undefined register bits will make your software dependent upon the unspecified Am386DX/DXL microprocessor handling of these bits. Depending on undefined values risks making software incompatible with future processors that define usages for the Am386DX/DXL CPU undefined bits. Avoid any software dependence upon the state of undefined Am386DX/DXL CPU register bits. Table 1. Register Usage | | | e In<br>Mode | Use<br>Protecte | e In<br>ed Mode | Use In<br>Virtual 8086 Mode | | |-------------------|------|--------------|-----------------|-----------------|-----------------------------|-------| | Register | Load | Store | Load | Store | Load | Store | | General Registers | Yes | Yes | Yes | Yes | Yes | Yes | | Segment Registers | Yes | Yes | Yes | Yes | Yes | Yes | | Flag Registers | Yes | Yes | Yes | Yes | IOPL | IOPL | | Control Registers | Yes | Yes | PL=0 | PL = 0 | No | Yes | | GDTR | Yes | Yes | PL=0 | Yes | No | Yes | | IDTR | Yes | Yes | PL=0 | Yes | No | Yes | | LDTR | No | No | PL=0 | Yes | No | No | | TR | No | No | PL=0 | Yes | No | No | | Debug Control | Yes | Yes | PL=0 | PL = 0 | No | No | | Test Registers | Yes | Yes | PL=0 | PL = 0 | No | No | Notes: PL = 0: The registers can be accessed only when the current privilege level is zero. IOPL: The PUSHF and POPF instructions are made I/O Privilege Level sensitive in Virtual 8086 Mode. #### Instruction Set #### Instruction Set Overview The instruction set is divided into nine categories of operations. Data Transfer Arithmetic Shift/Rotate String Manipulation Bit Manipulation Control Transfer High Level Language Support Operating System Support Processor Control These Am386DX/DXL microprocessor instructions are listed in Table 2. All Am386DX/DXL microprocessor instructions operate on either 0, 1, 2, or 3 operands where an operand resides in a register in the instruction itself or in memory. Most zero operand instructions (e.g., CLI, STI) take only one byte. One operand instructions generally are two bytes long. The average instruction is 3.2-bytes long. Since the Am386DX/DXL device has a 16-byte instruction queue, an average of 5 instructions will be prefetched. The use of two operands permits the following types of common instructions. Register to Register Memory to Register Immediate to Register Register to Memory Immediate to Memory The operands can be either 8-, 16-, or 32-bits long. As a general rule, when executing code written for the Am386DX/DXL microprocessor (32-bit code), operands are 8 or 32 bits; when executing existing 80286 or 8086 code (16-bit code), operands are 8 or 16 bits. Prefixes can be added to instructions that override the default length of the operands (i.e., use 32-bit operands for 16-bit code or 16-bit operands for 32-bit code). #### **Addressing Modes** #### **Addressing Modes Overview** The Am386DX/DXL microprocessor provides a total of 11 addressing modes for instructions to specify operands. The addressing modes are optimized to allow the efficient execution of high-level languages such as C and FORTRAN, and they cover the vast majority of data references needed by high-level languages. #### **Register and Immediate Modes** Two of the addressing modes provide for instructions that operate on register or immediate operands: Register Operand Mode: The operand is located in one of the 8-, 16-, or 32-bit general registers. Immediate Operand Mode: The operand is included in the instruction as part of the op-code. #### 32-Bit Memory Addressing Modes The remaining nine modes provide a mechanism for specifying the effective address of an operand. The linear address consists of two components: the segment base address and an effective address. The effective address is calculated by using combinations of the following four address elements. **Displacement:** An 8- or 32-bit immediate value following the instruction. Base: The contents of any general-purpose register. The Base registers are generally used by compilers to point to the start of the local variable area. Index: The contents of any general-purpose register except for ESP. The Index registers are used to access the elements of an array, or a string of characters. **Scale**: The index register's value can be multiplied by a scale factor of either 1, 2, 4, or 8. Scaled index mode is especially useful for accessing arrays or structures. Combinations of these four components make up the nine additional addressing modes. There is no performance penalty for using any of these addressing combinations, since the effective address calculation is pipelined with the execution of other instructions. The one exception is the simultaneous use of Base and Index components that requires one additional clock. As shown in Figure 9, the effective address (EA) of an operand is calculated according to the following formula. $\mathsf{EA} = \mathsf{Base} \; \mathsf{Reg} + (\mathsf{Index} \; \mathsf{Reg} \cdot \mathsf{Scaling}) + \mathsf{Displacement}$ Direct Mode: The operand's offset is contained as part of the instruction as an 8-, 16-, or 32-bit displacement. #### Example: INC Word PTR [500] Register Indirect Mode: A Base register contains the address of the operand. #### Example: MOV [ECX], EDX Based Mode: A Base register's contents is added to a Displacement to form the operands offset. Example: MOV ECX, [EAX + 24] #### Table 2. Am386DX/DXL Microprocessor Instructions #### Table 2a. Data Transfer | | General Purpose | | | | | | |-------|----------------------------------------------|--|--|--|--|--| | MOV | Move operand | | | | | | | PUSH | Push operand onto stack | | | | | | | POP | Pop operand off stack | | | | | | | PUSHA | Push all registers on stack | | | | | | | POPA | Pop all registers off stack | | | | | | | XCHG | Exchange operand register | | | | | | | XLAT | Translate | | | | | | | | Conversion | | | | | | | MOVZX | Move byte or Word, Dword with zero extension | | | | | | | MOVSX | Move byte or Word, Dword, sign extended | | | | | | | CBW | Convert byte to Word, or Word to Dword | | | | | | | CWD | Convert Word to Dword | | | | | | | CWDE | Convert Word to Dword extended | | | | | | | CDQ | Convert Dword to Qword | | | | | | | | Input/Output | | | | | | | IN | Input operand from I/O space | | | | | | | OUT | Output operand to I/O space | | | | | | | | Address Object | | | | | | | LEA | Load effective address | | | | | | | LDS | Load pointer into D segment register | | | | | | | LES | oad pointer into E segment register | | | | | | | LFS | Load pointer into F segment register | | | | | | | LGS | Load pointer into G segment register | | | | | | | LSS | Load pointer into S (Stack) segment register | | | | | | | | Flag Manipulation | | | | | | | LAHF | Load A register from Flags | | | | | | | SAHF | Store A register in Flags | | | | | | | PUSHF | Push flags onto stack | | | | | | | POPF | Pop flags off stack | | | | | | | PUSHF | Push EFLAGS onto stack | | | | | | | POPFD | Pop EFLAGS off stack | | | | | | | CLC | Clear Carry Flag | | | | | | | CLD | Clear Direction Flag | | | | | | | СМС | Complement Carry Flag | | | | | | | STC | Set Carry Flag | | | | | | | STD | Set Direction Flag | | | | | | #### **Table 2b. Arithmetic Instructions** | | Addition | | | | | |------|----------------------------------|--|--|--|--| | ADD | Add operands | | | | | | ADC | Add with carry | | | | | | INC | Increment operand by 1 | | | | | | AAA | ASCII adjust for addition | | | | | | DAA | Decimal adjust for addition | | | | | | | Subtraction | | | | | | SUB | Subtract operands | | | | | | SBB | Subtract with borrow | | | | | | DEC | Decrement operand by 1 | | | | | | NEG | Negate operand | | | | | | CMP | Compare operands | | | | | | DAS | Decimal adjust for subtraction | | | | | | AAS | ASCII adjust for subtraction | | | | | | | Multiplication | | | | | | MUL | Multiply Double/Single Precision | | | | | | IMUL | Integer multiply | | | | | | AAM | ASCII adjust after multiply | | | | | | | Division | | | | | | DIV | Divide unsigned | | | | | | IDIV | Integer divide | | | | | | AAD | ASCII adjust before division | | | | | | | | | | | | #### **Table 2c. String Instructions** | Move byte or Word, Dword string | |------------------------------------| | Input string from I/O space | | Output string to I/O space | | Compare byte or Word, Dword string | | Scan Byte or Word, Dword string | | Load byte or Word, Dword string | | Store byte or Word, Dword string | | Repeat | | Repeat while equal/zero | | Repeat while not equal/not zero | | | #### Table 2. Am386DX/DXL Microprocessor Instructions (continued) #### **Table 2d. Logical Instructions** | Logicals | | | | | |---------------|---------------------------------|--|--|--| | NOT | "NOT" operand | | | | | AND | "AND" operands | | | | | OR | "Inclusive OR" operands | | | | | XOR | "Exclusive OR" operands | | | | | TEST | "Test" operands | | | | | | Shifts | | | | | SHL/SHR | Shift logical left or right | | | | | SAL/SAR | Shift arithmetic left or right | | | | | SHLD/<br>SHRD | Double shift left or right | | | | | Rotates | | | | | | ROL/ROR | Rotate left/right | | | | | RCL/RCR | Rotate through carry left/right | | | | #### **Table 2e. Bit Manipulation Instructions** | Single Bit Instructions | | | | |-------------------------|-------------------------|--|--| | ВТ | Bit Test | | | | BTS | Bit Test and Set | | | | BTR | Bit Test and Reset | | | | BTC | Bit Test and Complement | | | | BSF | Bit Scan Forward | | | | BSR | Bit Scan Reverse | | | #### **Table 2f. Program Control Instructions** | Conditional Transfers | | | | |-----------------------|------------------------------------|--|--| | SETCC | Set byte equal to condition code | | | | JA/JNBE | Jump if above/not below nor equal | | | | JAE/JNB | Jump if above or equal/not below | | | | JB/JNAE | Jump if below/not above nor equal | | | | JBE/JNA | Jump if below or equal/not above | | | | JC | Jump if carry | | | | JE/JZ | Jump if equal/zero | | | | JG/JNLE | Jump if greater/not less nor equal | | | | JGE/JNL | Jump if greater or equal/not less | | | | JL/JNGE | Jump if less/not greater nor equal | | | | JLE/JNG | Jump if less or equal/not greater | | | | JNC | Jump if not carry | | | | JNE/JNZ | Jump if not equal/not zero | | | | JNO | Jump if not overflow | | | | JNP/JPO | Jump if not parity/parity odd | | | | JNS | Jump if not sign | | | | JO | Jump if overflow | | | | JP/JPE | Jump if parity/parity even | | | | JS | Jump if sign | | | ## Table 2f. Program Control Instructions (continued) | Unconditional Transfers | | | | |----------------------------|--|--|--| | Call procedure/task | | | | | Return from procedure | | | | | Jump | | | | | Iteration Controls | | | | | Loop | | | | | Loop if equal/zero | | | | | Loop if not equal/not zero | | | | | JUMP if register CX = 0 | | | | | Interrupts | | | | | Interrupt | | | | | Interrupt if overflow | | | | | Return from interrupt/task | | | | | Clear interrupt enable | | | | | Set interrupt enable | | | | | | | | | #### Table 2g. High Level Language Instructions | BOUND | Check array bounds | |-------|----------------------------------------------| | ENTER | Setup parameter block for entering procedure | | LEAVE | Leave procedure | #### Table 2h. Protection Model | SGDT | Store global descriptor table | |---------------|-------------------------------------------------| | SIDT | Store interrupt descriptor table | | STR | Store task register | | SLDT | Store local descriptor table | | LGDT | Load global descriptor table | | LIDT | Load interrupt descriptor table | | LTR | Load task register | | LLDT | Load local descriptor table | | ARPL | Adjust requested privilege level | | LAR | Load access rights | | LSL | Load segment limit | | VERR/<br>VERW | Verify segment for reading or writing | | LMSW | Load machine status word (lower 16 bits of CR0) | | SMSW | Store machine status word | | | | #### **Table 2i. Processor Control Instructions** | HLT | Halt | |------|-------------------------| | WAIT | Wait until BUSY negated | | ESC | Escape | | LOCK | Lock Bus | Index Mode: An Index register's contents is added to a Displacement to form the operands offset. #### Example: ADD EAX, TABLE [ESI] Scaled Index Mode: An Index register's contents is multiplied by a scaling factor that is added to a Displacement to form the operands offset. #### Example: IMUL EBX, TABLE [ESI • 4], 7 Based Index Mode: The contents of a Base register is added to the contents of an Index register to form the effective address of an operand. #### Example: MOV EAX, [ESI] [EBX] Based Scaled Index Mode: The contents of an Index register is multiplied by a Scaling factor and the result is added to the contents of a Base register to obtain the operands offset. Example: MOV ECX, [EDX • 8] [EAX] Based Index Mode with Displacement: The contents of an Index Register and a Base register's contents and a Displacement are all summed together to form the operand offset. #### Example: ADD EDX, [ESI] [EBP+00FFFF0H] Based Scaled Index Mode with Displacement: The contents of an Index register are multiplied by a Scaling factor; the result is added to the contents of a Base register and a Displacement to form the operand's offset. Example: MOV EAX, LOCALTABLE[EDI • 4] [EBP + 80] Figure 9. Addressing Mode Calculations #### Differences Between 16- and 32-Bit Addresses In order to provide software compatibility with the 80286 and the 8086, the Am386DX/DXL microprocessor can execute 16-bit instructions in Real and Protected Modes. The processor determines the size of the instructions it is executing by examining the D bit in the CS segment descriptor. If the D bit is 0 then all operand lengths and effective addresses are assumed to be 16 bits long. If the D bit is 1 then the default length for operands and addresses is 32 bits. In Real Mode, the default size for operands and addresses is 16 bits. Regardless of the default precision of the operands or addresses, the Am386DX/DXL microprocessor is able to execute either 16- or 32-bit instructions. This is specified via the use of override prefixes. Two prefixes, the Operand Size Prefix and the Address Length Prefix, override the value of the D bit on an individual instruction basis. **Example:** The processor is executing in Real Mode and the programmer needs to access the EAX registers. The assembler code for this might be MOV EAX, 32-bit MEMORYOP. An assembler automatically determines that an Operand Size Prefix is needed and generates it. Example: The D bit is 0 and the programmer wishes to use Scaled Index addressing mode to access an array. The Address Length Prefix allows the use of MOV DX, TABLE [ESI • 2]. The assembler uses an Address Length Prefix, since with D = 0, the default addressing mode is 16 bits. **Example**: The D bit is 1 and the program wants to store a 16-bit quantity. The Operand Length Prefix is used to specify only a 16-bit value: MOV MEM16, DX. The Operand Length and Address Length prefixes can be applied separately or in combination to any instruction. The Address Length Prefix does not allow addresses over 64 Kb to be accessed in Real Mode. A memory address exceeding FFFFH will result in a General Protection Fault. An Address Length Prefix only allows the use of the additional Am386DX/DXL microprocessor addressing modes. When executing 32-bit code, the Am386DX/DXL micro-processor uses either 8- or 32-bit displacements, and any register can be used as base or index registers. When executing 16-bit code, the displacements are either 8 or 16 bits, and the base and index register conform to the 80286 model. Table 3 illustrates the differences. #### **Data Types** The Am386DX/DXL microprocessor supports all data types commonly used in high-level languages. Bit: A single bit quantity. Bit Field: A group of up to 32 contiguous bits that spans a maximum of four bytes. Bit String: A set of contiguous bits on the Am386DX/DXL microprocessor bit strings can be up to 4 Gb long. Byte: A signed 8-bit quantity. Unsigned Byte: An unsigned 8-bit quantity. Integer (Word): A signed 16-bit quantity. Long Integer (Double Word): A signed 32-bit quantity. All operations assume a 2's complement representation. Unsigned Integer (Word): An unsigned 16-bit quantity. Unsigned Long Integer (Double Word): An unsigned 32-bit quantity. Signed Quad Word: A signed 64-bit quantity. Unsigned Quad Word: An unsigned 64-bit quantity. Offset: A 16- or 32-bit offset only quantity that indirectly references another memory location. Pointer: A full pointer which consists of a 16-bit segment selector and either a 16- or 32-bit offset. Char: A byte representation of an ASCII alphanumeric or control character. String: A contiguous sequence of bytes, words, or Dwords. A string may contain between 1 byte and 4 Gb. BCD: A byte (unpacked) representation of decimal digits 0-9. Packed BCD: A byte (packed) representation of two decimal digits 0–9 storing one digit in each nibble. When the Am386DX/DXL microprocessor is coupled with a 387DX math coprocessor then the following common floating point types are supported. Floating Point: A signed 32-, 64-, or 80-bit real number representation. Floating point numbers are supported by a 387DX compatible math coprocessor. Figure 10 illustrates the data types supported by the Am386DX/DXL microprocessor and a 387DX compatible math coprocessor. Table 3. Base and Index Registers for 16- and 32-Bit Addresses | | 16-Bit Addressing | 32-Bit Addressing | |------------------------------|-----------------------|-----------------------------------| | Base Register | BX, BP | Any 32-bit GP Register | | Index Register | SI, DI | Any 32-bit GP Register Except ESP | | Scale Factor<br>Displacement | None<br>0, 8, 16 bits | 1, 2, 4, 8<br>0, 8, 32 bits | Figure 10. Supported Data Types <sup>\*</sup>Supported by 387DX-compatible math coprocessor. Figure 10. Supported Data Types (continued) #### **Memory Organization** #### Introduction Memory on the Am386DX/DXL microprocessor is divided up into 8-bit quantities (Bytes), 16-bit quantities (Words), and 32-bit quantities (Dword). Words are stored in two consecutive bytes in memory with the low-order byte at the lowest address, the high-order byte at the highest address. Dwords are stored in four consecutive bytes in memory with the low-order byte at the lowest address, the high-order byte at the highest address. The address of a word or Dword is the byte address of the low-order byte. In addition to these basic data types, the Am386DX/DXL microprocessor supports two larger units of memory: pages and segments. Memory can be divided up into one or more variable length segments, which can be swapped to disk or shared between programs. Memory can also be organized into one or more 4-Kb pages. Finally, both segmentation and paging can be combined, gaining the advantages of both systems. The Am386DX/DXL microprocessor supports both pages and segments in order to provide maximum flexibility to the system designer. Segmentation and paging are complementary. Segmentation is useful for organizing memory in logical modules, and as such is a tool for the application programmer, while pages are useful for the system programmer for managing the physical memory of a system. #### **Address Spaces** The Am386DX/DXL microprocessor has three distinct address spaces: logical, linear, and physical. A logical address (also known as a virtual address) consists of a selector and an offset. A selector is the contents of a segment register. An offset is formed by summing all of the addressing components (Base, Index, Displacement) discussed in Section Memory Address Modes into an effective address. Since each task on Am386DX/DXL CPU has a maximum of 16K (2<sup>14</sup> –1) selectors, and offsets can be 4 Gb (2<sup>32</sup> bits), this gives a total of 2<sup>46</sup> bits or 64 tb of logical address space per task. The programmer sees this virtual address space. The segmentation unit translates the logical address space into a 32-bit linear address space. If the paging unit is not enabled then the 32-bit linear address corresponds to the physical address. The paging unit translates the linear address space into the physical address space. The physical address is what appears on the address pins. The primary difference between Real Mode and Protected Mode is how the segmentation unit performs the translation of the logical address into the linear address. In Real Mode, the segmentation unit shifts the selector left four bits and adds the result to the offset to form the linear address. While in Protected Mode, every selector has a linear base address associated with it. The linear base address is stored in one of two operating system tables (i.e., the Local Descriptor Table or Global Descriptor Table). The selector's linear base address is added to the offset to form the final linear address. Figure 11 shows the relationship between the various address spaces. Figure 11. Address Translation #### Segment Register Usage The main data structure used to organize memory is the segment. On the Am386DX/DXL microprocessor, segments are variable sized blocks of linear addresses that have certain attributes associated with them. There are two main types of segments: code and data. The segments are of variable size and can be as small as 1 byte or as large as 4 Gb (2<sup>32</sup> bytes). In order to provide compact instruction encoding and increase processor performance, instructions do not need to explicitly specify which segment register is used. A default segment register is automatically chosen according to the rules of Table 4 (Segment Register Selection Rules). In general, data references use the selector contained in the DS register; Stack references use the SS register; and Instruction fetches use the CS register. The contents of the Instruction Pointer provides the offset. Special segment override prefixes allow the explicit use of a given segment register, and override the implicit rules listed in Table 4. The override prefixes also allow the use of the ES, FS, and GS segment registers. There are no restrictions regarding the overlapping of the base addresses of any segments. Thus, all 6 segments could have the base address set to zero and create a system with a 4-Gb linear address space. This creates a system where the virtual address space is the same as the linear address space. Further details of segmentation are discussed in Section Protected Mode Architecture. #### I/O Space The Am386DX/DXL microprocessor has two distinct physical address spaces: Memory and I/O. Generally, peripherals are placed in I/O space although the Am386DX/DXL CPU also supports memory-mapped peripherals. The I/O space consists of 64 Kb and can be divided into 64K 8-bit ports, 32K 16-bit ports, or 16K 32-bit ports, or any combination of ports that add up to less than 64 Kb. The 64Kb I/O address space refers to physical memory rather than linear address since I/O instructions do not go through the segmentation or paging hardware. The M/TO pin acts as an additional address line, thus allowing the system designer to easily determine which address space the processor is accessing. The I/O ports are accessed via the IN and OUT I/O instructions, with the port address supplied as an immediate 8-bit constant in the instruction or in the DX register. All 8- and 16-bit port addresses are zero extended on the upper address lines. The I/O instructions cause the $M/\overline{IO}$ pin to be driven Low. I/O port addresses 00F8H through 00FFH are reserved. #### Interrupts #### Interrupts and Exceptions Interrupts and exceptions alter the normal program flow, in order to handle external events, to report errors or exceptional conditions. The difference between interrupts and exceptions is that interrupts are used to **Table 4. Segment Register Selection Rules** | Type of Memory Reference | Implied (Default) Segment Use | Segment Override Prefixes Possible | |----------------------------------------------------------------------------------------|----------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | Code Fetch | CS | None | | Destination of PUSH, PUSHF, INT, CALL, PUSHA Instructions | ss | None | | Source of POP, POPA, POPF, IRET, RET Instructions | SS | None | | Destination of STOS, MOVS, REP<br>STOS, REP MOVS Instructions<br>(DI is Base Register) | ES | None | | Other Data References with<br>Effective Address Using Base<br>Register of: | | | | [EAX] [EBX] [ECX] [EDX] [ESI] [ESI] [EBP] [ESP] | DS<br>DS<br>DS<br>DS<br>DS<br>DS<br>SS | CS, SS, ES, FS, GS | handle asynchronous external events while exceptions handle instruction faults. Although a program can generate a software interrupt via an INT n instruction, the processor treats software interrupts as exceptions. Hardware interrupts occur as the result of an external event and are classified into two types: maskable or non-maskable. Interrupts are serviced after the execution of the current instruction. After the interrupt handler is finished servicing the interrupt, execution proceeds with the instruction immediately after the interrupted instruction. The differences between the interrupts are discussed in the Maskable Interrupt and Non-Maskable Interrupt sections. Exceptions are classified as faults, traps, or aborts depending on the way they are reported, and whether or not restart of the instruction causing the exception is supported. Faults are exceptions that are detected and serviced before the execution of the faulting instruction. A fault occurs in a virtual memory system when the processor references a page or a segment that is not present. The operating system fetches the page or segment from disk, and then the Am386DX/DXL microprocessor restarts the instruction. Traps are exceptions that are reported immediately after the execution of the instruction that caused the problem. User defined interrupts are examples of traps. Aborts are exceptions that do not permit the precise location of the instruction causing the exception to be determined. Aborts are used to report severe errors, such as a hardware error or illegal values in system tables. Thus, when an interrupt service routine has been completed, execution proceeds from the instruction immediately following the interrupted instruction. On the other hand, the return address from an exception fault routine will always point at the instruction causing the exception and include any leading instruction prefixes. Table 5 summarizes the possible interrupts for the Am386DX/DXL microprocessor and shows where the return address points. The Am386DX/DXL microprocessor has the ability to handle up to 256 different interrupts/exceptions. In order to service the interrupts, a table with up to 256 interrupt vectors must be defined. The interrupt vectors are simply pointers to the appropriate interrupt service routine. In Real Mode (see Section Real Mode Introduction), the vectors are 4 byte quantities, a Code Segment plus a 16-bit offset; in Protected Mode, the interrupt vectors are 8 byte quantities that are put in an Interrupt Descriptor Table (see Section Introduction). Of the 256 possible interrupts, 32 are Reserved for Future Use, the remaining 224 are free to be used by the system designer. **Table 5. Interrupt Vector Assignments** | Function | Interrupt<br>Number | Instructions Which Can Cause Exceptions | Return Address<br>Points to<br>Faulting<br>Instruction | Type | |-----------------------------|---------------------|------------------------------------------------|--------------------------------------------------------|-------| | Divide Error | 0 | DIV, IDIV | Yes | FAULT | | Debug Exception | 1 | Any instruction | Yes | TRAP* | | NMI Interrupt | 2 | INT 2 or NMI | No | NMI | | One Byte Interrupt | 3 | INT | No | TRAP | | Interrupt on Overflow | 4 | INTO | No | TRAP | | Array Bounds Check | 5 | BOUND | Yes | FAULT | | Invalid Op-Code | 6 | Any illegal instruction | Yes | FAULT | | Device Not Available | 7 | ESC, WAIT | Yes | FAULT | | Double Fault | 8 | Any instruction that can generate an Exception | | ABORT | | Coprocessor Segment Overrun | 9 | ESC | No | ABORT | | Invalid TSS | 10 | JMP, CALL, IRET, INT | Yes | FAULT | | Segment Not Present | 11 | Segment register instructions | Yes | FAULT | | Stack Fault | 12 | Stack references | Yes | FAULT | | General Protection Fault | 13 | Any memory reference | Yes | FAULT | | Page Fault | 14 | Any memory access or code fetch | Yes | FAULT | | Reserved for Future Use | 15 | | | | | Coprocessor Error | 16 | ESC, WAIT | Yes | FAULT | | Reserved for Future Use | 17–31 | | | | | Two Byte Interrupt | 0–255 | INT n | No | TRAP | <sup>\*</sup>Some debug exceptions may report both traps on the previous instruction and faults on the next instruction. ## AMD ## Interrupt Processing When an interrupt occurs the following actions happen. - First, the current program address and the Flags are saved on the stack to allow resumption of the interrupted program. - Next, an 8-bit vector is supplied to the Am386DX/ DXL microprocessor that identifies the appropriate entry in the interrupt table. The table contains the starting address of the interrupt service routine. Then, the user supplied interrupt service routine is executed. - Finally, when an IRET instruction is executed the old processor state is restored and program execution resumes at the appropriate instruction. The 8-bit interrupt vector is supplied to the Am386DX/DXL microprocessor in several different ways: exceptions supply the interrupt vector internally; software INT instructions contain or imply the vector; maskable hardware interrupts supply the 8-bit vector via the interrupt acknowledge bus sequence. Non-Maskable hardware interrupts are assigned to interrupt vector 2. ## Maskable Interrupt Maskable interrupts are the most common way the Am386DX/DXL microprocessor responds to asynchronous external hardware events. A hardware interrupt occurs when the INTR is pulled High and the Interrupt Flag bit (IF) is enabled. The processor only responds to interrupts between instructions (REPeat String instructions have an interrupt window between memory moves, which allows interrupts during long string moves). When an interrupt cocurs, the processor reads an 8-bit vector supplied by the hardware that identifies the source of the interrupt (one of 224 user defined interrupts). The exact nature of the interrupt sequence is discussed in Section Functional Data. The IF bit in the EFLAGS register is reset when an interrupt is being serviced. This effectively disables servicing additional interrupts during an interrupt service routine. However, the IF bit may be set explicitly by the interrupt handler to allow the nesting of interrupts. When an IRET instruction is executed the original state of the IF bit is restored. ## Non-Maskable Interrupt Non-maskable interrupts provide a method of servicing very high priority interrupts. A common example of the use of a non-maskable interrupt (NMI) would be to activate a power failure routine. When the NMI input is pulled High it causes an interrupt with an internally supplied vector value of 2. Unlike a normal hardware interrupt, no interrupt acknowledgment sequence is performed for NMI. While executing the NMI servicing procedure, the Am386DX/DXL microprocessor will not service further NMI requests until an interrupt return (IRET) instruction is executed or the processor is reset. If NMI occurs while currently servicing an NMI, its presence will be saved for servicing after executing the first IRET instruction. The IF bit is cleared at the beginning of an NMI interrupt to inhibit further INTR interrupts. #### Software Interrupts A third type of interrupt/exception for the Am386DX/DXL microprocessor is the software interrupt. An INT n instruction causes the processor to execute the interrupt service routine pointed to by the nth vector in the interrupt table. A special case of the two byte software interrupt INT n is the one byte INT 3 or breakpoint interrupt. By inserting this one byte instruction in a program, the user can set breakpoints in the program as a debugging tool. A final type of software interrupt is the single step interrupt. It is discussed in the Debugging Support section. ## **Interrupt and Exception Priorities** Interrupts are externally-generated events. Maskable Interrupts (on the INTR input) and Non-Maskable Interrupts (on the NMI input) are recognized at instruction boundaries. When NMI and maskable INTR are both recognized at the same instruction boundary, the Am386DX/DXL microprocessor invokes the NMI service routine first. If after the NMI service routine has been invoked, maskable interrupts are still enabled, then the Am386DX/DXL CPU invokes the appropriate interrupt service routine. ## Table 6a. Am386DX/DXL Microprocessor Priority for Invoking Service Routines in Case of Simultaneous External Interrupts - 1. NMI - 2. INTR Exceptions are internally-generated events. Exceptions are detected by the Am386DX/DXL microprocessor if in the course of executing an instruction, the Am386DX/DXL CPU detects a problematic condition. The Am386DX/DXL microprocessor then immediately invokes the appropriate exception service routine. The state of the Am386DX/DXL CPU is such that the instruction causing the exception can be restarted. If the exception service routine has taken care of the problematic condition, the instruction will execute without causing the same exception. It is possible for a single instruction to generate several exceptions (for example, transferring a single operand could generate two page faults if the operand location spans two not present pages). However, only one exception is generated upon each attempt to execute the instruction. Each exception service routine should correct its corresponding exception, and restart the instruction. In this manner, exceptions are serviced until the instruction executes successfully. As the Am386DX/DXL microprocessor executes instructions, it follows a consistent cycle in checking for exceptions, as shown in Table 6b. This cycle is repeated as each instruction is executed and occurs in parallel with instruction decoding and execution. #### Instruction Restart The Am386DX/DXL microprocessor fully supports restarting all instructions after faults. If an exception is detected in the instruction to be executed (Exception Categories 4 through 10 in Table 6b), the Am386DX/ DXL device invokes the appropriate exception service routine. The Am386DX/DXL microprocessor is in a state that permits restart of the instruction, for all cases but those in Table 6c. Note that all such cases are easily avoided by proper design of the operating system. ## Table 6b. Sequence of Exception Checking Consider the case of the Am386DX/DXL microprocessor having just completed an instruction. It then performs the following checks before reaching the point where the next instruction is completed: - 1. Check for Exception 1 Traps from the instruction just completed (single-step via Trap Flag or Data Breakpoints set in the Debug Registers). - Check for Exception 1 Faults in the next instruction (Instruction Execution Breakpoint set in the Debug Registers for the next instruction). - Check for external NMI and INTR. - Check for Segmentation Faults that prevented fetching the entire next instruction (Exceptions 11 and 13). - Check for Paging Faults that prevented fetching the entire next instruction (Exception 14). - Check for Faults decoding the next instruction [Exception 6 if illegal op-code; Exception 6 if in Real Mode or in Virtual 8086 Mode and attempting to execute an instruction for Protected Mode only (see Section Protection and I/O Permission Bitmap); or Exception 13 if instruction is longer than 15 bytes, or privilege violation in Protected Mode (i.e., not at IOPL or at CPL = 0)]. - If WAIT op-code, check if TS = 1 and MP = 1 (Exception 7) if both are 1). - 8. If ESCAPE op-code for numeric coprocessor, check if EM = 1 or TS = 1 (Exception 7 if either are 1). - If WAIT op-code or ESCAPE op-code for numeric coprocessor, check ERROR input signal (Exception 16 if ERROR input is asserted). - 10. Check in the following order for each memory reference required by the instruction. - Check for Segmentation Faults that prevent transferring the entire memory quantity (Exceptions 11, 12, 13). - Check for Page Faults that prevent transferring the entire memory quantity (Exception 14). Note that the order stated supports the concept of the paging mechanism being underneath the segmentation mechanism. Therefore, for any given code or data reference in memory, segmentation exceptions are generated before paging exceptions are generated. ## Table 6c. Conditions Preventing Instruction Restart - 1. An instruction causes a task switch to a task whose Task State Segment (TSS) is partially not present. (An entire not present TSS is restartable.) Partially present TSS's can be avoided either by keeping the TSS's of such tasks present in memory or by aligning TSS segments to reside entirely within a single 4K page (for TSS segments of 4 Kb or less). - A coprocessor operand wraps around the top of a 64-Kb segment or a 4-Gb segment and spans three pages; and the page holding the middle portion of the operand is not present. This condition can be avoided by starting any segments containing coprocessor operands at a page boundary if the segments are approximately 64-200 Kb or larger (i.e., large enough for wraparound of the coprocessor operand to possibly occur). Note that these conditions are avoided by using the operating system designs mentioned in this table. #### **Double Fault** A Double Fault (Exception 8) results when the processor attempts to invoke an exception service routine for the segment exceptions (10, 11, 12, or 13), but in the process of doing so, detects an exception other than a Page Fault (Exception 14). A Double Fault (Exception 8) will also be generated when the processor attempts to invoke the Page Fault (Exception 14) service routine, and detects an exception other than a second Page Fault. In any functional system, the entire Page Fault service routine must remain present in memory. Double Page faults however do not raise the Double Fault exception. If a second Page Fault occurs while the processor is attempting to enter the service routine for the first time, then the processor will invoke the Page Fault (Exception 14) handler a second time rather than the Double Fault (Exception 8) handler. A subsequent fault, though, will lead to shutdown. When a Double Fault occurs, the Am386DX/DXL microprocessor invokes the exception service routine for Exception 8. ## Reset and Initialization When the processor is initialized or Reset, the registers have the values shown in Table 7. The Am386DX/DXL microprocessor will then start executing instructions near the top of physical memory, at location FFFFFF0H. When the first Inter-Segment Jump or Call is executed, address lines A31-A20 will drop Low for CS-relative memory cycles, and the Am386DX/DXL microprocessor will only execute instructions in the lower 1 Mb of physical memory. This allows the system designer to use a ROM at the top of physical memory to initialize the system and take care of Resets. RESET forces the Am386DX/DXL microprocessor to terminate all execution and local bus activity. No instruction execution or bus activity will occur as long as Reset is active. Between 350- and 450-CLK2 periods after Reset becomes inactive, the Am386DX/DXL device will start executing instructions at the top of physical memory. **Table 7. Register Values after Reset** | Flag Word | UUUU0002H | Note 1 | |---------------------------|------------------------------|--------| | Machine Status Word (CR0) | HOUUUUUUU ( | Note 2 | | Instruction Pointer | 0000FFF0H | | | Code Segment | F000H | Note 3 | | Data Segment | 0000H | | | Stack Segment | 0000H | | | Extra Segment (ES) | 0000H | | | Extra Segment (FS) | 0000H | | | Extra Segment (GS) | 0000H | | | DX Register | Component and<br>Stepping ID | Note 5 | | All Other Registers | Undefined | Note 4 | #### Notes: - EFLAGS Register. The upper 14 bits of the EFLAGS register are undefined, VM (Bit 17) and RF (Bit 16) and 0 (Bit 15) are all other defined flag bits. - CR0: (Machine Status Word). All of the defined fields in the CR0 are 0 (PG Bit 31, TS Bit 3, EM Bit 2, MP Bit 1, and PE Bit 0). - The code Segment Register (CS) will have its Base Address set to FFFF0000H and Limit set to 0FFFFH. - All undefined bits are Reserved for Future Use and should not be used. - DX register always holds component and stepping identifier (see Section Component and Revision Identifiers). EAX register holds self-test signature if self-test was requested (see Section Self-Test Signature). ## **Testability** #### Self-Test The Am386DX/DXL microprocessor has the capability to perform a self-test. The self-test checks the function of all the Control ROM and most of the non-random logic of the part. Approximately one-half of the Am386DX/DXL microprocessor can be tested during self-test. Self-Test is initiated on the Am386DX/DXL microprocessor when the RESET pin transitions from High to Low, and the $\overline{\text{BUSY}}$ pin is Low. The self-test takes about 219 clocks or approximately 26 ms with a 20-MHz Am386DX/DXL device. At the completion of self-test, the processor performs reset and begins normal operation. The part has successfully passed self-test if the contents of the EAX register are zero (0). If the results of EAX are not zero then the self-test has detected a flaw in the part. ### **TLB Testing** The Am386DX/DXL microprocessor provides a mechanism for testing the Translation Look-Aside Buffer (TLB) if desired. This particular mechanism is unique to the Am386DX/DXL CPU and may not be continued in the same way in future processors. When testing the TLB, paging must be turned off (PG = 0 in CR0) to enable the TLB testing hardware and avoid interference with the test data being written to the TLB. There are two TLB testing operations: - 1. Write entries into the TLB; and, - Perform TLB lookups. Two test registers, shown in Figure 12, are provided for the purpose of testing. TR6 is the test command register and TR7 is the test data register. The fields within these registers are defined below. C: This is the command bit. For a write into TR6 to cause an immediate write into the TLB entry, write a 0 to this bit. For a write into TR6 to cause an immediate TLB lookup, write a 1 to this bit. Linear Address: This is the tag field of the TLB. On a TLB write, a TLB entry is allocated to this linear address and the rest of that TLB entry is set per the value of TR7 and the value just written into TR6. On a TLB lookup, the TLB is interrogated per this value and if one and only one TLB entry matches, the rest of the fields of TR6 and TR7 are set from the matching TLB entry. Physical Address: This is the data field of the TLB. On a write to the TLB, the TLB entry allocated to the linear address in TR6 is set to this value. On a TLB lookup, the data field (physical address) from the TLB is read out to here. PL: On a TLB write, PL = 1 causes the REP field of TR7 to select which of four associative blocks of the TLB is to be written, but PL = 0 allows the internal pointer in the paging unit to select which TLB block is written. On a TLB lookup, the PL bit indicated whether the lookup was a hit (PL gets set to 1) or a miss (PL gets reset to 0). V: The valid bit for this TLB entry. All valid bits can also be cleared by writing to CR3. - $\mathbf{D}$ , $\overline{\mathbf{D}}$ : The dirty bit for/from the TLB entry. - U, U: The user bit for/from the TLB entry. - W, W: The writable bit for/from the TLB entry. For D, U, and W, both the attribute and its complement are provided as tag bits to permit the option of a don't care on TLB lookups. The meaning of these pairs of bits is given in the following table. | x | ₹ | Effect During<br>TLB Lookup | Value of Bit<br>X after TLB Write | |---|---|-----------------------------|-----------------------------------| | 0 | 0 | Miss All | Bit X becomes undefined | | 0 | 1 | Match if X = 0 | Bit X becomes 0 | | 1 | 0 | Match if X = 1 | Bit X becomes 1 | | 1 | 1 | Match All | Bit X becomes undefined | For writing a TLB entry: - Write TR7 for the desired physical address, PL, and REP values; and. - 2. Write TR6 with the appropriate linear address, etc., (be sure to write C = 0 for write command). For looking up (reading) a TLB entry: - Write TR6 with the appropriate linear address (be sure to write C = 1 for lookup command); and, - Read TR7 and TR6. If the PL bit in TR7 indicates a hit, then the other values reveal the TLB contents. If PL indicates a miss, then the other values in TR7 and TR6 are indeterminate. ## **Debugging Support** The Am386DX/DXL microprocessor provides several features that simplify the debugging process. The three categories of on-chip debugging aids are: - 1. The code execution breakpoint op-code (0CCH); - 2. The single-step capability provided by the TF bit in the flag register; and, - The code and data breakpoint capability provided by the Debug Registers DR3–DR0, DR6, and DR7. ## **Breakpoint Instruction** A single-byte op-code breakpoint instruction is available for use by software debuggers. The breakpoint opcode is 0CCh and generates an Exception 3 trap when executed. In typical use, a debugger program can plant the breakpoint instruction at all desired code execution breakpoints. The single-byte breakpoint op-code is an alias for the two-byte general software interrupt instruction, INT n, where n = 3. The only difference between INT 3 (0CCh) and INT n is that INT 3 is never IOPL-sensitive; but, INT n is IOPL-sensitive in Protected Mode and Virtual 8086 Mode. Note: 0 indicates "Reserved for Future Use." Do not define; see Section Compatibility. Figure 12. Test Registers ## Single-Step Trap If the single-step flag (TF, bit 8) in the EFLAGS register is found to be set at the end of an instruction, a single-step exception occurs. The single-step exception is auto vectored to Exception 1. Precisely, Exception 1 occurs as a trap after the instruction following the instruction that set TF. In typical practice, a debugger sets the TF bit of a flag register image on the debugger's stack. It then typically transfers control to the user program and loads the flag image with a signal instruction, the IRET instruction. The single-step trap occurs after executing one instruction of the user program. Since the Exception 1 occurs as a trap (that is, it occurs after the instruction has already executed), the CS:EIP pushed onto the debugger's stack points to the next unexecuted instruction of the program being debugged. An Exception 1 handler, merely by ending with an IRET instruction, can therefore efficiently support single-stepping through a user program. ## **Debug Registers** The Debug Registers are an advanced debugging feature of the Am386DX/DXL microprocessor. They allow data access breakpoints as well as code execution breakpoints. Since the breakpoints are indicated by onchip registers, an instruction execution breakpoint can be placed in ROM code or in code shared by several tasks, neither of which can be supported by the INT 3 breakpoint op-code. The Am386DX/DXL microprocessor contains six Debug Registers, providing the ability to specify up to four distinct breakpoint addresses, breakpoint control options, and read breakpoint status. Initially after reset, breakpoints will occur unless the debug registers are programmed. Breakpoints set up in the Debug Registers are auto-vectored to Exception 1. ## Linear Address Breakpoint Registers (DR3-DR0) Up to four breakpoint addresses can be specified by writing into Debug Registers DR3–DR0, shown in Figure 13. The breakpoint addresses specified are 32-bit linear addresses. Am386DX/DXL microprocessor hardware continuously compares the linear breakpoint addresses in DR3–DR0 with the linear addresses generated by executing software (a linear address is the result of computing the effective address and adding the 32-bit segment base address). Note that if paging is not enabled the linear address equals the physical address. If paging is enabled, the linear address is translated to a physical 32-bit address by the on-chip paging unit. Regardless of whether paging is enabled or not, however, the breakpoint registers hold linear addresses. ## Debug Control Register (DR7) A Debug Control Register, DR7, shown in Figure 13, allows several debug control functions, such as enabling the breakpoints and setting up other control options for the breakpoints. The fields within the Debug Control Register, DR7, are as follows. ## **LENi (Breakpoint Length Specification Bits)** A 2-bit LEN field exists for each of the four breakpoints. LEN specifies the length of the associated breakpoint field. The choices for data breakpoints are: 1 byte, 2 bytes, and 4 bytes. Instruction execution breakpoints must have a length of 1 (LENi = 00). Encoding of the LENi field is as follows. | LENi<br>Encoding | Breakpoint<br>Field Width | Usage of Least<br>Significant Bits in<br>Breakpoint Address<br>Register i, (i = 0 – 3) | |------------------|-------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------| | 00 | 1 byte | All 32-bits used to specify a single-byte breakpoint field. | | 01 | 2 bytes | A31–A1 used to specify<br>a two-byte, word-aligned<br>breakpoint field. A0 in<br>Breakpoint Address<br>Register is not used. | | 10 | Undefined—<br>do not use<br>this encoding | | | 11 | 4 bytes | A31–A2 used to specify<br>a four-byte, Dword-aligned<br>breakpoint field. A0 and<br>A1 in Breakpoint Address<br>Register are not used. | Note: 0 indicates "Reserved for Future Use." Do not define: see Section Compatibility. Figure 13. Debug Registers 15021B-016 The LENi field controls the size of breakpoint field i by controlling whether all low-order linear address bits in the breakpoint address register are used to detect the breakpoint event. Therefore, all breakpoint fields are aligned; 2-byte breakpoint fields begin on Word boundaries and 4-byte breakpoint fields begin on Dword boundaries. The following is an example of various size breakpoint fields. Assume the breakpoint linear address in DR2 is 00000005H. In that situation, the following illustration indicates the region of the breakpoint field for lengths of 1, 2, or 4 bytes. | DR2 = 00000 | 0005H; | <b>LEN2 = 00B</b> | | | | | |-------------|--------|-------------------|--|-----------|--|--| | | | bkpt fld2 | | 00000008H | | | | | | | | 00000000Н | | | ## **RWi (Memory Access Qualifier Bits)** A 2-bit RW field exists for each of the four breakpoints. The 2-bit RW field specifies the type of usage that must occur in order to activate the associated breakpoint. | RW<br>Encoding | Usage<br>Causing Breakpoint | |----------------|------------------------------------| | 00 | Instruction execution only | | 01 | Data writes only | | 10 | Undefined—do not use this encoding | | 11 | Data reads and writes only | RW encoding 00 is used to set up an instruction execution breakpoint. RW encodings 01 or 11 are used to set up write-only or read/write data breakpoints. Note that instruction execution breakpoints are taken as faults (i.e., before the instruction executes), but data breakpoints are taken as traps (i.e., after the data transfer takes place). ## Using LENi and RWi to Set Data Breakpoint i A data breakpoint can be set up by writing the linear address into DRi (i = 0–3). For data breakpoints, RWi can = 01 (write only) or 11 (write/read). LEN can = 00, 01, or 11. If a data access falls entirely or partly within the data breakpoint field, the data breakpoint condition has occurred, and if the breakpoint is enabled, an Exception 1 trap will occur. # Using LENi and RWi to Set Instruction Execution Breakpoint i An instruction execution breakpoint can be set up by writing address of the beginning of the instruction (including prefixes if any) into DRi (i=0-3). RWi must = 00 and LEN must = 00 for instruction execution breakpoints. If the instruction beginning at the breakpoint address is about to be executed, the instruction execution breakpoint condition has occurred, and if the breakpoint is enabled, an Exception 1 fault will occur before the instruction is executed. Note that an instruction execution breakpoint address must be equal to the beginning byte address of an instruction (including prefixes) in order for the instruction execution breakpoint to occur. ## **GD (Global Debug Register Access Detect)** The Debug Registers can only be accessed in Real Mode or at privilege level 0 in Protected Mode. The GD bit, when set, provides extra protection against any Debug Register access even in Real Mode or at privilege level 0 in Protected Mode. This additional protection feature is provided to guarantee that a software debugger (or ICE-386) can have full control over the Debug Register resources when required. The GD bit, when set, causes an Exception 1 fault if an instruction attempts to read or write any Debug Register. The GD bit is then automatically cleared when the Exception 1 handler is invoked, allowing the Exception 1 handler free access to the debug registers. # GE and LE (Exact Data Breakpoint Match, Global and Local) If either GE or LE is set, any data breakpoint trap will be reported exactly after completion of the instruction that caused the operand transfer. Exact reporting is provided by forcing the Am386DX/DXL microprocessor execution unit to wait for completion of data operand transfers before beginning execution of the next instruction. If exact data breakpoint match is not selected, data breakpoints may not be reported until several instructions later or may not be reported at all. When enabling a data breakpoint, it is therefore recommended to enable the exact data breakpoint match. When the Am386DX/DXL microprocessor performs a task switch, the LE bit is cleared. Thus, the LE bit supports fast task switching out of tasks that have enabled the exact data breakpoint match for their task-local breakpoints. The LE bit is cleared by the processor during a task switch to avoid having exact data breakpoint match enabled in the new task. Note that exact data breakpoint match must be re-enabled under software control. The Am386DX/DXL microprocessor GE bit is unaffected during a task switch. The GE bit supports exact data breakpoint match that is to remain enabled during all tasks executing in the system. Note that instruction execution breakpoints are always reported exactly, whether or not exact data breakpoint match is selected. ## Gi and Li (Breakpoint Enable, Global and Local) If either Gi or Li is set, then the associated breakpoint (as defined by the linear address in DRi, the length in LENi and the usage criteria in RWi) is enabled. If either Gi or Li is set and the Am386DX/DXL microprocessor detects the breakpoint condition, then the Exception 1 handler is invoked. When the Am386DX/DXL microprocessor performs a task switch to a new Task State Segment (TSS), all Li bits are cleared. Thus, the Li bits support fast task switching out of tasks that use some task-local breakpoint registers. The Li bits are cleared by the processor during a task switch to avoid spurious exceptions in the new task. Note that the breakpoints must be enabled under software control. All Am386DX/DXL microprocessor Gi bits are unaffected during a task switch. The Gi bits support breakpoints that are active in all tasks executing in the system. ## Debug Status Register (DR6) A Debug Status Register, DR6, shown in Figure 13, allows the Exception 1 handler to easily determine why it was invoked. Note the Exception 1 handler can be invoked as a result of one of several events. - 1. DR0 Breakpoint fault/trap. - 2. DR1 Breakpoint fault/trap. - 3. DR2 Breakpoint fault/trap. - 4. DR3 Breakpoint fault/trap. - 5. Single-step (TF) trap. - 6. Task switch trap. - Fault due to attempted debug register access when GD = 1. The Debug Status Register contains single-bit flags for each of the possible events invoking Exception 1. Note below that some of these events are faults (exception taken before the instruction is executed), while other events are traps (exception taken after the debug events occurred). The flags in DR6 are set by the hardware but never cleared by hardware. Exception 1 handler software should clear DR6 before returning to the user program to avoid future confusion in identifying the source of Exception 1. The fields within the Debug Status Register, DR6 are as follows. #### Bi (Debug Fault/Trap Due to Breakpoint 0-3) Four breakpoint indicator flags, B3-B0, correspond one-to-one with the breakpoint registers in DR3-DR0. A flag Bi is set when the condition described by DRi, LENi, and RWi occurs. If Gi or Li is set, and if the breakpoint is detected, the processor will invoke the Exception 1 handler. The exception is handled as a fault if an instruction execution breakpoint occurred or as a trap if a data breakpoint occurred. Important Note: A flag, Bi, is set whenever the hardware detects a match condition on enabled breakpoint i. Whenever a match is detected on at least one enabled breakpoint i, the hardware immediately sets all Bi bits corresponding to breakpoint conditions matching at that instant, whether enabled or not. Therefore, the Exception 1 handler may see that multiple Bi bits are set, but only set Bi bits corresponding to enabled breakpoints (Li or Gi set) are true indications of why the Exception 1 handler was invoked. # BD (Debug Fault Due to Attempted Register Access When GD Bit Set) This bit is set if the Exception 1 handler was invoked due to an instruction attempting to read or write to the debug registers when GD bit was set. If such an event occurs, then the GD bit is automatically cleared when the Exception 1 handler is invoked, allowing handler access to the debug registers. ## BS (Debug Trap Due to Single-Step) This bit is set if the Exception 1 handler was invoked due to the TF bit in the flag register being set (for single-stepping). See Section Single-Step Trap. ## BT (Debug Trap Due to Task Switch) This bit is set if the Exception 1 handler was invoked due to a task switch occurring to a task having an Am386DX/DXL microprocessor TSS with the T-bit set. (See Figure 29.) Note the task switch into the new task occurs normally, but before the first instruction of the task is executed, the Exception 1 handler is invoked. With respect to the task switch operation, the operation is considered to be a trap. ## Use of Resume Flag (RF) in Flag Register The Resume Flag (RF) in the flag word can suppress an instruction execution breakpoint when the Exception 1 handler returns to a user program at a user address that is also an instruction execution breakpoint. See Section Flags Register. # REAL MODE ARCHITECTURE Real Mode Introduction When the processor is reset or powered up, it is initialized in Real Mode. Real Mode has the same base architecture as the 8086, but allows access to the 32-bit register set of the Am386DX/DXL microprocessor. The addressing mechanism, memory size, and interrupt handling are all identical to the Real Mode on the 80286. All of the Am386DX/DXL microprocessor instructions are available in Real Mode (except those instructions listed in Protection and I/O Permission Bitmap). The default operand size in Real Mode is 16 bits, just like the 8086. In order to use the 32-bit registers and addressing modes, override prefixes must be used. In addition, the segment size on the Am386DX/DXL CPU in Real Mode is 64 Kb so 32-bit effective addresses must have a value less than 0000FFFFH. The primary purpose of Real Mode is to set up the processor for Protected Mode Operation. Figure 14. Real Address Mode Addressing ## **LOCK Operation** The LOCK prefix on the Am386DX/DXL microprocessor, even in Real Mode, is more restrictive than on the 80286. This is due to the addition of paging on the Am386DX/DXL CPU in Protected Mode and Virtual 8086 Mode. Paging makes it impossible to guarantee that repeated string instructions can be LOCKed. The Am386DX/DXL CPU cannot require that all pages holding the string be physically present in memory. Hence, a Page Fault (Exception 14) might have to be taken during the repeated string instruction. Therefore the LOCK prefix cannot be supported during repeated string instructions. These are the only instruction forms where the LOCK prefix is legal on the Am386DX/DXL microprocessor. | Opcode | Operands<br>(Dest, Source) | |--------------------------------------|----------------------------| | BIT TEST and<br>SET/RESET/COMPLEMENT | Mem, Reg/immed | | xchg<br>xchg | Reg, Mem<br>Mem, Reg | | ADD, OR, ADC, SBB, AND, SUB, XOR | Mem, Reg/immed | | NOT, NEG, INC, DEC | Mem | An Exception 6 will be generated if a LOCK prefix is placed before any instruction form or op-code not listed above. The LOCK prefix allows indivisible read/modify/write operations on memory operands using the instructions above. For example, even the ADD Reg, Mem is not LOCKable, because the Mem operand is not the destination (and therefore no memory read/modify/operation is being performed). Since, on the Am386DX/DXL microprocessor, repeated string instructions are not LOCKable, it is not possible to LOCK the bus for a long period of time. Therefore, the LOCK prefix is not IOPL-sensitive on the Am386DX/DXL device. The LOCK prefix can be used at any privilege level, but only on the instruction forms listed above. ## **Memory Addressing** In Real Mode, the maximum memory size is limited to 1 Mb. Thus, only address lines A19-A2 are active. Exception, the High address lines A31–A20 are High during CS-relative memory cycles until an intersegment jump or call is executed (see Section Reset and Initialization). Since paging is not allowed in Real Mode, the linear addresses are the same as physical addresses. Physical addresses are formed in Real Mode by adding the contents of the appropriate segment register that is shifted left by 4 bits to an effective address. This addition results in a physical address from 00000000H to 0010FFEFH. This is compatible with 80286 Real Mode. Since segment registers are shifted left by 4 bits, this implies that Real Mode segments always start on 16-byte boundaries. All segments in Real Mode are exactly 64-Kb long and may be read, written, or executed. The Am386DX/DXL microprocessor will generate an Exception 13 if a data operand or instruction fetch occurs past the end of a segment (i.e., if an operand has an offset greater than FFFFH; for example, a word with a low byte at FFFFH and the high byte at 0000H). Segments may be overlapped in Real Mode. Thus, if a particular segment does not use all 64 Kb, another segment c an be overlayed on top of the unused portion of the previous segment. This allows the programmer to minimize the amount of physical memory needed for a program. ## **Reserved Locations** There are two fixed areas in memory that are reserved in Real address mode: system initialization area and the interrupt table area. Locations 00000H through 003FFH are reserved for interrupt vectors. Each one of the 256 possible interrupts has a 4-byte jump vector reserved for it. Locations FFFFFFF0H through FFFFFFFH are reserved for system initialization. ## Interrupts Many of the exceptions shown in Table 5 and discussed in Section Interrupts are not applicable to Real Mode operation; in particular, Exceptions 10, 11, and 14 will not happen in Real Mode. Other exceptions have slightly different meanings in Real Mode. Table 8 identifies these exceptions. **Table 8. Other Exceptions in Real Mode** | Function | Interrupt<br>Number | Related Instructions | Return Address Location | |-------------------------------------------------|---------------------|---------------------------------------------------------------------------------------------------------|-------------------------| | Interrupt table limit too small | 8 | INT Vector is not within table limit. | Before Instruction | | CS, DS, ES, FS, GS<br>Segment overrun exception | 13 | Word memory reference<br>beyond offset = FFFFH.<br>An attempt to execute<br>past the end of CS segment. | Before Instruction | | SS Segment overrun exception | 12 | Stack Reference beyond offset = FFFFH. | Before Instruction | ## Shutdown and Halt The HLT instruction stops program execution and prevents the processor from using the local bus until restarted. Either NMI, FLT, INTR with interrupts enabled (IF=1), or RESET will force the Am386DX/DXL microprocessor out of halt. If interrupted, the saved CS:IP will point to the next instruction after the HLT. Shutdown will occur when a severe error is detected that prevents further processing. In Real Mode, shutdown can occur under two conditions: - An interrupt or an exception occur (Exception 8 or 13) and the interrupt vector is larger than the Interrupt Descriptor Table (i.e., there is not an interrupt handler for the interrupt); - A CALL, INT, or PUSH instruction attempts to wrap around the stack segment when SP is not even (e.g., pushing a value on the stack when SP = 0001 resulting in a stack segment greater than FFFFH). An NMI input can bring the processor out of shutdown if the Interrupt Descriptor Table limit is large enough to contain the NMI interrupt vector (at least 0017H) and the stack has enough room to contain the vector and flag information (i.e., SP is greater than 0005H). Otherwise shutdown can only be exited via the RESET input. # PROTECTED MODE ARCHITECTURE Introduction The complete capabilities of the Am386DX/DXL microprocessor are unlocked when the processor operates in Protected Virtual Address Mode (Protected Mode). Protected Mode vastly increases the linear address space to 4 Gb (2<sup>32</sup> bytes) and allows the running of virtual memory programs of almost unlimited size (64 tb or 2<sup>46</sup> bytes). In addition, Protected Mode allows the Am386DX/DXL CPU to run all of the existing 8086 and 80286 software, while providing a sophisticated memory management and a hardware-assisted protection mechanism. Protected Mode allows the use of additional instructions especially optimized for supporting multitasking operating systems. The base architecture of the Am386DX/DXL CPU remains the same; the registers, instructions, and addressing modes described in the previous sections are retained. The main differences between Protected Mode and Real Mode from a programmer's view is the increased address space and a different addressing mechanism. ## **Addressing Mechanism** Like Real Mode, Protected Mode uses two components to form the logical address: a 16-bit selector is used to determine the linear base address of a segment; the base address is added to a 32-bit effective address to form a 32-bit linear address. The linear address is then either used as the 32-bit physical address or if paging is enabled the paging mechanism maps the 32-bit linear address into a 32-bit physical address. The difference between the two modes lies in calculating the base address. In Protected Mode, the selector is used to specify an index into an operating system defined table (see Figure 15). The table contains the 32-bit base address of a given segment. The physical address is formed by adding the base address obtained from the table to the offset. Paging provides an additional memory management mechanism that operates only in Protected Mode. Paging provides a means of managing the very large segments of the Am386DX/DXL microprocessor. As such, paging operates beneath segmentation. The paging mechanism translates the protected linear address that comes from the segmentation unit into a physical address. Figure 16 shows the complete Am386DX/DXL device addressing mechanism with paging enabled. ## Segmentation ## **Segmentation Introduction** Segmentation is one method of memory management and provides the basis for protection. Segments are used to encapsulate regions of memory that have common attributes. For example, all of the code of a given program could be contained in a segment or an operating system table may reside in a segment. All information about a segment is stored in an 8-byte data structure called a descriptor. All of the descriptors in a system are contained in tables recognized by hardware. 15021B-018 Figure 15. Protected Mode Addressing Figure 16. Paging and Segmentation ## Terminology The following terms are used throughout the discussion of descriptors, privilege levels, and protection: PL: Privilege Level—One of the four hierarchical privilege levels. Level 0 is the most privileged level and level 3 is the least privileged. More privileged levels are numerically smaller than less privileged levels. RPL: Requester Privilege Level—The privilege level of the original supplier of the selector. RPL is determined by the least two significant bits of a selector. **DPL**: Descriptor Privilege Level—This is the least privileged level at which a task may access that descriptor (and the segment associated with that descriptor). Descriptor Privilege Level is determined by bits 6–5 in the Access Right Byte of a descriptor. CPL: Current Privilege Level—The privilege level at which a task is currently executing, which equals the privilege level of the code segment being executed. CPL can also be determined by examining the lowest 2 bits of the CS register, except for conforming code segments. **EPL**: Effective Privilege Level—The effective privilege level is the least privileged of the RPL and DPL. Since small privilege level values indicate greater privilege, EPL is the numerical maximum of RPL and DPL. **Task**: One instance of the execution of a program. Tasks are also referred to as processes. ## **Descriptor Tables** #### **Descriptor Tables Introduction** The descriptor tables define all of the segments which are used in an Am386DX/DXL microprocessor system. There are three types of tables on the Am386DX/DXL microprocessor that hold descriptors: the Global Descriptor Table, Local Descriptor Table, and the Interrupt Descriptor Table. All of the tables are variable length memory arrays. They can range in size between 8 bytes and 64 Kb. Each table can hold up to 8192 eight byte descriptors. The upper 13 bits of a selector are used as an index into the descriptor table. The tables have registers associated with them that hold the 32-bit linear base address, and the 16-bit limit of each table. Each of the tables has a register associated with it: the GDTR, LDTR, and the IDTR (see Figure 17). The LGDT, LLDT, and LIDT instructions load the base and limit of the Global, Local, and Interrupt Descriptor Tables, respectively, into the appropriate register. The SGDT, SLDT, and SIDT instructions store the base and limit values. These tables are manipulated by the operating system. Therefore, the load descriptor table instructions are privileged instructions. ## Global Descriptor Table The Global Descriptor Table (GDT) contains descriptors that are possibly available to all of the tasks in a system. The GDT can contain any type of segment descriptor except for descriptors that are used for servicing interrupts (i.e., interrupt and trap descriptors). Every Am386DX/DXL microprocessor contains a GDT. Generally, the GDT contains code and data segments used by the operating systems and task state segments and descriptors for the LDTs in a system. The first slot of the Global Descriptor Table corresponds to the null selector and is not used. The null selector defines a null pointer value. Figure 17. Descriptor Table Registers ## Local Descriptor Table LDTs contain descriptors that are associated with a given task. Generally, operating systems are designed so that each task has a separate LDT. The LDT may contain only code, data, stack, task gate, and call gate descriptors. LDTs provide a mechanism for isolating a given task's code and data segments from the rest of the operating system, while the GDT contains descriptors for segments that are common to all tasks. A segment cannot be accessed by a task if its segment descriptor does not exist in either the current LDT or the GDT. This provides both isolation and protection for a task's segments, while still allowing global data to be shared among tasks. Unlike the 6-byte GDT or IDT registers that contain a base address and limit, the visible portion of the LDT register contains only a 16-bit selector. This selector refers to a Local Descriptor Table descriptor in the GDT. ## Interrupt Descriptor Table The third table needed for Am386DX/DXL microprocessor systems is the Interrupt Descriptor Table (see Figure 18). The IDT contains the descriptors that point to the location of up to 256 interrupt service routines. The IDT may contain only task gates, interrupt gates, and trap gates. The IDT should be at least 256 bytes in size in order to hold the descriptors for the 32, Reserved for Future Use, interrupts. Every interrupt used by a system must have an entry in the IDT. The IDT entries are referenced via INT instructions, external interrupt vectors, and exceptions. (See Interrupts.) ## **Descriptors** ## Descriptor Attribute Bits The object to which the segment selector points is called a descriptor. Descriptors are 8-byte quantities that contain attributes about a given region of linear address space (i.e., a segment). These attributes include the 32-bit base linear address of the segment, the 20-bit length and granularity of the segment, the protection level, read, write or execute privileges, the default size of the operands (16 bit or 32 bit), and the type of segment. All of the attribute information about a segment is contained in 12 bits in the segment descriptor. Figure 19 shows the general format of a descriptor. All segments on the Am386DX/DXL microprocessor have three attribute fields in common: the P bit, the DPL bit, and the S bit. The Present P bit is 1 if the segment is loaded in physical memory; if P = 0 then any attempt to access this segment causes a not present exception (Exception 11). The Descriptor Privilege Level (DPL) is a 2-bit field that specifies the protection levels 0-3 associated with a segment. Figure 18. Interrupt Descriptor Table Register Use Base Base Address of the segment Limit The length of the segment P Present Bit: 1 = Present, 0 = Not Present DPL Descriptor Privilege Levels 0–3 S Segment Descriptor: 0 = System Segment Descriptor: 0 = System Descriptor, 1 = Code or Data Segment Descriptor Type Type of Segment A Accessed Bit G Granularity Bit: 1 = Segment length is page granular, 0 = Segment length is byte granular Default Operation Size (recognized in code segment descriptors only): 1 = 32-bit segment, 0 = 16-bit segment O Bit must be zero (0) for compatibility with future processors AVL Available field for user or OS Note: In a maximum-size segment (i.e., segment with G = 1 and segment limit 19–0 = FFFFFH), the lowest 12 bits of the segment base should be zero (i.e., segment base 11–000 = 000H). 15021B-022 ## Figure 19. General Format of Segment Descriptors The Am386DX/DXL microprocessor has two main categories of segments: system segments and non-system segments (for code and data). The segment S bit in the segment descriptor determines if a given segment is a system segment or a code or data segment. If the S bit is 1, then the segment is either a code or data segment; if it is 0, then the segment is a system segment. # Am386DX/DXL Microprocessor Code and Data Descriptors (S = 1) Figure 20 shows the general format of a code and data descriptor and Table 9 illustrates how the bits in the Access Rights Byte are interpreted. Code and data segments have several descriptor fields in common. The accessed A bit is set whenever the processor accesses a descriptor. The A bit is used by operating systems to keep usage statistics on a given segment. The G bit, or granularity bit, specifies if a segment length is byte-granular or page-granular. Am386DX/DXL microprocessor segments can be 1 Mb long with byte granularity (G=0) or 4 Gb with page granularity (G=1), (i.e., 220 pages—each page is 4 Kb in length). The granularity is totally unrelated to paging. An Am386DX/DXL CPU system can consist of segments with byte granularity and page granularity, whether or not paging is enabled. The executable E bit tells if a segment is a code or data segment. A code segment (E=1, S=1) may be execute-only or execute/read as determined by the Read R bit. Code segments are execute only if R=0 and execute/read if R=1. Code segments may never be written into. **Note:** Code segments may be modified via aliases. Aliases are writeable data segments that occupy the same range of linear address space as the code segment. The D bit indicates the default length for operands and effective addresses. If D = 1, then 32-bit operands and 32-bit addressing modes are assumed. If D = 0, then 16-bit operands and 16-bit addressing modes are assumed. Therefore all existing 80286 code segments will execute on the Am386DX/DXL microprocessor assuming the D bit is set 0. Another attribute of code segments is determined by the conforming C bit. Conforming segments, C = 1, can be executed and shared by programs at different privilege levels (see Section Protection). | ; | 31 | | | | | | | 0 | | |---|-----------------|----|-----|---|-----|----------------|-----------------------|---------------|----| | | Segment Base 15 | -0 | | | | | Segment Limit 15- | -0 | 0 | | | Base 31-24 | G | D/B | 0 | AVL | Limit<br>19–16 | Access Rights<br>Byte | Base<br>23–16 | +4 | D/B 1 = Default Instructions Attributes are 32 bits 0 = Default Instructions Attributes are 16 bits AVL Available field for user or OS Granularity Bit: 1 = Segment length is page granular, 0 = Segment length is byte granular 0 Bit must be zero (0) for compatibility with future processors Note: In a maximum-size segment (i.e., a segment with G = 1 and segment limit 19–0 = FFFFFH), the lowest 12 bits of the segment base should be zero (i.e., segment base 11–000 = 000H). 15021B-023 Figure 20. Code and Data Segment Descriptors Table 9. Access Rights Byte Definition for Code and Data Descriptions | Bit<br>Position | Name | | Function | | | | | |-----------------|-------------------------------------------------------|---------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|--| | 7 | Present (P) | P=1<br>P=0 | P=0 No mapping to physical memory exists, base and limit are | | | | | | 6–5 | Descriptor Privilege<br>Levels (DPL) | | not used.<br>Segment privilege attribute used in privilege tests. | | | | | | 4 | Segment Descriptor (S) | S=1<br>S=0 | Code or Data (includes stacks) segment descriptor.<br>System Segment Descriptor or Gate Descriptor. | | | | | | 3<br>2<br>1 | Executable (E) Expansion Direction (ED) Writeable (W) | E = 0<br>ED = 0<br>ED = 1<br>W = 0<br>W = 1 | Descriptor type is data segment.<br>Expand up segment, offsets must be $\leq$ limit.<br>Expand down segment, offsets must be $>$ limit.<br>Data segment may not be written into.<br>Data segment may be written into. | | | | | | 3 2 | Executable (E)<br>Conforming (C) | E=1<br>C=1 | Descriptor type is code segment. Code segment may only be executed when If Code | | | | | | 1 | Readable (R) | R = 0<br>R = 1 | CPL≥DPL and CPL remains unchanged. Code segment may not be read. Code segment may be read. Segment (S = 1, E = 1) | | | | | | 0 | Accessed (A) | A = 0<br>A = 1 | Segment has not been accessed. Segment selector has been loaded into segment register or used by selector test instructions. | | | | | Segments identified as data segments (E = 0, S = 1) are used for two types of Am386DX/DXL microprocessor segments: stack and data segments. The expansion direction (ED) bit specifies if a segment expands downward (stack) or upward (data). If a segment is a stack segment, all offsets must be greater than the segment limit. On a data segment all offsets must be less than or equal to the limit. In other words, stack segments start at the base linear address plus the maximum segment limit and grow down to the base linear address plus the limit. On the other hand, data segments start at the base linear address and expand to the base linear address plus limit. The write (W) bit controls the ability to write into a segment. Data segments are read-only if W = 0. The stack segment must have W = 1. The B bit controls the size of the stack pointer register. If B=1, then PUSHes, POPs, and CALLs all use the 32-bit ESP register for stack references and assume an upper limit of FFFFFFFH. If B=0, stack instructions all use the 16-bit SP register and assume an upper limit of FFFFH. ## System Descriptor Formats System segments describe information about operating system tables, tasks, and gates. Figure 21 shows the general format of system segment descriptors, and the various types of system segments. The Am386DX/DXL microprocessor system descriptors contain a 32-bit base linear address and a 20-bit segment limit. 80286 system descriptors have a 24-bit base address and a 16-bit segment limit. 80286 system descriptors are identified by the upper 16 bits being all zeros. ## LDT Descriptors (S = 0, Type = 2) LDT descriptors (S = 0, TYPE = 2) contain information about Local Descriptor Tables. LDTs contain a table of segment descriptors, unique to a particular task. Since the instruction to load the LDTR is only available at privilege level 0, the DPL field is ignored. LDT descriptors are only allowed in the Global Descriptor Table (GDT). ## TSS Descriptors (S = 0, Type = 1, 3, 9, B) A Task State Segment (TSS) descriptor contains information about the location, size, and privilege level of a TSS. A TSS in turn is a special fixed format segment that contains all the state information for a task and a linkage field to permit nesting tasks. The Type field is used to indicate whether the task is currently BUSY (i.e., on a chain of active tasks) or the TSS is available. The Type field also indicates if the segment contains a 80286 or an Am386DX/DXL microprocessor TSS. The Task Register (TR) contains the selector that points to the current TSS. ## Gate Descriptors (S = 0, Type = 4-7, C, F) Gates are used to control access to entry points within the target code segment. The various types of gate descriptors are call gates, task gates, interrupt gates, and trap gates. Gates provide a level of indirection between the source and destination of the control transfer. This indirection allows the processor to automatically perform protection checks. It also allows system designers to control entry points to the operating system. Call gates are used to change privilege levels (see Section Protection), task gates are used to perform a task switch, and interrupt and trap gates are used to specify interrupt service routines. Figure 22 shows the format of the four types of gate descriptors. Call gates are primarily used to transfer program control to a more privileged level. The call gate descriptor consists of three fields: the access byte; a long pointer (selector and offset) that points to the start of a routine; and a word count that specifies how many parameters are to be copied from the caller's stack to the stack of the called routine. The word count field is only used by call gates when there is a change in the privilege level, other types of gates ignore the word count field. Interrupt and trap gates use the destination selector and destination offset fields of the gate descriptor as a pointer to the start of the interrupt or trap handler routines. The difference between interrupt gates and trap gates is that the interrupt gate disables interrupts (resets the IF bit) while the trap gate does not. Task gates are used to switch tasks. Task gates may only refer to a task state segment (see Section Task Switching); therefore, only the destination selector portion of a task gate descriptor is used, and the destination offset is ignored. Exception 13 is generated when a destination selector does not refer to a correct descriptor type, i.e., a code segment for an interrupt, trap or call gate, a TSS for a task gate. The access byte format is the same for all gate descriptors. P=1 indicates that the gate contents are valid. P=0 indicates the contents are not valid and causes Exception 11 if referenced. DPL is the descriptor privilege level and specifies when this descriptor may be used by a task (see Section Protection). The S field, bit 4 of the access rights byte, must be 0 to indicate a system control descriptor. The type field specifies the descriptor type as indicated in Figure 22. Note: In a maximum-size segment (i.e., segment with G = 1 and segment limit 19–0 = FFFFFH), the lowest 12 bits of the segment base should be zero (i.e., segment base 11–000 = 000H). Figure 21. System Segments Descriptors ## Gate Descriptors Fields | Name | Value | Description | |------|-------|-----------------------------------------------| | Туре | 4 | 80286 Call Gate | | ,, | 5 | Task Gate (for 80286 or Am386DX/DXL CPU Task) | | | 6 | 80286 Interrupt Gate | | | 7 | 80286 Trap Gate | | | С | Am386DX/DXL CPU Call Gate | | | E | Am386DX/DXL CPU Interrupt Gate | | | F | Am386DX/DXL CPU Trap Gate | | Р | 0 | Descriptor contents are not valid | | | 1 | Descriptor contents are valid | DPL—Least privileged level at which a task may access the gate. WORD COUNT 0–31—The number of parameters to copy from caller's stack to the called procedure's stack. The parameters are 32-bit quantities for Am386DX/DXL CPU gates, and 16-bit quantities for 80286 gates. DESTINATION SELECTOR 16-Bit Selector Selector to the target code segment or Selector to the target state segment for task gate DESTINATION OFFSET Offset 16-bit 80286 32-bit Am386DX/DXL CPU Entry point within the target code segment 15021B-025 Figure 22. Gate Descriptor Formats # Difference Between Am386DX/DXL Microprocessor and 80286 Descriptors In order to provide operating system compatibility between the 80286 and Am386DX/DXL microprocessor, the Am386DX/DXL CPU supports all of the 80286 segment descriptors. Figure 23 shows the general format of an 80286 system segment descriptor. The only differences between 80286 and Am386DX/DXL device descriptor formats are that the values of the type fields and the limit and base address fields have been expanded for the Am386DX/DXL device. The 80286 system segment descriptors contained a 24-bit base address and 16-bit limit, while the Am386DX/DXL microprocessor system segment descriptors have a 32-bit base address, a 20-bit limit field, and a granularity bit. By supporting 80286 system segments, the Am386DX/DXL microprocessor is able to execute 80286 application programs on an Am386DX/DXL CPU operating system. This is possible because the processor automatically understands which descriptors are 80286-style descriptors and which are Am386DX/DXL microprocessor-style descriptors. In particular, if the upper word of a descriptor is zero, then that descriptor is an 80286-style descriptor. The only other differences between 80286-style descriptors and Am386DX/DXL microprocessor descriptors is the interpretation of the word count field of call gates and the B bit. The word count field specifies the number of 16-bit quantities to copy for 80286 call gates and 32-bit quantities for Am386DX/DXL device call gates. The B bit controls the size of PUSHes when using a call gate; if B = 0, then PUSHes are 16 bits, if B = 1, then PUSHes are 32 bits. ### Selector Fields A selector in Protected Mode has three fields: Local or Global Descriptor Table Indicator (TI), Descriptor Entry Index (Index), and Requestor (the selector's) Privilege Level (RPL) as shown in Figure 24. The TI bits select one of two memory-based tables of descriptors (the Global Descriptor Table or the Local Descriptor Table). The Index selects one of 8K descriptors in the appropriate descriptor table. The RPL bits allow high speed testing of the selector's privilege attributes. #### Segment Descriptor Cache In addition to the selector value, every segment register has a segment descriptor cache register associated with it. Whenever a segment register's contents are changed, the 8-byte descriptor associated with that selector is automatically loaded (cached) on the chip. Once loaded, all references to that segment use the cached descriptor information instead of reaccessing the descriptor. The contents of the descriptor cache are not visible to the programmer. Since descriptor caches only change when a segment register is changed, programs that modify the descriptor tables must reload the appropriate segment registers after changing a descriptor's value. Base Limit Base Address of the Segment The length of the Segment P Present Bit: 1 = Present, 0 = Not Present DPL Descriptor Privilege Levels 0–3 S System Descriptor: 0 = System, 1 = User Type of Segment 15021B-026 Figure 23. 80286 Code and Data Segment Descriptors Figure 24. Example Descriptor Selection ## Segment Descriptor Register Settings The contents of the segment descriptor cache vary depending on the operating mode of the Am386DX/DXL microprocessor. When operating in Real Address Mode, the segment base, limit, and other attributes within the segment cache registers are defined as shown in Figure 25. For compatibility with the 8086 architecture, the base is set to 16 times the current selector value, the limit is fixed at 0000FFFFH, and the attributes are fixed to indicate that the segment is present and fully usable. In Real Address Mode, the internal privilege level is always fixed to the highest level, level 0, so I/O and other privileged op-codes may be executed. When operating in Protected Mode, the segment base, limit, and other attributes within the segment cache registers are defined as shown in Figure 26. In Protected Mode, each of these fields are defined according to the contents of the segment descriptor indexed by the selector value loaded into the segment register. ## Segment Descriptor Cache Register Contents Key: Y = Yes N = No Privilege level 0 Privilege level 1 Privilege level 1 Privilege level 2 Privilege level 3 Privilege level 3 Expand down B = Byte granularity P = Page granularity W = Push/pop 16-bit words F = Push/pop 32-bit Dwords U = Expand up Figure 25. Segment Descriptor Caches for Real Address Mode (Segment Limit and Attributes are Fixed) <sup>\*</sup>Except the 32-bit CS base is initialized to FFFFF000H after reset until first intersegment control transfer (e.g., intersegment CALL, or intersegment JMP, or INT). (See Figure 27 example.) ## **Segment Descriptor Cache Register Contents** Kev: Y = Fixed Yes N = Fixed No d = Per segment descriptor p = Per segment descriptor; descriptor must indicate "present" to avoid Exception 11 (Exception 12 in case of SS) r = Per segment descriptor, but descriptor must indicate "readable" to avoid Exception 13 (special case for SS) w = Per segment descriptor, but descriptor must indicate "writeable" to avoid Exception 13 (special case for SS) - = Does not apply to that segment cache register 15021B-029 Figure 26. Segment Descriptor Caches for Protected Mode (Loaded per Descriptor) When operating in a Virtual 8086 Mode within the Protected Mode, the segment base, limit, and other attributes within the segment cache registers are defined as shown in Figure 27. For compatibility with the 8086 architecture, the base is set to 16 times the current selector value, the limit is fixed at 0000FFFFH, and the attributes are fixed so as to indicate the segment is present and fully usable. The virtual program executes at lowest privilege level, level 3, to allow trapping of all IOPL-sensitive instructions and level 0 only instructions. #### Segment Descriptor Cache Register Contents Key: Y = Yes N = No 0 = Privilege level 0 1 = Privilege level 1 2 = Privilege level 2 3 = Privilege level 3 U = Expand up D = Expand down B = Byte granularity P = Page granularity W = Push/pop 16-bit words F = Push/pop 32-bit Dwords - = Does not apply to that segment cache register Figure 27. Segment Caches for Virtual 8086 Mode within Protected Mode (Segment Limit and Attributes are Fixed) ## Protection ## **Protection Concepts** The Am386DX/DXL microprocessor has four levels of protection that are optimized to support the needs of a multitasking operating system to isolate and protect user programs from each other and the operating system. The privilege levels control the use of privileged instructions. I/O instructions, and access to segments and segment descriptors. Unlike traditional microprocessor based systems where this protection is achieved only through the use of complex external hardware and software, the Am386DX/DXL CPU provides the protection on a page basis when paging is enabled (see Section Page Level Protection). The four-level hierarchical privilege system is illustrated in Figure 28. It is an extension of the user/supervisor privilege mode commonly used by minicomputers and, in fact, the user/supervisor mode is fully supported by the Am386DX/DXL microprocessor paging mechanism. The privilege levels (PL) are numbered 0 through 3. Level 0 is the most privileged or trusted level. Figure 28. Four-Level Hierarchical Protection ## Rules of Privilege The Am386DX/DXL microprocessor controls access to both data and procedures between levels of a task, according to the following rules. - Data stored in a segment with privilege level p can be accessed only by code executing at a privilege level at least as privileged as p. - A code segment/procedure with privilege level p can only be called by a task executing at the same or a lesser privilege level than p. ## **Privilege Levels** ## Task Privilege At any point in time, a task on the Am386DX/DXL microprocessor always executes at one of the four privilege levels. The Current Privilege Level (CPL) specifies the task's privilege level. A task's CPL may only be changed by control transfers through gate descriptors to a code segment with a different privilege level (see Section Privilege Level Transfers). Thus, an application program running at PL=3 may call an operating system routine at PL = 1 (via a gate) that would cause the task's CPL to be set to 1 until operating system routine is finished. ## Selector Privilege (RPL) The privilege level of a selector is specified by the RPL field. The RPL is the two least significant bits of the selector. The selector's RPL is only used to establish a less trusted privilege level than the current privilege level for the use of a segment. This level is called the task's effective privilege level (EPL). The EPL is defined as being the least privileged (i.e., numerically larger) level of a task's CPL and a selector's RPL. Thus, if selector's RPL = 0, then the CPL always specifies the privilege level for making an access using the selector. On the other hand if RPL = 3, then a selector can only access segments at level 3 regardless of the task's CPL. The RPL is most commonly used to verify that pointers passed to an operating system procedure do not access data that is of higher privilege than the procedure that originated the pointer. Since the originator of a selector can specify any RPL value, the Adjust RPL (ARPL) instruction is provided to force the RPL bits to the originator's CPL. ## I/O Privilege and I/O Permission Bitmap The I/O privilege level (IOPL, a 2-bit field in the EFLAGS register) defines the least privileged level at which I/O instructions can be unconditionally performed. I/O instructions can be unconditionally performed when CPL≤ IOPL. (The I/O instructions are IN, OUT, INS, OUTS, REP INS, and REP OUTS.) When CPL > IOPL, and the current task is associated with a 286 TSS, attempted I/O instructions cause an Exception 13 fault. When CPL > IOPL, and the current task is associated with an Am386DX/DXL CPU TSS, the I/O Permission Bitmap (part of an Am386DX/DXL microprocessor TSS) is consulted on whether I/O to the port is allowed, or an Exception 13 fault is to be generated instead. For diagrams of the I/O Permission Bitmap, refer to Figures 29a and 29b. For further information on how the I/O Permission Bitmap is used in Protected Mode or in Virtual 8086 Mode, refer to Section Protection and I/O Permission Bitmap. I/O Ports Accessible: $2 \rightarrow 9$ , 12, 13, 15, $20 \rightarrow 24$ , 27, 33, 34, 40, 41, 48, 50, 52, 53, 58 $\rightarrow$ 60, 62, 63, 96 $\rightarrow$ 127 Figure 29b. Sample I/O Permission Bit Map 15021B-032b The I/O privilege level (IOPL) also affects whether several other instructions can be executed or cause an Exception 13 fault instead. These instructions are called IOPL-sensitive instructions and they are CLI and STI. (Note that the LOCK prefix is not IOPL-sensitive on the Am386DX/DXL microprocessor.) The IOPL also affects whether the IF bit (interrupts enable flag) can be changed by loading a value into the EFLAGS register. When CPL≤IOPL, the IF bit can be changed by loading a new value into the EFLAGS register. When CPL > IOPL, the IF bit cannot be changed by a new value POP'ed into (or otherwise loaded into) the EFLAGS register; the IF bit merely remains unchanged and no exception is generated. **Table 10. Pointer Test Instructions** | Instruction | Operands | Function | | | | | |-------------|-----------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|--|--| | ARPL | Selector,<br>Register | Adjust Requested Privilege<br>Level; adjusts the RPL of the<br>selector to the numeric maximum<br>of current selector RPL value and<br>the RPL value in the register. Set<br>zero flag if selector RPL was<br>changed. | | | | | | VERR | Selector | VERify for Read: sets the zero<br>flag if the segment referred to<br>by the selector can be read. | | | | | | VERW | Selector | VERify for Write: sets the zero flag if the segment referred to by the selector can be written. | | | | | | LSL | Register,<br>Selector | Load Segment Limit: reads the segment limit into the register if privilege rules and descriptor type allow. Set zero flag if successful. | | | | | | LAR | Register,<br>Selector | Load Access Rights: reads the descriptor access rights byte into the register if privilege rules allow. Set zero flag if successful. | | | | | ## Privilege Validation The Am386DX/DXL CPU provides several instructions to speed pointer testing and help maintain system integrity by verifying that the selector value refers to an appropriate segment. Table 10 summarizes the selector validation procedures available for the Am386DX/DXL microprocessor. This pointer verification prevents the common problem of an application at PL=3 calling an operating-systems routine at PL=0 and passing the operating-systems routine a bad pointer that corrupts a data structure belonging to the operating system. If the operating-systems routine uses the ARPL instruction to ensure that the RPL of the selector has no greater privilege than that of the caller, then this problem can be avoided. ## Descriptor Access There are basically two types of segment accesses: those involving code segments, such as control transfers; and those involving data accesses. Determining the ability of a task to access a segment involves the type of segment to be accessed, the instruction used, the type of descriptor used, and CPL, RPL, and DPL as described above. Any time an instruction loads data segment registers (DS, ES, FS, GS) the Am386DX/DXL microprocessor makes protection validation checks. Selectors loaded in the DS, ES, FS, GS registers must refer only to data segments or readable code segments. The data access rules are specified in Section Rules of Privilege. The only exception to those rules is readable conforming code segments which can be accessed at any privilege level. Finally, the privilege validation checks are performed. The CPL is compared to the EPL; if the EPL is more privileged than the CPL, an Exception 13 (General Protection fault) is generated. The rules regarding the stack segment are slightly different than those involving data segments. Instructions that load selectors into SS must refer to data segment descriptors for writeable data segments. The DPL and RPL must equal the CPL. All other descriptor types or a privilege level violation will cause Exception 13. A stack not present fault causes Exception 12. Note that an Exception 11 is used for a not-present code or data segment. **Table 11. Descriptor Types Used for Control Transfer** | Control Transfer Types | Operation Types | Descriptor<br>Referenced | Descriptor<br>Table | |-----------------------------------------------------------|-----------------------------------------------------------------|---------------------------|---------------------| | Intersegment within the same privilege level | JMP, CALL, RET, IRET* | Code Segment | GDT/LDT | | Intersegment to the same or higher privilege level | CALL | Call Gate | GDT/LDT | | Interrupt within task may change CPL | Interrupt Instruction,<br>Exception, External Interrupt | Trap or Interrupt<br>Gate | IDT | | Intersegment to a lower privilege level (change task CPL) | RET, IRET* | Code Segment | GDT/LDT | | | CALL, JMP | Task State Segment | GDT | | , | CALL, JMP | Task Gate | GDT/LDT | | Task Switch | IRET**, Interrupt Instruction,<br>Exception, External Interrupt | Task Gate | IDT | <sup>\*</sup>NT (Nested Task bit of flag register) = 0 \*\*NT (Nested Task bit of flag register) = 1 ## Privilege Level Transfers Intersegment control transfers occur when a selector is loaded in the CS register. For a typical system most of these transfers are simply the result of a call or a jump to another routine. There are five types of control transfers, which are summarized in Table 11. Many of these transfers result in a privilege level transfer. Changing privilege levels is done only via control transfers by using gates, task switches, and interrupt or trap gates. Control transfers can only occur if the operation that loaded the selector references the correct descriptor type. Any violation of these descriptor usage rules will cause an Exception 13 (e.g., JMP through a call gate or IRET from a normal subroutine call). In order to provide further system security, all control transfers are also subject to the privilege rules. The privilege rules require that: - Privilege level transitions can only occur via gates. - JMPs can be made to a non-conforming code segment with the same privilege or to a conforming code segment with greater or equal privilege. - CALLs can be made to a non-conforming code segment with the same privilege or via a gate to a more privileged level. - Interrupts handled within the task obey the same privilege rules as CALLs. - Conforming Code segments are accessible by privilege levels that are the same or less privileged than the conforming-code segment's DPL. - Both the requested privilege level (RPL) in the selector pointing to the gate and the task's CPL must be of equal or greater privilege than the gate's DPL. - The code segment selected in the gate must be the same or more privileged than the task's CPL. - Return instructions that do not switch tasks can only return control to a code segment with same or less privilege. - Task switches can be performed by a CALL, JMP, or INT that references either a task gate or task state segment whose DPL is less privileged or the same privilege as the old task's CPL. Any control transfer that changes CPL within a task causes a change of stacks as a result of the privilege level change. The initial values of SS:ESP for privilege levels 0, 1, and 2 are retained in the task state segment (see Section Task Switching). During a JMP or CALL control transfer, the new stack pointer is loaded in the SS and ESP registers and the previous stack pointer is pushed onto the new stack. When returning to the original privilege level, use of the lower-privilege stack is restored as part of the RET or IRET instruction operation. For subroutine calls that pass parameters on the stack and cross privilege levels, a fixed number of words (as specified in the gate's word count field) are copied from the previous stack to the current stack. The intersegment RET instruction with a stack adjustment value will correctly restore the previous stack pointer upon return. ## Call Gates Gates provide protected, indirect CALLs. One of the major uses of gates is to provide a secure method of privilege transfers within a task. Since the operating system defines all the gates in a system, it can ensure that all gates only allow entry into a few trusted procedures (such as those that allocate memory or perform I/O). Gate descriptors follow the data access rules of privilege; that is, gates can be accessed by a task if the EPL is equal to or more privileged than the gate descriptor's DPL. Gates follow the control transfer rules of privilege and therefore may only transfer control to a more privileged level. Call gates are accessed via a CALL instruction and are syntactically identical to calling a normal subroutine. When an interlevel Am386DX/DXL microprocessor call gate is activated, the following actions occur: - 1. Load CS:EIP from gate check for validity; - 2. SS is pushed zero-extended to 32 bits: - 3. ESP is pushed; - Copy word count 32-bit parameters from the old stack to the new stack; - 5. Push return address on stack. The procedure is identical for 80286 Call gates, except that 16-bit parameters are copied and 16-bit registers are pushed. Interrupt gates and Trap gates work in a similar fashion as the call gates, except there is no copying of parameters. The only difference between Trap and Interrupt gates is that control transfers through an Interrupt gate, disable further interrupts (i.e., the IF bit is set to 0), and Trap gates leave the interrupt status unchanged. ## **Task Switching** A very important attribute of any multitasking/multi-user operating system is its ability to rapidly switch between tasks or processes. The Am386DX/DXL microprocessor directly supports this operation by providing a task switch instruction in hardware. The Am386DX/DXL CPU task switch operation saves the entire state of the machine (all of the registers, address space, and a link to the previous task), loads a new execution state, performs protection checks, and commences execution in the new task, in about 17 ms. Like transfer of control via gates, the task switch operation is invoked by executing an intersegment JMP or CALL instruction that refers to a Task State Segment (TSS), or a task gate descriptor in the GDT or LDT. An INT n instruction, exception, trap, or external interrupt may also invoke the task switch operation if there is a task gate descriptor in the associated IDT descriptor slot. The TSS descriptor points to a segment (see Figure 29a) containing the entire Am386DX/DXL microprocessor execution state while a task gate descriptor contains a TSS selector. The Am386DX/DXL CPU supports both 80286 and Am386DX/DXL CPU style TSSs. Figure 30 shows an 80286 TSS. The limit of an Am386DX/DXL microprocessor TSS must be greater than 0064H (002BH for an 80286 TSS) and can be as large as 4 Gb. In the additional TSS space, the operating system is free to store additional information, such as the reason the task is inactive, time the task has spent running, and open files belonging to the task. Each task must have a TSS associated with it. The current TSS is identified by a special register in the Am386DX/DXL microprocessor called the Task State Segment Register (TR). This register contains a selector referring to the task state segment descriptor that defines the current TSS. A hidden base and limit register associated with TR are loaded whenever TR is loaded with a new selector. Returning from a task is accomplished by the IRET instruction. When IRET is executed, control is returned to the task that was interrupted. The current executing task's state is saved in the TSS and the old task state is restored from its TSS. Several bits in the flag register and machine status word (CR0) give information about the state of a task that are useful to the operating system. The Nested Task (NT) (bit 14 in EFLAGS) controls the function of the IRET instruction. If NT = 0, the IRET instruction performs the regular return; when NT = 1, IRET performs a task switch operation back to the previous task. The NT bit is set or reset in the following fashion. When a CALL or INT instruction initiates a task switch, the new TSS will be marked busy and the back link field of the new TSS set to the old TSS selector. The NT bit of the new task is set by CALL or INT initiated task switches. An interrupt that does not cause a task switch will clear NT. (The NT bit will be restored after execution of the interrupt handler.) NT may also be set or cleared by POPF or IRET instructions. The Am386DX/DXL microprocessor Task State Segment is marked busy by changing the descriptor type field from Type 9H to Type BH. An 80286 TSS is marked busy by changing the descriptor type field from Type 1 to Type 3. Use of a selector that references a busy task state segment causes an Exception 13. The Virtual Mode (VM) bit 17 is used to indicate if a task is a virtual 8086 task. If VM=1, then the tasks will use the Real Mode addressing mechanism. The virtual 8086 environment is only entered and exited via a task switch (see Section Virtual Mode). The coprocessor's state is not automatically saved when a task switch occurs, because the incoming task may not use the coprocessor. The Task Switched (TS) Bit (bit 3 in the CR0) helps deal with the coprocessor's state in a multitasking environment. Whenever the Am386DX/DXL microprocessor switches tasks, it sets the TS bit. The Am386DX/DXL CPU detects the first use of a processor extension instruction after a task switch and causes the processor extension not available Exception 7. The exception handler for Exception 7 may then decide whether to save the state of the coprocessor. A processor extension not present Exception 7 will occur when attempting to execute an ESC or WAIT instruction if the Task Switched and Monitor coprocessor extension bits are both set (i.e., TS=1 and MP=1). The T bit in the Am386DX/DXL microprocessor TSS indicates that the processor should generate a debug exception when switching to a task. If T=1, then upon entry to a new task, a debug Exception 1 will be generated. ## Initialization and Transition to Protected Mode Since the Am386DX/DXL microprocessor begins executing in Real Mode immediately after RESET, it is necessary to initialize the system tables and registers with the appropriate values. The GDT and IDT registers must refer to a valid GDT and IDT. The IDT should be at least 256-bytes long, and GDT must contain descriptors for the initial code and data segments. Figure 31 shows the tables and Figure 32 shows the descriptors needed for a simple Protected Mode Am386DX/DXL microprocessor system. It has a single code and single data/stack segment each 4 Gb long and a single privilege level PL = 0. The actual method of enabling Protected Mode is to load CR0 with PE bit set, via the MOV CR0, R/M instruction. This puts the Am386DX/DXL microprocessor in Protected Mode. After enabling Protected Mode, the next instruction should execute an intersegment JMP to load the CS register and flush the instruction decode queue. The final step is to load all of the data segment registers with the initial selector values. An alternate approach to entering Protected Mode that is especially appropriate for multitasking operating systems is to use the built in task-switch to load all of the registers. In this case, the GDT would contain two TSS descriptors in addition to the code and data descriptors needed for the first task. The first JMP instruction in Protected Mode would jump to the TSS causing a task switch and loading all of the registers with the values stored in the TSS. The TR should be initialized to point to a valid TSS descriptor since a task switch saves the state of the current task in a task state segment. ## **Paging** ## **Paging Concepts** Paging is another type of memory management useful for virtual memory multitasking operating systems. Unlike segmentation that modularizes programs and data into variable length segments, paging divides programs into multiple uniform size pages. Pages bear no direct relation to the logical structure of a program. While segment selectors can be considered the logical name of a program module or data structure, a page most likely corresponds to only a portion of a module or data structure. By taking advantage of the locality of reference displayed by most programs, only a small number of pages from each active task need be in memory at any one moment. ## **Paging Organization** ## Page Mechanism The Am386DX/DXL microprocessor uses two levels of tables to translate the linear address (from the segmentation unit) into a physical address. There are three components to the paging mechanism of the Am386DX/DXL CPU: the page directory, the page tables, and the page itself (page frame). All memory-resident elements of the Am386DX/DXL CPU paging mechanism are the same size, namely, 4 Kb. A uniform size for all of the elements simplifies memory allocation and reallocation schemes, since there is no problem with memory fragmentation. Figure 33 shows how the paging mechanism works. 15021B-034 Figure 31. Simple Protected System | Data<br>Descriptor | Segment Base 15-0<br>0118 (H) | | | | Segment Limit 15–0<br>FFFF (H) | | | | | | | | | | | |--------------------|-------------------------------|--------|--------|--------------------------------|--------------------------------|-------------------------|---|---|---|---|---|----|---|---|-------------------------| | | Base 31–24<br>00 (H) | G<br>1 | D<br>1 | 0 | 0 | Limit<br>19–16<br>F (H) | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | Base<br>23–16<br>00 (H) | | Code<br>Descriptor | Segment Base 15–0<br>0118 (H) | | | Segment Limit 15–0<br>FFFF (H) | | | | | | | | | | | | | | Base 31–24<br>00 (H) | G<br>1 | D<br>1 | 0 | 0 | Limit<br>19–16<br>F (H) | 1 | 0 | 0 | 1 | 1 | ۱° | 1 | 0 | Base<br>23-16<br>00 (H) | | | Null | | | | | Descriptor | | | | | | | | | | | | | | | | | | | | | | | | | | | | | 31 | | 24 | | | 16 | 1 | 5 | | | | 8 | | | | 15021B-035 Figure 32. GDT Descriptors for Simple System ## Page Descriptor Base Register CR2 is the Page Fault Linear Address register. It holds the 32-bit linear address that caused the last Page Fault detected. CR3 is the Page Directory Physical Base Address Register. It contains the physical starting address of the Page Directory. The lower 12 bits of CR3 are always zero to ensure that the Page Directory is always page aligned. Loading it via a MOV CR3, reg instruction causes the Page Table entry cache to be flushed, as will a task switch through a TSS that changes the value of CR0. (See Translation Look-Aside Buffer.) ## Page Directory The Page Directory is 4-Kb long and allows up to 1024 Page Directory entries. Each Page Directory entry contains the address of the next level of tables, the Page Tables and information about the page table. The contents of a Page Directory entry are shown in Figure 34. The upper 10 bits of the linear address (A31–A22) are used as an index to select the correct Page Directory entry. Figure 33. Paging Mechanism Figure 34. Page Directory Entry (Points to Page Table) 15021B-037 Figure 35. Page Table Entry (Points to Page) ## Page Tables Each Page Table is 4 Kb and holds up to 1024 Page Table entries. Page Table entries contain the starting address of the page frame and statistical information about the page (see Figure 35). Address bits A21–A12 are used as an index to select one of the 1024 Page Table entries. The 20 upper-bit page frame address is concatenated with the lower 12 bits of the linear address to formthe physical address. Page tables can be shared between tasks and swapped to disks. ## Page Directory/Table Entries The lower 12 bits of the Page Table entries and Page Directory entries contain statistical information about pages and page tables respectively. The P (Present) bit 0 indicates if a Page Directory or Page Table entry can be used in address translation. If P=1, the entry cannot be used for address translation; if P=0, the entry cannot be used for translation. Note that the present bit of the page table entry that points to the page where code is currently being executed should always be set. Code that marks its own page not present should not be written. All of the other bits are available for use by the software. For example the remaining 31 bits could be used to indicate where on the disk the page is stored. The A (Accessed) bit 5 is set by the Am386DX/DXL microprocessor for both types of entries before a read or write access occurs to an address covered by the entry. The D (Dirty) bit 6 is set to 1 before a write to an address covered by that page table entry occurs. The D bit is undefined for Page Directory entries. When the P, A, and D bits are updated by the Am386DX/DXL CPU, the microprocessor generates a Read-Modify-Write cycle that locks the bus and prevents conflicts with other processors or peripherals. Software that modifies these bits should use the LOCK prefix to ensure the integrity of the page tables in multi-master systems. The three bits marked OS Reserved in Figures 34 and 35 (bits 11–9) are software definable. OSs are free to use these bits for whatever purpose they wish. An example use of the OS Reserved bits would be to store information about page aging. By keeping track of how long a page has been in memory since being accessed, an operating system can implement a page replacement algorithm like Least Recently Used. The (User/Supervisor) U/S bit 2 and the (Read/Write) R/W bit 1 are used to provide protection attributes for individual pages. ## Page Level Protection (R/W, U/S Bits) The Am386DX/DXL microprocessor provides a set of protection attributes for paging systems. The paging mechanism distinguishes between two levels of protection: user, which corresponds to level 3 of the segmentation based protection, and supervisor, which encompasses all of the other protection levels (0, 1, 2). Programs executing at level 0, 1, or 2 bypass the page protection, although segmentation based protection is still enforced by the hardware. The U/S and R/W bits are used to provide User/Supervisor and Read/Write protection for individual pages or for all pages covered by a Page Table Directory entry. The U/S and R/W bits in the first level Page Directory Table apply to all pages described by the page table pointed to by that directory entry. The U/S and R/W bits in the second level Page Table entry apply only to the page described by that entry. The U/S and R/W bits for a given page are obtained by taking the most restrictive of the U/S and R/W bits from the Page Directory Table entries and the Page Table entries and using these bits to address the page. Example: If the U/S and R/W bits for the Page Directory entry were 10 and the U/S and R/W bits for the Page Table entry were 01, the access rights for the page would be 01, the numerically smaller of the two. Table 12 shows the effect of the U/S and R/W bits on accessing memory. Table 12. Protection Provided by R/W and U/S | | U/S | R/W | Permitted<br>Level 3 | Permitted Access<br>Levels 0, 1, or 2 | | | | | |---|-----|-----|----------------------|---------------------------------------|--|--|--|--| | - | 0 | 0 | None | Read/Write | | | | | | | 0 | 1 | None | Read/Write | | | | | | | 1 | 0 | Read-Only | Read/Write | | | | | | | 1 | 1 | Read/Write | Read/Write | | | | | However, a given segment can be easily made readonly for level 0, 1, or 2 via the use of segmented protection mechanisms (see Section Protection). ### **Translation Look-Aside Buffer** The Am386DX/DXL microprocessor paging hardware is designed to support demand paged virtual memory systems. However, performance would degrade substantially if the processor was required to access two levels of tables for every memory reference. To solve this problem, the Am386DX/DXL device keeps a cache of the most recently accessed pages, this cache is called the Translation Look-Aside Buffer (TLB). The TLB is a four-way set associative 32-entry page table cache. It automatically keeps the most commonly used Page Table entries in the processor. The 32-entry TLB, coupled with a 4K page size, results in coverage of 128 Kb of memory addresses. For many common multitasking systems, the TLB will have a hit rate of about 98%. This means that the processor will only have to access the two-level page structure on 2% of all memory references. Figure 36 illustrates how the TLB complements the Am386DX/DXL microprocessor's paging mechanism. Figure 36. Translation Look-Aside Buffer ## **Paging Operation** The paging hardware operates in the following fashion: the paging unit hardware receives a 32-bit linear address from the segmentation unit. The upper 20 linear address bits are compared with all 32 entries in the TLB to determine if there is a match. If there is a match (i.e., a TLB hit), then the 32-bit physical address is calculated and will be placed on the address bus. However, if the Page Table entry is not in the TLB, the Am386DX/DXL microprocessor will read the appropriate Page Directory entry. If P=1 on the Page Directory entry indicating that the page table is in memory, then the Am386DX/DXL device will read the appropriate Page Table entry and set the Access bit. If P=1 on the Page Table entry indicating that the page is in memory, the Am386DX/DXL device will update the Access and Dirty bits as needed and fetch the operand. The upper 20 bits of the linear address, read from the page table, will be stored in the TLB for future accesses. However, if P=0 for either the Page Directory entry or the Page Table Entry, then the processor will generate a Page Fault, an Exception 14. The processor will also generate an Exception 14, Page Fault, if the memory reference violated the page protection attributes (i.e., U/S or R/W; trying to write to a read-only page). CR2 will hold the linear address that caused the page fault. If a second page fault occurs while the processor is attempting to enter the service routine for the first, then the processor will invoke the Page Fault (Exception 14) handler a second time, rather than the Double Fault (Exception 8) handler. Since Exception 14 is classified as a fault, CS:EIP will point to the instruction causing the page fault. The 16-bit error code pushed as part of the page fault handler will contain status bits which indicate the cause of the Page Fault. The 16-bit error code is used by the operating system to determine how to handle the Page Fault. Figure 37 shows the format of the page-fault error code and the interpretation of the bits. Note: Even though the bits in the error code (U/S, R/W, and P) have similar names as the bits in the Page Directory/Table entries, the interpretation of the error code bits is different. Figure 38 indicates what type of access caused the Page Fault. Figure 37. Page Fault Error Code Format U/S: The U/S bit indicates whether the access causing the fault occurred when the processor was executing the User Mode (U/S=1) or in Supervisor mode (U/S=0). R/W: The R/W bit indicates whether the access causing the fault was a Read (R/W = 0) or a Write (R/W = 1). P: The P bit indicates whether a Page Fault was caused by a not-present page (P = 0) or by a page level protection violation (P = 1). U: Undefined. | U/S | R/W | Access Type | |-----|-----|------------------| | 0 | 0 | Supervisor* Read | | 0 | 1 | Supervisor Write | | 1 | 0 | User Read | | 1 | 1 | User Write | <sup>\*</sup>Descriptor table access will fault with U/S = 0, even if the program is executing at level 3. 15021B-041 Figure 38. Type of Access Causing Page Fault ## **Operating System Responsibilities** The Am386DX/DXL microprocessor takes care of the page address translation process, relieving the burden from an operating system in a demand-paged system. The operating system is responsible for setting up the initial page tables and handling any page faults. The operating system is also required to invalidate (i.e., flush) the TLB when any changes are made to any of the Page Table entries. The operating system must reload CR3 to cause the TLB to be flushed. Setting up the tables is simply a matter of loading CR3 with the address of the Page Directory and allocating space for the Page Directory and the Page Tables. The primary responsibility of the operating system is to implement a swapping policy and handle all of the page faults. A final concern of the operating system is to ensure that the TLB cache matches the information in the paging tables. In particular, any time the operating system sets the P present bit of page table entry to zero, the TLB must be flushed. Operating systems may want to take advantage of the fact that CR3 is stored as part of a TSS to give every task or group of tasks its own set of page tables. ## Virtual 8086 Environment ## **Executing 8086 Programs** The Am386DX/DXL microprocessor allows the execution of 8086 application programs in both Real Mode and in the Virtual 8086 Mode (Virtual Mode). Of the two methods. Virtual 8086 Mode offers the system designer the most flexibility. The Virtual 8086 Mode allows the execution of 8086 applications, while still allowing the system designer to take full advantage of the Am386DX/DXL device protection mechanism. In particular, the Am386DX/DXL CPU allows the simultaneous execution of 8086 operating systems and its applications, and an Am386DX/DXL CPU operating system and both 80286 and Am386DX/DXL microprocessor applications. Thus, in a multiuser Am386DX/DXL CPU computer, one person could be running a MS-DOS spreadsheet, another person using MS-DOS, and a third person could be running multiple UNIX utilities and applications. Each person in this scenario would believe they had the computer completely to themself. Figure 39 illustrates this concept. ## Virtual 8086 Mode Addressing Mechanism One of the major differences between Am386DX/DXL microprocessor Real and Protected Modes is how the segment selectors are interpreted. When the processor is executing in Virtual 8086 Mode, the segment registers are used in an identical fashion to Real Mode. The contents of the segment register are shifted left 4 bits and added to the offset to form the segment base linear address. The Am386DX/DXL microprocessor allows the operating system to specify which programs use the 8086 style address mechanism, and which programs use Protected Mode addressing, on a per task basis. Through the use of paging, the 1-Mb address space of the Virtual Mode task can be mapped to anywhere in the 4-Gb linear address space of the Am386DX/DXL device. Like Real Mode, Virtual Mode effective addresses (i.e., segment offsets) that exceed 64 Kb will cause an Exception 13. However, these restrictions should not prove to be important because most tasks running in Virtual 8086 Mode will simply be existing 8086 application programs. ## Paging In Virtual Mode The paging hardware allows the concurrent running of multiple Virtual Mode tasks and provides protection and operating system isolation. Although it is not strictly necessary to have the paging hardware enabled to run Virtual Mode tasks, it is needed in order to run multiple Virtual Mode tasks or to relocate the address. space of a Virtual Mode task to physical address space greater than 1 Mb. The paging hardware allows the 20-bit linear address produced by a Virtual Mode program to be divided into up to 256 pages. Each one of the pages can be located anywhere within the maximum 4-Gb physical address space of the Am386DX/DXL microprocessor. In addition, since CR3 (the Page Directory Base Register) is loaded by a task switch, each Virtual Mode task can use a different mapping scheme to map pages to different physical locations. Finally, the paging hardware allows the sharing of the 8086 operating system code between multiple 8086 applications. Figure 39 shows how the Am386DX/DXL device paging hardware enables multiple 8086 programs to run under a virtual memory demand paged system. ## Protection and I/O Permission Bitmap All Virtual 8086 Mode programs execute at privilege level 3, the level of least privilege. As such, Virtual 8086 Mode programs are subject to all of the protection checks defined in Protected Mode. (This is different from Real Mode which implicitly is executing at privilege level 0, the level of greatest privilege.) Thus, an attempt to execute a privileged instruction when in Virtual 8086 Mode will cause an Exception 13 fault. The following are privileged instructions, which may be executed only at Privilege Level 0. Therefore, attempting to execute these instructions in Virtual 8086 Mode (or anytime CPL > 0) causes an Exception 13 fault. ``` LIDT; MOV DRn, reg; MOV reg, DRn; LGDT; MOV TRn, reg; MOV reg, TRn; LMSW: MOV CRn, req; MOV req, CRn; CLTS; ``` HLT: Several instructions, particularly those applying to the multitasking model and protection model, are available only in Protected Mode. Therefore, attempting to execute the following instructions in Real Mode or in Virtual 8086 Mode generates an Exception 6 fault. LTR; STR; LLDT: SLDT: LAR; VERR; VERW; LSL; ARPL. Figure 39. Virtual 8086 Environment Memory Management 15021B-042 The instructions that are IOPL-sensitive in Protected Mode are: IN; STI; OUT; CLI; INS; OUTS; REP INS; REP OUTS. In Virtual 8086 Mode, a slightly different set of instructions are made IOPL-sensitive. The following instructions are IOPL-sensitive in Virtual 8086 Mode: INT n; STI; PUSHF; CLI; POPF; IRET. The PUSHF, POPF, and IRET instructions are IOPL-sensitive in Virtual 8086 Mode only. This provision allows the IF flag (interrupt enable flag) to be virtualized to the Virtual 8086 Mode program. The INT n software interrupt instruction is also IOPL-sensitive in Virtual 8086 Mode. Note, however, that the INT 3 (op-code 0CCH), INTO, and BOUND instructions are not IOPL-sensitive in Virtual 8086 Mode (they are not IOPL sensitive in Protected Mode either). Note that the I/O instructions (IN, OUT, INS, OUTS, REP INS, and REP OUTS) are not IOPL-sensitive in Virtual 8086 Mode. Rather, the I/O instructions become automatically sensitive to the I/O Permission Bitmap contained in the Am386DX/DXL CPU TSS. The I/O Permission Bitmap, automatically used by the Am386DX/DXL microprocessor in Virtual 8086 Mode, is illustrated by Figures 29a and 29b. The I/O Permission Bitmap can be viewed as a 0–64K bit string, that begins in memory at offset Bit\_Map\_Offset in the current TSS. Bit\_Map\_Offset must be ≤ DFFFH so the entire bit map and the byte FFH that follows the bit map are all at offset ≤ FFFFH from the TSS base. The 16-bit pointer Bit\_Map\_Offset (15–0) is found in the word beginning at offset 66H (102 decimal) from the TSS base, as shown in Figure 29a. Each bit in the I/O Permission Bitmap corresponds to a single byte-side I/O port, as illustrated in Figure 29a. If a bit is 0, I/O to the corresponding byte-wide port can occur without generating an exception. Otherwise the I/O instruction causes an Exception 13 fault. Since every byte-wide I/O port must be protectable, all bits corresponding to a Word-wide or Dword-wide port must be 0 for the Word-wide or Dword-wide I/O to be permitted. If all the referenced bits are 0, the I/O will be allowed. If any referenced bits are 1, the attempted I/O will cause an Exception 13 fault. Due to the use of a pointer to the base of the I/O Permission Bitmap, the bitmap may be located anywhere within the TSS or may be ignored completely by pointing the Bit\_Map\_Offset (15–0) beyond the limit of the TSS segment. In the same manner, only a small portion of the 64K I/O space need have an associated map bit by adjusting the TSS limit to truncate the bitmap. This eliminates the commitment of 8K of memory when a complete bitmap is not required, while allowing the fully general case if desired. Example of Bitmap for I/O Ports 0–255: Setting the TSS limit to {Bit\_Map\_Offset + 31 +1\*\*} [\*\*see note below] will allow a 32-byte bitmap for the I/O ports 0–255, plus a terminator byte of all 1s [\*\*see note below]. This allows the I/O bitmap to control I/O Permission to I/O ports 0–255 while causing an Exception 13 fault on attempted I/O to any I/O port 256 through 65,565. \*\*Important Implementation Note: Beyond the last byte of I/O mapping, information in the I/O Permission Bitmap must be a byte containing all 1s. The byte of all 1s must be within the limit of the Am386DX/DXL CPU TSS segment (see Figure 29a). #### Interrupt Handling In order to fully support the emulation of an 8086 machine, interrupts in Virtual 8086 Mode are handled in a unique fashion. When running in Virtual Mode, all interrupts and exceptions involve a privilege change back to the host Am386DX/DXL CPU operating system. The Am386DX/DXL microprocessor operating system determines if the interrupt comes from a Protected Mode application or from a Virtual Mode program by examining the VM bit in the EFLAGS image stored on the stack. When a Virtual Mode program is interrupted and execution passes to the interrupt routine at level 0, the VM bit is cleared. However, the VM bit is still set in the EFLAGS image on the stack. The Am386DX/DXL microprocessor operating system in turn handles the exception or interrupt and then returns control to the 8086 program. The Am386DX/DXL CPU operating system may choose to let the 8086 operating system handle the interrupt or it may emulate the function of the interrupt handler. For example, many 8086 operating system calls are accessed by PUSHing parameters on the stack, and then executing an INT n instruction. If the IOPL is set to 0 then all INT n instructions will be intercepted by the Am386DX/DXL microprocessor operating system. The Am386DX/DXL CPU operating system could emulate the 8086 operating system's call. Figure 40 shows how the Am386DX/DXL microprocessor operating system could intercept an 8086 operating system's call to Open a File. The Am386DX/DXL microprocessor operating system can provide a Virtual 8086 Environment that is totally transparent to the application software via intercepting and then emulating 8086 operating system's calls, and intercepting IN and OUT instructions. ## Entering and Leaving Virtual 8086 Mode Virtual 8086 Mode is entered by executing an IRET instruction (at CPL = 0), or Task Switch (at any CPL) to an Am386DX/DXL microprocessor task whose Am386DX/ DXL microprocessor TSS has an EFLAGS image containing a 1 in the VM bit position while the processor is executing in Protected Mode. That is, one way to enter Virtual 8086 Mode is to switch to a task with an Am386DX/DXL device TSS that has a 1 in the VM bit in the EFLAGS image. The other way is to execute a 32-bit IRET instruction at privilege level 0, where the stack has a 1 in the VM bit in the EFLAGS image. POPF does not affect the VM bit even if the processor is in Protected Mode or level 0, and so cannot be used to enter Virtual 8086 Mode. PUSHF always pushes a 0 in the VM bit, even if the processor is in Virtual 8086 Mode, so that a program cannot tell if it is executing in Real Mode or in . Virtual 8086 Mode. The VM bit can be set by executing an IRET instruction only at privilege level 0 or by any instruction or interrupt that causes a task switch in Protected Mode (with VM = 1 in the new FLAGS image), and can be cleared only by an interrupt or exception in Virtual 8086 Mode. IRET and POPF instructions executed in Real Mode or Virtual 8086 Mode will not change the value in the VM bit. The transition out of Virtual 8086 Mode to Am386DX/DXL microprocessor Protected Mode occurs only on receipt of an interrupt or exception (such as due to a sensitive instruction). In Virtual 8086 Mode, all interrupts and exceptions vector through the Protected Mode IDT, and enter an interrupt handler in Am386DX/DXL CPU Protected Mode. That is, as part of interrupt processing, the VM bit is cleared. Because the matching IRET must occur from level 0, if an Interrupt or Trap gate is used to field an interrupt or exception out of Virtual 8086 Mode, the gate must perform an interlevel interrupt only to level 0. Interrupt or Trap gates through conforming segments or through segments with DPL>0, will raise a GP fault with the CS selector as the error code. ## Task Switches To/From Virtual 8086 Mode Tasks which can execute in Virtual 8086 Mode must be described by a TSS with the new Am386DX/DXL microprocessor format (Type 9 or 11 descriptor). 8086 Application makes "Open File Call" → causes General Protection Fault (Arrow #1) Virtual 8086 Monitor intercepts call. Calls Am386DX/DXL CPU OS (Arrow #2) Am386DX/DXL CPU OS "Opens File" returns control to 8086 OS (Arrow #3) 8086 OS returns control to application (Arrow #4) Transparent to Application 15021B-043 Figure 40. Virtual 8086 Environment Interrupt and Call Handling A task switch out of Virtual 8086 Mode will operate exactly the same as any other task switch out of a task with an Am386DX/DXL CPU TSS. All of the programmer visible state, including the FLAGS register with the VM bit set to 1, is stored in the TSS. The segment registers in the TSS will contain 8086 segment base values rather than selectors. A task switch into a task described by an Am386DX/DXL microprocessor TSS will have an additional check to determine if the incoming task should be resumed in Virtual 8086 Mode. Tasks described by 80286 format TSSs cannot be resumed in Virtual 8086 Mode, so no check is required there (the FLAGS image in 80286 format TSS has only the low-order 16 FLAGS bits), Before loading the segment register images from an Am386DX/DXL CPU TSS, the FLAGS image is loaded so that the segment registers are loaded from the TSS image as 8086 segment base values. The task is now ready to resume in Virtual 8086 Execution Mode. ## Transitions Through Trap and Interrupt Gates, and IRET A task switch is one way to enter or exit Virtual 8086 Mode. The other method is to exist through a Trap or Interrupt gate, as part of handling an interrupt, and to enter as part of executing an IRET instruction. The transition out must use an Am386DX/DXL microprocessor Trap gate (Type 14) or Interrupt gate (Type 15) that must point to a non-conforming level 0 segment (DPL = 0) in order to permit the trap handler to IRET back to the Virtual 8086 program. The gate must point to a nonconforming level 0 segment to perform a level switch to level 0 so that the matching IRET can change the VM bit. Am386DX/DXL device gates must be used, since 80286 gates save only the lower 16 bits of the FLAGS register, so that the VM bit will not be saved on transitions through the 80286 gates. Also, the 16-bit IRET (presumably) used to terminate the 80286 interrupt handler will pop only the lower 16 bits from FLAGS, and will not affect the VM bit. The action taken for an Am386DX/DXL microprocessor Trap or Interrupt gate if an interrupt occurs while the task is executing in Virtual 8086 Mode is given by the following sequence. - Save the FLAGS register in a temp to push later. Turn off the VM and TF bits, and if the interrupt is serviced by an Interrupt gate, turn off IF bit, also. - 2. Interrupt and Trap gates must perform a level switch from 3 (where the VM86 program executes) to level 0 (so IRET can return). This process involves a stack switch to the stack given in the TSS for privilege level 0. Save the Virtual 8086 Mode SS and ESP registers to push in a later step. The segment register load of SS will be done as a Protected Mode - segment load since the VM bit was turned off above. 3. Push the 8086 segment register values onto the new stack, in the order: GS, FS, DS, ES. These are pushed as 32-bit quantities with undefined values in the upper 16 bits. Then load these 4 registers with null selectors (0). - Push the old 8086 stack pointer onto the new stack by pushing the SS register (as 32-bit, high bits undefined), then pushing the 32-bit ESP register saved above. - 5. Push the 32-bit FLAGS register saved in step 1. - Push the old 8086 instruction pointer onto the new stack by pushing the CS register (as 32-bits, high bits undefined), then pushing the 32-bit EIP register. - Load up the new CS:EIP value from the interrupt gate and begin execution of the interrupt routine in Protected Am386DX/DXL Microprocessor Mode. The transition out of Virtual 8086 Mode performs a level change and stack switch, in addition to changing back to Protected Mode. In addition, all of the 8086 segment register images are stored on the stack (behind the SS:ESP image), and then loaded with null (0) selectors before entering the interrupt handler. This will permit the handler to safely save and restore the DS, ES, FS, and GS registers as 80286 selectors. This is needed so that interrupt handlers that "don't care" about the mode of the interrupted program can use the same prolog and epilog code for state saving (i.e., push all registers in prolog, pop all in epilog) regardless of whether or not a native mode or Virtual 8086 Mode program was interrupted. Restoring null selectors to these registers before executing the IRET will not cause a trap in the interrupt handler. Interrupt routines that expect values in the segment registers or return values in segment registers will have to obtain/return values from the 8086 register images pushed onto the new stack. They will need to know the mode of the interrupted program in order to know where to find/return segment registers, and also to know how to interpret segment register values. The IRET instruction will perform the inverse of the above sequence. Only the extended Am386DX/DXL microprocessor IRET instruction (operand size = 32) can be used and must be executed at level 0 to change the VM bit to 1. - If the NT bit in the FLAGS register is on, an inter-task return is performed. The current state is stored in the current TSS, and the link field in the current TSS is used to locate the TSS for the interrupted task which is to be resumed. Otherwise, continue with the following sequence. - Read the FLAGS image from SS:8[ESP] into the FLAGS register. This will set VM to the value active in the interrupted routine. - Pop off the instruction pointer CS:EIP. EIP is popped first, then a 32-bit word is popped that contains the CS value in the lower 16 bits. If VM = 0, this CS load is done as a Protected Mode segment load. If VM = 1, this will be done as an 8086 segment load. - 4. Increment the ESP register by 4 to bypass the FLAGS image which was popped in step 1. - If VM=1, load segment registers ES, DS, FS, and GS from memory locations SS:[ESP+8], SS:[ESP+12], SS:[ESP+16], and SS:[ESP+20], respectively, where the new value of ESP stored in step 4 is used. Since VM = 1, these are done as 8086 segment register loads. - Else if VM = 0, check that the selectors in ES, DS, FS, and GS are valid in the interrupted routine. Null out invalid selectors to trap if an attempt is made to access through them. - If (RPL(CS) > CPL), pop the stack pointer SS:ESP from the stack. The ESP register is popped first, followed by 32-bits containing SS in the lower 16 bits. If VM = 0, SS is loaded as a Protected Mode segment register load. If VM = 1, an 8086 segment register load is used. - Resume execution of the interrupted routine. The VM bit in the FLAGS register (restored from the interrupt routine's stack image in step 1) determines whether the processor resumes the interrupted routine in Protected Mode of Virtual 8086 Mode. # FUNCTIONAL DATA Introduction The Am386DX/DXL microprocessor features a straight forward functional interface to the external hardware. The Am386DX/DXL CPU has separate parallel buses for data and address. The data bus is 32 bits in width and is bidirectional. The address bus outputs 32-bit address values in the most directly usable form for the high-speed local bus: 4 individual Byte Enable signals and the 30 upper-order bits as a binary value. The data and address buses are interpreted and controlled with their associated control signals. A dynamic data bus sizing feature allows the processor to handle a mix of 32- and 16-bit external buses on a cycle-by-cycle basis (see Data Bus Sizing). If 16-bit bus size is selected, the Am386DX/DXL microprocessor automatically makes any adjustment needed, even performing another 16-bit bus cycle to complete the transfer if that is necessary. Any 8-bit peripheral devices may be connected to 32- or 16-bit buses with no loss of performance. A new address pipelining option is provided and applies to 32- and 16-bit buses for substantially improved memory utilization, especially for the most heavily used memory resources. The address pipelining option, when selected, typically allows a given memory interface to operate with one less wait state than would otherwise be required (see Address Pipelining). The pipelined bus is also well suited to interleaved memory designs. When address pipelining is requested by the external hardware, the Am386DX/DXL microprocessor will output the address and bus cycle definition of the next bus cycle (if it is internally available) even while waiting for the current cycle to be acknowledged. Non-pipelined address timing, however, is ideal for external cache designs, since the cache memory will typically be fast enough to allow non-pipelined cycles. For maximum design flexibility, the address pipelining option is selectable on a cycle-by-cycle basis. The processor's bus cycle is the basic mechanism for information transfer, either from system to processor or from processor to system. Am386DX/DXL microprocessor bus cycles perform data transfer in a minimum of only two clock periods. On a 32-bit data bus, the maximum Am386DX/DXL device transfer at 20-MHz bandwidth is therefore 40 Mb/s, at 25-MHz bandwidth is 50 Mb/s, at 33-MHz bandwidth is 66 Mb/s, and at 40-MHz bandwidth is 80 Mb/s. Any bus cycle will be extended for more than two clock periods, however, if external hardware withholds acknowledgment of the cycle. At the appropriate time, acknowledgment is signaled by asserting the Am386DX/DXL microprocessor READY input. The Am386DX/DXL CPU can relinquish control of its local buses to allow mastership by other devices, such as direct memory access channels. When relinquished, HLDA is the only output pin driven by the Am386DX/DXL microprocessor providing near-complete isolation of the processor from its system. The near-complete isolation characteristic is ideal when driving the system from test equipment and in fault-tolerant applications. Functional data covered in this section describes the processor's hardware interface. First, the set of signals available at the processor pins is described (see Signal Description). Following that are the signal waveforms occurring during bus cycles (see Bus Transfer Mechanism, Bus Functional Description, and Other Functional Descriptions). ## **Signal Description** #### Introduction Ahead is a brief description of the Am386DX/DXL CPU input and output signals arranged by functional groups (see Figure 41). Example signal: M/IO —High voltage indicates Memory selected -Low voltage indicates I/O selected The signal descriptions sometimes refer to AC timing parameters, such as t25 RESET Setup Time and t26 RESET Hold Time. #### Clock (CLK2) CLK2 provides the fundamental timing for the Am386DX/DXL microprocessor. It is divided by two internally to generate the internal processor clock used for instruction execution. The internal clock is comprised of two phases, phase one and phase two. Each CLK2 period is a phase of the internal clock. Figure 42 illustrates the relationship. If desired, the phase of the internal processor clock can be synchronized to a known phase by ensuring the RESET signal falling edge meets its applicable setup and hold times, t25 and t26. ## Data Bus (D31-D0) These three-state, bidirectional signals provide the general purpose data path between the Am386DX/DXL microprocessor and other devices. Data bus inputs and outputs indicate 1 when High. The data bus can transfer data on 32- and 16-bit buses using a data bus sizing feature controlled by the BS16 input. See Section Bus Control. Data bus reads require that read data setup and hold times, t21 and t22, be met for correct operation. In addition, the Am386DX/DXL microprocessor requires that all data bus pins be at a valid logic state (High or Low) at the end of each read cycle, when READY is asserted. During any write operation (and during halt cycles and shut down cycles), the Am386DX/DXL microprocessor always drives all 32 signals of the data bus even if the current bus size is 16 bits. Figure 41. Functional Signal Groups Figure 42. CLK2 Signal and Internal Processor Clock #### Address Bus (BE3-BE0, A31-A2) These three-state outputs provide physical memory addresses or I/O port addresses. The address bus is capable of addressing 4 Gb of physical memory space (00000000H–FFFFFFFFH), and 64 Kb of I/O address space (00000000H–0000FFFH) for programmed I/O. I/O transfers automatically generated for Am386DX/DXL microprocessor-to-coprocessor communication use I/O addresses 800000F8H–800000FFH, so A31 is High in conjunction with M/IO Low allows simple generation of the coprocessor select signal. The Byte Enable outputs, BE3-BE0, directly indicate which bytes of the 32-bit data bus are involved with the current transfer. This is most convenient for external hardware. BE0 applies to D7–D0 BE1 applies to D15–D8 BE2 applies to D23–D16 BE3 applies to D31–D24 The number of Byte Enables asserted indicates the physical size of the operand being transferred (1, 2, 3, or 4 bytes). Refer to Section Operand Alignment. When a memory write cycle or I/O write cycle is in progress and the operand being transferred occupies only the upper 16 bits of the data bus (D31–D16), duplicate data is simultaneously presented on the corresponding lower 16 bits of the data bus (D15–D0). This duplication is performed for optimum write performance on 16 bit buses. The pattern of write data duplication is a function of the Byte Enables asserted during the write cycle. Table 13 lists the write data present on D31–D0, as a function of the asserted Byte Enable outputs BE3–BE0. # Bus Cycle Definition Signals (W/ $\overline{R}$ , D/ $\overline{C}$ , M/ $\overline{IO}$ , LOCK) These three-state outputs define the type of bus cycle being performed. W/R distinguishes between write and read cycles, D/C between data and control cycles, M/IO between memory and I/O cycles, and LOCK between locked and unlocked bus cycles. The primary bus cycle definition signals are $W/\overline{R}$ , $D/\overline{C}$ , and $M/\overline{IO}$ , since these are the signals driven valid as the $\overline{ADS}$ (Address Status output) is driven asserted. The $\overline{LOCK}$ is driven valid at the same time as the first locked bus cycle begins, which due to address pipelining, could be later than $\overline{ADS}$ is driven asserted. See Pipelined Address. The $\overline{LOCK}$ is negated when the $\overline{READY}$ input terminates the last bus cycle that was locked. Exact bus cycle definitions, as a function of W/ $\overline{R}$ , D/ $\overline{C}$ , and M/ $\overline{IO}$ , are given in Table14. Note one combination of W/ $\overline{R}$ , D/ $\overline{C}$ , and M/ $\overline{IO}$ is never given when $\overline{ADS}$ is asserted (however, that combination, which is listed as does not occur, may occur during idle bus states when $\overline{ADS}$ is not asserted). If M/ $\overline{IO}$ , D/ $\overline{C}$ , and W/ $\overline{R}$ are qualified by $\overline{ADS}$ asserted, then a decoding scheme may be simplified by using this definition. Table 13. Write Data Duplication as a Function of BE3-BE0 | Am386 | DX/DXL C | PU Byte E | nables | A | m386DX/DXL | CPU Write Da | ta | Automatic | | |-------|----------|-----------|--------|---------|------------|--------------|-------|--------------|--| | BE3 | BE2 | BE1 | BEO | D31-D24 | D23-D16 | D15D8 | D7-D0 | Duplication? | | | High | High | High | Low | Undef | Undef | Undef | Α | No | | | High | High | Low | High | Undef | Undef | В | Undef | No | | | High | Low | High | High | Undef | С | Undef | С | Yes | | | Low | High | High | High | Ď | Undef | D | Undef | Yes | | | High | High | Low | Low | Undef | Undef | В | Α | No | | | High | Low | Low | High | Undef | С | В | Undef | No | | | Low | Low | High | High | D | C . | D | С | Yes | | | High | Low | Low | Low | Undef | С | В | Α | No | | | Low | Low | Low | High | D | С | В | Undef | No | | | Low | Low | Low | Low | D | С | В | Α | . No | | Key: D = Logical Write Data D31-D24 C = Logical Write Data D23-D16 B = Logical Write Data D15–D8 A = Logical Write Data D7–D0 **Table 14. Bus Cycle Definition** | M/IO | D/C | W/R | Bus Cycle Type | Locked? | |------|------|------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------| | Low | Low | Low | Interrupt Acknowledge | Yes | | Low | Low | High | Does Not Occur | | | Low | High | Low | I/O Data Read | No | | Low | High | High | I/O Data Write | No | | High | Low | Low | Memory Code Read | No | | High | Low | High | Halt: Shutdown: Address = 2 Address = 0 BE0 High BE0 Low BE1 High BE1 High BE2 Low BE2 High BE3 High BE3 High A31-A2 Low A31-A2 Low | No | | High | High | Low | Memory Data Read | Some Cycles | | High | High | High | Memory Data Write | Some Cycles | # Bus Control Signals (ADS, READY, NA, BS16) Introduction The following signals allow the processor to indicate when bus cycle has begun and allow other system hardware to control address pipelining, data bus width, and bus cycle termination. ### Address Status (ADS) This three-state output indicates that a valid bus cycle definition and address (W/R, D/C, M/IO, BE3-BE0, and A31-A2) is being driven at the Am386DX/DXL microprocessor pins. It is asserted during T1 and T2P bus states (see Non-pipelined Address and Pipelined Address for additional information on bus states). #### Transfer Acknowledge (READY) This input indicates the current bus cycle is complete, and the active bytes indicated by BE3-BE0 and BS16 are accepted or provided. When READY is sampled asserted during a read cycle or interrupt acknowledge cycle, the Am386DX/DXL microprocessor latches the input data and terminates the cycle. When READY is sampled asserted during a write cycle, the processor terminates the bus cycle. READY is ignored on the first bus state of all bus cycles, and sampled each bus state thereafter until asserted. READY must eventually be asserted to acknowledge every bus cycle, including Halt Indication and Shutdown Indication bus cycles. When being sampled, READY must always meet setup and hold times, t19 and t20, for correct operation. See all sections of Bus Functional Description. #### Next Address Request (NA) This is used to request address pipelining. This input indicates the system is prepared to accept new values of $\overline{BE3}-\overline{BE0}$ , A31-A2, W/R, D/C, and M/IO from the Am386DX/DXL microprocessor even if the end of the current cycle is not being acknowledged on READY. If this input is asserted when sampled, the next address is driven onto the bus provided the next bus request is already pending internally. See Address Pipelining and Read and Write Cycles. NA must always meet setup and hold times, t15 and t16, for correct operation. #### Bus Size 16 (BS16) The BS16 feature allows the Am386DX/DXL microprocessor to directly connect to 32- and 16-bit data buses. Asserting this input constrains the current bus cycle to use only the lower-order half (D15-D0) of the data bus. corresponding to BEO and BE1. Asserting BS16 has no additional effect if only BEO and/or BE1 are asserted in the current cycle. However, during bus cycles asserting BE2 or BE3, asserting BS16 will automatically cause the Am386DX/DXL microprocessor to make adjustments for correct transfer of the upper byte(s) using only physical data signals D15-D0. If the operand spans both halves of the data bus and BS16 is asserted, the Am386DX/DXL microprocessor will automatically perform another 16-bit bus cycle. BS16 must always meet setup and hold times, t17 and t18, for correct operation. Am386DX/DXL CPU I/O cycles are automatically generated for coprocessor communication. Since the Am386DX/DXL microprocessor must transfer 32-bit quantities between itself and a 387DX math coprocessor, BS16 must not be asserted during 387DX math coprocessor communication cycles. # Bus Arbitration Signals (HOLD, HLDA) #### Introduction This section describes the mechanism by which the processor relinquishes control of its local buses when requested by another bus master device. See Entering and Exiting Hold Acknowledge for additional information. #### Bus Hold Request (HOLD) This input indicates some device other than the Am386DX/DXL CPU requires bus mastership. HOLD must remain asserted as long as any other device is a local bus master. HOLD is not recognized while RESET is asserted. If RESET is asserted while HOLD is asserted, RESET has priority and places the bus into an idle state, rather than the hold acknowledge (high impedance) state. HOLD is level-sensitive and is a synchronous input. HOLD signals must always meet setup and hold times, t23 and t24, for correct operation. #### Bus Hold Acknowledge (HLDA) Assertion of this output indicates the Am386DX/DXL microprocessor has relinquished control of its local bus in response to HOLD asserted, and is in the Bus Hold Acknowledge state. The Hold Acknowledge state offers near-complete signal isolation. In the Hold Acknowledge state, HLDA is the only signal being driven by the Am386DX/DXL microprocessor. The other output signals or bidirectional signals (D31–D0, BE3–BE0, A31–A2, W/R, D/C, M/IO, LOCK, and ADS) are in a high-impedance state so the requesting bus master may control them. Pull-up resistors may be desired on several signals to avoid spurious activity when no bus master is driving them. See Resistor Recommendations. Also, one rising edge occurring on the NMI input during Hold Acknowledge is remembered for processing after the HOLD input is negated. In addition to the normal usage of Hold Acknowledge with DMA controllers or master peripherals, the near-complete isolation has particular attractiveness during system test when test equipment drives the system and in hardware-fault-tolerant applications. # Coprocessor Interface Signals (PEREQ, BUSY, ERROR) #### Introduction In the following sections are descriptions of signals dedicated to the numeric coprocessor interface. In addition to the data bus, address bus, and bus cycle definition signals, these following signals control communication between the Am386DX/DXL microprocessor and its 387DX math coprocessor extension. #### Coprocessor Request (PEREQ) When asserted, this input signal indicates a coprocessor request for a data operand to be transferred to/from memory by the Am386DX/DXL microprocessor. In response, the Am386DX/DXL CPU transfers information between the coprocessor and memory. Because Am386DX/DXL microprocessor has internally stored the coprocessor op-code being executed, it performs the requested data transfer with the correct direction and memory address. PEREQ is level-sensitive and is allowed to be asynchronous to the CLK2 signal. #### Coprocessor Busy (BUSY) When asserted, this input indicates the coprocessor is still executing an instruction and is not yet able to accept another. When the Am386DX/DXL microprocessor encounters any coprocessor instruction that operates on the numeric stack (e.g., load, pop, or arithmetic operation) or the WAIT instruction, this input is first automatically sampled until it is seen to be negated. This sampling of the BUSY input prevents overrunning the execution of a previous coprocessor instruction. The FNINIT and FNCLEX coprocessor instructions are allowed to execute even if BUSY is asserted, since these instructions are used for coprocessor initialization and exception-clearing. BUSY is level-sensitive and is allowed to be asynchronous to the CLK2 signal. BUSY serves an additional function. If BUSY is sampled Low at the falling edge of RESET, the Am386DX/DXL microprocessor performs an internal self-test (see Bus Activity During and Following Reset). If BUSY is sampled High, no self-test is performed. #### Coprocessor Error (ERROR) This input signal indicates that the previous coprocessor instruction generated a coprocessor error of a type not masked by the coprocessor's control register. This input is automatically sampled by the Am386DX/DXL microprocessor when a coprocessor instruction is encountered, and if asserted, the Am386DX/DXL device generates Exception 16 to access the error-handling software. Several coprocessor instructions, generally those that clear the numeric error flags in the coprocessor or save coprocessor state, do execute without the Am386DX/DXL microprocessor generating Exception 16 even if ERROR is asserted. These instructions are FNINIT, FNCLEX, FSTSW, FSTSWAX, FSTCW, FSTENV, FSAVE, FESTENV, and FESAVE. ERROR is level-sensitive and is allowed to be asynchronous to the CLK2 signal. ## Interrupt Signals (INTR, NMI, RESET) #### Introduction The following descriptions cover inputs that can interrupt or suspend execution of the processor's current instruction stream. #### Maskable Interrupt Request (INTR) When asserted, this input indicates a request for interrupt service, which can be masked by the Am386DX/DXL CPU Flag Register IF bit. When the Am386DX/DXL microprocessor responds to the INTR input, it performs two interrupt acknowledge bus cycles, and at the end of the second, latches an 8-bit interrupt vector on D17–D0 to identify the source of the interrupt. INTR is level-sensitive and is allowed to be asynchronous to the CLK2 signal. To assure recognition of an INTR request, INTR should remain asserted until the first interrupt acknowledge bus cycle begins. #### Non-Maskable Interrupt Request (NMI) This input indicates a request for interrupt service, which cannot be masked by software. The non-maskable interrupt request is always processed according to the pointer or gate in slot 2 of the interrupt table. Because of the fixed NMI slot assignment, no interrupt acknowledge cycles are performed when processing NMI. NMI is rising edge-sensitive and is allowed to be asynchronous to the CLK2 signal. To assure recognition of NMI, it must be negated for at least eight CLK2 periods, and then be asserted for at least eight CLK2 periods. Once NMI processing has begun, no additional NMI's are processed until after the next IRET instruction, which is typically the end of the NMI service routine. If NMI is re-asserted prior to that time, however, one rising edge on NMI will be remembered for processing after executing the next IRET instruction. #### Reset (RESET) This input signal suspends any operation in progress and places the Am386DX/DXL microprocessor in a known reset state. The Am386DX/DXL device is reset by asserting RESET for 15 or more CLK2 periods (80 or more CLK2 periods before requesting self-test). When RESET is asserted, all other input pins, except FLT, are ignored, and all other bus pins are driven to an idle bus state as shown in Table 15. If RESET and HOLD are both asserted at a point in time, RESET takes priority even if the Am386DX/DXL device was in a Hold Acknowledge state prior to RESET asserted. RESET is level-sensitive and must be synchronous to the CLK2 signal. If desired, the phase of the internal processor clock and the entire Am386DX/DXL CPU state can be completely synchronized to external circuitry by ensuring the RESET signal falling edge meets its applicable setup and hold times, t25 and t26. The signal summary is shown in Table 16. Table 15. Pin State (Idle Bus) During Reset | Pin Name | Signal Level During Reset | |----------|---------------------------| | ADS | High | | D31-D0 | High Impedance | | BE3-BE0 | Low | | A31-A2 | High | | W/R | Low | | D/C | High | | M/ĪŌ | Low | | LOCK | High | | HLDA | Low | Table 16. Am386DX/DXL Microprocessor Signal Summary | Signal<br>Name | Function | Active<br>State | Input/<br>Output | Input Synch<br>or Asynch to<br>CLK2 | Output High<br>Impedance During<br>HLDA? | |----------------|-----------------------------|-----------------|------------------|-------------------------------------|------------------------------------------| | CLK2 | Clock | <del>-</del> | ı | _ | _ | | D31-D0 | Data Bus | High | 1/0 | S | Yes | | BE3-BE0 | Byte Enables | Low | 0 | | Yes | | A31-A2 | Address Bus | High | 0 | _ | Yes | | W/R | Write-Read Indication | High | 0 | | Yes | | D/C | Data-Control Indication | High | 0 | _ | Yes | | M/ĪŌ | Memory-I/O Indication | High | 0 | _ | Yes | | LOCK | Bus Lock Indication | Low | 0 | _ | Yes | | ADS | Address Status | Low | 0 | | Yes | | NA | Next Address Request | Low | 1 | s | _ | | BS16 | Bus Size 16 | Low | l I | S | _ | | READY | Transfer Acknowledge | Low | I | S | | | HOLD | Bus Hold Request | High | l l | S | _ | | HLDA | Bus Hold Acknowledge | High | 0 | _ | No | | PEREQ | Coprocessor Request | High | I | Α | _ | | BUSY | Coprocessor Busy | Low | 1 | Α | | | ERROR | Coprocessor Error | Low | 1 | Α | - | | INTR | Maskable Interrupt Request | High | ı | Α | _ | | NMI | Non-Maskable Intrpt Request | High | l I | A | _ | | RESET | Reset | High | l | S | _ | # Bus Transfer Mechanism Introduction All data transfers occur as a result of one or more bus cycles. Logical data operands of byte, word, and Dword lengths may be transferred without restrictions on physical address alignment. Any byte boundary may be used, although two or even three physical bus cycles are performed as required for unaligned operand transfers. See Dynamic Data Bus Sizing and Operand Alignment. The Am386DX/DXL microprocessor address signals are designed to simplify external system hardware. Higher-order address bits are provided by A31–A2. Lower-order address in the form of BE3–BE0 directly provides linear selects for the four bytes of the 32-bit data bus. Physical operand size information is thereby implicitly provided for each bus cycle in the most usable form. Byte Enable outputs, BE3–BE0, are asserted when their associated data bus bytes are involved with the present bus cycle, as listed in Table 17. During a bus cycle, any possible pattern of contiguous asserted Byte Enable outputs can occur, but never patterns having a negated Byte Enable separating two or three asserted Enables. Address bits A0 and A1 of the physical operand's base address can be created when necessary (for BE1 K – Map for A0 Signal instance, for MULTIBUS I or MULTIBUS II interface), as a function of the lowest-order asserted Byte Enable. This is shown by Table 18. Logic to generate A0 and A1 is given by Figure 43. Table 17. Byte Enables and Associated Data and Operand Bytes | Byte Enable Signal | Associated Data Bus Signals | | | | |--------------------|-----------------------------------|--|--|--| | BE0 | D7-D0 (Byte 0-least significant) | | | | | BE1 | D15-D8 (Byte 1) | | | | | BE2 | D23-D16 (Byte 2) | | | | | BE3 | D31-D24 (Byte 3-most significant) | | | | Each bus cycle is composed of at least two bus states and each bus state requires one processor clock period. Additional bus states added to a single bus cycle are called wait states. See Bus Functional Description. Since a bus cycle requires a minimum of two bus states (equal to two processor clock periods), data can be transferred between external devices and the Am386DX/DXL CPU at a maximum rate of one 4-byte Dword every two processor clock periods, for a maximum bus bandwidth of 80 Mb/s (Am386DX/DXL microprocessor operating at 40-MHz processor clock rate). Figure 43. Logic to Generate A0, A1 from BE3-BE0 Table 18. Generating A31-A0 from BE3-BE0 and A31-A2 | Am386DX/DXL CPU Address Signals | | | | | | | Is | | |---------------------------------|-----------------|------------|----|------------|-----|------|------|------| | A31 | | A2 | | | BE3 | BE2 | BE1 | BE0 | | | Physica<br>Addr | | е | | | | | | | A31 | | <b>A</b> 2 | A1 | <b>A</b> 0 | | | | | | A31 | | <b>A</b> 2 | 0 | 0 | х | х | Х | Low | | A31 | | A2 | 0 | 1 | х | х | Low | High | | A31 | | <b>A</b> 2 | 1 | 0 | Х | Low | High | High | | A31 | | <b>A</b> 2 | 1 | 1 | Low | High | High | High | Note: Since A31 is High during automatic communication with coprocessor, A31 High and M/IO Low can be used to easily generate a coprocessor select signal. Figure 44. Physical Memory and I/O Spaces ### Memory and I/O Spaces Bus cycles may access physical memory space or I/O space. Peripheral devices in the system may either be memory-mapped, or I/O-mapped, or both. As shown in Figure 44, physical memory addresses range from 000000000H to FFFFFFFH (4 Gb) and I/O addresses from 00000000H to 0000FFFH (64 Kb) for programmed I/O. Note the I/O addresses used by the automatic I/O cycles for coprocessor communication are 800000F8H to 800000FFH, beyond the address range of programmed I/O, to allow easy generation of a coprocessor chip select signal using the A31 and M/IO signals. #### Memory and I/O Organization The Am386DX/DXL microprocessor datapath to memory and I/O spaces can be 32- or 16-bits wide. When 32-bits wide, memory and I/O spaces are organized naturally as arrays of physical 32-bit Dwords. Each memory or I/O Dword has four individually addressable bytes at consecutive byte addresses. The lowest-addressed byte is associated with data signals D17–D0; the highest-addressed byte with D31–D24. The Am386DX/DXL microprocessor includes a bus control input, $\overline{B}\overline{S}16$ , that also allows direct connection to 16-bit memory or I/O spaces organized as a sequence of 16-bit word. Cycles to 32- and 16-bit memory or I/O devices may occur in any sequence, since the $\overline{B}\overline{S}16$ control is sampled during each bus cycle. (See Dynamic Data Bus Sizing.) The Byte Enable signals, $\overline{B}\overline{E}3-\overline{B}\overline{E}0$ , allow byte granularity when addressing any memory or I/O structure, whether 32- or 16-bits wide. #### **Dynamic Data Bus Sizing** Dynamic Data Bus Sizing is a feature allowing direct processor connection to 32- or 16-bit data buses for memory or I/O. A single processor may connect to both size buses. Transfers to or from 32- or 16-bit ports are supported by dynamically determining the bus width during each bus cycle. During each bus cycle an address decoding circuit or the slave device itself may assert \$\overline{BS16}\$ for 16-bit ports, or negate \$\overline{BS16}\$ for 32-bit ports. With BS16 asserted, the processor automatically converts operand transfers larger than 16 bits, or misaligned 16-bit transfers, into two or three transfers as required. All operand transfers physically occur on D15–D0 when BS16 is asserted. Therefore, 16-bit memories or I/O devices only connect on data signals D15–D0. No extra transceivers are required. Asserting BS16 only affects the processor when BE2 and/or BE3 are asserted during the current cycle. If only D15–D0 are involved with the transfer, asserting BS16 has no affect since the transfer can proceed normally over a 16-bit bus whether BS16 is asserted or not. In other words, asserting BS16 has no effect when only the lower half of the bus is involved with the current cycle. There are two types of situations where the processor is affected by asserting BS16, depending on which Byte Enables are asserted during the current bus cycle. Upper Half Only: Only BE2 and/or BE3 asserted. Upper and Lower Half: At least $\overline{BE1}$ , $\overline{BE2}$ asserted (and perhaps also $\overline{BE0}$ and/or $\overline{BE3}$ ). Effect of asserting BS16 during Upper Half Only read cycles: Asserting BS16 during Upper Half Only reads causes the Am386DX/DXL microprocessor to read data on the lower 16 bits of the data bus and ignore data on the upper 16 bits of the data bus. Data that would have been read from D31–D16 (as indicated by BE2 and BE3) will instead be read from D15–D0, respectively. Effect of asserting BS16 during Upper Half Only write cycles: Asserting BS16 during Upper Half Only writes does not affect the Am386DX/DXL microprocessor. When only BE2 and/or BE3 are asserted during a Write cycle, the Am386DX/DXL microprocessor always duplicates data signals D31–D16 onto D15–D0 (see Table 13). Therefore, no further Am386DX/DXL CPU action is required to perform these writes on 32- or 16-bit buses. Effect of asserting BS16 during Upper and Lower Half read cycles: Asserting BS16 during Upper and Lower Half reads causes the processor to perform two 16-bit read cycles for complete physical operand transfer. Bytes 0 and 1 (as indicated by BE0 and BE1) are read on the first cycle using D15–D0. Bytes 2 and 3 (as indicated by BE2 and BE3) are read during the second cycle, again using D15–D0. D31–D16 are ignored during both 16-bit cycles. BE0 and BE1 are always negated during the second 16-bit cycle. See Figure 54 Cycles 2 and 2a. Effect of asserting BS16 during Upper and Lower Half write cycles: Asserting BS16 during Upper and Lower Half writes causes the Am386DX/DXL microprocessor to perform two 16-bit write cycles for complete physical operand transfer. All bytes are available the first write cycle allowing external hardware to receive Bytes 0 and 1 (as indicated by BE0 and BE1) using D15–D0. On the second cycle the Am386DX/DXL microprocessor duplicates Bytes 2 and 3 on D15–D0 and Bytes 2 and 3 (as indicated by BE2 and BE3) are written using D15–D0. BE0 and BE1 are always negated during the second 16-bit cycle. BS16 must be asserted during the second 16-bit cycle. See Figure 54 Cycles 1 and 1a. #### Interfacing with 32- and 16-Bit Memories In 32-bit-wide physical memories such as Figure 45, each physical Dword begins at a byte address that is a multiple of 4. A31–A2 are directly used as a Dword selects and $\overline{\text{BE3}}$ – $\overline{\text{BE0}}$ as byte selects. $\overline{\text{BS16}}$ is negated for all bus cycles involving the 32-bit array. When 16-bit-wide physical arrays are included in the system, as in Figure 46, each 16-bit physical word begins at an address that is a multiple of 2. Note the address is decoded to assert BS16 only during bus cycles involving the 16-bit array. If desiring to use pipelined address with 16-bit memories, then BE3-BE0 and W/R are also decoded to determine when BS16 should be asserted. (See Pipelined Address with Dynamic Data Bus Sizing.) A31-A2 are directly usable for addressing 32- and 16-bit devices. To address 16-bit devices. A1 and two Byte Enable signals are also needed. To generate an A1 signal and two Byte Enable signals for 16-bit access, BE3-BE0 should be decoded as in Table 19. Note that certain combinations of BE3-BE0 are never generated by the Am386DX/DXL microprocessor, leading to "don't care" conditions in the decoder. Any BE3-BE0 decoder, such as shown in Figure 47, may use the non-occurring BE3-BE0 combinations to its best advantage. Figure 45. Am386DX/DXL Microprocessor with 32-Bit Memory 15021B-048 Figure 46. Am386DX/DXL Microprocessor with 32-Bit and 16-Bit Memory Table 19. Generating A1, BHE, and BLE for Addressing 16-Bit Devices | Am3 | 86DX/DX | L CPU S | ignals | 16-Bit Bus Signals | | Signals | | |-----|---------|---------|--------|--------------------|-----|----------|------------------------| | BE3 | BE2 | BE1 | BEO | <b>A</b> 1 | BHE | BLE (A0) | Comments | | H* | H* | H* | H* | Х | Х | Х | X—no active bytes | | Н | Н | Н | L | L | н | L | , | | H | Н | L | Н | L | L | Н | | | Н | Н | L | L | L | L | L | | | Н | L | Н | Н | Н | н | L | | | H* | L* | H* | L* | Х | Х | Х | X-non-contiguous bytes | | Н | L | L | Н | L | L | Н | | | Н | L | L | L | L | L | L | | | L | Н | Н | Н | Н | L | Н | | | L* | H* | H* | L* | Х | Х | Х | X—non-contiguous bytes | | L* | H* | L* | H* | Х | Х | Х | X—non-contiguous bytes | | L* | H* | L* | L* | Х | Х | Х | X—non-contiguous bytes | | L | L | Н | Н | Н | L | L | | | L* | L* | H* | L* | Х | X | X | Xnon-contiguous bytes | | L | L | L | Н | L | L | н | | | L | L | L | L | L | L | L | | BLE asserted when D7-D0 of 16-bit bus is active. BHE asserted when D15–D8 of 16-bit bus is active. A1 Low for all even words; A1 High for all odd words. Key: X = "Don't Care" H = High voltage level L = Low voltage level \* = A non-occurring pattern of Byte Enables; either none are asserted or the pattern has Byte Enables asserted for non-contiguous bytes. #### **Operand Alignment** With the flexibility of memory addressing on the Am386DX/DXL microprocessor, it is possible to transfer a logical operand that spans more than one physical Dword or Word of memory or I/O. Examples are 32-bit Dword operands beginning at addresses not evenly divisible by 4- or a 16-bit Word operand split between two physical Dwords of memory array. Operand alignment and data bus size dictates when multiple bus cycles are required. Table 20 describes the transfer cycles generated for all combinations of logical operand lengths, alignment, and data bus sizing. When multiple bus cycles are required to transfer a multi-byte logical operand, the highest-order bytes are transferred first (but if BS16 asserted requires two 16-bit cycles be performed, that part of the transfer is lowest-order first). # Bus Functional Description Introduction The Am386DX/DXL microprocessor has separate, parallel buses for data and address. The data bus is 32 bits in width and is bidirectional. The address bus provides a 32-bit value using 30 signals for the 30 upper-order address bits and 4 Byte Enable signals to directly indicate the active bytes. These buses are interpreted and controlled via several associated definition or control signals. The definition of each bus cycle is given by three definition signals: $M/\overline{O}$ , $W/\overline{R}$ , and $D/\overline{C}$ . At the same time, a valid address is present on the Byte Enable signals $\overline{BE3}-\overline{BE0}$ and other address signals, A31–A2. A status signal, $\overline{ADS}$ , indicates when the Am386DX/DXL CPU issues a new bus cycle definition and address. Collectively, the address bus, data bus, and all associated control signals are referred to simply as the bus. When active, the bus performs one of the bus cycles below. - Read from memory space. - 2. Locked read from memory space. - 3. Write to memory space. - Locked write to memory space. - 5. Read from I/O space (or coprocessor). - 6. Write to I/O space (or coprocessor). - Interrupt acknowledge. - 8. Indicate halt or indicate shutdown. Table 14 shows the encoding of the bus cycle definition signals for each bus cycle. See Section Bus Cycle Definition. The data bus has a dynamic sizing feature supporting 32- and 16-bit bus size. Data bus size is indicated to the Am386DX/DXL microprocessor using its Bus Size 16 (BS16) input. All bus functions can be performed with either data bus size. K - Map for A1 Signal (same as Figure 43) K - Map for 16-bit BHE signal K - Map for 16-bit BLE signal (same as A0 signal in Figure 43). 15021B-050 Figure 47. Logic to Generate A1, BHE, and BLE for 16-Bit Buses Table 20. Transfer Bus Cycles for Bytes, Words, and Dwords | | Byte-Length of Logical Operand | | | | | | | | | |--------------------------------------------------|--------------------------------|-----|-----|----|------------|-----|-----------|-----------|-----------| | · | 1 | 1 2 | | | 4 | | | | | | Physical Byte Address in Memory (low-order bits) | xx | 00 | 01 | 10 | 11 | 00 | 01 | 10 | 11 | | Transfer Cycles over 32-bit Data Bus | b | w | w | w | hb,*<br>lb | d | hd<br>I3 | hw,<br>lw | h3,<br>lb | | | b | w | lb, | w | hb, | lw, | hb, | hw, | mw, | | Transfer Cycles over 16-bit Data Bus | | | hb | | lb | hw | lb,<br>mw | lw | hb,<br>lb | Key: b = Byte transfer 3 = 3-byte transfer w = Word transfer d = Dword transfer I = low-order portion h = high-order portion m = mid-order portion x = Don't care = BS16 asserted causes second bus cycle. <sup>\*</sup>For this case, 8086, 8088, 80186, 80188, 80286 transfer lb first, then hb. Figure 48. Fastest Read Cycles with Non-Pipelined Address Timing 15021B-051 When the Am386DX/DXL CPU bus is not performing one of the activities listed above, it is either Idle or in the Hold Acknowledge state, which may be detected by external circuitry. The Idle state can be identified by the Am386DX/DXL microprocessor giving no further assertions on its address strobe output (ADS) since the beginning of its most recent bus cycle, and the most recent bus cycle has been terminated. The Hold Acknowledge state is identified by the Am386DX/DXL CPU asserting its Hold Acknowledge (HLDA) output. The shortest time unit of bus activity is a bus state. A bus state is one processor clock period (two CLK2 periods) in duration. A complete data transfer occurs during a bus cycle, composed of two or more bus states. The fastest Am386DX/DXL microprocessor bus cycle requires only two bus states. For example, three consecutive bus read cycles, each consisting of two bus states, are shown by Figure 48. The bus states in each cycle are named T1 and T2. Any memory or I/O address may be accessed by such a two-state bus cycle, if the external hardware is fast enough. The high-bandwidth, two-clock bus cycle realizes the full potential of fast main memory, or cache memory. Every bus cycle continues until it is acknowledged by the external system hardware, using the Am386DX/DXL microprocessor READY input. Acknowledging the bus cycle at the end of the first T2 results in the shortest bus cycle, requiring only T1 and T2. If READY is not immediately asserted, however, T2 states are repeated indefinitely until the READY input is sampled asserted. #### **Address Pipelining** The address pipelining option provides a choice of bus cycle timings. Pipelined or non-pipelined address timing is selectable on a cycle-by-cycle basis with the Next Address ( $\overline{\text{NA}}$ ) input. When address pipelining is not selected, the current address and bus cycle definition remain stable throughout the bus cycle. When address pipelining is selected, the address ( $\overline{\text{BE3}}$ – $\overline{\text{BE0}}$ , A31–A2) and definition (W/ $\overline{\text{R}}$ , D/ $\overline{\text{C}}$ , and M/ $\overline{\text{IO}}$ ) of the next cycle are available before the end of the current cycle. To signal their availability, the Am386DX/DXL microprocessor address status output ( $\overline{\text{ADS}}$ ) is also asserted. Figure 49 illustrates the fastest read cycles with pipelined address timing. Note from Figure 49, the fastest bus cycles using pipelined address require only two bus states, named T1P and T2P. Therefore, cycles with pipelined address timing allow the same data bandwidth as non-pipelined cycles, but address-to-data access time is increased compared to that of a non-pipelined cycle. Fastest pipelined bus cycles consist of T1P and T2P 15021B-052 Figure 49. Fastest Read Cycles with Pipelined Address Timing By increasing the address-to-data access time, pipelined address timing reduces wait state requirements. For example, if one wait state is required with non-pipelined address timing, no wait states would be required with pipelined address. Pipelined address timing is useful in typical systems having address latches. In those systems, once an address has been latched, pipelined availability of the next address allows decoding circuitry to generate chip selects (and other necessary select signals) in advance, so selected devices are accessed immediately when the next cycle begins. In other words, the decode time for the next cycle can be overlapped with the end of the current cycle. If a system contains a memory structure of two or more interleaved memory banks, pipelined address timing potentially allows even more overlap of activity. This is true when the interleaved memory controller is designed to allow the next memory operation to begin in one memory bank while the current bus cycle is still activating another memory bank. Figure 50 shows the general structure of the Am386DX/DXL microprocessor with two-bank and four-bank interleaved memory. Note each memory bank of the interleaved memory has full data bus width (32-bit data width typically, unless 16-bit bus size is selected). Further details of pipelined address timing are given in Pipelined Address; Initiating and Maintaining Pipelined Address; Pipelined Address with Dynamic Bus Sizing; and, Maximum Pipelined Address Usage With 16-bit Bus Size. #### **Read and Write Cycles** #### Introduction Data transfers occur as a result of bus cycles, classified as Read or Write cycles. During Read cycles, data is transferred from an external device to the processor. During Write cycles, data is transferred in the other direction, from the processor to an external device. Two choices of address timing are dynamically selectable: non-pipelined or pipelined. After a bus idle state, the processor always uses non-pipelined address timing. However, the $\overline{\rm NA}$ (Next Address) input may be asserted to select pipelined address timing for the next bus cycle. When pipelining is selected and the Am386DX/DXL microprocessor has a bus request pending internally, the address and definition of the next cycle is acknowledged by $\overline{\rm READY}$ . Generally, the $\overline{\rm NA}$ input is sampled each bus cycle to select the desired address timing for the next bus cycle. #### Two-Bank Interleaved Memory: - a. Address signal A2 selects bank - b. 32-bit datapath to each bank #### Four-Bank Interleaved Memory: - a. Address signals A3 and A2 select bank - b. 32-bit datapath to each bank 15021B-053 Figure 50. Two-Bank and Four-Bank Interleaved Memory Structure Two choices of physical data bus width are dynamically selectable: 32 bits or 16 bits. Generally, the BS16 (Bus Size 16) input is sampled near the end of the bus cycle to confirm the physical data bus size applicable to the current cycle. Negation of BS16 indicates a 32-bit size and assertion indicates a 16-bit bus size. If 16-bit bus size is indicated, the Am386DX/DXL CPU automatically responds as required to complete the transfer on a 16-bit data bus. Depending on the size and alignment of the operand, another 16-bit bus cycle may be required. Table 19 provides all details. When necessary, the Am386DX/DXL microprocessor performs an additional 16-bit bus cycle, using D15–D0 in place of D31–D16. Terminating a Read cycle or Write cycle, like any bus cycle, requires acknowledging the cycle by asserting the READY input. Until acknowledged, the processor inserts wait states into the bus cycle to allow adjustment for the speed of any external device. External hardware, that has decoded the address and bus cycle type asserts the READY input at the appropriate time. At the end of the second bus state within the bus cycle, READY is sampled. At that time, if external hardware acknowledges the bus cycle by asserting READY, the bus cycle terminates as shown in Figure 51. If READY is negated as in Figure 52, the cycle continues another bus state (a wait state) and READY is sampled again at the end of that state. This continues indefinitely until the cycle is acknowledged by READY asserted. When the current cycle is acknowledged, the Am386DX/DXL microprocessor terminates it. When a Read cycle is acknowledged, the Am386DX/DXL CPU latches the information present at its data pins. When a Write cycle is acknowledged, the Am386DX/DXL CPU write data remains valid throughout phase one of the next bus state to provide write data hold time. Note: Idle states are shown here for diagram variety only. Write cycles are not always followed by an idle state. An active bus cycle can immediately follow the write cycle. 15021B-054 Figure 51. Various Bus Cycles and Idle States with Non-Pipelined Address (Zero Wait States) #### Non-Pipelined Address Any bus cycle may be performed with non-pipelined address timing. For example, Figure 51 shows a mixture of Read and Write cycles with non-pipelined address timing. Figure 51 shows that the fastest possible cycles with non-pipelined address have two bus states per bus cycle. The states are named T1 and T2. In phase one of the T1, the address signals and bus cycle definition signals are driven valid, and to signal their availability, address status (ADS) is simultaneously asserted. During Read or Write cycles, the data bus behaves as follows. If the cycle is a read, the Am386DX/DXL microprocessor floats its data signals to allow driving by the external device being addressed. The Am386DX/DXL device requires that all data bus pins be at a valid logic state (High or Low) at the end of each read cycle, when READY is asserted, even if all byte enables are not asserted. The system **must** be designed to meet this requirement. If the cycle is a write, data signals are driven by the Am386DX/DXL device beginning in phase two of T1 until phase one of the bus state following cycle acknowledgment. Figure 52 illustrates non-pipelined bus cycles with one wait added to Cycles 2 and 3. READY is sampled negated at the end of the first T2 in Cycles 2 and 3. Therefore, Cycles 2 and 3 have T2 repeated. At the end of the second T2, READY is sampled asserted. When address pipelining is not used, the address and bus cycle definition remain valid during all wait states. When wait states are added and you desire to maintain non-pipelined address timing, it is necessary to negate $\overline{\text{NA}}$ during each T2 state except the last one, as shown in Figure 52 Cycles 2 and 3. If $\overline{\text{NA}}$ is sampled asserted during a T2 other than the last one, the next state would be T2I (for pipelined address) or T2P (for pipelined address) instead of another T2 (for non-pipelined address). Note: Idle states are shown here for diagram variety only. Write cycles are not always followed by an idle state. An active bus cycle can immediately follow the Write cycle. 15021B-055 Figure 52. Various Bus Cycles and Idle States with Non-Pipelined Address (Various Number of Wait States) Bus States: T1 — First clock of a non-pipelined bus cycle (Am386DX/DXL microprocessor drives new address and asserts ADS). T2— Subsequent clocks of a bus cycle when $\overline{NA}$ has not been sampled asserted in the current bus cycle. Th— Hold Acknowledge state (Am386DX/DXL microprocessor asserts HLDA). 15022B-017 Four basic bus states describe bus operation when not using pipelined address. These states do include BS16 usage for 32-bit and 16-bit bus size. If asserting BS16 requires second 16-bit bus cycle to be performed, it is performed before HOLD asserted acknowledged. Figure 53. Bus States (Not Using Pipelined Address) Figure 53 illustrates the bus states and transitions when address pipelining is not used. The bus transitions between four possible states: T1, T2, Ti, and Th. Bus cycles consist of T1 and T2, with T2 being repeated for wait states. Otherwise, the bus may be idle in the Ti state, or in hold acknowledge, the Th state. The fastest bus cycle consists of two states: T1 and T2. When address pipelining is not used, the bus state diagram is as shown in Figure 53. When the bus is idle, it is in state Ti. Bus cycles always begin with T1. T1 always leads to T2. If a bus cycle is not acknowledged during T2 and NA is negated, T2 is repeated. When a cycle is acknowledged during T2, the following state will be T1 of the next bus cycle if a bus request is pending internally, or Ti if there is no bus request pending, or Th if the HOLD input is being asserted. The bus state diagram in Figure 53 also applies to the use of BS16. If the Am386DX/DXL microprocessor makes internal adjustments for 16-bit bus size, the adjustments do not affect the external bus states. If an additional 16-bit bus cycle is required to complete a transfer on a 16-bit bus, it also follows the state transitions shown in Figure 53. Use of pipelined address allows the Am386DX/DXL CPU to enter three additional bus states not shown in Figure 53. Figure 59 in Pipelined Address is the complete bus state diagram, including pipelined address cycles. ### Non-Pipelined Address With Dynamic Data Bus Sizing The physical data bus width for any non-pipelined bus cycle can be either 32 or 16 bits. At the beginning of the bus cycle, the processor behaves as if the data bus is 32-bits wide. When the bus cycle is acknowledged by asserting READY at the end of a T2 state, the most recent sampling of BS16 determines the data bus size for the cycle being acknowledged. If BS16 was most recently negated, the physical data bus size is defined as 32 bits. If BS16 was most recently asserted, the size is defined as 16 bits. When BS16 is asserted and two 16-bit bus cycles are required to complete the transfer, BS16 must be asserted during the second cycle; 16-bit bus size is not assumed. Like any bus cycle, the second 16-bit cycle must be acknowledged by asserting READY. Key: Dn = Physical data pin n dn = Logical data pin n Figure 54. Asserting BS16 (Zero-Wait-States, Non-Pipelined Address) Key: Dn = Physical data pin n dn = Logical data pin n Figure 55. Asserting BS16 (One-Wait-State, Non-Pipelined Address) # AMD When a second 16-bit bus cycle is required to complete the transfer over a 16-bit bus, the addresses generated for the two 16-bit bus cycles are closely related to each other. The addresses are the same, except $\overline{\text{BE0}}$ and $\overline{\text{BE1}}$ are always negated for the second cycle. This is because data on D15–D0 was already transferred during the first 16-bit cycle. Figures 54 and 55 show cases where assertion of $\overline{BS16}$ requires a second 16-bit cycle for complete operand transfer. Figure 54 illustrates cycles without wait states. Figure 55 illustrates cycles with one wait state. In Figure 55 Cycle 1, the bus cycle during which $\overline{BS16}$ is asserted, note that $\overline{NA}$ must be negated in the T2 state(s) prior to the last T2 state. This is to allow the recognition of $\overline{BS16}$ asserted in the final T2 state. The relation of $\overline{NA}$ and $\overline{BS16}$ is given fully in Pipelined Address, but Figure 55 illustrates this only precaution you need to know when using $\overline{BS16}$ with non-pipelined address. #### Pipelined Address Address pipelining is the option of requesting the address and the bus cycle definition of the next internally pending bus cycle before the current bus cycle is acknowledged with READY asserted. ADS is asserted by the Am386DX/DXL microprocessor when the next address is issued. The address pipelining option is controlled on a cycle-by-cycle basis with the NA input signal. Once a bus cycle is in progress and the current address has been valid for at least one entire bus state, the $\overline{\text{NA}}$ input is sampled at the end of every phase one until the bus cycle is acknowledged. During non-pipelined bus cycles, therefore, $\overline{\text{NA}}$ is sampled at the end of phase one in every T2. An example is Cycle 2 in Figure 56, during which $\overline{\text{NA}}$ is sampled at the end of phase one of every T2 (it was asserted once during the first T2 and has no further effect during that bus cycle). If $\overline{\text{NA}}$ is sampled asserted, the Am386DX/DXL microprocessor is free to drive the address and bus cycle definition of the next bus cycle, and assert $\overline{\text{ADS}}$ , as soon as it has a bus request internally pending. It may drive the next address as early as the next bus state, whether the current bus cycle is acknowledged at that time or not. Regarding the details of address pipelining, the Am386DX/DXL CPU has the following characteristics. For NA to be sampled asserted, BS16 must be negated at the sampling window (see Figure 56 Cycles 2 through 4, and Figure 57 Cycles 1 through 4). If NA and BS16 are both sampled asserted during - the last T2 period of a bus cycle, BS16 asserted has priority. Therefore, if both are asserted, the current bus size is taken to be 16 bits and the next address is not pipelined. - 2. The next address may appear as early as the bus state after NA was sampled asserted (see Figure 56 or 57). In that case, state T2P is entered immediately, However, when there is not an internal bus request already pending, the next address will not be available immediately after NA is asserted and T2I is entered instead of T2P (see Figure 58 Cycle 3). Provided the current bus cycle is not yet acknowledged by READY asserted, T2P will be entered as soon as the Am386DX/DXL microprocessor does drive the next address. External hardware should therefore observe the ADS output as confirmation the next address is actually being driven on the bus. - 3. Once NA is sampled asserted, the Am386DX/DXL microprocessor commits itself to the highest priority bus request that is pending internally. It can no longer perform another 16-bit transfer to the same address should BS16 be asserted externally, so thereafter must assume the current bus size is 32 bits. Therefore, if NA is sampled asserted within a bus cycle, BS16 must be negated thereafter in that bus cycle (see Figures 56, 57, 58). Consequently, do not assert NA during bus cycles that must have BS16 driven asserted. See Dynamic Bus Sizing with Pipelined Address. - Any address which is validated by a pulse on the Am386DX/DXL CPU ADS output will remain stable on the address pins for at least two processor clock periods. The Am386DX/DXL microprocessor cannot produce a new address more frequently than every two processor clock periods (see Figures 56, 57, 58). - Only the address and bus cycle definition of the very next bus cycle is available. The pipelining capability cannot look further than one bus cycle ahead (see Figure 58 Cycle 1). The complete bus state transition diagram, including operation with pipelined address is given by Figure 59. Note it is a superset of the diagram for non-pipelined address only and the three additional bus states for pipelined address are drawn in bold. The fastest bus cycle with pipelined address consists of just two bus states, T1P and T2P (recall for non-pipelined address it is T1 and T2). T1P is the first bus state of a pipelined cycle. Note: Following any idle bus state (Ti), addresses are non-pipelined. Within non-pipelined bus cycles, NA is only sampled during wait states. Therefore, to begin address pipelining during a group of non-pipelined bus cycles requires a non-pipelined cycle with at least one wait state (Cycle 2 above). Figure 56. Transitioning to Pipelined Address During Burst of Bus Cycles Note: Following any idle bus state (Ti), the address is always non-pipelined and NA is only sampled during wait states. To start address pipelining after an idle state requires a non-pipelined cycle with at least one wait state (Cycle 1 above). The pipelined cycles (2, 3, 4 above) are shown with various numbers of wait states. Figure 57. Fastest Transition to Pipelined Address Following Idle Bus State #### Initiating and Maintaining Pipelined Address Using the state diagram Figure 59, observe the transitions from an idle state, Ti, to the beginning of a pipelined bus cycle, T1P. From an idle state Ti, the first bus cycle must begin with T1, and is therefore a non-pipelined bus cycle. The next bus cycle will be pipelined, however, provided NA is asserted and the first bus cycle ends in a T2P state (the address for the next bus cycle is driven during T2P). The fastest path from an idle state to a bus cycle with pipelined address is shown in below: T1-T2-T2P are the states of the bus cycle that establishes address pipelining for the next bus cycle, which begins with T1P. The same is true after a bus hold state, shown below: | Th, Th, Th | T1-T2-T2P | T1P-T2P | |-------------------------------|------------------------|--------------------| | | | | | Hold<br>Acknowledge<br>States | Non-Pipelined<br>Cycle | Pipelined<br>Cycle | The transition to pipelined address is shown functionally by Figure 57 Cycle 1. Note that Cycle 1 is used to transition into pipelined address timing for the subsequent Cycles 2, 3, and 4 that are pipelined. The $\overline{\text{NA}}$ input is asserted at the appropriate time to select address pipelining for Cycles 2, 3, and 4. Once a bus cycle is in progress and the current address has become valid, the $\overline{\text{NA}}$ input is sampled at the end of every phase one, beginning with the next bus state, until the bus cycle is acknowledged. During Figure 57 Cycle 1 therefore, sampling begins in T2. Once $\overline{\text{NA}}$ is sampled asserted during the current cycle, the Am386DX/DXL microprocessor is free to drive a new address and bus cycle definition on the bus as early as the next bus state. In Figure 56 Cycle 1 for example, the next address is driven during state T2P. Thus, Cycle 1 makes the transition to pipelined address timing, since it begins with T1 but ends with T2P. Because the address for Cycle 2 is available before Cycle 2 begins, Cycle 2 is called a pipelined bus cycle, and it begins with T1P. Cycle 2 begins as soon as READY asserted terminates Cycle 1. Example transition bus cycles are Figure 57 Cycle 1 and Figure 56 Cycle 2. Figure 57 shows transition during the very first cycle after an idle bus state, which is the fastest possible transition into address pipelining. Figure 56 Cycle 2, shows a transition cycle occurring during a burst of bus cycles. In any case, a transition cycle is the same whenever it occurs: it consists at least of T1, T2 (you assert NA at that time), and T2P (provided the Am386DX/DXL microprocessor has an internal bus request already pending, which it almost always has). T2P states are repeated if wait states are added to the cycle. Note three states (T1, T2, and T2P) are only required in a bus cycle performing a transition from non-pipelined address into pipelined address timing; for example, Figure 57 Cycle 1. Figure 57 Cycles 2, 3, and 4 show that address pipelining can be maintained with two-state bus cycles consisting only of T1P and T2P. Once a pipelined bus cycle is in progress, pipelined timing is maintained for the next cycle by asserting $\overline{\text{NA}}$ and detecting that the Am386DX/DXL CPU enters T2P during the current bus cycle. The current bus cycle must end in state T2P for pipelining to be maintained in the next cycle. T2P is identified by the assertion of $\overline{\text{ADS}}$ . Figures 56 and 57 however, show pipelining ending after Cycle 4, because Cycle 4 ends in T2P. This indicates the Am386DX/DXL CPU did not have an internal bus request prior to the acknowledgment of Cycle 4. If a cycle ends with a T2 or T2I, the next cycle will not be pipelined. Realistically, address pipelining is almost always maintained as long as $\overline{\text{NA}}$ is sampled asserted. This is so, because in the absence of any other request a code prefetch request is always internally pending until the instruction decoder and code prefetch queue are completely full. Therefore, address pipelining is maintained for long bursts of bus cycles, if the bus is available (i.e., HOLD negated) and $\overline{\text{NA}}$ is sampled asserted in each of the bus cycles. Figure 58. Details of Address Pipelining During Cycles with Wait States Figure 59. Am386DX/DXL Microprocessor Complete Bus States (Including Pipelined Address) ### Pipelined Address With Dynamic Data Bus Sizing The BS16 feature allows easy interface to 16-bit data buses. When asserted, the Am386DX/DXL microprocessor bus interface hardware performs appropriate action to make the transfer using a 16-bit data bus connected on D15–D0. There is a degree of interaction, however, between the use of Address Pipelining and the use of Bus Size 16. The interaction results from the multiple bus cycles required when transferring 32-bit operands over a 16-bit bus. If the operand requires both 16-bit halves of the 32-bit bus, the appropriate Am386DX/DXL microprocessor action is a second bus cycle to complete the operand's transfer. This necessity conflicts with $\overline{\text{NA}}$ usage. When $\overline{\text{NA}}$ is sampled asserted, the Am386DX/DXL microprocessor commits itself to perform the next internally pending bus request, and is allowed to drive the next internally pending address onto the bus. Asserting $\overline{\text{NA}}$ therefore makes it impossible for the next bus cycle to again access the current address on A31–A2, such as may be required when $\overline{\text{BS16}}$ is asserted by the external hardware. To avoid conflict, the Am386DX/DXL microprocessor is designed with following two provisions. - To avoid conflict, BS16 must be negated in the current bus cycle if NA has already been sampled asserted in the current cycle. If NA is sampled asserted, the current data bus size is assumed to be 32 bits. - Also to avoid conflict, if NA and BS16 are both asserted during the same sampling window, BS16 asserted has priority and the Am386DX/DXL microprocessor acts as if NA was negated at that time. Certain types of 16- or 8-bit operands require no adjustment for correct transfer on a 16-bit bus. Those are read or write operands using only the lower half of the data bus, and write operands using only the upper half of the bus, since the Am386DX/DXL CPU simultaneously duplicates the write data on the lower half of the data bus. For these patterns of Byte Enables and the W/ $\overline{\rm R}$ signals, $\overline{\rm BS16}$ need not be asserted at the Am386DX/DXL CPU allowing $\overline{\rm NA}$ to be asserted during the bus cycle if desired. #### Interrupt Acknowledge (INTA) Cycles In response to an interrupt request on the INTR input when interrupts are enabled, the Am386DX/DXL microprocessor performs two interrupt acknowledge cycles. These bus cycles are similar to read cycles in that bus definition signals define the type of bus activity taking place, and each cycle continues until acknowledged by READY sampled asserted. The state of A2 distinguishes the first and second interrupt acknowledge cycles. The byte address driven during the first interrupt acknowledge cycle is 4 (A31–A3 Low, A2 High, BE3–BE1 High, and BE0 Low). The address driven during the second interrupt acknowledge cycle is 0 (A31–A2 Low, BE3–BE1 High, BE0 Low). The LOCK output is asserted from the beginning of the first interrupt acknowledge cycle until the end of the second interrupt acknowledge cycle. Four idle bus states, Ti, are inserted by the Am386DX/DXL microprocessor between the two interrupt acknowledge cycles, allowing for compatibility with spec TRHRL of the 8259A Interrupt Controller. During both interrupt acknowledge cycles, D31–D0 float. No data is read at the end of the first interrupt acknowledge cycle. At the end of the second interrupt acknowledge cycle, the Am386DX/DXL microprocessor will read an external interrupt vector from D7–D0 of the data bus. The vector indicates the specific interrupt number (from 0–255) requiring service. #### **Halt Indication Cycle** The Am386DX/DXL microprocessor halts as a result of executing a HALT instruction. Signaling its entrance into the halt state, a halt indication cycle is performed. The halt indication cycle is identified by the state of the bus definition signals shown in Bus cycle Definition and a byte address of 2. BEO and BE2 are the only signals distinguishing halt indication from shutdown indication, that drives an address of 0. During the halt cycle undefined data is driven on D31–D0. The halt indication cycle must be acknowledged by READY asserted. A halted Am386DX/DXL CPU resumes execution when INTR (if interrupts are enabled) or NMI or RESET is asserted. #### **Shutdown Indication Cycle** The Am386DX/DXL microprocessor shuts down as a result of a protection fault while attempting to process a double fault. Signaling its entrance into the shutdown state, a shutdown indication cycle is performed. The shutdown indication cycle is identified by the state of the bus definition signals shown in Bus Cycle Definition and a byte address of 0. BEO and BE2 are the only signals distinguishing shutdown indication from halt indication, which drives an address of 2. During the shutdown indication cycle, undefined data is driven on D31–D0. The shutdown indication cycle must be acknowledged by READY asserted. A shutdown Am386DX/DXL microprocessor resumes execution when NMI or RESET is asserted. Key: Dn = Physical data pin n Logical data pin n Cycle 1 is pipelined. Cycle 1A cannot be pipelined, but its address can be inferred from that of Cycle 1, to externally simulate address pipelining during Cycle 1A. Figure 60. Using NA and BS16 Interrupt Vector (0–255) is read on D7–D0 at end of second Interrupt Acknowledge bus cycle. Because each Interrupt Acknowledge bus cycle is followed by idle bus states, asserting NA has no practical effect. Choose the approach that is simplest for your system hardware design. Figure 61. Interrupt Acknowledge Cycles Figure 62. Halt Indication Cycle Figure 63. Shutdown Indication Cycle # Other Functional Descriptions Entering and Exiting Hold Acknowledge The Bus Hold Acknowledge State, Th, is entered in response to the HOLD input being asserted. In the Bus Hold Acknowledge state, the Am386DX/DXL microprocessor floats all output or bidirectional signals, except for HLDA. HLDA is asserted as long as the Am386DX/DXL CPU remains in the bus hold acknowledge state. In the Bus Hold Acknowledge state, all inputs except HOLD, FLT, RESET, BUSY, ERROR, and PEREQ are ignored (also up to one rising edge on NMI is remembered for processing when HOLD is no longer asserted). Note: For maximum design flexibility, the Am386DX/DXL CPU has no internal pullup resistors on its outputs. The design may require an external pullup on ADS and other Am386DX/DXL CPU outputs to keep them negated during float periods. Figure 64. Requesting Hold from Idle Bus Th may be entered from a bus idle state, as in Figure 64, or after the acknowledgment of the current physical bus cycle if the LOCK signal is not asserted, as in Figures 65 and 66. If HOLD is asserted during a locked bus cycle, the Am386DX/DXL microprocessor may execute one unlocked bus cycle before acknowledging HOLD. If asserting BS16 requires a second 16-bit bus cycle to complete a physical operand transfer, it is performed before HOLD is acknowledged, although the bus state diagrams in Figures 53 and 59 do not indicate that detail. Th is exited in response to the HOLD input being negated. The following state will be Ti as in Figure 64 if no bus request is pending. The following bus state will be T1 if a bus request is internally pending, as in Figures 65 and 66. Th is also exited in response to RESET being asserted. If a rising edge occurs on the edge-triggered NMI input while in Th, the event is remembered as a non-maskable interrupt 2 and is serviced when Th is exited, unless of course, the Am386DX/DXL microprocessor is reset before Th is exited. #### **RESET During HOLD Acknowledge** RESET being asserted takes priority over HOLD being asserted. Therefore, Th is exited in response to the RESET input being asserted. If RESET is asserted while HOLD remains asserted, the Am386DX/DXL microprocessor drives its pins to defined states during reset, as in Table 15 Pin State During RESET, and performs internal reset activity as usual. If HOLD remains asserted when RESET is negated, the Am386DX/DXL microprocessor enters the hold acknowledge state before performing its first bus cycle, provided HOLD is still asserted when the Am386DX/DXL microprocessor would otherwise perform its first bus cycle. If HOLD remains asserted when RESET is negated, the BUSY input is still sampled as usual to determine whether a self test is being requested, and ERROR is still sampled as usual to determine whether a 387DX math coprocessor versus an 80287 (or none) is present. #### **Float** Activating the FLT input floats all Am386DX/DXL CPU bidirectional and output signals, including HLDA. Asserting FLT isolates the Am386DX/DXL CPU from the surrounding circuitry. As the Am386DX/DXL microprocessor is packaged in a surface mount PQFP, it cannot be removed from the motherboard when In-Circuit Emulation (ICE) is needed. The FLT input allows the Am386DX/DXL CPU to be electrically isolated from the surrounding circuitry. This allows connection of an emulator to the Am386DX/DXL microprocessor PQFP without removing it from the PCB. This method of emulation is referred to as ON-Circuit Emulation (ONCE). #### **Entering and Exiting Float** FLT is an asynchronous, active Low input. It is recognized on the rising edge of CLK2. When recognized, it aborts the current bus cycle and floats the outputs of the Am386DX/DXL microprocessor (Figure 68). FLT must be held Low for a minimum of 16-CLK2 cycles. Reset should be asserted and held asserted until after FLT is deasserted. This will ensure that the Am386DX/DXL CPU will exit Float in a valid state. Asserting the FLT input unconditionally aborts the current bus cycle and forces the Am386DX/DXL microprocessor into the Float mode. Since activating FLT unconditionally forces the Am386DX/DXL CPU into Float mode, the Am386DX/DXL CPU is not guaranteed to enter Float in a valid state. After deactivating FLT, the Am386DX/DXL CPU is not guaranteed to exit Float mode in a valid state. This is not a problem, as the FLT pin is meant to be used only during ONCE. After exiting Float, the Am386DX/DXL CPU must be reset to return it to a valid state. Reset should be asserted before FLT is deasserted. This will ensure that the Am386DX/DXL CPU will exit Float in a valid state. FLT has an internal pull-up resistor, and if it is not used it should be unconnected. #### **Bus Activity During and Following Reset** RESET is the highest priority input signal capable of interrupting any processor activity when it is asserted. A bus cycle in progress can be aborted at any stage; or idle states or bus hold acknowledge states discontinued so that the RESET state is established. RESET should remain asserted for at least 15-CLK2 periods to ensure it is recognized throughout the Am386DX/DXL microprocessor, and at least 80-CLK2 periods if Am386DX/DXL device self-test is going to be requested at the falling edge. RESET asserted pulses less than 15 CLK2 periods may not be recognized. RESET pulses less than 80 CLK2 periods followed by a self-test may cause the self-test to report a failure when no true failure exists. The additional RESET pulse width is required to clear additional state prior to valid self-test. Provided the RESET falling edge meets setup and hold times, t25 and t26, the internal processor clock phase is defined at that time, as illustrated by Figure 67. An Am386DX/DXL microprocessor self-test may be requested at the time RESET is negated by having the $\overline{\text{BUSY}}$ input at a Low level, as shown in Figure 67. The self-test requires ( $2^{20}$ ) + approximately 60-CLK2 periods to complete. The self-test duration is not affected by the test results. Even if the self-test indicates a problem, the Am386DX/DXL device attempts to proceed with the reset sequence afterward. After the RESET falling edge (and after the self-test if it was requested) the Am386DX/DXL microprocessor performs an internal initialization sequence for approximately 350 to 450 CLK2 periods. The Am386DX/DXL microprocessor samples its ERROR input some time after the falling edge of RESET and before executing the first ESC instruction. During this sampling period BUSY must be High. If ERROR was sampled active, the Am386DX/DXL device employs the 32-bit protocol of a 387DX math coprocessor. Even though this protocol was selected, it is still necessary to use a software recognition test to determine the presence or identity of the coprocessor and to assure compatibility with future processors. Figure 65. Requesting Hold from Active Bus (NA Negated) Note: HOLD is a synchronous input and can be asserted at any CLK2 edge, provided setup and hold (t23 and t24) requirements are met. This waveform is useful for determining Hold Acknowledge latency. Figure 66. Requesting Hold from Active Bus (NA Asserted) **Table 21. Component and Revision Identifier History** | Intel<br>i386<br>Stepping Name | Am386DX/DXL<br>Microprocessor<br>Revision | Component<br>Identifier | Revision<br>Identifier | | | |--------------------------------|-------------------------------------------|-------------------------|------------------------|--|--| | D1 | D | 03 | 08 | | | Notes: 1. BUSY should be held stable for 8-CLK2 periods before and after the CLK2 period in which RESET falling edge occurs 2. If self-test is requested, the Am386DX/DXL microprocessor outputs remain in their reset state as shown here and in Table 14. Figure 67. Bus Activity from Reset Until First Code Fetch 15022B-029 Figure 68. Entering and Exiting FLT ### **Self-Test Signature** Upon completion of self-test, (if self-test was requested by holding BUSY Low at least eight CLK2 periods before and after the falling edge of RESET), the EAX register will contain a signature of 00000000h indicating the Am386DX/DXL CPU passed its self-test of microcode and major PLA contents with no problems detected. The passing signature in EAX, 0000000h, applies to all Am386DX/DXL microprocessor revision levels. Any non-zero signature indicates the Am386DX/DXL CPU unit is faulty. ### **Component and Revision Identifiers** To assist Am386DX/DXL microprocessor users, the microprocessor after reset holds a component identifier and a revision identifier in its DX register. The upper 8 bits of DX hold 03h as identification of the Am386DX/DXL CPU component. The lower 8 bits of DX hold an 8-bit unsigned binary number related to the component revision level. The revision identifier begins chronologically with a value zero and is subject to change (typically it will be incremented) with component steppings intended to have certain improvements or distinctions from previous steppings. These features are intended to assist Am386DX/DXL microprocessor users to a practical extent. However, the revision identifier value is not guaranteed to change with every stepping revision nor to follow a completely uniform numerical sequence, depending on the type or intention of revision or manufacturing materials required to be changed. ### Coprocessor Interfacing The Am386DX/DXL microprocessor provides an automatic interface for a 387DX floating-point math coprocessor. A 387DX math coprocessor uses an I/O-mapped interface driven automatically by the Am386DX/DXL microprocessor and assisted by three dedicated signals: BUSY, ERROR, and PEREQ. As the Am386DX/DXL CPU begins supporting a coprocessor instruction, it tests the BUSY and ERROR signals to determine if the coprocessor can accept its next instruction. Thus, the BUSY and ERROR inputs eliminate the need for any preamble bus cycles for communication between processor and coprocessor. A 387DX math coprocessor can be given its command op-code immediately. The dedicated signals provide instruction synchronization, and eliminate the need of using the Am386DX/DXL CPU WAIT op-code (9Bh) for 387DX math coprocessor instruction synchronization (the WAIT op-code was required when 8086 or 8088 was used with the 8087 coprocessor). Custom coprocessors can be included in Am386DX/ DXL microprocessor based systems, via memorymapped or I/O-mapped interfaces. Such coprocessor interfaces allow a completely custom protocol, and are not limited to a set of coprocessor protocol primitives. Instead, memory-mapped or I/O-mapped interfaces may use all applicable Am386DX/DXL microprocessor instructions for high-speed coprocessor communication. The BUSY and ERROR inputs of the Am386DX/DXL CPU may also be used for the custom coprocessor interface, if such hardware assist is desired. These signals can be tested by the Am386DX/DXL CPU WAIT opcode (9Bh). The WAIT instruction will wait until the BUSY input is negated (interruptable by an NMI or enable INTR input), but generates an Exception 16 fault if the ERROR pin is in the asserted state when the BUSY goes (or is) negated. If the custom coprocessor interface is memory-mapped, protection of the addresses used for the interface can be provided with the Am386DX/DXL microprocessor on-chip paging or segmentation mechanisms. If the custom interface is I/Omapped, protection of the interface can be provided with the Am386DX/DXL microprocessor IOPL (I/O Privilege Level) mechanism. A 387DX math coprocessor interface is I/O mapped as shown in Table 22. Note that a 387DX math coprocessor interface addresses are beyond the 0hFFFFh range for programmed I/O. When the Am386DX/DXL CPU supports a 387DX math coprocessor, the Am386DX/DXL microprocessor automatically generates bus cycles to the coprocessor interface addresses. **Table 22. Math Coprocessor Port Addresses** | Address in Am386DX/DXL<br>CPU I/O Space | 387DX<br>Coprocessor Register | |-----------------------------------------|-----------------------------------| | 800000F8h | Opcode Register<br>(32-bit port) | | 800000FCh | Operand Register<br>(32-bit port) | To correctly map a 387DX math coprocessor registers to the appropriate I/O addresses, connect a 387DX math coprocessor CMD0 pin directly to the A2 output of the Am386DX/DXL microprocessor. ### **Software Testing for Coprocessor Presence** When software is used to test for coprocessor (387DX) presence, it should use only the following coprocessor op-codes: FINIT, FNINIT, FSTCW mem, FSTSW mem, FSTSW AX. To use other coprocessor op-codes when a coprocessor is known to be not present, first set EM = 1 in Am386DX/DXL microprocessor CR0. ### **ABSOLUTE MAXIMUM RATINGS** Stresses above those listed under ABSOLUTE MAXIMUM RATINGS may cause permanent device failure. Functionality at or above these limits is not implied. Exposure to Absolute Maximum Ratings for extended periods may affect device reliability. ## DC CHARACTERISTICS over COMMERCIAL operating ranges $V_{CC} = 5 \text{ V} \pm 5\%$ ; $T_{CASE} = 0^{\circ}\text{C}$ to $+85^{\circ}\text{C}$ (PGA) $V_{CC} = 5 \text{ V} \pm 10\%$ ; $T_{CASE} = 0^{\circ}\text{C}$ to $\pm 100^{\circ}\text{C}$ (PQFP — 20, 25, and 33 MHz) $V_{CC} = 5 \text{ V} \pm 5\%$ ; $T_{CASE} = 0^{\circ}\text{C}$ to $+100^{\circ}\text{C}$ (PQFP — 40 MHz) | Symbol | Parameter Description | Notes | Min | Max | Unit | |------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------|------|-------------------------|------| | VIL | Input Low Voltage | (Note 1) | -0.3 | 0.8 | V | | ViH | Input High Voltage | | 2.0 | V <sub>cc</sub> + 0.3 | V | | V <sub>ILC</sub> | CLK2 Input Low Voltage | (Note 1) | -0.3 | 0.8 | V | | V <sub>IHC</sub> | CLK2 Input High Voltage | | 2.7 | V <sub>cc</sub> +0.3 | V | | V <sub>oL</sub> | Output Low Voltage | | | | | | | I <sub>OL</sub> = 4 mA: A31-A2, D31-D0 | (Note 6) | | 0.45 | V | | | $I_{OL} = 5 \text{ mA: } \overline{BE3} - \overline{BE0}, \text{ W/R},$ $D/\overline{C}, \text{ M/IO}, \overline{LOCK}, \overline{ADS}, \text{ HLDA}$ | | | 0.45 | V | | V <sub>он</sub> | Output High Voltage | | | | | | | l <sub>он</sub> = 1 mA: A31-A2, D31-D0 | (Note 6) | 2.4 | [ | V | | | I <sub>OH</sub> = 0.9 mA: BE3−BEō,<br>W/R, D/Ō, M/IŌ, LOCK,<br>ADS, HLDA | | 2.4 | | V | | lu | Input Leakage Current<br>(All pins except BS16, PEREQ,<br>BUSY, FLT, and ERROR) | 0 V ≤ V <sub>IN</sub> ≤ V <sub>CC</sub> | | ±15 | μА | | l <sub>itt</sub> | Input Leakage Current<br>(PEREQ Pin) | V <sub>IH</sub> = 2.4 V<br>(Note 2) | | 200 | μА | | I <sub>IL</sub> | Input Leakage Current<br>(BS16, BUSY, FLT, and ERROR) | V <sub>IL</sub> = 0.45<br>(Note 3) | | -400 | μА | | I <sub>LO</sub> | Output Leakage Current | 0.45 V ≤ V <sub>out</sub> ≤ V <sub>cc</sub> | | ±15 | μА | | loc | Supply Current | (Note 7) V <sub>cc</sub> = 5.0 V | | V <sub>cc</sub> = 5.5 V | | | | CLK2 = 40 MHz: with -20 | lcc Typ = 130 | | 155 | mA | | | CLK2 = 50 MHz: with -25 | lcc Typ = 160 | | 190 | mA | | | CLK2 = 66 MHz: with -33 | lcc Typ = 210 | | 245 | mA | | | CLK2 = 80 MHz: with -40 | lcc Typ = 330 | | 400 | mA | | IccsB | Standby Current<br>(Am386DXL microprocessor) | l <sub>ccsв</sub> Typ = 20 μA<br>(Note 5) | | 150 | μА | | C <sub>IN</sub> | Input or I/O Capacitance | F <sub>c</sub> =1 MHz (Note 4) | | 10 | pF | | Соит | Output Capacitance | F <sub>c</sub> =1 MHz (Note 4) | | 12 | pF | | C <sub>CLK</sub> | CLK2 Capacitance | F <sub>c</sub> =1 MHz (Note 4) | | 20 | pF | Notes: 1. The Min value, -0.3, is not 100% tested. 2. PEREQ input has an internal pulldown resistor. 3. BS16, BUSY, FLT, and ERROR inputs each have an internal pullup resistor. 4. Not 100% tested. 5. Measurement taken with inputs at rails, outputs unloaded, BS16, BUSY, FLT, and ERROR at Vcc voltage level, PEREQ at Gnd. 6. Outputs are CMOS and will pull to rail if load is not resistive. 7. Inputs at rails (Vcc or Vss). ## SWITCHING CHARACTERISTICS over COMMERCIAL operating range — 40 MHz $V_{CC} = 5 \text{ V} \pm 5\%$ ; $T_{CASE} = 0^{\circ}\text{C to } +85^{\circ}\text{C (PGA)}$ $V_{CC} = 5 \text{ V} \pm 5\%$ ; $T_{CASE} = 0^{\circ}\text{C to } +100^{\circ}\text{C (PQFP)}$ | No. | Parameter Description | Notes | Ref Figure | Min | Max | Unit | |-----|----------------------------------------------|------------------------------------|------------|--------------|----------|------------| | | Oper. Frequency: Am386DX CPU<br>Am386DXL CPU | Half CLK2 freq.<br>Half CLK2 freq. | | 2<br>0 | 40<br>40 | MHz<br>MHz | | 1 | CLK2 Period: Am386DX CPU<br>Am386DXL CPU | | 71<br>71 | 12.5<br>12.5 | 250 | ns<br>ns | | 2 | CLK2 High Time | at V <sub>IHC</sub> | 71 | 4 | | ns | | 3 | CLK2 Low Time | at 0.8 V | 71 | 5 | | ns | | 4 | CLK2 Fall Time | 2.7 V-0.8 V (Note 3) | 71 | | 4 | ns | | 5 | CLK2 Rise Time | 0.8 V-2.7 V (Note 3) | 71 | | 4 | ns | | 6 | A31-A2 Valid Delay | C <sub>L</sub> = 50 pF | 70, 73, 81 | 4 | 13 | ns | | 7 | A31-A2 Float Delay | (Note 1) | 81 | 4 | 20 | ns | | 8 | BE3-BE0, LOCK Valid Delay | C <sub>L</sub> =50 pF | 70, 73, 81 | 4 | 13 | ns | | 9 | BE3-BE0, LOCK Float Delay | (Note 1) | 81 | 4 | 20 | ns | | 10 | W/R, M/lŌ, D/C Valid Delay | C <sub>L</sub> =50 pF | 70, 73, 81 | 4 | 13 | ns | | 10a | ADS Valid Delay | C <sub>L</sub> =50 pF | 70, 73, 81 | 4 | 13 | ns | | 11 | W/R, M/IO, D/C, ADS Float Delay | (Note 1) | 81 | 4 | 20 | ns | | 12 | D31-D0 Write Data Valid Delay | C <sub>L</sub> = 50 pF (Note 4) | 70, 74, 81 | 7 | 18 | ns | | 12a | D31-D0 Write Data Hold Time | C <sub>L</sub> = 50 pF | 70, 75 | 2 | | ns | | 13 | D31-D0 Float Delay | (Note 1) | 81 | 4 | 17 | ns | | 14 | HLDA Valid Delay | C <sub>L</sub> =50 pF | 70, 81 | 4 | 17 | ns | | 14f | HLDA Float Delay (PQFP Only) | (Note 1) | 70, 81 | 4 | 17 | ns | | 15 | NA Setup Time | | 72 | 5 | | ns | | 16 | NA Hold Time | | 72 | 2 | | ns | | 17 | BS16 Setup Time | | 72 | 5 | | ns | | 18 | BS16 Hold Time | | 72 | 2 | | ns | | 19 | READY Setup Time | | 72 | 7 | | ns | | 20 | READY Hold Time | | 72 | 4 | | ns | | 21 | D31-D0 Read Setup Time | | 72 | 4 | | ns | | 22 | D31-D0 Read Hold Time | | 72 | 3 | | ns | | 23 | HOLD Setup Time | | 72 | 4 | | ns | | 24 | HOLD Hold Time | | 72 | 2 | | ns | | 25 | RESET Setup Time | | 82 | 4 | | ns | | 26 | RESET Hold Time | | 82 | 2 | | ns | | 27 | NMI, INTR Setup Time | (Note 2) | 72 | 5 | | ns | | 28 | NMI, INTR Hold Time | (Note 2) | 72 | 5 | | ns | | 29 | PEREQ, ERROR, BUSY,<br>FLT* Setup Time | (Note 2) | 72 | 5 | | ns | | 30 | PEREQ, ERROR, BUSY, FLT* Hold Time | (Note 2) | 72 | 4 | | ns | Notes: 1. Float condition occurs when maximum output current becomes less than ILo in magnitude. Float delay is not 100% tested. 2. These inputs are allowed to be asynchronous to CLK2. The setup and hold specifications are given for testing purposes, to assure recognition within a specific clock period. <sup>3.</sup> Rise and fall times are not tested. Min time not 100% tested. <sup>\*</sup>PQFP package only. # SWITCHING CHARACTERISTICS over COMMERCIAL operating range — 33 MHz $V_{\text{CC}} = 5~V~\pm 5\%;~T_{\text{CASE}} = 0^{\circ}\text{C to } + 85^{\circ}\text{C (PGA)}$ $V_{\text{CC}} = 5~V~\pm 10\%;~T_{\text{CASE}} = 0^{\circ}\text{C to } + 100^{\circ}\text{C (PQFP)}$ | No. | Parameter Description | Notes | Ref Figure | Min | Max | Unit | |-----|----------------------------------------------|------------------------------------|------------|----------|--------------|------------| | | Oper. Frequency: Am386DX CPU<br>Am386DXL CPU | Half CLK2 freq.<br>Half CLK2 freq. | | 2<br>0 | 33.3<br>33.3 | MHz<br>MHz | | 1 | CLK2 Period: Am386DX CPU<br>Am386DXL CPU | | 71<br>71 | 15<br>15 | 250 | ns<br>ns | | 2 | CLK2 High Time | at V <sub>IHC</sub> | 71 | 4 | | ns | | 3 | CLK2 Low Time | at 0.8 V | 71 | 5 | | ns | | 4 | CLK2 Fall Time | 2.7 V-0.8 V (Note 3) | 71 | | 4 | ns | | 5 | CLK2 Rise Time | 0.8 V-2.7 V (Note 3) | 71 | | 4 | ns | | 6 | A31-A2 Valid Delay | C <sub>L</sub> = 50 pF | 70, 73, 81 | 4 | 15 | ns | | 7 | A31-A2 Float Delay | (Note 1) | 81 | 4 | 20 | ns | | 8 | BE3-BE0, LOCK Valid Delay | C <sub>L</sub> =50 pF | 70, 73, 81 | 4 | 15 | ns | | 9 | BE3-BE0, LOCK Float Delay | (Note 1) | 81 | 4 | 20 | ns | | 10 | W/R, M/iŌ, D/C Valid Delay | C <sub>L</sub> =50 pF | 70, 73, 81 | 4 | 15 | ns | | 10a | ADS Valid Delay | C <sub>L</sub> =50 pF | 70, 73, 81 | 4 | 14.5 | ns | | 11 | W/R, M/IO, D/C, ADS Float Delay | (Note 1) | 81 | 4 | 20 | ns | | 12 | D31-D0 Write Data Valid Delay | C <sub>L</sub> = 50 pF (Note 4) | 70, 74, 81 | 7 | 23 | ns | | 12a | D31-D0 Write Data Hold Time | C <sub>L</sub> =50 pF | 70, 75 | 2 | | ns | | 13 | D31-D0 Float Delay | (Note 1) | 81 | 4 | 17 | ns | | 14 | HLDA Valid Delay | C <sub>L</sub> = 50 pF | 70, 81 | 4 | 20 | ns | | 14f | HLDA Float Delay (PQFP Only) | (Note 1) | 70, 81 | 4 | 20 | ns | | 15 | NA Setup Time | | 72 | 5 | | ns | | 16 | NA Hold Time | | 72 | 2 | | ns | | 17 | BS16 Setup Time | | 72 | 5 | | ns | | 18 | BS16 Hold Time | | 72 | 2 | | ns | | 19 | READY Setup Time | | 72 | 7 | | ns | | 20 | READY Hold Time | | 72 | 4 | | ns | | 21 | D31-D0 Read Setup Time | | 72 | 5 | | ns | | 22 | D31-D0 Read Hold Time | | 72 | 3 | | ns | | 23 | HOLD Setup Time | | 72 | 9 | | ns | | 24 | HOLD Hold Time | | 72 | 2 | | ns | | 25 | RESET Setup Time | | 82 | 5 | | ns | | 26 | RESET Hold Time | | 82 | 2 | | ns | | 27 | NMI, INTR Setup Time | (Note 2) | 72 | 5 | | ns | | 28 | NMI, INTR Hold Time | (Note 2) | 72 | 5 | | ns | | 29 | PEREQ, ERROR, BUSY,<br>FLT* Setup Time | (Note 2) | 72 | 5 | | ns | | 30 | PEREQ, ERROR, BUSY,<br>FLT* Hold Time | (Note 2) | 72 | 4 | | ns | Notes: 1. Float condition occurs when maximum output current becomes less than I<sub>LO</sub> in magnitude. Float delay is not 100% tested. 2. These inputs are allowed to be asynchronous to CLK2. The setup and hold specifications are given for testing purposes, to assure recognition within a specific clock period. <sup>3.</sup> Rise and fall times are not tested. 4. Min time not 100% tested. <sup>\*</sup>PQFP package only. # SWITCHING CHARACTERISTICS over COMMERCIAL operating range — 25 MHz $V_{CC} = 5 \text{ V} \pm 5\%$ ; $T_{CASE} = 0^{\circ}\text{C}$ to $+85^{\circ}\text{C}$ (PGA) $V_{CC} = 5 \text{ V} \pm 10\%$ ; $T_{CASE} = 0^{\circ}\text{C to} + 100^{\circ}\text{C (PQFP)}$ | No. | Parameter Description | Notes | Ref Figure | Min | Max | Unit | |-----|----------------------------------------------|------------------------------------|------------|----------|----------|------------| | | Oper. Frequency: Am386DX CPU<br>Am386DXL CPU | Half CLK2 freq.<br>Half CLK2 freq. | | 2<br>0 | 25<br>25 | MHz<br>MHz | | 1 | CLK2 Period: Am386DX CPU<br>Am386DXL CPU | | 71<br>71 | 20<br>20 | 250 | ns<br>ns | | 2 | CLK2 High Time | at V <sub>IHC</sub> | 71 | 4 | | ns | | 3 | CLK2 Low Time | at 0.8 V | 71 | 5 | | ns | | 4 | CLK2 Fall Time | 2.7 V-0.8 V (Note 3) | 71 | | 7 | ns | | 5 | CLK2 Rise Time | 0.8 V-2.7 V (Note 3) | 71 | | 7 | ns | | 6 | A31-A2 Valid Delay | C <sub>L</sub> = 50 pF | 70, 73, 81 | 4 | 17 | ns | | 7 | A31-A2 Float Delay | (Note 1) | 81 | 4 | 30 | ns | | 8 | BE3-BE0, LOCK Valid Delay | C <sub>L</sub> = 50 pF | 70, 73, 81 | 4 | 17 | ns | | 9 | BE3-BE0, LOCK Float Delay | (Note 1) | 81 | 4 | 30 | ns | | 10 | W/R, M/IO, D/C, ADS Valid Delay | C <sub>L</sub> = 50 pF | 70, 73, 81 | 4 | 17 | ns | | 11 | W/R, M/IO, D/C, ADS Float Delay | (Note 1) | 81 | 4 | 30 | ns | | 12 | D31-D0 Write Data Valid Delay | C <sub>L</sub> =50 pF | 70, 74, 81 | 7 | 23 | ns | | 12a | D31-D0 Write Data Hold Time | C <sub>L</sub> =50 pF | 70, 75 | 2 | | ns | | 13 | D31-D0 Float Delay | (Note 1) | 81 | 4 | 22 | ns | | 14 | HLDA Valid Delay | C <sub>L</sub> = 50 pF | 70, 81 | 4 | 22 | ns | | 14f | HLDA Float Delay (PQFP Only) | (Note 1) | 70, 81 | 4 | 30 | ns | | 15 | NA Setup Time | | 72 | 5 | | ns | | 16 | NA Hold Time | | 72 | 3 | | ns | | 17 | BS16 Setup Time | | 72 | 5 | | ns | | 18 | BS16 Hold Time | | 72 | 3 | | ns | | 19 | READY Setup Time | | 72 | 9 | | ns | | 20 | READY Hold Time | | 72 | 4 | | ns | | 21 | D31-D0 Read Setup Time | | 72 | 7 | | ns | | 22 | D31-D0 Read Hold Time | | 72 | 5 | | ns | | 23 | HOLD Setup Time | | 72 | 9 | | ns | | 24 | HOLD Hold Time | | 72 | 3 | | ns | | 25 | RESET Setup Time | | 82 | 8 | | ns | | 26 | RESET Hold Time | | 82 | 3 | | ns | | 27 | NMI, INTR Setup Time | (Note 2) | 72 | 6 | | ns | | 28 | NMI, INTR Hold Time | (Note 2) | 72 | 6 | | ns | | 29 | PEREQ, ERROR, BUSY, FLT*<br>Setup Time | (Note 2) | 72 | 6 | | ns | | 30 | PEREQ, ERROR, BUSY, FLT*<br>Hold Time | (Note 2) | 72 | 5 | | ns | Notes: 1. Float condition occurs when maximum output current becomes less than I<sub>LO</sub> in magnitude. Float delay is not 100% tested. 2. These inputs are allowed to be asynchronous to CLK2. The setup and hold specifications are given for testing purposes, to assure recognition within a specific clock period. <sup>3.</sup> Rise and fall times are not tested. \*PQFP package only. # SWITCHING CHARACTERISTICS over COMMERCIAL operating range — 20 MHz $V_{CC} = 5 \text{ V} \pm 5\%$ ; $T_{CASE} = 0^{\circ}\text{C}$ to $+85^{\circ}\text{C}$ (PGA) $V_{CC} = 5 \text{ V} \pm 10\%$ ; $T_{CASE} = 0^{\circ}\text{C}$ to $+100^{\circ}\text{C}$ (PQFP) | No. | Parameter Description | Notes | Ref Figure | Min | Max | Unit | |-----|----------------------------------------------|------------------------------------|------------|----------|----------|------------| | | Oper. Frequency: Am386DX CPU<br>Am386DXL CPU | Half CLK2 freq.<br>Half CLK2 freq. | | 2 | 20<br>20 | MHz<br>MHz | | 1 | CLK2 Period: Am386DX CPU<br>Am386DXL CPU | | 71<br>71 | 25<br>25 | 250 | ns<br>ns | | 2 | CLK2 High Time | at V <sub>інс</sub> | 71 | 6 | | ns | | 3 | CLK2 Low Time | at 0.8 V | 71 | 6 | | ns | | 4 | CLK2 Fall Time | 2.7 V-0.8 V (Note 3) | 71 | | 8 | ns | | 5 | CLK2 Rise Time | 0.8 V-2.7 V (Note 3) | 71 | | 8 | ns | | 6 | A31-A2 Valid Delay | C <sub>L</sub> =120 pF | 70, 73, 81 | 4 | 30 | ns | | 7 | A31-A2 Float Delay | (Note 1) | 81 | 4 | 32 | ns | | 8 | BE3-BE0, LOCK Valid Delay | C <sub>L</sub> =75 pF | 70, 73, 81 | 4 | 30 | ns | | 9 | BE3-BE0, LOCK Float Delay | (Note 1) | 81 | 4 | 32 | ns | | 10 | W/R, M/IO, D/C, ADS Valid Delay | C <sub>L</sub> =75 pF | 70, 73, 81 | 4 | 28 | ns | | 11 | W/R, M/IO, D/C, ADS Float Delay | (Note 1) | 81 | 4 | 30 | ns | | 12 | D31-D0 Write Data Valid Delay | C <sub>L</sub> = 120 pF | 70, 74, 81 | 4 | 38 | ns | | 13 | D31-D0 Float Delay | (Note 1) | 81 | 4 | 27 | ns | | 14 | HLDA Valid Delay | C <sub>L</sub> =75 pF | 70, 81 | 6 | 28 | ns | | 14f | HLDA Float Delay (PQFP Only) | (Note 1) | 70, 81 | 4 | 30 | ns | | 15 | NA Setup Time | | 72 | 9 | | ns | | 16 | NA Hold Time | | 72 | 14 | | ns | | 17 | BS16 Setup Time | | 72 | 13 | | ns | | 18 | BS16 Hold Time | | 72 | 21 | | ns | | 19 | READY Setup Time | | 72 | 12 | | ns | | 20 | READY Hold Time | | 72 | 4 | | ns | | 21 | D31-D0 Read Setup Time | | 72 | 11 | | ns | | 22 | D31-D0 Read Hold Time | | 72 | 6 | | ns | | 23 | HOLD Setup Time | | 72 | 17 | | ns | | 24 | HOLD Hold Time | | 72 | 5 | | ns | | 25 | RESET Setup Time | | 82 | 12 | | ns | | 26 | RESET Hold Time | | 82 | 4 | | ns | | 27 | NMI, INTR Setup Time | (Note 2) | 72 | 16 | | ns | | 28 | NMI, INTR Hold Time | (Note 2) | 72 | 16 | | ns | | 29 | PEREQ, ERROR, BUSY, FLT* Setup Time | (Note 2) | 72 | 14 | | ns | | 30 | PEREQ, ERROR, BUSY, FLT*<br>Hold Time | (Note 2) | 72 | 5 | | ns | Notes: 1. Float condition occurs when maximum output current becomes less than I<sub>LO</sub> in magnitude. Float delay is not 100% tested. <sup>2.</sup> These inputs are allowed to be asynchronous to CLK2. The setup and hold specifications are given for testing purposes, to assure recognition within a specific clock period. Rise and fall times are not tested. \*PQFP package only. ### SWITCHING WAVEFORMS The switching characteristics consist of output delays, input setup requirements, and input hold requirements. All characteristics are relative to the CLK2 rising edge crossing the 2.0 V level. Switching characteristic measurement is defined by Figure 69. Inputs must be driven to the voltage levels indicated by this diagram. Am386DX/DXL CPU output delays are specified with minimum and maximum limits measured as shown. The minimum Am386DX/DXL microprocessor delay times are hold times provided to external circuitry. Am386DX/DXL microprocessor input setup and hold time are specified as minimums, defining the smallest acceptable sampling window. Within the sampling window, a synchronous input signal must be stable for correct Am386DX/DXL microprocessor operation. Outputs ADS, W/R, D/C, M/IO, LOCK, BE3-BE0, A31-A2, and HLDA only change at the beginning of phase one. D31-D0 (write cycles) only change at the beginning of phase two. The READY, HOLD, BUSY, ERROR, PEREQ, FLT, and D31-D0 (read cycles) inputs are sampled at the beginning of phase one. The NA, BS16, INTR, and NMI inputs are sampled at the beginning of phase two. Figure 69. Drive Levels and Measurement Points **Am386 Microprocessors for Personal Computers** C<sub>L</sub> includes all parasitic capacitances. Figure 70. AC Test Load 15021B-072 Figure 71. CLK2 Timing Figure 73. Output Valid Delay Timing Figure 74. Write Data Valid Delay Timing (25, 33, and 40 MHz) Figure 76. Write Data Valid Delay Timing (20 MHz) Note: This graph will not be linear outside of the C<sub>L</sub> range shown. 15021B-079 Figure 77. Typical Output Valid Delay Versus Load Capacitance at Maximum Operating Temperature (C<sub>L</sub>=120 pF) Note: This graph will not be linear outside of the $\ensuremath{\text{C}_{\text{L}}}$ range shown. Figure 78. Typical Output Valid Delay Versus Load Capacitance at Maximum Operating Temperature (C<sub>L</sub>=75 pF) Note: This graph will not be linear outside of the C<sub>L</sub> range shown. Figure 79. Typical Output Valid Delay Versus Load Capacitance at Maximum Operating Temperature (C<sub>L</sub> = 50 pF) 15021B-081 Note: This graph will not be linear outside of the $\ensuremath{C_L}$ range shown. Figure 80. Typical Output Rise Time Versus Load Capacitance at Maximum Operating Temperature Figure 81. Output Float Delay and HLDA Valid Delay Timing The second internal processor phase following RESET High-to-Low transition (provided t25 and t26 are met) is $\phi$ 2. Figure 82. RESET Setup and Hold Timing and Internal Phase ### **INSTRUCTION SET** This section describes the Am386DX/DXL microprocessor instruction set. A table lists all instructions along with instruction encoding diagrams and clock counts. Further details of the instruction encoding are then provided in the following sections, which completely describe the encoding structure and the definition of all fields occurring within Am386DX/DXL CPU instructions. # Am386DX/DXL Microprocessor Instruction Encoding and Clock Count Summary To calculate elapsed time for an instruction, multiply the instruction clock count, as listed in Table 23, by the processor clock period (e.g., 50 ns for a 20-MHz, 40 ns for a 25-MHz, 30 ns for a 33-MHz, and 25 ns for a 40-MHz Am386DX/DXL microprocessor). For more detailed information on the encodings of instructions refer to Section Instruction Encodings. Section Instruction Encodings explains the general structure of instruction encodings and defines exactly the encodings of all fields contained within the instruction. ### **Instruction Clock Count Assumptions** The instruction has been prefetched and decoded, and is ready for execution. - 2. Bus cycles do not require wait states. - There are no local bus HOLD requests delaying processor access to the bus. - 4. No Exceptions are detected during instruction execution. - If an effective address is calculated, it does not use two general register components. One register, scaling, and displacement can be used within the clock counts shown. However, if the effective address calculation uses two general register components, add 1 clock to the clock count shown. ### Instruction Clock Count Notation - If two clock counts are given, the smaller refers to a register operand and the larger refers to a memory operand. - 2. n = number of times repeated. - m = number of components in the next instruction executed, where the entire displacement (if any) counts as one component; the entire immediate data (if any) counts as one component; and each of the other bytes of the instruction and prefix(es) each count as one component. Table 23. Am386DX/DXL Microprocessor Instruction Set Summary | | | | | Clock | Count | | | |----------------------------------------|--------------------|-------------------|----------------------|-------------------------|-----------------------------------------|-------------------------|-----------------------------------------| | Instruction | Format | | | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | | GENERAL DATA TRANSFER | | | | | | | | | MOV = Move: | | | | | | | | | Register to Register/Memory | 1000100w | mod reg r/m | | 2/2 | 2/2 | b | h | | Register/Memory to Register | 1000101w | mod reg r/m | | 2/4 | 2/4 | ь | h | | Immediate to Register/Memory | 1100011w | mod 0 0 0 r/m | immediate data | 2/2 | 2/2 | ь | h | | Immediate to Register (short form) | 1 0 1 1 w reg | immediate data | • | 2 | 2 | | | | Memory to Accumulator (short form) | 1010000w | full displacement | | 4 | 4 | b | h | | Accumulator to Memory (short form) | 1010001w | full displacement | | . 2 | 2 | b | h | | Register/Memory to Segment Register | 10001110 | mod sreg3 r/m | | 2/5 | 18, 19 | ь | h, i, j | | Segment Register to Register/Memory | 10001100 | mod reg r/m | | 2/2 | 2/2 | b | h | | MOVSX = Move with Sign Extension | | | • | | | | | | Register from Register/Memory | 00001111 | 1011111w | mod reg r/m | 3/6 | 3/6 | b | h | | MOVZX = Move with Zero Extension | | | <del></del> | | | | | | Register from Register/Memory | 00001111 | 1011011w | mod reg r/m | 3/6 | 3/6 | ь | h | | PUSH = Push: | L | | | | | | | | Register/Memory | 11111111 | mod 1 1 0 r/m | | 5 | 5 | b | h | | Register (short form) | 01010 reg | | • | 2 | 2 | ь | h | | Segment Register (ES,CS,SS, or DS) | 0 0 0 sreg 2 1 1 0 | | | 2 | 2 | ь | h | | Segment Register (FS or GS) | 00001111 | 1 0 sreg 3 0 0 0 | | 2 | 2 | ь | h | | Immediate | 011010s0 | immediate data | | 2 | 2 | b | h | | PUSHA = Push All | 01100000 | | | 18 | 18 | b | h | | POP=Pop | | ı | | | | | | | Register/Memory | 10001111 | mod 0 0 0 r/m | | 5 | 5 | b | h | | Register (short form) | 01011 reg | | | 4 | 4 | b | h | | Segment Register (ES, SS, or DS) | 0 0 0 sreg 2 1 1 1 | | | 7 | 21 | ь | h, i, j | | Segment Register (FS or GS) | 00001111 | 1 0 sreg 3 0 0 1 | | 7 | 21 | b | h, i, j | | POPA = Pop All | 01100001 | | | 24 | 24 | b | h | | XCHG = Exchange | | 4 | | | | | | | Register/Memory with Register | 1000011w | mod reg r/m | | 3/5 | 3/5 | b, f | f, h | | Register with Accumulator (short form) | 10010 reg | | Clock Count | 3 | 3 | | | | IN = Input from: | | • | Virtual<br>8086 Mode | | | | | | Fixed Port | 1110010w | port number | . ♦26 | 12 | 6*/26** | | m | | Variable Port | 1110110w | | <b>◊2</b> 7 | 13 | 7*/27** | | m | | OUT = Output to: | | | | | | | | | Fixed Port | 1110011w | port number | ◊24 | 10 | 4*/24** | | m | | Variable Port | 1110111w | | ◊25 | 11 | 5*/25** | | m | | | | T | | 2 | 2 | | i | <sup>\*</sup> If CPL≤IOPL \*\* If CPL>IOPL Clock count shown applies if I/O permission allows I/O to the port in Virtual 8086 Mode. If I/O bit map denies permission, Exception 13 fault occurs; refer to clock counts for INT 3 instruction. | | | | | | Clock Count | | Com | ments | |---------------------------------------|-----------|----------------|----------------|---|-------------------------|-----------------------------------------|-------------------------|-----------------------------------------| | Instruction | Format | | | | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | | SEGMENT CONTROL | | | | | | | | | | LDS=Load pointer to DS | 11000101 | mod reg r/m | | | 7 | 22 | ь | h, i, j | | LES=Load pointer to ES | 11000100 | mod reg r/m | | | 7 | 22 | b | h, i, j | | LFS=Load pointer to FS | 00001111 | 10110100 | mod reg r/m | 1 | 7 | 25 | ь | h, i, j | | LGS=Load pointer to GS | 00001111 | 10110101 | mod reg r/m | 1 | 7 | 25 | ь | h, i, j | | LSS=Load pointer to SS | 00001111 | 10110010 | mod reg r/m | 1 | 7 | 22 | ь | h, i, j | | FLAG CONTROL | | | | J | | | | | | CLC = Clear Carry Flag | 11111000 | | | | 2 | 2 | | | | CLD = Clear Direction Flag | 11111100 | | | | 2 | 2 | | | | CLI = Clear Interrupt Enable Flag | 11111010 | | | | 8 | 8 | | m | | CLTS = Clear Task Switched Flag | 00001111 | 00000110 | | | 6 | 6 | С | i | | CMC = Complement Carry Flag | 11110101 | | | | 2 | 2 | | | | LAHF = Load AH into Flag | 10011111 | | | | 2 | 2 | | | | POPF = Pop Flag | 10011101 | | | | 5 | 5 | ь | h,n | | PUSHF = Push Flag | 10011100 | | | | 4 | 4 | b | h | | SAHF = Store AH into Flag | 10011110 | | | | 3 | 3 | | | | STC = Set Carry Flag | 11111001 | | | | 2 | 2 | | | | STD = Set Direction Flag | 11111101 | | | | 2 | 2 | | | | STI = Set Interrupt Enable Flag | 11111011 | | | | 8 | 8 | | m | | ARITHMETIC | L, | | | | | | | | | ADD=Add | | | | | | | | | | Register to Register | 000000dw | mod reg r/m | | | 2 | 2 | | | | Register to Memory | 0000000w | mod reg r/m | | | 7 | 7 | ь | h | | Memory to Register | 0000001w | mod reg r/m | | | 6 | 6 | b | h | | Immediate to Register/Memory | 100000sw | mod 0 0 0 r/m | immediate data | | 2/7 | 2/7 | ь | h | | Immediate to Accumulator (short form) | 0000010w | immediate data | • | | 2 | 2 | | | | ADC = Add with carry | | | | | | | | | | Register to Register | 000100dw | mod reg r/m | | | 2 | 2 | | | | Register to Memory | 0001000w | mod reg r/m | | | 7 | 7 | ь | h | | Memory to Register | 0001001w | mod reg r/m | | | 6 | 6 | b | h | | Immediate to Register/Memory | 100000sw | mod 0 1 0 r/m | immediate data | | 2/7 | 2/7 | ь | h | | Immediate to Accumulator (short form) | 0001010w | immediate data | ı | | 2 | 2 | | | | INC =Increment | <u> </u> | J | | | | | | | | Register/Memory | 1111111w | mod 0 0 0 r/m | ] | | 2/6 | 2/6 | ь | h | | Register (short form) | 01000 reg | | | | 2 | 2 | | | | SUB =Subtract | | • | | | | | | | | Register from Register | 001010dw | mod reg r/m | | | 2 | 2 | | | | Register from Memory | 0010100w | mod reg r/m | | | 7 | 7 | ь | h | | Memory from Register | 0010101w | mod reg r/m | | | 6 | 6 | ь | h | | | L | | | | | | | | Table 23. Am386DX/DXL Microprocessor Instruction Set Summary (continued) | | | | | Clock | Count | Com | ments | |-----------------------------------------------|-----------|----------------|----------------|----------------------------|-----------------------------------------|-------------------------|-----------------------------------------| | Instruction | Format | | | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | | ARITHMETIC (continued) | | | | | | | | | Immediate from Register/Memory | 0010011w | mod 1 0 1 r/m | immediate data | 2/7 | 2/7 | b | h | | Immediate from Accumulator | 0001110w | immediate data | | 2 | 2 | | | | (short form) | | | | | | | | | SBB = Subtract with Borrow | | | | | | | | | Register from Register | 000110dw | mod reg r/m | | 2 | 2 | | | | Register from Memory | 0001100w | mod reg r/m | | 7 | 7 | b | h | | Memory from Register | 0001101w | mod reg r/m | | 6 | 6 | b | h | | Immediate from Register/Memory | 100000sw | mod 0 1 1 r/m | immediate data | 2/7 | 2/7 | b | h | | Immediate from Accumulator | 0001110w | immediate data | | 2 | 2 | | | | DEC = Decrement | <u> </u> | | | | | | | | Register/Memory | 1111111W | reg 0 0 1 r/m | | 2/6 | 2/6 | b | h | | Register (short form) | 01001 reg | | | 2 | 2 | | | | CMP = Compare | <b>_</b> | | | | | | | | Register with Register | 001110dw | mod reg r/m | | 2 | 2 | | | | Memory with Register | 0011100w | mod reg r/m | | 5 | 5 | b | h | | Register with Memory | 0011101w | mod reg r/m | | 6 | 6 | b | h | | Immediate with Register/Memory | 100000sw | mod 1 1 1 r/m | immediate data | 2/5 | 2/5 | b | h | | Immediate with Accumulator(short form) | 0011110w | immediate data | | 2 | 2 | | | | NEG = Change Sign | 1111011w | mod 0 1 1 r/m | | 2/6 | 2/6 | b | h | | AAA = ASCII Adjust for Add | 00110111 | | | 4 | 4 | | | | DAA = Decimal Adjust for Add | 00111111 | | | 4 | 4 | | | | AAS = ASCII Adjust for Subtract | 00100111 | | | 4 | 4 | | | | DAS = Decimal Adjust for Subtract | 00101111 | | | 4 | 4 | | | | MUL = Multiply (Unsigned) | | • | | | | | | | Accumulator with Register/Memory | 1111011w | mod 1 0 0 r/m | | | | | | | Multiplier -Byte | <u> </u> | | | 12-17/15-20 | 12-17/15-20 | b,d | d,h | | -Word<br>-Doubleword | | | | 12-25/15-28<br>12-41/15-44 | | b,d<br>b,d | d,h<br>d,h | | IMUL = Integer Multiply (signed) | | | | | | -,- | _, | | Accumulator with Register/Memory | 1111011w | mod 1 0 1 r/m | | | | | | | Multiplier-Byte | | | | | 12-17/15-20 | b,d | d,h | | -Word<br>-Doubleword | | <u> </u> | | 12-25/15-28<br>12-41/15-44 | | b,d<br>b,d | d,h<br>d,h | | Register with Register/Memory | 00001111 | 10101111 | mod reg r/m | | | | | | Multiplier-Byte<br>-Word | | | | | 12-17/15-20<br>12-25/15-28 | b,d<br>b,d | d,h<br>d,h | | -Doubleword<br>Register/Memory with Immediate | | | | 12-41/15-44 | | b,d | d,h | | to Register | 011010s1 | mod reg r/m | immediate data | | | | | | -Word<br>-Doubleword | L | | ı | 13-26/14-27<br>13-42/14-43 | 13-26/14-27<br>13-42/14-43 | b,d<br>b,d | d,h<br>d,h | | Bodoloword | | | | 10 42 14 40 | 10 42 14 40 | 0,0 | u, | | | | | | | | : | | | | | | | | | | 1 | | | | | | | Clock Count | | Com | ments | |-----------------------------------------------------------------------------------------------|-----------------------------|---------------------------------------------|-------------------|----------------------|-------------------------|-----------------------------------------|-------------------------|-----------------------------------------| | Instruction | Format | | | | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | | ARITHMETIC (continued) | | | | | | | | | | DIV = Divide (Unsigned) | | | | | | | | | | Accumulator by Register/Memory<br>Divisor -Byte<br>-Word<br>-Doubleword | 1111011w | mod 1 1 0 r/m | | | 14/17<br>22/25<br>38/41 | 14/17<br>22/25<br>38/41 | b,e<br>b,e<br>b,e | e, h<br>e, h<br>e, h | | IDIV = Integer Divide (Signed) Accumulator by Register/Memory Divisor -Byte -Word -Doubleword | 11110112 | mod 1 1 1 r/m | | | 19/22<br>27/30<br>43/46 | 19/22<br>27/30<br>43/46 | b,e<br>b,e<br>b,e | e, h<br>e, h<br>e, h | | AAD = ASCII Adjust for Divide | 11010101 | 00001010 | l | | 19 | 19 | | | | AAM = ASCII Adjust for Multiply | 11010100 | 00001010 | | | 17 | 17 | | | | CBW=Convert Byte to Word | 10011000 | | | | 3 | 3 | | | | CWD=Convert Word to<br>Double Word | 10011001 | | | | 2 | 2 | | | | LOGIC | | | | | | | | | | Shift/Rotate Instructions<br>Not Through Carry (ROL, ROR, SAL, | SAR, SHL, and SH | IR) | | | | | | | | Register/Memory by 1 | 1101000w | mod TTT r/m | | | 3/7 | 3/7 | b | h | | Register/Memory by CL | 1101001w | mod TTT r/m | | | 3/7 | 3/7 | b | h | | Register Memory by Immediate Count | 1100000w | mod TTT r/m | immediate 8-bit d | ata | 3/7 | 3/7 | b | h | | Through Carry (RCL and RCR) | | | | | | | | | | Register/Memory by 1 | 1101000w | mod TTT r/m | | | 9/10 | 9/10 | ь | h | | Register/Memory by CL | 1101001w | mod TTT r/m | | | 9/10 | 9/10 | b | h | | Register/Memory by Immediate Count | 1100000w | mod TTT r/m | immediate 8-bit d | ata | 9/10 | 9/10 | b | h | | SHLD≖Shift Left Double | TTT 000 001 010 011 100 101 | Instruction ROL ROR RCL RCR SHL/SAL SHR SAR | | | | | | | | Register/Memory by Immediate | 00001111 | 10100100 | mod reg r/m | immediate 8-bit data | 3/7 | 3/7 | İ | | | Register/Memory by CL | 00001111 | 10100101 | mod reg r/m | 1 | 3/7 | 3/7 | | | | SHRD=Shift Right Double | | *************************************** | | | | | 1 | | | Register/Memory by Immediate | 00001111 | 10101100 | mod reg r/m | immediate 8-bit data | 3/7 | 3/7 | | | | Register/Memory by CL | 00001111 | 10101101 | mod reg r/m | 1 | 3/7 | 3/7 | | | | AND = And | | | <u> </u> | • | | | | | | Register to Register | 001000dw | mod reg r/m | | | 2 | 2 | | | | Register to Memory | 0010000w | mod reg r/m | | | 7 | 7 | ь | h | | Memory to Register | 0010001w | mod reg r/m | | | 6 | 6 | ь | h | | Immediate to Register/Memory | 1000000w | mod 1 1 0 r/m | immediate data | | 2/7 | 2/7 | ь | h | | Immediate to Accumulator (short form) | 0010010w | immediate data | • | | 2 | 2 | | | | TEST = And Function to Flags, no Re | sult | • | | | 1 | | | | | Register/Memory and Register | 1000010w | mod reg r/m | | | 2/5 | 2/5 | b | h | | Immediate Data and Register/Memory | 1111011w | mod 0 0 0 r/m | immediate data | | 2/5 | 2/5 | b | h | | Immediate Data and Accumulator (short form) | 1010100w | immediate data | | | 2 | 2 | | | Table 23. Am386DX/DXL Microprocessor Instruction Set Summary (continued) | | *************************************** | | | | Clock | Count | Co | mments | |----------------------------------------------|-----------------------------------------|-----------------------|----------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------|---------------------------------|-----------------|---------------------------------| | | F 4 | | | | Real<br>Address | Protected<br>Virtual<br>Address | Real<br>Address | Protected<br>Virtual<br>Address | | | Format | | | | Mode | Mode | Mode | Mode | | LOGIC (continued) | | | | | | | | | | OR = Or | | I | | | | | | | | Register to Register | 000010dw | mod reg r/m | | | 2 | 2 | | | | Register to Memory | 0000100w | mod reg r/m | | | 7 | 7 | b | h | | Memory to Register | 0000101w | mod reg r/m | | | 6 | 6 | b | h | | Immediate and Register/Memory | 1000000w | mod 0 0 1 r/m | immediate data | | 2/7 | 2/7 | ь | h | | Immediate to Accumulator (short form) | 0000110w | immediate data | | | 2 | 2 | | | | XOR = Exclusive or | | T | ı | | | | | | | Register to Register | 001100dw | mod reg r/m | | | 2 | 2 | | | | Register to Memory | 0011000w | mod reg r/m | | | 7 | 7 | ь | h | | Memory to Register | 0011001w | mod reg r/m | | | 6 | 6 | ь | h | | Immediate to Register/Memory | 1000000w | mod 1 1 0 r/m | immediate data | | 2/7 | 2/7 | b | h | | Immediate to Accumulator (short form) | 0011010w | immediate data | • | | 2 | 2 | | | | NOT = Invert Register/Memory | 1111011w | mod 0 1 0 r/m | | | 2/6 | 2/6 | ь | h | | STRING MANIPULATION | | | l | Clock Count | | | | | | CMPS = Compare Byte/Word | 1010011w | | | Virtual 8086<br>Mode | 10 | 10 | b | h | | INS=Input Byte/Wd from DX Port | 0110110w | | | ◊29 | 15 | 9*/29** | ь | h, m | | LODS = Load Byte/Wd to AL/AX | 1010110w | | | | 5 | 5 | ь | h | | MOVS = Move Byte/Word | 1010010w | | | | 8 | 8 | ь | h | | OUTS = Output Byte/Wd to DX Port | 0110111w | | | ◊28 | 14 | 8*/28** | ь | h, m | | SCAS = Scan Byte/Word | 1010111w | | | | 8 | 8 | ь | h | | STOS = Store Byte/Word from AL/AX/EX | 1010101w | | | | 5 | 5 | ь | h | | XLAT = Translate String | 11010111 | | | | 5 | 5 | | h | | REPEATED STRING MANIPULA | TION Repeated | ı<br>bv Count in CX ∈ | or ECX | | | | | | | REPE CMPS = Compare string | · | | 1 | | | | | | | (Find Non-Match) REPNE CMPS = Compare String | 11110011 | 1010011w | | Clock Count<br>Virtual 8086 | 5+9n | 5+9n | b | h | | (Find Match) | 11110010 | 1010011w | | Mode | 5+9n | 5+9n | b | h | | REP INS = Input String | 11110010 | 0110110w | | 028 + 6n | 14+6n | 8+6n*/ | ь | h, m | | REP LODS = Load String | 11110010 | 1010110w | | | 5+6n | 28 + 6n**<br>5 + 6n | b | h | | REP MOVS = Move String | 11110010 | 1010010w | | | 8+4n | 8 + 4n | b | h | | REP OUTS = Output String | 11110010 | 0110111w | | ♦26 + 5n | 12+5n | 6+5n*/ | b | h, m | | REPE SCAS = Scan String | | | 1 | Contract of the last la | | 26 + 5n** | 1 | | | (Find Non-AL/AX/EAX) | 11110011 | 1010111w | | | 5+8n | 5+8n | b | h | | REPNE SCAS = Store String (Find AL/AX/EAX) | 11110010 | 1010111w | | | 5+8n | 5+9n | Ь | h | | REP STOS = Store String | 11110010 | 1010101W | | | 5+5n | 5+5n | Ь | h | | BIT MANIPULATION | | | | • | | " | | '' | | BSF=Scan Bit Forward | 00001111 | 10111100 | mod reg r/m | | 11+3n | 11+3n | ь | h | | BSR=Scan Bit Reverse | 00001111 | 10111101 | mod reg r/m | | 9+3n | 9+3n | ь | h | | | | | | | L | <u> </u> | L | L | <sup>\*</sup> If CPL≤IOPL <sup>\*\*</sup> If CPL > IOPL Clock count shown applies if I/O permission allows I/O to the port in Virtual 8086 Mode. If I/O bit map denies permission, Exception 13 fault occurs; refer to clock counts for INT 3 instruction. | | | | | | Clock | Count | <del>- </del> | | |--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------|---------------------|---------------|----------------------|-------------------------|-----------------------------------------------------------------------|-------------------------|--------------------------------------------------------------------------------------------------------------| | Instruction | Format | | | | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | | BIT MANIPULATION (continued | n | | | | | | | | | BT = Test Bit | -, | | | | | | | | | Register/Memory, Immediate | 00001111 | 10111010 | mod 1 0 0 r/m | immediate 8-bit data | 3/6 | 3/6 | b | h | | Register/Memory, Register | 00001111 | 10100011 | mod reg r/m | | 3/12 | 3/12 | ь | h | | BTC = Test Bit and Complement | | | | j | | | | | | Register/Memory, Immediate | 00001111 | 10111010 | mod 1 1 1 r/m | immediate 8-bit data | 6/8 | 6/8 | ь | h | | Register/Memory, Register | 00001111 | 10111011 | mod reg r/m | | 6/13 | 6/13 | ь | h | | BTR = Test Bit and Reset | | | | ı | | | | | | Register/Memory, Immediate | 00001111 | 10111010 | mod 1 1 0 r/m | immediate 8-bit data | 6/8 | 6/8 | ь | h | | Register/Memory, Register | 00001111 | 10110011 | mod reg r/m | | 6/13 | 6/13 | ь | h | | BTS = Test Bit and Set | | · | | J | | | | | | Register/Memory, Immediate | 00001111 | 10111010 | mod 1 0 1 r/m | immediate 8-bit data | 6/8 | 6/8 | b | h | | Register/Memory, Register | 00001111 | 10101011 | mod reg r/m | | 6/13 | 6/13 | b | h | | CONTROL TRANSFER | | | | J | | | | | | CALL = Call | | | | | | | | | | Direct Within Segment | 11101000 | full displacement | | | 7+m | 7+m | ь | r | | Register/Memory | 11111111 | mod 0 1 0 r/m | | | 7+m | 7 + m | ь | h, r | | Indirect Within Segment | | | | | 10+m | 10+m | | '''` | | Direct Intersegment | 10011010 | unsigned full offse | et, selector | | 17 + m | 34 + m | ь | j, k, r | | Protected Mode Only (Direct Intersegi<br>Via Call Gate to Same Privilege Lev<br>Via Call Gate to Different Privilege I<br>Via Call Gate to Different Privilege I<br>From 80286 Task to 80286 TSS<br>From 80286 Task to May866D/XDX<br>From 80286 Task to Virtual 8086 Te<br>From Am386D/X/DXL CPU Task to<br>From Am386D/X/DXL CPU Task to<br>From Am386D/X/DXL CPU Task to<br>From Am386D/X/DXL CPU Task to | rel<br>Level, (No Paramete<br>Level, (x Parameters<br>L CPU TSS<br>ask (Am386DX/DXL<br>80286 TSS<br>Am386DX/DXL CPU | CPU TSS) | TSS) | | | 52 + m<br>86 + m<br>94+4x+m<br>273<br>298<br>218<br>273<br>300<br>218 | | h, j, k, r<br>h, r | | Indirect Intersegment | 11111111 | mod 0 1 1 r/m | | | 22+m | 38 + m | ь | h, j, k, r | | Protected Mode Only (Indirect Interse | gment) | <u> </u> | 1 | | | | | | | Via Call Gate to Same Privilege Let Via Call Gate to Different Privilege I Via Call Gate to Different Privilege I From 80286 Tass to B0286 TSS From 80286 Task to Am386DX/DXI From 80286 Task to Virtual 8086 Ti From Am386DX/DXI. CPU Task to From Am386DX/DXI. CPU Task to From Am386DX/DXI. CPU Task to | Level (No Parameter<br>Level (x Parameters)<br>L CPU TSS<br>ask (Am386DX/DXL<br>80286 TSS<br>Am386DX/DXL CPU | CPU TSS) | TSS) | | | 56 + m<br>90 + m<br>98+4x+m<br>278<br>303<br>222<br>278<br>305<br>222 | | h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r | | JMP = Unconditional Jump | | | | | 1 | | | | | Short | 11101011 | 8-bit displacemer | t | | 7+m | 7+m | | r | | Direct within Segment | 11101001 | full displacement | | | 7+m | 7 + m | | r | | Register/Memory<br>Indirect within Segment | 11111111 | mod 1 0 0 r/m | | | 7+m<br>10+m | 7+m<br>10+m | b | h,r | | Direct Intersegment | 11101010 | unsigned full offs | et, selector | | 12+m | 27 + m | | j, k, r | | | | • | | | | | | | | | | | Clock | Count | Co | mments | | |----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------|---------------------|-------------------|-------------------------|--------------------------------------------------|-------------------------|------------------------------------------------------------------------------------------------| | Instruction | Format | | | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | | CONTROL TRANSFER (continue | d) | | | | | | | | Protected Mode Only (Direct Intersegm<br>Via Call Gate to Same Privilege Leve<br>From 80286 Task to 80286 TSS<br>From 80286 Task to Am386DX/DXL<br>From 80286 Task to Virtual 8086 Task<br>From Am386DX/DXL CPU Task to A<br>From Am386DX/DXL CPU Task to A<br>From Am386DX/DXL CPU Task to V | el<br>CPU TSS<br>sk (Am386DX/DXL<br>0286 TSS<br>m386DX/DXL CPU | TSS | rss) | | 45 + m<br>274<br>301<br>219<br>270<br>303<br>221 | | h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r | | Indirect Intersegment | 11111111 | mod 1 0 1 r/m | | 17 + m | 31+m | b | h, j, k, r | | Protected Mode Only (Indirect Interseg Via Call Gate to Same Privilege Leve From 80286 Task to 80286 TSS From 80286 Task to Am386DX/DXL From 80286 Task to Virtual 8086 Task Trom Am386DX/DXL CPU Task to 8 From Am386DX/DXL CPU Task to A From Am386DX/DXL CPU Task to V From Am386DX/DXL CPU Task to V | PI<br>CPU TSS<br>Sk (Am386DX/DXL<br>0286 TSS<br>m386DX/DXL CPU | TSS | rss) | | 49 + m<br>279<br>306<br>223<br>275<br>308<br>225 | | h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r | | RET = Return from CALL | | - | | | | | | | Within Segment | 11000011 | | | 10+m | 10+m | b | g, h, r | | Within Seg. Adding Immediate to SP | 11000010 | 16-bit displacement | | 10+m | 10+m | b | g, h, r | | Intersegment | 11001011 | | | 18+m | 32+m | b | g,h,j,k,r | | Intersegment Adding Immediate to SP | 11001010 | 16-bit displacement | | 18 + m | 32+m | b | g,h,j,k,r | | Protected Mode Only (RET) to Differer<br>Intersegment<br>Intersegment Adding Immediate to S<br>CONDITIONAL JUMPS (Note: Ti<br>JO=Jump on Overflow | Р | Taken or Not Tal | cen") | | 69<br>69 | | h, j, k, r<br>h, j, k, r | | 8-bit Displacement | 01110000 | 8-bit displacement | | 7+m or 3 | 7+m or 3 | | r | | Full Displacement | 00001111 | 10000000 | full displacement | 7+m or 3 | 7+m or 3 | | r | | JNO=Jump on Not Overflow | | | | | | | | | 8-bit Displacement | 01110001 | 8-bit displacement | | 7+m or 3 | 7+m or 3 | | , | | Full Displacement | 00001111 | 10000001 | full displacement | 7+m or 3 | 7+m or 3 | | r | | JB/JNAE = Jump on Below/Not Abov | L | 4 | | | | | | | 8-bit Displacement | 01110010 | 8-bit displacement | | 7+m or 3 | 7+m or 3 | | r | | Full Displacement | 00001111 | 10000010 | full displacement | 7+m or 3 | 7+m or 3 | | r | | JNB/JAE = Jump on Not Below/Abov | e or Equal | <u> </u> | | | | | | | 8-bit Displacement | 01110011 | 8-bit displacement | | 7+m or 3 | 7+m or 3 | | r | | Full Displacement | 00001111 | 10000011 | full displacement | 7+m or 3 | 7+m or 3 | | r | | JE/JZ=Jump on Equal/ Zero | | | | | | | | | 8-bit Displacement | 01110100 | 8-bit displacement | | 7+m or 3 | 7+m or 3 | | r | | Full Displacement | 00001111 | 10000100 | full displacement | 7+m or 3 | 7+m or 3 | | r | | JNE/JNZ = Jump on Not Equal/Not Ze | ro | | • | | | | | | 8-bit Displacement | 01110101 | 8-bit displacement | | 7+m or 3 | 7+m or 3 | | r | | Full Displacement | 00001111 | 10000101 | full displacement | 7+m or 3 | 7+m or 3 | | r | | Tall Diopiacomont | | | | | | | i . | | JBE/JNA = Jump on Below or Equal/ | Not Above | | _ | | | | | | · | Not Above<br>01110110 | 8 -bit displacement | | 7+m or 3 | 7+m or 3 | | r | | | Clock Count | | Comments | | | | | | |------------------------------------------|-----------------|---------------------|-------------------|---|-------------------------|-----------------------------------------|-------------------------|-----------------------------------------| | Instruction | Format | | | | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | | CONDITIONAL JUMPS (continue | | | , waren, , | | Mode | Mode | Mode | Mode | | JNBE/JA = Jump on Not Below or Eq | • | | | | | | | | | 8-bit Displacement | 01110111 | 8 -bit displacement | | | 7+m or 3 | 7+m or 3 | | r | | Full Displacement | 00001111 | 10000111 | full displacement | | 7+m or 3 | 7+m or 3 | | r | | JS≡Jump on Sign | <u> </u> | 1 | , | | | | | | | 3-bit Displacement | 01111000 | 8 -bit displacement | | | 7+m or 3 | 7+m or 3 | | r | | Full Displacement | 00001111 | 10001000 | full displacement | | 7+m or 3 | 7+m or 3 | | r | | JNS = Jump on Not Sign | | | | | | | | | | B-bit Displacement | 01111001 | 8 -bit displacement | | | 7+m or 3 | 7+m or 3 | | r | | Full Displacement | 00001111 | 10001001 | full displacement | | 7+m or 3 | 7+m or 3 | | r | | ا<br>JP/JPE = Jump on Parity/Parity Even | | | l | | | | | | | B-bit Displacement | 01111010 | 8 -bit displacement | | | 7+m or 3 | 7+m or 3 | | r | | Full Displacement | 00001111 | 10001010 | full displacement | | 7+m or 3 | 7+m or 3 | | r | | ا<br>JNP/JPO = Jump on Not Parity/Parity | Odd | · | | | | | | | | B-bit Displacement | 01111011 | 8 -bit displacement | | | 7+m or 3 | 7+m or 3 | | r | | Full Displacement | 00001111 | 10001011 | full displacement | | 7+m or 3 | 7+m or 3 | | r | | ا<br>JL/JNGE = Jump on Less/Not Greate | r or Equal | | | | | | | | | 3-bit Displacement | 01111100 | 8 -bit displacement | | | 7+m or 3 | 7+m or 3 | | r | | Full Displacement | 00001111 | 10001100 | full displacement | | 7+m or 3 | 7+m or 3 | | r | | !<br>INL/JGE≔Jump on Not Less/Greate | r or Equal | | l | | | | | | | 3-bit Displacement | 01111101 | 8-bit displacement | | | 7+m or 3 | 7+m or 3 | | r | | Full Displacement | 00001111 | 10001101 | full displacement | | 7+m or 3 | 7+m or 3 | | r | | ا<br>JLE/JNG=Jump on Less or Equal/No | ot Greater | | | | | | | | | B-bit Displacement | 01111110 | 8-bit displacement | | | 7+m or 3 | 7+m or 3 | | r | | Full Displacement | 00001111 | 10001110 | full displacement | | 7+m or 3 | 7+m or 3 | | r | | JNLE/JG=Jump on Not Less or Equ | al/Greater | | | | | | | | | B-bit Displacement | 01111111 | 8-bit displacement | | | 7+m or 3 | 7+m or 3 | | r | | Full Displacement | 00001111 | 10001111 | full displacement | | 7+m or 3 | 7+m or 3 | | r | | JCXZ=Jump on CX Zero * | 11100011 | 8-bit displacement | | | 9+m or 5 | 9+m or 5 | | r | | JECXZ = Jump on ECX Zero * | 11100011 | 8-bit displacement | | | 9+m or 5 | 9+m or 5 | | r | | LOOP = Loop CX Times | 11100010 | 8-bit displacement | | | 11 +m | 11 + m | | r | | LOOPZ/LOOPE = Loop with Zero/Equal | 11100001 | 8-bit displacement | | | 11 +m | 11 + m | | r | | LOOPNZ/LOOPNE=Loop while Not Zero | 11100000 | 8-bit displacement | | į | 11 +m | 11 + m | | r | | CONDITIONAL BYTE SET (Note: | : Times are Reg | ister/Memory) | • | | | | | | | SETO=Set Byte on Overflow | | | | | | | | | | To Register/Memory | 00001111 | 10010000 | mod 0 0 0 r/m | | 4/5 | 4/5 | | h | | SETNO=Set Byte on Not Overflow | | | | | | | | | | To Register/Memory | 00001111 | 10010001 | mod 0 0 0 r/m | | 4/5 | 4/5 | | h | | SETB/SETNAE = Set Byte on Below/f | | 1 | | | 415 | 4 | | | | To Register/Memory | 00001111 | 10010010 | mod 0 0 0 r/m | | 4/5 | 4/5 | | h | <sup>\*</sup> Address Size Prefix Differentiates JCXZ from JECXZ. | | | | | | Clock Count | | Com | Comments | | |-----------------------------------------|---------------------|-------------------|-----------------|-----|-------------------------|-----------------------------------------|-------------------------|-----------------------------------------|--| | Instruction | Format | | | | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | | | CONDITIONAL BYTE SET (conti | | | | | mvue | mode | mode | moue | | | SETNB = Set Byte on Not Below/Abo | • | | | | | | | | | | To Register/Memory | 00001111 | 10010011 | mod 0 0 0 r/m | | 4/5 | 4/5 | | h | | | SETE/SETZ = Set Byte on Equal/Zero | L | | | | | | | | | | To Register/Memory | 00001111 | 10010100 | mod 0 0 0 r/m | | 4/5 | 4/5 | | h | | | SETNE/SETNZ=Set Byte on Not Equ | | | | l | | | | | | | To Register/Memory | 00001111 | 10010101 | mod 0 0 0 r/m | | 4/5 | 4/5 | | h | | | SETBE/SETNA = Set Byte on Below | or Equal/Not Above | | | | | | | | | | To Register/Memory | 00001111 | 10010110 | mod 0 0 0 r/m | | 4/5 | 4/5 | | h | | | SETNBE/SETA = Set Byte on Not Bei | low or Equal/Above | ) | | 1 | | | | | | | To Register/Memory | 00001111 | 10010111 | mod 0 0 0 r/m | | 4/5 | 4/5 | | h | | | SETS = Set Byte on Sign | | | | ' | | | | | | | To Register/Memory | 00001111 | 10011000 | mod 0 0 0 r/m | | 4/5 | 4/5 | | h | | | SETNS = Set Byte on Not Sign | | | | ' | | | | | | | To Register/Memory | 00001111 | 10011001 | mod 0 0 0 r/m | | 4/5 | 4/5 | | h | | | SETP/SETPE = Set Byte on Parity/Pa | rity Even | | | 1 | | | | | | | To Register/Memory | 00001111 | 10011010 | mod 0 0 0 r/m | | 4/5 | 4/5 | | h | | | SETNP/SETPO = Set Byte on Not Par | rity/Parity Odd | | | ' | | | | | | | To Register/Memory | 00001111 | 10011011 | mod 0 0 0 r/m | | 4/5 | 4/5 | | h | | | SETL/SETNGE = Set Byte on Less/No | ot Greater or Equal | | | ' | | | | | | | To Register/Memory | 00001111 | 10011100 | mod 0 0 0 r/m | | 4/5 | 4/5 | | h | | | SETNL/SETGE = Set Byte on Not Les | ss/Greater or Equa | | | · | | | | | | | To Register/Memory | 00001111 | 01111101 | mod 0 0 0 r/m | | 4/5 | 4/5 | | h | | | SETLE/SETNG = Set Byte on Less or | Equal/Not Greate | | | ' | | | | | | | To Register/Memory | 00001111 | 10011110 | mod 0 0 0 r/m | ] | 4/5 | 4/5 | | h | | | SETNLE/SETG = Set Byte on Not Les | ss or Equal/Greater | • | | ' ! | | | | | | | To Register/Memory | 00001111 | 10011111 | mod 0 0 0 r/m | | 4/5 | 4/5 | | h | | | ENTER = Enter Procedure | 11001000 | 16-bit displaceme | nt, 8-bit level | | | | | | | | L = 0<br>L = 1<br>L > 1 | | | | • | 10<br>12<br>15+4(n-1) | 10<br>12<br>15+4(n–1) | b<br>b<br>b | h<br>h<br>h | | | LEAVE = Leave Procedure | 11001001 | | | | 4 | 4 | ь | h | | | INTERRUPT INSTRUCTIONS | L | I | | | | | | | | | INT = interrupt: | | | | | | | | | | | Type Specified | 11001101 | type | | | 37 | | ь | | | | Type 3 | 11001100 | | | | 33 | | b | | | | INTO = Interrupt 4 if Overflow Flag Set | | | | | | | | 1 | | | If OF = 1<br>If OF = 0 | | ı | | | 35<br>3 | 3 | b, e<br>b, e | | | | | | | | | | | | | | | | | | | | | | | | | | | Clock Cou | nt Comments | |------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------|---------------------------------------------------------------------------------------------------------------------------------------------------| | Instruction Format | | ual Real Virtual<br>ress Address Address | | INTERRUPT INSTRUCTIONS (continued) | | | | Bound = Interrupt 5 if Detect Value Out of Range 0 1 1 0 0 0 1 0 mod reg r/m | | | | If Out of Range<br>If in Range | 10 | b, e e, g, h, j, k b, e e, g, h, j, k | | Protected Mode Only (INT) INT: Type Specified Via Interrupt or Trap Gate to Same Privilege Level Via Interrupt or Trap Gate to Different Privilege Level From 80286 Task to 80286 TSS via Task Gate From 80286 Task to Mn386DX/DXL CPU TSS via Task Gate From 80286 Task to Virtual 8086 Mode via Task Gate From Am386DX/DXL CPU Task to 80286 TSS via Task Gate From Am386DX/DXL CPU Task to Am386DX/DXL CPU TSS via Task Gate From Am386DX/DXL CPU Task to Am386DX/DXL CPU TSS via Task Gate From Virtual 8086 Mode to 80286 TSS via Task Gate From Virtual 8086 Mode to Am386DX/DXL CPU TSS via Task Gate From Virtual 8086 Mode to Am386DX/DXL CPU TSS via Task Gate From Virtual 8086 Mode to Privilege Level 0 via Trap Gate or Interrupt Gate | | 99 g.j.k.r<br>199 g.j.k.r<br>192 g.j.k.r<br>193 g.j.k.r<br>194 g.j.k.r<br>195 g.j.k.r<br>195 g.j.k.r<br>196 g.j.k.r<br>196 g.j.k.r<br>197 g.j.k.r | | INT: Type 3 Via Interrupt or Trap Gate to Same Privilege Level Via Interrupt or Trap Gate to Different Privilege Level From 80286 Task to 80286 TSS via Task Gate From 80286 Task to Am386DX/DXL CPU TSS via Task Gate From 80286 Task to Virtual 8086 Mode via Task Gate From M366DX/DXL CPU Task to 80286 TSS via Task Gate From Am386DX/DXL CPU Task to 80286 TSS via Task Gate From Am386DX/DXL CPU Task to Am386DX/DXL CPU TSS via Task Gate From Virtual 8086 Mode to 80286 TSS via Task Gate From Virtual 8086 Mode to M286DX/DXL CPU TSS via Task Gate From Virtual 8086 Mode to Privilege Level 0 via Trap Gate or Interrupt Gate | | 99 g,j,k,r<br>199 g,j,k,r<br>78 g,j,k,r<br>05 g,j,k,r<br>22 g,j,k,r<br>07 g,j,k,r<br>07 g,j,k,r<br>124 g,j,k,r<br>15 g,j,k,r<br>16 g,j,k,r | | INTO Via Interrupt or Trap Gate to Same Privilege Level Via Interrupt or Trap Gate to Different Privilege Level From 80286 Task to 80286 TSS via Task Gate From 80286 Task to 00286 TSS via Task Gate From 80286 Task to Virtual 8086 Mode via Task Gate From 80286 Task to Virtual 8086 Mode via Task Gate From Am386DX/DXL CPU Task to 80286 TSS via Task Gate From Am386DX/DXL CPU Task to Am386DX/DXL CPU TSS via Task Gate From Am386DX/DXL CPU Task to Virtual 8086 Mode via Task Gate From Virtual 8086 Mode to 80286 TSS via Task Gate From Virtual 8086 Mode to Am386DX/DXL CPU TSS via Task Gate From Virtual 8086 Mode to Privilege Level 0 via Trap Gate or Interrupt Gate | | 9 | | BOUND Via Interrupt or Trap Gate to Same Privilege Level Via Interrupt or Trap Gate to Different Privilege Level From 80286 Task to 80286 TSS via Task Gate From 80286 Task to Ma386DX/DXL CPU TSS via Task Gate From 80286 Task to Virtual 8086 Mode via Task Gate From 80286 Task to Virtual 8086 Mode via Task Gate From Am386DX/DXL CPU Task to 80286 TSS via Task Gate From Am386DX/DXL CPU Task to Am386DX/DXL CPU TSS via Task Gate From Am386DX/DXL CPU Task to Virtual 8086 Mode via Task Gate From Virtual 8086 Mode to 80286 TSS via Task Gate From Virtual 8086 Mode to Am386DX/DXL CPU TSS via Task Gate From Virtual 8086 Mode to Privilege Level 0 via Trap Gate or Interrupt Gate | | 99 g,j,k,r<br>199 g,j,k,r<br>184 g,j,k,r<br>181 g,j,k,r<br>181 g,j,k,r<br>184 g,j,k,r<br>184 g,j,k,r<br>184 g,j,k,r<br>184 g,j,k,r<br>189 g,j,k,r | | INTERRUPT RETURN IRET=Interrupt Return 11001111 | 22 | g, h, j, k, | | Protected Mode Only (IRET) To the Same Privilege Level (within Task) To Different Privilege Level (within Task) From 80286 Task to 80286 TSS From 80286 Task to Am386DX/DXL CPU TSS From 80286 Task to Virtual 8086 Mode (within Task) From 80286 Task to Virtual 8086 Mode (within Task) From Mm386DX/DXL CPU Task to 80286 TSS From Am386DX/DXL CPU Task to Am386DX/DXL CPU TSS From Am386DX/DXL CPU Task to Virtual 8086 Task From Am386DX/DXL CPU Task to Virtual 8086 Task From Am386DX/DXL CPU Task to Virtual 8086 Mode (within Task) | 275<br>223 | 98 | | | | | | 00. 00 | Clock Count | | Comments | | |------------------------------------------|------------------|---------------------------------------|----------------------|--------|-------------------------|-----------------------------------------|-------------------------|-----------------------------------------| | Instruction | Format | | | : | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | | PROCESSOR CONTROL | | | | | | | | | | HLT=HALT | 11110100 | 1 ' | | | 5 | 5 | | ١, | | MOV = Move to and From Control/De | | 1 | | | Ů | | | i . | | CR0/CR2/CR3 from register | 00001111 | 00100010 | 1 1 eee reg | ] | 11/4/5 | 11/4/5 | | | | Register From CR3–0 | 00001111 | 00100000 | | | 6 | 6 | | ; | | _ | | | 1 1 eee reg | | 22 | | | ; | | DR3-0 From Register | 00001111 | 00100011 | 1 1 eee reg | | | 22 | | ' | | DR7–6 From Register | 00001111 | | 1 1 eee reg | | 16 | 16 | | | | Register from DR7-6 | 00001111 | 00100001 | 1 1 eee reg | | 14 | 14 | | | | Register from DR3-0 | 00001111 | 00100001 | 1 1 eee reg | | 22 | 22 | | ' | | TR7–6 from Register | 00001111 | 00100110 | 1 1 eee reg | | 12 | 12 | | ı | | Register from TR7–6 | 00001111 | 00100100 | 1 1 eee reg | | 12 | 12 | | 1 | | NOP = No Operation | 10010000 | | | | 3 | 3 | | | | WAIT = Wait until BUSY<br>pin is negated | 10011011 | ] | | | 7 | 7 | | | | NOP = No Operation | 10010000 | | | | 3 | 3 | | | | PROCESSOR EXTENSION INST | RUCTIONS | | | | | | | | | Processor Extension Escape | 11011TTT | mod L L L r/m | | | | | | h | | | TTT and LLL bits | are op-code inform | ation for coprocesso | or | | | | | | PREFIX BYTES | | | | | | | | | | Address Size Prefix | 01100111 | | | | 0 | 0 | | | | LOCK = Bus Lock Prefix | 11110000 | | | | 0 | 0 | | m | | Operand Size Prefix | 01100110 | | | | 0 | 0 | | | | Segment Override Prefix | | • | | | | | | | | CS: | 00101110 | | | | 0 | 0 | | | | DS: | 00111110 | 1 | | | 0 | 0 | | | | ES: | 00100110 | 1 | | | 0 | 0 | | | | FS: | 01100100 | 1 | | | 0 | 0 | | | | GS: | 01100101 | 1 | | | 0 | 0 | | | | SS: | 00110110 | 1 | | | 0 | 0 | | | | PROTECTION CONTROL | | J | | | | | | | | ARPL=Adjust Requested Privilege | Level | | | | | | | | | From Register/Memory | 01100011 | mod reg r/m | | | N/A | 20/21 | а | h | | LAR = Load Access Rights | L | · · · · · · · · · · · · · · · · · · · | | | | | | | | From Register/Memory | 00001111 | 00000010 | mod reg r/m | | N/A | 15/16 | a | g, h, j, p | | LGDT = Load Global Descriptor | | | | • | | | | | | Table Register | 00001111 | 00000001 | mod 0 1 0 r/m | | 11 | 11 | b,c | h,i | | LIDT = Load Interrupt Descriptor | | | | • | | | | | | Table Register | 00001111 | 00000001 | mod 0 1 1 r/m | | 11 | 11 | b, c | h, l | | LLDT = Load Local Descriptor | | T | | 1 | | | | | | Table Register to Register/Memory | 00001111 | 00000000 | mod 0 1 0 r/m | | N/A | 20/24 | а | g, h, j, l | | LMSW = Load Machine Status Word | r | Tanana | I | 1 | | | ١. | ١ | | From Register/Memory | 00001111 | 00000001 | mod 1 1 0 r/m | | 11/14 | 11/14 | b.c | h, i | | | | | Clock | Count | Con | nments | | | |------------------------------------------------------------------|----------|----------|---------------|-------|-------------------------|-----------------------------------------|-------------------------|----------------------------------------| | Instruction | Format | | | | Real<br>Address<br>Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode | Protecte<br>Virtual<br>Address<br>Mode | | PROTECTION CONTROL (continu | ued) | | | | | | | | | LSL = Load Segment Limit | | | | | | | | | | From Register/Memory | 00001111 | 00000011 | mod reg r/m | | | | | | | Byte-Granular Limit Page-Granular Limit LTR = Load Task Register | | | | | N/A<br>N/A | 21/22<br>25/26 | a<br>a | g, h, j, p<br>g, h, j, p | | From Register/Memory | 00001111 | 00000000 | mod 0 0 1 r/m | | N/A | 23/27 | а | g, h, j, l | | SGDT = Store Global Descriptor | | <u></u> | | | | | | | | Table Register | 00001111 | 00000001 | mod 0 0 0 r/m | | 9 | 9 | b, c | h | | SIDT = Store Interrupt Descriptor | | | | | | | | | | Table Register | 00001111 | 00000001 | mod 0 0 1 r/m | | 9 | 9 | b, c | h | | SLDT = Store Local Descriptor Table | Register | | | • | | | | | | To Register/Memory | 00001111 | 00000000 | mod 0 0 0 r/m | | N/A | 2/2 | а | h | | SMSW = Store Machine Status Word | 00001111 | 00000001 | mod 1 0 0 r/m | | 2/2 | 2/2 | b, c | h, l | | STR = Store Task Register | | | | | | | | | | To Register/Memory | 00001111 | 00000000 | mod 0 0 1 r/m | | N/A | 2/2 | а | h | | VERR = Verify Read Access | | | | | | | | | | Register/Memory | 00001111 | 00000000 | mod 1 0 0 r/m | | N/A | 10/11 | а | g, h, j, <sub>l</sub> | | VERW = Verify Write Access | 00001111 | 00000000 | mod 1 0 1 r/m | | N/A | 15/16 | а | g, h, j, | #### Instruction Notes for Table 23. # Notes a through c apply to Am386DX/DXL CPU Real Address Mode only. - a. This is a Protected Mode instruction. Attempted execution in Real Mode will result in Exception 6 (Invalid op-code). - b. Exception 13 fault (General Protection) will occur in Real Mode if an operand reference is made that partially or fully extends beyond the maximum CS, DS, ES, FS, or GS limit, FFFFH. Exception 12 (fault stack segment limit violation or not present) will occur in Real Mode if an operand reference is made that partially or fully extends beyond the maximum SS limit. - c. This instruction may be executed in Real Mode. In Real Mode, its purpose is primarily to initialize the CPU for Protected Mode. # Notes d through g apply to Am386DX/DXL CPU Real Address Mode and Am386DX/DXL CPU Protected Virtual Address Mode. - d. The Am386DX/DXL CPU uses an early-out multiply algorithm. The actual number of clocks depends on the position of the most significant bit in the operand (multiplier). - Clock counts given are minimum to maximum. To calculate actual clocks, use the following formula: Actual Clock = if m < > 0 then max ( $\lceil \log_2 |m| \rceil$ , 3) + b clocks: if m = 0 - then 3+b clocks - In this formula, m is the multiplier, and - b = 9 for register to register, - b = 12 for memory to register, - b = 10 for register with immediate to register, - b = 11 for memory with immediate to register. - e. An Exception may occur, depending on the value of the operand. - f. LOCK is automatically asserted, regardless of the presence or absence of the LOCK prefix. - g. LOCK is asserted during descriptor table accesses. # Notes h through r apply to Am386DX/DXL CPU Protected Virtual Address Mode only. - h. Exception 13 fault (General Protection Violation) will occur if the memory operand in CS, DS, ES, FS, or GS cannot be used due to either a segment limit violation or access rights violation. If a stack limit is violated, an Exception 12 (Stack Segment Limit Violation or Not Present) occurs. - i. For segment load operations, the CPL, RPL, and DPL must agree with the privilege rules to avoid an Exception 13 fault (General Protection Violation). The segment's descriptor must indicate present or Exception 11 (CS, DS, ES, FS, GS Not Present). If the SS register is loaded and a stack segment not present is detected, an Exception 12 (Stack Segment Limit Violation or Not Present) occurs. - j. All segment descriptor accesses in the GDT or LDT made by this instruction will automatically assert LOCK to maintain descriptor integrity in multiprocessor systems. - k. JMP, CALL, INT, RET, and IRET instructions referring to another code segment will cause an Exception 13 (General Protection Violation) if an applicable privilege rule is violated. - An Exception 13 fault occurs if CPL is greater than 0 (0 is the most privileged level). - m. An Exception 13 fault occurs if CPL is greater than IOPL. - The IF bit of the flag register is not updated if CPL is greater than IOPL. The IOPL and VM fields of the flag register are updated only if CPL=0. - o. The PE bit of the MSW (CR0) cannot be reset by this instruction. Use MOV into CR0 if desiring to reset the PE bit. - Any violation of privilege rules as applied to the selector operand does not cause a protection Exception; rather, the zero flag is cleared. - q. If the coprocessor's memory operand violates a segment limit or segment access rights, an Exception 13 fault (General Protection Exception) will occur before the ESC instruction is executed. An Exception 12 fault (Stack Segment Limit Violation or Not Present) will occur if the stack limit is violated by the operand's starting address. - r. The destination of a JMP, CALL, INT, RET, or IRET must be in the defined limit of a code segment or an Exception 13 fault (General Protection Violation) will occur. ### Instruction Encoding #### Overview All instruction encodings are subsets of the general instruction format shown in Figure 83. Instructions consist of one or two primary op-code bytes, possibly an address specifier consisting of the mod r/m byte and scaled index byte, a displacement if required, and an immediate data field if required. Within the primary op-code or op-codes, smaller encoding fields may be defined. These fields vary according to the class of operation. The fields define such information as direction of the operation, size of the displacements, register encoding, or sign extension. Almost all instructions referring to an operand in memory have an addressing mode byte following the primary op-code byte(s). This byte, the mod r/m byte, specifies the address mode to be used. Certain encodings of the mod r/m byte indicate a second addressing byte, the scale-index-base byte, follows the mod r/m byte to fully specify the addressing mode. Addressing modes can include a displacement immediately following the mod r/m byte, or scaled index byte. If a displacement is present, the possible sizes are 8, 16, or 32 bits. If the instruction specifies an immediate operand, the immediate operand follows any displacement bytes. The immediate operand, if specified, is always the last field of instruction. Figure 83 illustrates several of the fields that can appear in an instruction, such as the mod field and the r/m field, but the Figure does not show all fields. Several smaller fields also appear in certain instructions, sometimes within the op-code bytes themselves. Table 24 is a complete list of all fields appearing in the Am386DX/DXL microprocessor instruction set. Further ahead, following Table 24, are detailed tables for each field. #### 32-Bit Extensions of the Instruction Set With the Am386DX/DXL microprocessor, the 8086/80186/80286 instruction set is extended in two orthogonal directions: 32-bit forms of all 16-bit instructions are added to support the 32-bit data types, and 32-bit addressing modes are made available for all instructions referencing memory. This orthogonal instruction set extension is accomplished having a Default (D) bit in the code segment descriptor, and by having 2 prefixes to the instruction set. Whether the instruction defaults to operations of 16 or 32 bits depends on the setting of the D bit in the code segment descriptor, which gives the default length (either 32 or 16 bits) for both operands and effective addresses when executing that code segment. In the Real Address Mode or Virtual 8086 Mode, no code segment descriptors are used, but a D value of 0 is assumed internally by the Am386DX/DXL microprocessor when operating in those modes (for 16-bit default sizes compatible with the 8086/80186/80286). Two prefixes, the Operand Size Prefix and the Effective Address Size Prefix, allow overriding individually the Default selection of operand size and effective address size. These prefixes may precede any op-code bytes and affect only the instruction they precede. If necessary, one or both of the prefixes may be placed before the op-code bytes. The presence of the Operand Size Prefix and the Effective Address Prefix will toggle the operand size or the effective address size, respectively. to the value opposite from the Default setting. For example, if the default operand size is for 32-bit data operations, then presence of the Operand Size Prefix toggles the instruction to 16-bit data operation. As another example, if the default effective address size is 16 bits. presence of the Effective Address Size prefix toggles the instruction to use 32-bit effective address computations. Figure 83. General Instruction Format Table 24. Fields within Am386DX/DXL Microprocessor Instructions | Field Name | Description | Number of Bits | |------------|-----------------------------------------------------------------------------------|-------------------------| | w | Specifies if Data is Byte or Full Size (Full Size is either 16 or 32 bits) | 1 | | d | Specifies Direction of Data Operation | 1 | | s | Specifies if an Immediate Data Field must be Sign-Extended | 1 | | reg | General Register Specifier | 3 | | mod r/m | Address Mode Specifier (Effective Address can be a General Register) | 2 for mod;<br>3 for r/m | | ss | Scale Factor for Scaled Index Address Mode | 2 | | index | General Register to be used as Index Register | 3 | | base | General Register to be used as Base Register | 3 | | sreg2 | Segment Register Specifier for CS, SS, DS, ES | 2 | | sreg3 | Segment Register Specifier for CS, SS, DS, ES, FS, GS | 3 | | tttn | For Condition Instructions, specifies a Condition Asserted or a Condition Negated | 4 | Note: Table 23 shows encoding of individual instructions. These 32-bit extensions are available in all Am386DX/DXL microprocessor modes, including the Real Address Mode or the Virtual 8086 Mode. In these modes the default is always 16 bits, so prefixes are needed to specify 32-bit operands or addresses. For instructions with more than one prefix, the order of prefixes is unimportant. Unless specified otherwise, instructions with 8- and 16-bit operands do not affect the contents of the high-order bits of the extended registers. #### **Encoding of Instruction Fields** Within the instruction are several fields indicating register selection, addressing mode and so on. The exact encodings of these fields are defined immediately ahead. ### Encoding of Operand Length (w) Field For any given instruction performing a data operation, the instruction is executing as a 32- or 16-bit operation. Within the constraints of the operation size, the w field encodes the operand size as either one byte or the full operation size, as shown in the table below. | w Field | Operand Size<br>During 16-Bit<br>Data Operations | Operand Size<br>During 32-Bit<br>Data Operations | |---------|--------------------------------------------------|--------------------------------------------------| | 0 | 8 Bits | 8 Bits | | 1 | 16 Bits | 32 Bits | ### Encoding of The General Register (reg) Field The general register is specified by the reg field, which may appear in the primary op-code bytes, or as the reg field of the mod r/m byte, or as the r/m field of the mod r/m byte. # Encoding of reg Field When w Field is not Present in Instruction | reg Field | Register Selected<br>During 16-Bit<br>Data Operations | Register Selected<br>During 32-Bit<br>Data Operations | |-----------|-------------------------------------------------------|-------------------------------------------------------| | 000 | AX | EAX | | 001 | CX | ECX | | 010 | DX | EDX | | 011 | BX | EBX | | 100 | SP | ESP | | 101 | BP | EBP | | 110 | Si | ESI | | 111 | DI | EDI | # Encoding of reg Field When w Field is Present in Instruction | | Register Specified by reg Field<br>During 16-Bit Data Operations | | | | | | |-----|------------------------------------------------------------------|------------|--|--|--|--| | | Function | of w Field | | | | | | reg | (when w = 0) (when w = 1) | | | | | | | 000 | AL | AX | | | | | | 001 | CL | cx | | | | | | 010 | DL | DX | | | | | | 011 | BL | BX | | | | | | 100 | AH | SP | | | | | | 101 | СН | BP | | | | | | 110 | DH | SI | | | | | | 111 | BH | DI | | | | | | | Register Specified by reg Field<br>During 32-Bit Data Operations | | | | | |-----|------------------------------------------------------------------|--------------|--|--|--| | | Function | of w Field | | | | | reg | (when w = 0) | (when w = 1) | | | | | 000 | AL | EAX | | | | | 001 | CL | ECX | | | | | 010 | DL | EDX | | | | | 011 | BL | EBX | | | | | 100 | AH | ESP | | | | | 101 | CH | EBP | | | | | 110 | DH | ESI | | | | | 111 | BH | EDI | | | | ### Encoding of The Segment Register (sreg) Field The sreg field in certain instructions is a 2-bit field allowing one of the four 80286 segment registers to be specified. The sreg field in other instructions is a 3-bit field, allowing the Am386DX/DXL microprocessor FS and GS segment registers to be specified. ### 2-Bit srea2 Field | 2-Bit sreg2 Field | Segment Register Selected | |-------------------|---------------------------| | 00 | ES | | 01 | cs | | 10 | ss | | 11 | DS | ### 3-Bit sreg3 Field | 0-bit slego i leid | | |--------------------|---------------------------| | 3-Bit sreg3 Field | Segment Register Selected | | 000 | ES | | 001 | cs | | 010 | SS | | 011 | DS | | 100 | FS | | 101 | GS | | 110 | do not use | | 111 | do not use | ### **Encoding of Address Mode** Except for special instructions, such as PUSH or POP, where the addressing mode is pre-determined, the addressing mode for the current instruction is specified by addressing bytes following the primary op-code. The primary addressing byte is the mod r/m byte, and a second byte of addressing information, the scale-index-base (s-i-b) byte, can be specified. The s-i-b byte is specified when using 32-bit addressing mode and the mod r/m byte has r/m = 100 and mod = 00, 01, or 10. When the s-i-b byte is present, the 32-bit addressing mode is a function of the mod, ss, index, and base fields. The primary addressing byte, the mod r/m byte, also contains three bits (shown as TTT in Figure 83) sometimes used as an extension of the primary op-code. The three bits, however, may also be used as a register field (reg). When calculating an effective address, either 16-bit addressing or 32-bit addressing is used. 16-bit addressing uses 16-bit address components to calculate the effective address while 32-bit addressing uses 32-bit address components to calculate the effective address. When 16-bit addressing is used, the mod r/m byte is interpreted as a 16-bit addressing mode specifier. When 32-bit addressing is used, the mod r/m byte is interpreted as a 32-bit addressing mode specifier. Tables on the following pages define all encodings of all 16- and 32-bit addressing modes. # Encoding of 16-Bit Address Mode with mod r/m Byte | mod r/m | Effective Address | |---------|-------------------| | 00 000 | DS:[BX + SI] | | 00 001 | DS:[BX + DI] | | 00 010 | SS:[BP + SI] | | 00 011 | DS:[BP + DI] | | 00 100 | DS:[SI] | | 00 101 | DS:[DI] | | 00 110 | DS:d16 | | 00 111 | DS:[BX] | | | | | 01 000 | DS:[BX + SI + d8] | | 01 001 | DS:[BX + DI + d8] | | 01 010 | SS:[BP + SI + d8] | | 01 011 | SS:[BP + DI + d8] | | 01 100 | DS:[SI + d8] | | 01 101 | DS:[DI + d8] | | 01 110 | SS:[BP + d8] | | 01 111 | DS:(BX + d8) | | mod | l r/m | Effective Address | |-----|-------|--------------------| | 10 | 000 | DS:[BX + SI + d16] | | 10 | 001 | DS:[BX + DI + d16] | | 10 | 010 | SS:[BP + SI + d16] | | 10 | 011 | SS:[BP + DI + d16] | | 10 | 100 | DS:[SI + d16] | | 10 | 101 | DS:[DI + d16] | | 10 | 110 | SS:[BP + d16] | | 10 | 111 | DS:[BX + d16] | | 11 | 000 | Register—See Below | | 11 | 001 | Register—See Below | | 11 | 010 | Register—See Below | | 11 | 011 | Register—See Below | | 11 | 100 | Register—See Below | | 11 | 101 | Register—See Below | | 11 | 110 | Register—See Below | | 11 | 111 | Register—See Below | | Register Specified by r/m During 32-Bit Data Operations | | | |---------------------------------------------------------|---------------------|--------------| | | Function of w Field | | | mod r/m | (when w = 0) | (when w = 1) | | 11 000 | AL | EAX | | 11 001 | CL | ECX | | 11 010 | DL | EDX | | 11 011 | BL | EBX | | 11 100 | AH | ESP | | 11 101 | CH | EBP | | 11 110 | DH | ESI | | 11 111 | вн | EDI | | Register Specified by r/m During 16-Bit Data Operations | | | |---------------------------------------------------------|---------------------|--------------| | | Function of w Field | | | mod r/m | (when w = 0) | (when w = 1) | | 11 000 | AL | AX | | 11 001 | CL | CX | | 11 010 | DL | DX | | 11 011 | BL | BX | | 11 100 | AH | SP | | 11 101 | CH | BP | | 11 110 | DH | SI | | 11 111 | BH | DI | ### Encoding of 32-Bit Address Mode with mod r/m byte (No s-i-b Byte Present) | mod r/m | Effective Address | |---------|-------------------| | 00 000 | DS:[EAX] | | 00 001 | DS:[ECX] | | 00 010 | DS:[EDX] | | 00 011 | DS:[EBX] | | 00 100 | s-i-b is present | | 00 101 | DS:d32 | | 00 110 | DS:[ESI] | | 00 111 | DS:[EDI] | | | | | 01 000 | DS:[EAX + d8] | | 01 001 | DS:[ECX + d8] | | 01 010 | DS:[EDX + d8] | | 01 011 | DS:[EBX + d8] | | 01 100 | s-i-b is present | | 01 101 | SS:[EBP + d8] | | 01 110 | DS:[ESI + d8] | | 01 111 | DS:[EDI + d8] | | mod r/m | Effective Address | |---------|--------------------| | 10 000 | DS:[EAX + d32] | | 10 001 | DS:[ECX + d32] | | 10 010 | DS:[EDX + d32] | | 10 011 | DS:[EBX + d32] | | 10 100 | s-i-b is present | | 10 101 | SS:[EBP + d32] | | 10 110 | DS:[ESI + d32] | | 10 111 | DS:[EDI + d32] | | ! | | | 11 000 | Register—See Below | | 11 001 | Register—See Below | | 11 010 | Register—See Below | | 11 011 | Register—See Below | | 11 100 | Register—See Below | | 11 101 | Register—See Below | | 11 110 | Register—See Below | | 11 111 | Register—See Below | | Register Specified by reg or r/m<br>During 32-Bit Data Operations | | | |-------------------------------------------------------------------|---------------------|--------------| | | Function of w Field | | | mod r/m | (when w = 0) | (when w = 1) | | 11 000 | AL | EAX | | 11 001 | CL | ECX | | 11 010 | DL | EDX | | 11 011 | BL | EBX | | 11 100 | AH | ESP | | 11 101 | CH | EBP | | 11 110 | DH | ESI | | 11 111 | BH | EDI | | | Register Specified by reg or r/m<br>During 16-Bit Data Operations | | | |---------|-------------------------------------------------------------------|--------------|--| | | Function of w Field | | | | mod r/m | (when w = 0) | (when w = 1) | | | 11 000 | AL | AX | | | 11 001 | CL | CX | | | 11 010 | DL | DX | | | 11 011 | BL | BX | | | 11 100 | AH | SP | | | 11 101 | CH | BP | | | 11 110 | DH | SI | | | 11 111 | BH | DI | | ### Encoding of 32-Bit Address Mode (mod r/m Byte and s-i-b present) | mod base | Effective Address | |----------|---------------------------------| | 00 000 | DS:[EAX + (scaled index)] | | 00 001 | DS:[ECX + (scaled index)] | | 00 010 | DS:[EDX + (scaled index)] | | 00 011 | DS:[EBX + (scaled index)] | | 00 100 | SS:[ESP + (scaled index)] | | 00 101 | DS:[d32 + (scaled index)] | | 00 110 | DS:[ESI + (scaled index)] | | 00 111 | DS:[EDI + (scaled index)] | | 01 000 | DS:[EAX + (scaled index) + d8] | | 01 000 | DS:[ECX + (scaled index) + d8] | | 01 010 | DS:[EDX + (scaled index) + d8] | | 01 010 | DS:[EBX + (scaled index) + d8] | | 01 100 | SS:[ESP + (scaled index) + d8] | | 01 101 | SS:[EBP + (scaled index) + d8] | | 01 110 | DS:[ESI + (scaled index) + d8] | | 01 111 | DS:[EDI + (scaled index) + d8] | | ] | DO.[LDI + (Scaled Index) + do] | | 10 000 | DS:[EAX + (scaled index) + d32] | | 10 001 | DS:[ECX + (scaled index) + d32] | | 10 010 | DS:[EDX + (scaled index) + d32] | | 10 011 | DS:[EBX + (scaled index) + d32] | | 10 100 | SS:[ESP + (scaled index) + d32] | | 10 101 | SS:[EBP + (scaled index) + d32] | | 10 110 | DS:[ESI + (scaled index) + d32] | | 10 111 | DS:[EDI + (scaled index) + d32] | | ss | Scale Factor | |----|--------------| | 00 | x1 | | 01 | x2 | | 10 | x4 | | 11 | x8 | | Index | Index Register | |-------|-------------------------| | 000 | EAX | | 001 | ECX | | 010 | EDX | | 011 | EBX | | 100 | no index reg (see note) | | 101 | EBP | | 110 | ESI | | 111 | EDI | Note: When index field is 100, indicating no index register, then ss field must equal 00. If index is 100 and ss does not equal 00, the effective address is undefined. Note: Mod field in mod r/m byte; ss, index, base fields in s-i-b byte. ### Encoding of Operation Direction (d) Field In many two-operand instructions the d field is present to indicate which operand is considered the source and which is the destination. | d | Direction of Operation | |---|----------------------------------------------------------------------------------------------------------------------------| | 0 | Register/Memory ← Register reg Field indicates Source Operand; mod r/m or mod ss index base indicates Destination Operand. | | 1 | Register Register Memory reg Field indicates Destination Operand; mod r/m or mod ss index base indicates Source Operand. | ### Encoding of Sign-Extend (s) Field The *s* field occurs primarily to instructions with immediate data fields. The *s* field has an effect only if the size of the immediate data is 8 bits and is being placed in a 16-or 32-bit destination. | s | Effect on<br>Immediate Data 8 | Effect on<br>Immediate Data 16 32 | |---|--------------------------------------------------------------|-----------------------------------| | 0 | None | None | | 1 | Sign-Extended Data 8 to fill<br>16-Bit or 32-Bit Destination | None | ### Encoding of Conditional Test (tttn) Field For the conditional instructions (conditional jumps and set on condition), tttn is encoded with n indicating to use the condition (n=0) or its negation (n=1), and ttt giving the condition to test. | Mnemonic | Condition | tttn | |----------|-------------------------------------|------| | 0 | Overflow | 0000 | | NO | No Overflow | 0001 | | B/NAE | Below/Not Above or Equal | 0010 | | NB/AE | Not Below/Above or Equal | 0011 | | E/Z | Equal/Zero | 0100 | | NE/NZ | Not Equal/Not Zero | 0101 | | BE/NA | Below or Equal/Not Above | 0110 | | NBE/A | Not Below or Equal/Above | 0111 | | s | Sign | 1000 | | NS | Not Sign | 1001 | | P/PE | Parity/Parity Even | 1010 | | NP/PO | Not Parity/Parity Odd | 1011 | | L/NGE | Less Than/Not Greater or Equal | 1100 | | NL/GE | Not Less Than/Greater or Equal | 1101 | | LE/NG | Less Than or Equal/Not Greater Than | 1110 | | NLE/G | Not Less Than or Equal/Greater Than | 1111 | # Encoding of Control or Debug or Test Register (eee) Field For the loading and storing of the Control, Debug and Test registers. ### When Interpreted as Control Register Field | eee Code | Reg Name | |--------------|---------------------| | 000 | CR0 | | 010 | CR2 | | 011 | CR3 | | Do not use a | any other encoding. | ### When Interpreted as Debug Register Field | eee Code | Reg Name | | |--------------------------------|----------|--| | 000 | DR0 | | | 001 | DR1 | | | 010 | DR2 | | | 011 | DR3 | | | 110 | DR6 | | | 111 | DR7 | | | Do not use any other encoding. | | | ### When Interpreted as Test Register Field | eee Code | Reg Name | |--------------|--------------------| | 110 | TR6 | | 111 | TR7 | | Do not use a | ny other encoding. | ### MECHANICAL DATA ### Introduction In this section, the physical packaging and its connections are described in detail. ### **Package Dimensions and Mounting** The initial Am386DX/DXL microprocessor package is a 132-pin ceramic pin grid array (PGA). Pins of this package are arranged 0.100 inch (2.54 mm) center-to-center, in a 14 x 14 matrix, three rows around. A wide variety of available sockets allow low insertion force or zero insertion force mountings, and a choice of terminals such as soldertail, surface mount, or wire wrap. ### Package Thermal Specification The Am386DX/DXL microprocessor is specified for operation when ambient temperature is within the range of 0°C–100°C. The ambient temperature may be measured in any environment, to determine whether the Am386DX/DXL microprocessor is within specified operating range. The PGA ambient temperature should be measured at the center of the top surface opposite the pins. ### **ELECTRICAL DATA** ### Introduction The following sections describe recommended electrical connections for the Am386DX/DXL microprocessor and its electrical specifications. ### **Power and Grounding** ### **Power Connections** The Am386DX/DXL CPU is implemented in CS21S technology and has modest power requirements. However, its high clock frequency and 72 output buffers (address, data, control, and HLDA) can cause power surges as multiple output buffers drive new signal levels simultaneously. For clean on-chip power distribution at high frequency, 20 $V_{\rm CC}$ and 21 $V_{\rm SS}$ pins separately feed functional units of the Am386DX/DXL CPU. Power and ground connections must be made to all external $V_{\rm CC}$ and GND pins of the Am386DX/DXL CPU. On the circuit board, all $V_{\rm CC}$ pins must be connected on a $V_{\rm CC}$ plane. All $V_{\rm SS}$ pins must be likewise connected to the GND plane. ### **Power Decoupling Recommendations** Liberal decoupling capacitance should be placed near the Am386DX/DXL CPU. The Am386DX/DXL microprocessor driving its 32-bit parallel address and data buses at high frequencies can cause transient power surges, particularly when driving large capacitive loads. Low inductance capacitors and interconnects are recommended for best high frequency electrical performance. Inductance can be reduced by shortening circuit board traces between the Am386DX/DXL microprocessor and decoupling capacitors as much as possible. Capacitors specifically for PGA packages are also commercially available, for the lowest possible inductance. ### Resistor Recommendations The ERROR, FLT, and BUSY inputs have resistor pull-ups of approximately 20 Kohms built into the Am386DX/DXL CPU to keep these signals negated when no 387DX math coprocessor is present in the system (or temporarily removed from its socket). The BS16 input also has an internal pull-up resistor of approximately 20 Kohms, and the PEREQ input has an internal pull-down resistor of approximately 20 Kohms. In typical designs, the external pull-up resistors are recommended. However, a particular design may have reason to adjust the resistor values recommended here, or after the use of pull-up resistors in other ways. ### **Other Connection Recommendations** For reliable operation, always connect unused inputs to an appropriate signal level. NC pins should always remain unconnected. Particularly when not using interrupts or bus hold, (as when first prototyping, perhaps) prevent any chance of spurious activity by connecting these associated inputs to GND. | Pin | Signal | |-----|--------| | B7 | INTR | | B8 | NMI | | D14 | HOLD | If not using address pipelining, pull-up D13 $\overline{\text{NA}}$ to $V_{\text{CC}}.$ If not using 16-bit size, pull-up C14 $\overline{BS16}$ to $V_{CC}$ . Pull-ups in the range of 20 Kohms are recommended. ### PHYSICAL DIMENSIONS Base Plane Seating Plane .045 .055 .016 .020 .100 **BSC** <u>.105</u> .125 .060 .105 .195 .025 .055 .080 .140 Side View **Bottom View (Pins Facing Up)** 15304C BP 36 4/30/91 CD # **PHYSICAL DIMENSIONS (continued)** PQB 132—Plastic Quad Flat Pack (Trimmed and Formed) Top View Side View 11772E CF 43 7/28/92 SG ## **PHYSICAL DIMENSIONS (continued)** PQB 132—Plastic Quad Flat Pack with Molded Carrier Ring (Outer Ring measured in millimeters) 14826E CB 51 7/28/92 SG # Am386™SX/SXL Advanced Micro Devices High-Performance, 32-Bit Microprocessor with 16-Bit Data Bus ## **DISTINCTIVE CHARACTERISTICS** - 40-, 33-, 25-, and 20-MHz operating speeds - True static design for long battery life in portable PCs - --- 0 MHz (DC) minimum frequency - -Typical standby (DC) current < 20 μA - -Typical operating current < 165 mA at 20 MHz - Wide range of chip set and BIOS support take advantage of standby mode capabilities - Lower heat dissipation facilitates elimination of cooling fan in desktop PCs - Pin-for-pin replacement of the Intel i386SX - Supports 387SX-compatible math coprocessors - 100-pin PQFP package with optional protective ring for better lead coplanarity - AMD advanced 0.8 micron CMOS technology #### GENERAL DESCRIPTION The Am386SX/SXL microprocessor is a high-speed, true static implementation of the Intel i386SX. It is ideal for both desktop and battery-powered notebook personal computers. For notebooks, the Am386SXL microprocessor's true static design offers longer battery life with low operating power consumption and standby mode. At 20 MHz, this device offers a current which is 22% lower than the Intel i386SX. Standby mode allows the Am386SXL CPU to be clocked down to 0 MHz (DC) and retain full register contents. Typical current in standby mode is reduced to less than 20 $\mu\text{A}$ —nearly a $1000\times$ reduction in power consumption versus the Intel i386SX. For desktop PCs, the Am386SX/SXL microprocessor offers a 21% increase in the maximum operating speed, from 33 to 40 MHz. Also, this device offers lower heat dissipation, allowing system designers to remove or reduce the size and cost of the cooling fan. This device will be available in a standard 100-pin Plastic Quad Flat Pack (PQFP). This package may be shipped in an optional protective ring for better lead protection during manufacturing. Publication#: 15022 Rev. D Amendment:/0 Issue Date: October 1992 ## **BLOCK DIAGRAM** #### LOGIC SYMBOL #### **FUNCTIONAL DESCRIPTION** ## True Static Operation (Am386SXL CPU) The Am386SXL microprocessor incorporates a true static design. Unlike dynamic circuit design, the Am386SXL CPU eliminates the minimum operating frequency restriction. It may be clocked from its maximum speed of 40 MHz all the way down to 0 MHz (DC). System designers can use this feature to design battery-powered notebook PCs with long battery life. ## Standby Mode (Am386SXL CPU) This true static design allows for a standby mode. At any operating speed (40 to 0 MHz), the Am386SXL microprocessor will retain its state (i.e., the contents of all its registers). By shutting off the clock completely, the device enters standby mode. Since power consumption is proportional to clock frequency, operating power consumption is reduced as the frequency is lowered. In standby mode, typical current draw is reduced to less than 20 µA at DC. Not only does this feature improve battery life, but it also simplifies the design of power-conscious notebook computers in the following ways: - Eliminates the need for software in the BIOS to save and restore the contents of registers. - Allows simpler circuitry to control stopping of the clock since the system does not need to know what state the processor is in. ## Lower Operating Icc (Am386SXL CPU) True static design also allows lower operating lcc when operating at any speed. See the following graph for typical current at operating speeds. #### Performance On Demand (Am386SXL CPU) The Am386SXL microprocessor retains its state at any speed from 0 MHz (DC) to its maximum operating speed. With this feature, system designers may vary the operating speed of the system to extend the battery life in portable systems. For example, the system could operate at low speeds during inactivity or polling operations. However, upon interrupt, the system clock can be increased up to its maximum speed. After a user-defined time-out period, the system can be returned to a low (or 0 MHz) operating speed without losing its state. This design maximizes battery life while achieving optimal performance. ## **CONNECTION DIAGRAMS** ## Top Side View — 100-Lead Plastic Quad Flat Pack Notes: Pin 1 is marked for orientation. NC = Not connected; connection of an NC pin may cause a malfunction or incompatibility with future shippings of the Am386SX/SXL microprocessor. # CONNECTION DIAGRAMS (continued) Pin Side View — 100-Lead Plastic Quad Flat Pack Notes: Pin 1 is marked for orientation. NC = Not connected; connection of an NC pin may cause a malfunction or incompatibility with future shippings of the Am386SX/SXL microprocessor. # AMD ## PIN DESIGNATION TABLES (sorted by Functional Grouping) | Add | ress | Da | ıta | Cor | itrol | NC | NC Vcc | | |------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------|----------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------|----------------------------------------------------------|--------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------| | Pin Name | Pin No. | Pin Name | Pin No. | Pin Name | Pin No. | Pin No. | Pin No. | Pin No. | | A1<br>A2<br>A3<br>A4<br>A5<br>A6<br>A7<br>A8<br>A9<br>A10<br>A11<br>A12<br>A13<br>A14<br>A15<br>A16<br>A17<br>A18<br>A19<br>A20<br>A21<br>A22<br>A23 | 18<br>51<br>52<br>53<br>54<br>55<br>56<br>58<br>59<br>60<br>61<br>62<br>64<br>65<br>66<br>70<br>72<br>73<br>74<br>75<br>76<br>79<br>80 | D0 D1 D2 D3 D4 D5 D6 D7 D8 D9 D10 D11 D12 D13 D14 D15 | 1<br>100<br>99<br>96<br>95<br>94<br>93<br>92<br>90<br>89<br>88<br>87<br>86<br>83<br>82<br>81 | ADS BHE BLE BUSY CLK2 D/C ERROR FLT HLDA HOLD INTR LOCK M/IO NA NMI PEREQ READY RESET W/R | 16<br>19<br>17<br>34<br>15<br>24<br>36<br>28<br>3<br>4<br>40<br>26<br>23<br>6<br>38<br>37<br>7 | 20<br>27<br>29<br>30<br>31<br>43<br>44<br>45<br>46<br>47 | 8<br>9<br>10<br>21<br>32<br>39<br>42<br>48<br>57<br>69<br>71<br>84<br>91<br>97 | 2<br>5<br>11<br>12<br>13<br>14<br>22<br>35<br>41<br>49<br>50<br>63<br>67<br>68<br>77<br>78<br>85<br>98 | ## PIN DESIGNATION TABLES (sorted by Pin Number) | Pin No. | Pin Name | Pin No. | Pin Name | Pin No. | Pin Name | Pin No. | Pin Name | Pin No. | Pin Name | |---------|----------|---------|----------|---------|------------|---------|-----------------|---------|----------| | 1 | D0 | 21 | Vcc | 41 | Vss | 61 | A11 | 81 | D15 | | 2 | Vss | 22 | Vss | 42 | Vcc | 62 | A12 | 82 | D14 | | 3 | HLDA | 23 | M/īŌ | 43 | NC | 63 | Vss | 83 | D13 | | 4 | HOLD | 24 | D/C | 44 | NC | 64 | A13 | 84 | Vcc | | 5 | Vss | 25 | W/R | 45 | NC | 65 | A14 | 85 | Vss | | 6 | NA | 26 | LOCK | 46 | NC | 66 | A15 | 86 | D12 | | 7 | READY | 27 | NC | 47 | NC | 67 | V <sub>ss</sub> | 87 | D11 | | 8 | Vcc | 28 | FLT | 48 | Vcc | 68 | Vss | 88 | D10 | | 9 | Vcc | 29 | NC . | 49 | Vss | 69 | Vcc | 89 | D9 | | 10 | Vcc | 30 | NC | 50 | Vss | 70 | A16 | 90 | D8 | | 11 | Vss | 31 | NC | 51 | A2 | 71 | Vcc | 91 | Vcc | | 12 | Vss | 32 | Vcc | 52 | A3 | 72 | A17 | 92 | D7 | | 13 | Vss | 33 | RESET | 53 | A4 | 73 | A18 | 93 | D6 | | 14 | Vss | 34 | BUSY | 54 | <b>A</b> 5 | 74 | A19 | 94 | D5 | | 15 | CLK2 | 35 | Vss | 55 | A6 | 75 | A20 | 95 | D4 | | 16 | ADS | 36 | ERROR | 56 | A7 | 76 | A21 | 96 | D3 | | 17 | BLE | 37 | PEREQ | 57 | Vcc | 77 | Vss | 97 | Vcc | | 18 | A1 | 38 | NMI | 58 | A8 | 78 | Vss | 98 | Vss | | 19 | BHE | 39 | Vcc | 59 | A9 | 79 | A22 | 99 | D2 | | 20 | NC | 40 | INTR | 60 | A10 | 80 | A23 | 100 | D1 | #### ORDERING INFORMATION #### Standard Products AMD standard products are available in several packages and operating ranges. The order number (Valid Combination) is formed by a combination of the elements below. | | Valid Combinations | | | | | | | | | | |----|--------------------|--------------------------------------------------------|--|--|--|--|--|--|--|--| | NG | 80386SX | –40, –40/F | | | | | | | | | | NG | 80386SX/SXL | -33, -33/F<br>-25, -25/F<br>-20, -20/F<br>-16*, -16/F* | | | | | | | | | ## Valid Combinations Valid Combinations list configurations planned to be supported in volume for this device. Consult the local AMD sales office to confirm availability of specific valid combinations and to check on newly released combinations. <sup>\*</sup>Contact AMD for 16-MHz availability. #### PIN DESCRIPTIONS #### A23-A1 #### Address Bus (Outputs) Outputs physical memory or port I/O addresses. #### ADS #### Address Status (Active Low: Output) Indicates that a valid bus cycle definition and address (W/R, D/C, M/IO, BHE, BLE, and A23–A1) are being driven at the Am386SX/SXL microprocessor pins. #### BHE. BLE #### **Byte Enables (Active Low: Outputs)** Indicate which data bytes of the data bus take part in a bus cycle. #### BUSY #### **Busy (Active Low; Input)** Signals a busy condition from a processor extension. #### CLK2 #### CLK2 (Input) Provides the fundamental timing for the Am386SX/SXL microprocessor. #### D15-D0 #### **Data Bus (inputs/Outputs)** Inputs data during memory, I/O, and interrupt acknowledge read cycles; outputs data during memory and I/O write cycles. ## D/C #### **Data/Control (Output)** A bus cycle definition pin that distinguishes data cycles, either memory or I/O, from control cycles which are: interrupt acknowledge, halt, and code fetch. ## ERROR #### **Error (Active Low; Input)** Signals an error condition from a processor extension. #### FLT ## Float (Active Low; Input) An input which forces all bidirectional and output signals, including HLDA, to the three-state condition. #### **HLDA** #### **Bus Hold Acknowledge (Active High; Output)** Output indicates that the Am386SX/SXL microprocessor has surrendered control of its logical bus to another bus master. #### HOLD #### **Bus Hold Request (Active High; Input)** Input allows another bus master to request control of the local bus. #### INTR ## **Interrupt Request (Active High; Input)** A maskable input that signals the Am386SX/SXL microprocessor to suspend execution of the current program and execute an interrupt acknowledge function. #### LOCK #### **Bus Lock (Active Low; Output)** A bus cycle definition pin that indicates that other system bus masters are not to gain control of the system bus while it is active. #### M/IO #### Memory/IO (Output) A bus cycle definition pin that distinguishes memory cycles from input/output cycles. #### NA #### **Next Address (Active Low; Input)** Used to request address pipelining. #### NC #### **No Connect** Should always be left unconnected. Connection of a NC pin may cause the processor to malfunction or be incompatible with future steppings of the Am386SX/SXL microprocessor. #### NM # Non-Maskable Interrupt Request (Active High; Input) A non-maskable input that signals the Am386SX/SXL microprocessor to suspend execution of the current program and execute an interrupt acknowledge function. #### PEREQ ## **Processor Extension Request (Active High: Input)** Indicates that the processor has data to be transferred by the Am386SX/SXL microprocessor. #### READY #### **Bus Ready (Active Low; Input)** Terminates the bus cycle. #### RESET ## Reset (Active High; Input) Suspends any operation in progress and places the Am386SX/SXL microprocessor in a known reset state. $V_{cc}$ ## **System Power (Input)** Provides the +5 V nominal DC supply input. $V_{ss}$ ## System Ground (Input) Provides the 0 V connection from which all inputs and outputs are measured. W/R ## Write/Read (Output) A bus cycle definition pin that distinguishes write cycles from read cycles. ## INTRODUCTION The Am386SX/SXL microprocessor is 100% object-code compatible with the Am386DX/DXL, 286, and 8086 microprocessors. System manufacturers can provide Am386DX/DXL CPU-based systems optimized for performance and Am386SX/SXL CPU-based systems optimized for cost, both sharing the same operating systems and application software. Systems based on the Am386SX/SXL microprocessor can access the world's largest existing microcomputer software base. Instruction pipelining, high-bus bandwidth, and a very high-performance ALU ensure short average instruction execution times and high system throughput. The Am386SX/SXL CPU is capable of execution at sustained rates of 2.5–3.0 million instructions per second (MIPS). The integrated Memory Management Unit (MMU) includes an address translation cache, advanced multitasking hardware, and a four-level hardware-enforced protection mechanism to support operating systems. The virtual machine capability of the Am386SX/SXL CPU allows simultaneous execution of applications from multiple operating systems such as MS-DOS and UNIX. The Am386SX/SXL CPU offers on-chip testability and debugging features. Four breakpoint registers allow conditional or unconditional breakpoint traps on code execution or data accesses for powerful debugging of even ROM-based systems. Other testability features include self-test, three-state of output buffers, and direct access to the page translation cache. #### **BASE ARCHITECTURE** The Am386SX/SXL microprocessor consists of a central processing unit, a Memory Management Unit, and a bus interface. The central processing unit consists of the execution unit and the instruction unit. The execution unit contains the eight 32-bit general purpose registers which are used for both address calculation and data operations and a 64-bit barrel shifter used to speed shift, rotate, multiply, and divide operations. The instruction unit decodes the instruction op-codes and stores them in the decoded instruction queue for immediate use by the execution unit. The MMU consists of a segmentation unit and a paging unit. Segmentation allows the managing of the logical address space by providing an extra addressing component, one that allows easy code and data relocatability, and efficient sharing. The paging mechanism operates beneath and is transparent to the segmentation process, to allow management of the physical address space. The segmentation unit provides four levels of protection for isolating and protecting applications and the operating system from each other. The hardware enforced protection allows the design of systems with a high degree of integrity. The Am386SX/SXL microprocessor has two modes of operation: Real Address Mode (Real Mode) and Protected Virtual Address Mode (Protected Mode). In Real Mode the Am386SX/SXL CPU operates as a very fast 8086, but with 32-bit extensions, if desired. Real Mode is required primarily to set up the processor for Protected Mode operation. Within Protected Mode, software can perform a task switch to enter into tasks designated as Virtual 8086 Mode tasks. Each such task behaves with 8086 semantics, thus allowing 8086 software (an application program or an entire operating system) to execute. The Virtual 8086 tasks can be isolated and protected from one another and the host Am3865X/SXL microprocessor operating system by use of paging. Finally, to facilitate high-performance system hardware designs, the Am386SX/SXL microprocessor bus interface offers address pipelining and direct Byte Enable signals for each byte of the data bus. ## **Register Set** The Am386SX/SXL microprocessor has 34 registers as shown in Figure 1. These registers are grouped into the following seven categories: General Purpose Registers: The eight 32-bit general purpose registers are used to contain arithmetic and logical operands. Four of these (EAX, EBX, ECX, and EDX) can be used either in their entirety as 32-bit registers, as 16-bit registers, or split into pairs of separate 8-bit registers. Segment Registers: Six 16-bit special purpose registers select, at any given time, the segments of memory that are immediately addressable for code, stack, and data. Flags and Instruction Pointer Registers: The two 32-bit special purpose registers in Figure 1 record or control certain aspects of the Am386SX/SXL microprocessor state. The EFLAGS register includes status and control bits that are used to reflect the outcome of many instructions and modify the semantics of some instructions. The Instruction Pointer (EIP) is 32-bits wide. The EIP controls instruction fetching, and the processor automatically increments it after executing an instruction. Control Registers: The four 32-bit control registers are used to control the global nature of the Am386SX/SXL microprocessor. The CR0 register contains bits that set the different processor modes (Protected, Real, Paging, and Coprocessor Emulation). CR2 and CR3 registers are used in the paging operation. Reserved for future use-do not use. 15022B-004 Figure 1. Am386SX/SXL Microprocessor Registers System Address Registers: These four special registers reference the tables or segments supported by the 80286/Am386SX/SXL/Am386DX/DXL CPU's protection model. These tables or segments are: GDTR (Global Descriptor Table Register), IDTR (Interrupt Descriptor Table Register), LDTR (Local Descriptor Table Register), TR (Task State Segment Register). **Debug Registers**: The six programmer accessible debug registers provide on-chip support for debugging. The use of the debug registers is described in the section Debugging Support. **Test Registers**: Two registers are used to control the testing of the RAM/CAM (Content Addressable Memories) in the Translation Look-Aside Buffer portion of the Am386SX/SXL microprocessor. Their use is discussed in the section Testability. #### **EFLAGS Register** The flag register is a 32-bit register named EFLAGS. The defined bits and bit fields within EFLAGS, shown in Figure 2, control certain operations and indicate the status of the Am386SX/SXL microprocessor. The lower 16 bits (bits 15–0) of EFLAGS contain the 16-bit flag register named FLAGS. This is the default flag register used when executing 8086, 80286, or real mode code. The functions of the flag bits are given in Table 1. #### **Control Registers** The Am386SX/SXL microprocessor has three control registers of 32 bits, CR3–CR0, to hold the machine state of a global nature. These registers are shown in Figures 1 and 2. The defined CR0 bits are described in Table 2. #### Instruction Set The instruction set is divided into nine categories of operations: Data Transfer Arithmetic Shift/Rotate String Manipulation Bit Manipulation Control Transfer High-Level Language Support Operating System Support Processor Control These instructions are listed in the Instruction Set Clock Count Summary (pages 217 through 231). Figure 2. Status and Control Register Bit Functions 15022B-005 ## Table 1. Flag Definitions | Bit Position | Name | Function | | | |--------------|------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--| | 0 | CF | Carry Flag—Set on high-order bit carry or borrow; cleared otherwise. | | | | 2 | PF | Parity Flag—Set if low-order 8 bits of result contain an even number of 1 bits; cleared otherwise. | | | | 4 | AF | Auxiliary Carry Flag—Set on carry from or borrow to the low-order 4 bits of AL; cleared otherwise. | | | | 6 | ZF | Zero Flag—Set if result is zero; cleared otherwise. | | | | 7 | SF | Sign Flag—Set equal to high-order bit of result (0 if positive, 1 if negative). | | | | 8 | TF | Single-Step Flag—Once set, a single-step interrupt occurs after the next instruction executes. TF is cleared by the single-step interrupt. | | | | 9 | IF | Interrupt-Enable Flag—When set, maskable interrupts will cause the CPU to transfer control to an interrupt vector specified location. | | | | 10 | DF | Direction Flag—Causes string instructions to auto-increment (default) the appropriate index registers when cleared. Setting DF causes auto-decrement. | | | | 11 | OF | Overflow Flag—Set if the operation resulted in a carry/borrow into the sign bit (high-order bit) of the result but did not result in a carry/borrow out of the high-order bit or vice-versa. | | | | 12, 13 | IOPL | I/O Privilege Level—Indicates the maximum CPL permitted to execute I/O instructions without generating an Exception 13 fault or consulting the I/O permission bit map while executing in protected mode. For virtual 8086 mode it indicates the maximum CPL allowing alteration of the IF bit. | | | | 14 | NT | Nested Task—Indicates that the execution of the current task is nested within another task. | | | | 16 | RF | Resume Flag—Used in conjunction with debug register breakpoints. It is checked at instruction boundaries before breakpoint processing. If set, any debug fault is ignored on the next instruction. | | | | 17 | VM | Virtual 8086 Mode—If set while in protected mode, the Am386SXL micro-<br>processor will switch to virtual 8086 operation, handling segment loads as<br>8086 does, but generating Exception 13 faults on privileged op-codes. | | | ## Table 2. CR0 Definitions | Bit Position | Name | Function | |--------------|------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | 0 | PE | Protection Mode Enable—Places the Am386SXL microprocessor into protected mode. If PE is reset, the processor operates again in Real Mode. PE may be set by loading MSW or CR0. PE can be reset only by loading CR0; it cannot be reset by the LMSW instruction. | | 1 | MP | Monitor Coprocessor Extension—Allows WAIT instructions to cause a processor extension Not Present exception (number 7). | | 2 | ЕМ | Emulate Processor Extension—Causes a processor extension Not Present exception (number 7) on ESC instructions to allow emulating a processor extension. | | 3 | TS | Task Switched—Indicates the next instruction using a processor extension will cause Exception 7, allowing software to test whether the current processor extension context belongs to the current task. | | 31 | PG | Paging Enable Bit—Is set to enable the on-chip paging unit. It is reset to disable the on-chip paging unit. | All Am386SX/SXL microprocessor instructions operate on either 0, 1, 2, or 3 operands; an operand resides in a register, in the instruction itself, or in memory. Most zero operand instructions (e.g., CLI, STI) take only one byte. One operand instruction generally is two bytes long. The average instruction is 3.2-bytes long. Since the Am386SX/SXL CPU has a 16-byte prefetch instruction queue, an average of 5 instructions will be prefetched. The use of two operands permits the following types of common instructions: Register to Register Memory to Register Immediate to Register Memory to Memory Register to Memory Immediate to Memory The operands can be either 8, 16, or 32 bits long. As a general rule, when executing code written for the Am386SX/SXL microprocessor (32-bit code), operands are 8 or 32 bits; when executing existing 8086 or 80286 code (16-bit code), operands are 8 or 16 bits. Prefixes can be added to all instructions which override the default length of the operands (i.e., use 32-bit operands for 16-bit code, or 16-bit operands for 32-bit code). ## **Memory Organization** Memory on the Am386SX/SXL microprocessor is divided into 8-bit quantities (Bytes), 16-bit quantities (Words), and 32-bit quantities (Dwords). Words are stored in two consecutive bytes in memory with the low-order byte at the lowest address. Dwords are stored in four consecutive bytes in memory with the low-order byte at the lowest address. The address of a Word or Dword is the byte address of the low-order byte. In addition to these basic data types, the Am386SX/SXL microprocessor supports two larger units of memory: pages and segments. Memory can be divided up into one or more variable length segments, which can be swapped to disk or shared between programs. Memory can also be organized into one or more 4-Kb pages. Finally, both segmentation and paging can be combined, gaining the advantages of both systems. The Am386SX/SXL CPU supports both pages and segmentation in order to provide maximum flexibility to the system designer. Segmentation and paging are complementary. Segmentation is useful for organizing memory in logical modules, and, as such, is a tool for the application programmer, while pages are useful to the system programmer for managing the physical memory of a system. 15021B-011 Figure 3. Address Translation #### Address Spaces The Am386SX/SXL microprocessor has three types of address spaces: logical, linear, and physical. A logical address (also known as a virtual address) consists of a selector and an offset. A selector is the contents of a segment register. An offset is formed by summing all of the addressing components (Base, Index, Displacement) discussed in the section Addressing Modes, into an effective address. This effective address, along with the selector, is known as the logical address. Since each task on the Am386SX/SXL CPU has a maximum of 16K (2<sup>14</sup>-1) selectors, and offsets can be 4 Gb (with paging enabled), this gives a total of 2<sup>46</sup> bits, or 64 tb, of logical address space per task. The programmer sees the logical address space. The segmentation unit translates the logical address space into a 32-bit linear address space. If the paging unit is not enabled then the 32-bit linear address is truncated into a 24-bit physical address. The physical address is what appears on the address pins. The primary differences between Real Mode and Protected Mode are how the segmentation unit performs the translation of the logical address into the linear address, size of the address space, and paging capability. In Real Mode, the segmentation unit shifts the selector left four bits and adds the result to the effective address to form the linear address. This linear address is limited to 1 Mb. In addition, Real Mode has no paging capability. Protected Mode will see one of two different address spaces, depending on whether or not paging is enabled. Every selector has a logical base address associated with it that can be up to 32 bits in length. This 32-bit logical base address is added to the effective address to form a final 32-bit linear address. If paging is disabled, this final linear address reflects physical memory and is truncated so that only the lower 24 bits of this address are used to address the 16-Mb memory address space. If paging is enabled, this final linear address reflects a 32-bit address that is translated through the paging unit to form a 16-Mb physical address. The logical base address is stored in one of two operating system tables (i.e., the Local Descriptor Table). Figure 3 shows the relationship between the various address spaces. #### Segment Register Usage The main data structure used to organize memory is the segment. On the Am386SX/SXL CPU, segments are variable sized blocks of linear addresses which have certain attributes associated with them. There are two main types of segments, code and data. The segments are of variable size and can be as small as 1 byte or as large as 4 Gb (2<sup>32</sup> bits). In order to provide compact instruction encoding and increase processor performance, instructions do not need to explicitly specify which segment register is used. The segment register is automatically chosen according to the rules of Table 3 (Segment Register Selection Rules). In general, data references use the selector contained in the DS register; stack references use the SS register; and, instruction fetches use the CS register. The contents of the Instruction Pointer provide the offset. Special segment override prefixes allow the explicit use of a given segment register and override the implicit rules listed in Table 3. The override prefixes also allow the use of the ES. FS. and GS segment registers. There are no restrictions regarding the overlapping of the base addresses of any segments. Thus, all six segments could have the base address set to zero and create a system with 4-Gb linear address space. This creates a system where the virtual address space is the same as the linear address space. Further details of segmentation are discussed in the section Protected Mode Architecture. ## **Addressing Modes** The Am386SX/SXL microprocessor provides a total of eight addressing modes for instructions to specify operands. The addressing modes are optimized to allow the efficient execution of high-level languages such as C and FORTRAN, and they cover the vast majority of data references needed by high-level languages. #### **Register and Immediate Modes** Two of the addressing modes provide for instructions that operate on register or immediate operands. Register Operand Mode: The operand is located in one of the 8-, 16-, or 32-bit general registers. Immediate Operand Mode: The operand is included in the instruction as part of the op-code. #### 32-Bit Memory Addressing Modes The remaining six modes provide a mechanism for specifying the effective address of an operand. The linear address consists of two components: the segment base address and an effective address. The effective address is calculated by summing any combination of the following three address elements (see Figure 3). **Displacement**: an 8-,16-, or 32-bit immediate value, following the instruction. Base: The contents of any general purpose register. The base registers are generally used by compilers to point to the start of the local variable area. Index: The contents of any general purpose register except for ESP. The index registers are used to access the elements of an array or a string of characters. The index register's value can be multiplied by a scale factor, either 1, 2, 4, or 8. The scaled index is especially useful for accessing arrays or structures. **Table 3. Segment Register Selection Rules** | Type of Memory Reference | Implied (Default) Segment Use | Segment Override Prefixes Possible | | | |-----------------------------------------------------------------------------|-------------------------------|------------------------------------|--|--| | Code Fetch | cs | None | | | | Destination of PUSH, PUSHF, INT, CALL, PUSHA Instructions | SS | None | | | | Source of POP, POPA, POPF, IRET, RET Instructions | SS | None | | | | Destination of STOS, MOVE, REP<br>STOS, REP MOVS Instructions | ES | None | | | | Other Data References, with<br>Effective Address Using Base<br>Register of: | | | | | | [EAX] | DS | CS, SS, ES, FS, GS | | | | [EBX] | DS | CS, SS, ES, FS, GS | | | | [ECX] | DS | CS, SS, ES, FS, GS | | | | [EDX] | DS | CS, SS, ES, FS, GS | | | | [ESI] | DS | CS, SS, ES, FS, GS | | | | [EDI] | DS | CS, SS, ES, FS, GS | | | | [EBP] | SS | CS, DS, ES, FS, GS | | | | [ESP] | SS | CS, DS, ES, FS, GS | | | Combinations of these three components make up the six additional addressing modes. There is no performance penalty for using any of these addressing combinations, since the effective address calculation is pipelined with the execution of other instructions. The one exception is the simultaneous use of Base and Index components which requires one additional clock. As shown in Figure 4, the Effective Address (EA) of an operand is calculated according to the following formula: EA = BaseRegister + (IndexRegister × Scaling) + Displacement - Direct Mode: The operand's offset is contained as part of the instruction as an 8-, 16-, or 32-bit displacement. - Register Indirect Mode: A Base register contains the address of the operand. - 3. **Based Mode**: A Base register's contents are added to a Displacement to form the operand's offset. - Scaled Index Mode: An Index register's contents are multiplied by a Scaling factor, and the result is added to a Displacement to form the operand's offset. - Based Scaled Index Mode: The contents of an Index register are multiplied by a Scaling factor, and the result is added to the contents of a Base register to obtain the operand's offset. - Based Scaled Index Mode with Displacement: The contents of an Index register are multiplied by a Scaling factor, and the result is added to the contents of a Base register and a Displacement to form the operand's offset. #### Differences Between 16- and 32-Bit Addresses In order to provide software compatibility with the 8086 and the 80286, the Am386SX/SXL microprocessor can execute 16-bit instructions in Real and Protected Modes. The processor determines the size of the instructions it is executing by examining the D bit in a Segment Descriptor. If the D bit is 0, then all operand lengths and effective addresses are assumed to be 16-bits long. If the D bit is 1, then the default length for operands and addresses is 32 bits. In Real Mode the default size for operands and addresses is 16 bits. Regardless of the default precision of the operands or addresses, the Am386SX/SXL microprocessor is able to execute either 16- or 32-bit instructions. This is specified through the use of override prefixes. Two prefixes, the Operand Length Prefix and the Address Length Prefix, override the value of the D bit on an individual instruction basis. These prefixes are automatically added by assemblers. The Operand Length and Address Length Prefixes can be applied separately or in combination to any instruction. The Address Length Prefix does not allow addresses over 64 Kb to be accessed in Real Mode. A memory address which exceeds 0FFFH will result in a General Protection Fault. An Address Length Prefix only allows the use of the additional Am386SX/SXL CPU addressing modes. When executing 32-bit code, the Am386SX/SXL CPU uses either 8- or 32-bit displacements, and any register can be used as Base or Index registers. When executing 16-bit code, the displacements are either 8- or 16-bits, and the Base and Index registers conform to the 80286 model. Table 4 illustrates the differences. Figure 4. Addressing Mode Calculations 15021B-012 ## **Data Types** The Am386SX/SXL microprocessor supports all of the data types commonly used in high-level languages. Bit: A single bit quantity. Bit Field: A group of up to 32 contiguous bits, which spans a maximum of four bytes. **Bit String**: A set of contiguous bits; on the Am386SX/SXL microprocessor, bit strings can be up to 4 Gbits long. Byte: A signed 8-bit quantity. Unsigned Byte: An unsigned 8-bit quantity. Integer (Word): A signed 16-bit quantity. Long Integer (Dword): A signed 32-bit quantity. All operations assume a 2's complement representation. Unsigned Integer (Word): An unsigned 16-bit quantity. **Unsigned Long Integer (Dword)**: An unsigned 32-bit quantity. Signed Quad Word: A signed 64-bit quantity. Unsigned Quad Word: An unsigned 64-bit quantity. **Pointer:** A 16- or 32-bit offset-only quantity which indirectly references another memory location. Long Pointer: A full pointer which consists of a 16-bit segment selector and either a 16- or 32-bit offset. **Char**: A byte representation of an ASCII alphanumeric or control character. String: A contiguous sequence of bytes, Words, or Dwords. A string may contain between 1 byte and 4 Gb. $\boldsymbol{\mathsf{BCD}} \colon A$ byte (unpacked) representation of decimal digits 0–9. Packed BCD: A byte (packed) representation of two decimal digits 0–9 storing one digit in each nibble. When the Am386SX/SXL microprocessor is coupled with a 387SX math coprocessor, the following common floating point types are supported. **Floating Point**: A signed 32-, 64-, or 80-bit real number representation. Floating point numbers are supported by 387SX-compatible math coprocessors. Table 4. Base and Index Registers for 16- and 32-Bit Addresses | | 16-Bit<br>Addressing | 32-Bit<br>Addressing | |---------------------------------|-----------------------|----------------------------------------------------------------| | Base Register<br>Index Register | BX, BP<br>SI, DI | Any 32-bit GP Register<br>Any 32-bit GP Register<br>Except ESP | | Scale Factor<br>Displacement | None<br>0, 8, 16 bits | 1, 2, 4, 8<br>0, 8, 32 bits | Figure 5 illustrates the data types supported by the Am386SX/SXL microprocessor and a 387SX-compatible math coprocessor. ## I/O Space The Am386SX/SXL CPU has two distinct physical address spaces: physical memory and I/O. Generally, peripherals are placed in I/O space, although the Am386SX/SXL CPU also supports memory-mapped peripherals. The I/O space consists of 64 Kb which can be divided into 64K 8-bit ports or 32K 16-bit ports, or any combination of ports which add up to no more than 64 Kb. The 64Kb I/O address space refers to physical addresses rather than linear addresses since I/O instructions do not go through the segmentation or paging hardware. The M/IO pin acts as an additional address line, thus allowing the system designer to easily determine which address space the processor is accessing. The I/O ports are accessed by the In and Out instructions, with the port address supplied as an immediate 8-bit constant in the instruction or in the DX register. All 8-bit and 16-bit port addresses are zero extended on the upper address lines. The I/O instructions cause the M/IO pin to be driven Low. I/O port addresses 00F8H through 00FFH are reserved for future use. #### Interrupts and Exceptions Interrupts and exceptions alter the normal program flow in order to handle external events, report errors, or report exceptional conditions. The difference between interrupts and exceptions is that interrupts are used to handle asynchronous external events while exceptions handle instruction faults. Although a program can generate a software interrupt via an INT n instruction, the processor treats software interrupts as exceptions. Hardware interrupts occur as the result of an external event and are classified into two types: maskable or non-maskable. Interrupts are serviced after the execution of the current instruction. After the interrupt handler is finished servicing the interrupt, execution proceeds with the instruction immediately after the interrupted instruction. Exceptions are classified as faults, traps, or aborts, depending on the way they are reported and whether or not restart of the instruction causing the exception is supported. Faults are exceptions that are detected and serviced *before* the execution of the faulting instruction. Traps are exceptions that are reported immediately *after* the execution of the instruction which caused the problem. Aborts are exceptions that do not permit the precise location of the instruction causing the exception to be determined. Thus, when an interrupt service routine has been completed, execution proceeds from the instruction immediately following the interrupted instruction. On the other hand, the return address from an exception fault routine will always point to the instruction causing the exception and will include any leading instruction prefixes. Table 5 summarizes the possible interrupts for the Am386SX/SXL microprocessor and shows where the return address points. The Am386SX/SXL CPU has the ability to handle up to 256 different interrupts/exceptions. In order to service the interrupts, a table with up to 256 interrupt vectors must be defined. The interrupt vectors are simply pointers to the appropriate interrupt service routine. In Real Mode, the vectors are 4-byte quantities, a Code Segment plus a 16-bit offset; in Protected Mode, the interrupt vectors are 8-byte quantities which are put in an Interrupt Descriptor Table. Of the 256 possible interrupts, 32 are reserved for future use and the remaining 224 are free to be used by the system designer. #### Interrupt Processing When an interrupt occurs, the following actions happen. First, the current program address and Flags are saved on the stack to allow resumption of the interrupted program. Next, an 8-bit vector is supplied to the Am386SX/SXL microprocessor which identifies the appropriate entry in the interrupt table. The table contains the starting address of the interrupt service routine. Then, the user supplied interrupt service routine is executed. Finally, when an IRET instruction is executed the old processor state is restored and program execution resumes at the appropriate instruction. The 8-bit interrupt vector is supplied to the Am386SX/SXL microprocessor in several different ways: exceptions supply the interrupt vector internally; software INT instructions contain or imply the vector; maskable hardware interrupts supply the 8-bit vector via the interrupt acknowledge bus sequence. Non-Maskable hardware interrupts are assigned to interrupt vector 2. #### Maskable Interrupt Maskable interrupts are the most common way to respond to asynchronous external hardware events. A hardware interrupt occurs when the INTR is pulled High and the Interrupt Flag bit (IF) is enabled. The processor only responds to interrupts between instructions (string instructions have an interrupt window between memory moves that allows interrupts during long string Figure 5. Am386SX/SXL Microprocessor Supported Data Types **Table 5. Interrupt Vector Assignments** | Function | Interrupt<br>Number | Instruction Which<br>Can Cause Exception | Return Address Points to Faulting Instruction | Туре | |-----------------------------|---------------------|------------------------------------------------|-----------------------------------------------|-------| | Divide Error | or 0 DIV, IDIV | | Yes | FAULT | | Debug Exception | 1 | Any Instruction | Yes | TRAP* | | NMI Interrupt | 2 | INT2 or NMI | No | NMI | | One Byte Interrupt | 3 | INT | No | TRAP | | Interrupt on Overflow | 4 | INTO | No | TRAP | | Array Bounds Check | 5 | BOUND | Yes | FAULT | | Invalid Op-code | 6 | Any Illegal Instruction | Yes | FAULT | | Device Not Available | 7 | ESC, WAIT | Yes | FAULT | | Double Fault | 8 | Any instruction that can generate an exception | | ABORT | | Coprocessor Segment Overrun | 9 | ESC | No | ABORT | | Invalid TSS | 10 | JMP, CALL, IRET, INT | Yes | FAULT | | Segment Not Present | 11 | Segment Register Instructions | Yes | FAULT | | Stack Fault | 12 | Stack References | Yes | FAULT | | General Protection Fault | 13 | Any Memory Reference | Yes | FAULT | | Page Fault | 14 | Any Memory Access or Code Fetch | Yes | FAULT | | Coprocessor Error | 16 | ESC, WAIT | Yes | FAULT | | Reserved for Future Use | 17–32 | | | | | Two Byte Interrupt | 0–255 | INT n | No | TRAP | Note: Some debug exceptions may report both traps on the previous instruction and faults on the next instruction. moves). When an interrupt occurs the processor reads an 8-bit vector supplied by the hardware which identifies the source of the interrupt (one of 224 user defined interrupts). Interrupts through interrupt gates automatically reset IF bit, disabling INTR requests. Interrupts through Trap Gates leave the state of the IF bit unchanged. Interrupts through a Task Gate change the IF bit according to the image of the EFLAGs register in the task's Task State Segment (TSS). When an IRET instruction is executed, the original state of the IF bit is restored. #### Non-Maskable Interrupt Non-maskable interrupts provide a method of servicing very high priority interrupts. When the NMI input is pulled High it causes an interrupt with an internally supplied vector value of 2. Unlike a normal hardware interrupt, no interrupt acknowledgment sequence is performed for an NMI. While executing the NMI servicing procedure, the Am386SX/SXL microprocessor will not service any further NMI request or INT requests until an Interrupt Return (IRET) instruction is executed or the processor is reset. If NMI occurs while currently servicing an NMI, its presence will be saved for servicing after executing the first IRET instruction. The IF bit is cleared at the beginning of an NMI interrupt to inhibit further INTR interrupts. #### Software Interrupts A third type of interrupt/exception for the Am386SX/SXL CPU is the software interrupt. An INT n instruction causes the processor to execute the interrupt service routine pointed to by the nth vector in the interrupt table. A special case of the two byte software interrupt INT n is the one byte INT 3, or breakpoint interrupt. By inserting this one byte instruction in a program, the user can set breakpoints in his program as a debugging tool. A final type of software interrupt is the single-step interrupt. It is discussed in section Single-Step Trap. #### Interrupt and Exception Priorities Interrupts are externally generated events. Maskable Interrupts (on the INTR input) and Non-Maskable Interrupts (on the NMI input) are recognized at instruction boundaries. When NMI and maskable INTR are both recognized at the same instruction boundary, the Am386SX/SXL microprocessor invokes the NMI service routine first. If maskable interrupts are still enabled after the NMI service routine has been invoked, then the Am386SX/SXL CPU will invoke the appropriate interrupt service routine. As the Am386SX/SXL microprocessor executes instructions, it follows a consistent cycle in checking for exceptions, as shown in Table 6. This cycle is repeated as each instruction is executed, and occurs in parallel with instruction decoding and execution. #### Instruction Restart The Am386SX/SXL microprocessor fully supports restarting all instructions after Faults. If an exception is detected in the instruction to be executed (exception categories 4 through 10 in Table 6), the Am386SX/SXL microprocessor invokes the appropriate exception service routine. The Am386SX/SXL microprocessor is in a state that permits restart of the instruction, for all cases by those given in Table 7. Note that all such cases will be avoided by a properly designed operating system. #### **Double Fault** A Double Fault (Exception 8) results when the processor attempts to invoke an exception service routine for the segment exceptions (10, 11, 12, or 13), but in the process of doing so detects an exception other than a Page Fault (Exception 14). One other cause of generating a Double Fault is the Am386SX/SXL CPU detecting any other exception when it is attempting to invoke the Page Fault (Exception 14) service routine (e.g., if a Page Fault is detected when the Am386SX/SXL microprocessor attempts to invoke the Page Fault service routine). Of course, in any functional system, not only the Am386SX/SXL CPU- based systems, the entire Page Fault service must remain present in memory. ## Reset and Initialization When the processor is initialized or Reset, the registers have the values shown in Table 8. The Am386SX/SXL CPU will then start executing instructions near the top of physical memory, at location 0FFFF0H. When the first intersegment Jump or Call is executed, address lines A23–A20 will drop Low for CS-relative memory cycles, and the Am386SX/SXL CPU will only execute instructions in the lower 1 Mb of physical memory. This allows the system designer to use a shadow ROM at the top of physical memory to initialize the system and take care of Resets. Reset forces the Am386SX/SXL microprocessor to terminate all execution and local bus activity. No instruction execution or bus activity will occur as long as Reset is active. Between 350- and 450-CLK2 periods after Reset becomes inactive, the Am386SX/SXL microprocessor will start executing instructions at the top of physical memory. ### **Table 6. Sequence of Exception Checking** Consider the case of the Am386SXL microprocessor having just completed an instruction. It then performs the following checks before reaching the point where the next instruction is completed. - Check for Exception 1 Traps from the instruction just completed (single-step via Trap Flag, or Data Breakpoints set in the Debug Registers). - 2. Check for external NMI and INTR. - 3. Check for Exception 1 Faults in the next instruction (Instruction Execution Breakpoint set in the Debug Registers for the next instruction). - 4. Check for Segmentation Faults that prevented fetching the entire next instruction (Exceptions 11 and 13). - 5. Check for Page Faults that prevented fetching the entire next instruction (Exception 14). - 6. Check for Faults decoding the next instruction (Exception 6 if illegal op-code; Exception 6 if in Real Mode or in Virtual 8086 Mode and attempting to execute an instruction for Protected Mode only; or Exception 13 if instruction is longer than 15 bytes, or privilege violation in Protected Mode (i.e., not at IOPL or at CPL = 0)). - 7. If WAIT op-code, check if TS = 1 and MP = 1 (Exception 7 if both are 1). - 8. If ESCape op-code for math coprocessor, check if EM = 1 or TS = 1 (Exception 7 if either are 1). - If WAIT op-code or ESCape op-code for math coprocessor, check ERROR input signal (Exception 16 if ERROR input is asserted). - 10. Check in the following order for each memory reference required by the instruction. - a. Check for Segmentation Faults that prevent transferring the entire memory quantity (Exceptions 11, 12, and 13). - b. Check for Page Faults that prevent transferring the entire memory quantity (Exception 14). Note: Segmentation exceptions are generated before paging exceptions. ## Table 7. Conditions Preventing Instruction Restart - An instruction causes a task switch to a task whose Task State Segment (TSS) is partially not present (an entire not present TSS is restartable). Partially present TSSs can be avoided either by keeping the TSSs of such tasks present in memory, or by aligning TSS segments to reside entirely within a single 4K page (for TSS segments of 4 Kb or less). - 2. A coprocessor operand wraps around the top of a 64-Kb segment or a 4-Gb segment and spans three pages, and the page holding the middle portion of the operand is not present. This condition can be avoided by starting at a page boundary any segments containing coprocessor operands, if the segments are approximately 64K–200K bytes or larger (i.e., large enough for wraparound of the coprocessor operand to possibly occur). Note: These conditions are avoided by using the operating system designs mentioned in this table. #### **Table 8. Register Values after Reset** | EL W. LIEFLAGO | | | |---------------------------|---------------------------|--------| | Flag Word (EFLAGS) | uuuu0002H | Note 1 | | Machine Status Word (CR0) | uuuuuu10H | | | Instruction Pointer (EIP) | 0000FFF0H | | | Code Segment (CS) | F000H | Note 2 | | Data Segment (DS) | 0000H | Note 3 | | Stack Segment (SS) | 0000H | | | Extra Segment (ES) | 0000H | Note 3 | | Extra Segment (FS) | 0000H | | | Extra Segment (GS) | 0000H | | | EAX Register | 0000H | Note 4 | | EDX Register | Component and Stepping ID | Note 5 | | All Other Registers | Undefined | Note 6 | Notes: 1. EFLAGS Register. The upper 14 bits of the EFLAGS register are undefined; all defined flag bits are zero. - 2. The Code Segment register (CS) will have its Base Address set to 0FFFF0000H and Limit set to 0FFFFH. - 3. The Data and Extra Segment registers (DS and ES) will have their Base Address set to 000000000H and Limit set to 0FFFFH. - If self-test is selected, the EAX register should contain a 0 value. If a value of 0 is not found, the self-test has detected a flaw in the part. - 5. EDX register always holds a component and stepping identifier. - 6. All undefined bits are reserved for future use and should not be used. ## **Testability** The Am386SX/SXL microprocessor, like the Am386DX/DXL microprocessor, offers testability features that include a self-test and direct access to the page translation cache. #### Self-Test The Am386SX/SXL microprocessor has the capability to perform a self-test. The self-test checks the function of all of the Control ROM and most of the non-random logic of the part. Approximately one-half of the Am386SX/SXL CPU can be tested during self-test. Self-Test is initiated on the Am386SX/SXL microprocessor when the Reset pin transitions from High to Low, and the BUSY pin is Low. The self-test takes about 2<sup>20</sup> clocks, or approximately 33 ms with a 16-MHz Am386SX/SXL CPU. At the completion of self-test the processor performs reset and begins normal operation. The part has successfully passed self-test if the contents of the EAX are zero. If the results of the EAX are not zero, then the self-test has detected a flaw in the part. #### **TLB Testing** The Am386SX/SXL microprocessor also provides a mechanism for testing the Translation Look-Aside Buffer (TLB), if desired. This particular mechanism may not be continued in the same way in future processors. There are two TLB testing operations: 1) writing entries into the TLB; and, 2) performing TLB lookups. Two test registers, shown in Figure 6 are provided for the purpose of testing. TR6 is the test command register, and TR7 is the test data register. ## **Debugging Support** The Am386SX/SXL microprocessor provides several features which simplify the debugging process. The three categories of on-chip debugging aids are: - 1. The code execution breakpoint op-code (0CCH). - The single-step capability provided by the TF bit in the flag register. - The code and data breakpoint capability provided by the Debug Registers DR3–DR0, DR6, and DR7. #### **Breakpoint Instruction** A single-byte software interrupt (INT 3) breakpoint instruction is available for use by software debuggers. The breakpoint op-code is 0CCH, and generates an Exception 3 trap when executed. #### Single-Step Trap If the single-step flag (TF, bit 8) in the EFLAGS register is found to be set at the end of an instruction, a single-step exception occurs. The single-step exception is auto-vectored to Exception 1. #### **Debug Registers** The Debug Registers are an advanced debugging feature of the Am386SX/SXL microprocessor. They allow data access breakpoints as well as code execution breakpoints. Since the breakpoints are indicated by on-chip registers, an instruction execution breakpoint can be placed in ROM code or in code shared by several tasks, neither of which can be supported by the INT 3 breakpoint op-code. The Am386SX/SXL microprocessor contains six Debug Registers, consisting of four breakpoint address Table 9. Exceptions in Real Mode | Function | Interrupt<br>Number | Related<br>Instructions | Return<br>Address Location | |-------------------------------------------------|---------------------|-----------------------------------------------------------------------------------------------|----------------------------| | Interrupt table limit too small | 8 | INT vector is not within table limit | Before<br>Instruction | | CS, DS, ES, FS, GS<br>Segment Overrun exception | 13 | Word memory reference with offset = 0FFFFH. An attempt to execute past the end of CS segment. | Before<br>Instruction | | SS Segment Overrun exception | 12 | Stack Reference<br>beyond offset = 0FFFFH. | Before<br>Instruction | Figure 6. Test Registers registers and two breakpoint control registers. Initially after reset, breakpoints are in the disabled state; therefore, no breakpoints will occur unless the Debug Registers are programmed. Breakpoints set up in the Debug Registers are auto-vectored to Exception 1. Figure 7 shows the breakpoint status and control registers. #### **REAL MODE ARCHITECTURE** When the processor is reset or powered up it is initialized in Real Mode. Real Mode has the same base architecture as the 8086, but allows access to the 32-bit register set of the Am386SX/SXL microprocessor. The addressing mechanism, memory size, and interrupt handling are all identical to the Real Mode on the 80286. The default operand size in Real Mode is 16 bits, as in the 8086. In order to use the 32-bit registers and addressing modes, override prefixes must be used. In addition, the segment size on the Am386SX/SXL microprocessor in Real Mode is 64 Kb, so 32-bit addresses must have a value less than 0000FFFFH. The primary purpose of Real Mode is to set up the processor for Protected Mode operation. ## **Memory Addressing** In Real Mode the linear addresses are the same as physical addresses (paging is not allowed). Physical addresses are formed in Real Mode by adding the contents of the appropriate segment register which is shifted left by four bits to an effective address. This addition results in a 20-bit physical address or a 1-Mb address space. Since segment registers are shifted left by 4 bits, Real Mode segments always start on 16-byte boundaries. All segments in Real Mode are exactly 64-Kb long, and may be read, written, or executed. The Am386SX/SXL microprocessor will generate an Exception 13 if a data operand or instruction fetch occurs past the end of a segment. #### **Reserved Locations** There are two fixed areas in memory that are reserved in Real Address Mode: the system initialization area and the interrupt table area. Locations 00000H through 003FFH are reserved for interrupt vectors. Each one of the 256 possible interrupts has a 4-byte jump vector reserved for it. Locations 0FFFFF0H through 0FFFFFH are reserved for system initialization. ## Interrupts Many of the exceptions discussed in section Interrupts and Exceptions are not applicable to Real Mode operation; in particular, Exceptions 10, 11, and 14 do not occur in Real Mode. Other exceptions have slightly different meanings in Real Mode; Table 9 identifies these exceptions. ## Shutdown and Halt The HLT instruction stops program execution and prevents the processor from using the local bus until restarted. Either NMI, FLT, INTR with interrupts enabled (IF=1), or Reset will force the Am386SX/SXL microprocessor out of halt. If interrupted, the saved CS:IP will point to the next instruction after the HLT. Shutdown will occur when a severe error is detected that prevents further processing. In Real Mode, shutdown can occur under two conditions: - An interrupt or an exception occurs (Exceptions 8 or 13) and the interrupt vector is larger than the Interrupt Descriptor Table. - 2. A Call, INT, or Push instruction attempts to wrap around the stack segment when SP is not even. An NMI input can bring the processor out of shutdown if the Interrupt Descriptor Table limit is large enough to contain the NMI interrupt vector (at least 000FH) and the stack has enough room to contain the vector and flag information (i.e., SP is greater than 0005H). Otherwise, shutdown can only be exited by a processor reset. ## **LOCK Operation** The LOCK prefix on the Am386SX/SXL microprocessor, even in Real Mode, is more restrictive than on the 80286. This is due to the addition of paging on the Am386SX/SXL CPU in Protected Mode and Virtual 8086 Mode. The LOCK prefix is not supported during repeat string instructions. The only instruction forms where the LOCK prefix is legal on the Am386SX/SXL microprocessor are shown in Table 10. Table 10. Legal Instructions for the LOCK Prefix | Op-Code | Operands<br>(Dest, Source) | |------------------------------------|----------------------------| | BIT Test and SET/RESET/COMPLEMENT | Mem, Reg/Immed | | XCHG | Reg, Mem | | XCHG | Mem, Reg | | ADD, OR, ADC, SBB<br>AND, SUB, XOR | Mem, Reg/Immed | | NOT, NEG, INC, DEC | Mem | An Exception 6 will be generated if a LOCK prefix is placed before any instruction form or op-code not listed above. The LOCK prefix allows indivisible read/modify/ write operations on memory operands using the instructions above. The LOCK prefix is not IOPL-sensitive on the Am386SX/SXL microprocessor. The LOCK prefix can be used at any privilege level, but only on the instruction forms listed in Table 10. #### PROTECTED MODE ARCHITECTURE The complete capabilities of the Am386SX/SXL microprocessor are unlocked when the processor operates in Protected Virtual Address Mode (Protected Mode). Protected Mode vastly increases the linear address space to 4 Gb (232 bytes), and allows the running of virtual memory programs of almost unlimited size (64 tb (246 bytes)). In addition, Protected Mode allows the Am386SX/SXL CPU to run all of the existing Am386DX/ DXL CPU (using only 16 Mb of physical memory). 80286, and 8086 CPU's software, while providing a sophisticated memory management and a hardwareassisted protection mechanism. Protected Mode allows the use of additional instructions specially optimized for supporting multitasking operating systems. The base architecture of the Am386SX/SXL microprocessor remains the same; the registers, instructions, and addressing modes described in the previous sections are retained. The main difference between Protected Mode and Real Mode from a programmer's viewpoint is the increased address space and a different addressing mechanism. ## Addressing Mechanism Like Real Mode, Protected Mode uses two components to form the logical address: a 16-bit selector is used to determine the linear base address of a segment, the base address is added to a 32-bit effective address to form a 32-bit linear address. The linear address is then either used as a 24-bit physical address, or if paging is enabled, the paging mechanism maps the 32-bit linear address into a 24-bit physical address. The difference between the two modes lies in calculating the base address. In Protected Mode, the selector is used to specify an index into an operating system defined table (see Figure 8). The table contains the 32-bit base address of a given segment. The physical address is formed by adding the base address obtained from the table to the offset. Paging provides an additional memory management mechanism which operates only in Protected Mode. Paging provides a means of managing the very large segments of the Am386SX/SXL microprocessor, as paging operates beneath segmentation. The page mechanism translates the protected linear address which comes from the segmentation unit into a physical address. Figure 9 shows the complete Am386SX/SXL CPU addressing mechanism with paging enabled. 15022B-007 Figure 7. Debug Registers ## Segmentation Segmentation is one method of memory management. It provides the basis for software protection and is used to encapsulate regions of memory that have common attributes. For example, all of the code of a given program could be contained in a segment, or an operating system table may reside in a segment. All information about each segment is stored in an 8-byte data structure called a descriptor. All of the descriptors in a system are contained in descriptor tables which are recognized by hardware. #### Terminology The following terms are used throughout the discussion of descriptors, privilege levels, and protection: PL: Privilege Level—One of the four hierarchical privilege levels. Level 0 is the most privileged level and level 3 is the least privileged. RPL: Requestor Privilege Level—The privilege level of the original supplier of the selector. RPL is determined by the least two significant bits of a selector. DPL: Descriptor Privilege Level—This is the least privileged level at which a task may access that descriptor (and the segment associated with that descriptor). Descriptor Privilege Level is determined by bits 6:5 in the Access Right Byte of a descriptor. CPL: Current Privilege Level—The privilege level at which a task is currently executing, which equals the privilege level of the code segment being executed. CPL can also be determined by examining the lowest 2 bits of the CS register, except for conforming code segments. EPL: Effective Privilege Level —The effective privilege level is the least privileged of the RPL and the DPL. EPL is the numerical maximum of RPL and DPL. Task: One instance of the execution of a program. Tasks are also referred to as processes. #### **Descriptor Tables** The descriptor tables define all of the segments which are used in an Am386SX/SXL microprocessor system. There are three types of tables which hold descriptors: the Global Descriptor Table, Local Descriptor Table, and Interrupt Descriptor Table. All of the tables are variable length memory arrays and can vary in size from 8 bytes to 64 Kb. Each table can hold up to 8192 8-byte descriptors. The upper 13 bits of a selector are used as an index into the descriptor table. The tables have registers associated with them which hold the 32-bit linear base address and the 16-bit limit of each table. Figure 8. Protected Mode Addressing Figure 9. Paging and Segmentation Figure 10. Descriptor Table Registers 15021B-020 Each of the tables has a register associated with it: GDTR, LDTR, and IDTR (see Figure 1). The LGDT, LLDT, and LIDT instructions load the base and limit of the Global, Local, and Interrupt Descriptor Tables into the appropriate register. The SGDT, SLDT, and SIDT store the base and limit values. These are privileged instructions. #### Global Descriptor Table The Global Descriptor Table (GDT) contains descriptors which are available to all of the tasks in a system. The GDT can contain any type of segment descriptor except for interrupt and trap descriptors. Every Am386SX/SXL CPU system contains a GDT. The first slot of the Global Descriptor Table corresponds to the null selector and is not used. The null selector defines a null pointer value. ## Local Descriptor Table LDTs contain descriptors which are associated with a given task. Generally, operating systems are designed so that each task has a separate LDT. The LDT may contain only code, data, stack, task gate, and call gate descriptors. LDTs provide a mechanism for isolating a given task's code and data segments from the rest of the operating system, while the GDT contains descriptors for segments which are common to all tasks. A segment cannot be accessed by a task if its segment descriptor does not exist in either the current LDT or the GDT. This provides both isolation and protection for a task's segments while still allowing global data to be shared among tasks. Unlike the 6-byte GDT or IDT registers which contain a base address and limit, the visible portion of the LDT register contains only a 16-bit selector. This selector refers to a Local Descriptor Table descriptor in the GDT (see Figure 1). #### Interrupt Descriptor Table The third table needed for Am386SX/SXL microprocessor systems is the Interrupt Descriptor Table. The IDT contains the descriptors which point to the location of the up to 256 interrupt service routines. The IDT may contain only task gates, interrupt gates, and trap gates. The IDT should be at least 256 bytes in size in order to hold the descriptors for the 32 interrupts reserved for future use. Every interrupt used by a system must have an entry in the IDT. The IDT entries are referenced by INT instructions, external interrupt vectors, and exceptions. ## **Descriptors** The object to which the segment selector points to is called a descriptor. Descriptors are eight byte quantities which contain attributes about a given region of linear address space. These attributes include the 32-bit base linear address of the segment, the 20-bit length and granularity of the segment, the protection level, read, write, or execute privileges, the default size of the operands (16 bit or 32 bit), and the type of segment. All of the attribute information about a segment is contained in 12 bits in the seament descriptor. Figure 11 shows the general format of a descriptor. All segments on the Am386SX/SXL microprocessor have three attribute fields in common: the P bit, the DPL bit, and the S bit. The P (Present) Bit is 1 if the segment is loaded in physical memory. If P = 0, then any attempt to access this segment causes a Not Present exception (number 11). The Descriptor Privilege Level (DPL) is a two bit field which specifies the protection level, 0-3, associated with a segment. The Am386SX/SXL microprocessor has two main categories of segments: system segments and non-system segments (for code and data). The segment bit (S) determines if a given segment is a system segment or a code or data segment. If the S bit is 1, then the segment is either a code or data segment; if it is 0, then the segment is a system segment. ## Code and Data Descriptors (S=1) Figure 12 shows the general format of a code and data descriptor, and Table 11 illustrates how the bits in the Access Right Byte are interpreted. Code and data segments have several descriptor fields in common. The accessed bit (A) is set whenever the processor accesses a descriptor. The granularity bit (G) specifies if a segment length is byte-granular or page-granular. **Figure 11. Segment Descriptors** | 31 | Segm | ent Ba | se ' | 15–0 | ) | | Segment Limit 1 | 5–0 | 0 Byte Address<br>0 | |-----|------------------------------------------|--------|------|------|-----|----------------|----------------------------------------|------------------------------------|----------------------------| | | Base 31–24 | G | D | 0 | AVL | Limit<br>19–16 | Access Rights Bytes | Base<br>23–16 | +4 | | D/B | 1 = Default Instru<br>0 = Default Instru | | | | | | Granularity Bit 1 = Segme<br>0 = Segme | nt length is pa<br>nt length is by | ge-granular<br>te-granular | | AVL | | | | | | | Bit must be zero for compatil | • | - | 15021B-023 **Byte Address** Figure 12. Code and Data Descriptors 15021B-024 Figure 13. System Descriptors Table 11. Access Rights Byte Definition for Code and Data Descriptors | Bit<br>Position | Name | Function | | | | | | |-----------------|-----------------------------------|----------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------|--|--|--| | 7 | Present (P) | P=1 Segment is mapped into physical memory. No mapping to physical memory exists, Base and Limit are not used. | | | | | | | 6–5 | Descriptor Privilege Levels (DPL) | Segment privilege attribute used in privilege tests. | | | | | | | 4 | Segment Descriptor (S) | S = 1 Code or Data (includes stacks) Segment Descriptor. S = 0 System Segment Descriptor or Gate Descriptor. | | | | | | | 3 | Executable (E) | E=0 | Descriptor type is data segment: | If Data | | | | | 2 | Expansion Direction (ED) | ED=0<br>ED=1 | Expand up segment, offsets must be ≤ limit. Expand down segment, offsets must be > limit. | Segment<br>(S = 1,<br>E = 0) | | | | | 1 | Writeable (W) | W = 0<br>W = 1 | Data segment may not be written into. Data segment may be written into. | | | | | | 3 | Executable (E) | E=1 | Descriptor type is code segment: | # Codo | | | | | 2 | Conforming (C) | C=1 | Code segment may only be executed when CPL≥DPL and CPL remains unchanged. | If Code<br>Segment<br>(S = 1,<br>E = 1) | | | | | 1 | Readable (R) | R=0<br>R=1 | Code segment may not be read. Code segment may be read. | | | | | | 0 | Accessed (A) | A = 0<br>A = 1 | Segment has not been accessed.<br>Segment selector has been loaded into segment reg<br>or used by selector test instructions. | ister | | | | #### System Descriptor Formats (S=0) System segments describe information about operating system tables, task, and gates. Figure 13 shows the general format of system segment descriptors, and the various types of system segments. Am386SX/SXL CPU system descriptors (which are the same as Am386DX/DXL CPU system descriptors) contain a 32-bit base linear address and a 20-bit segment limit. 80286 system descriptors have a 24-bit base address and a 16-bit segment limit. 80286 system descriptors are identified by the upper 16 bits being all zero. # Differences Between Am386SX/SXL Microprocessor and 80286 Descriptors In order to provide operating system compatibility with the 80286, the Am386SX/SXL CPU supports all of the 80286 segment descriptors. The 80286 system segment descriptors contain a 24-bit base address and 16-bit limit, while the Am386SX/SXL CPU system segment descriptors have a 32-bit base address, a 20-bit limit field, and a granularity bit. The word count field specifies the number of 16-bit quantities to copy for 80286 call gates and 32-bit quantities for Am386SX/SXL CPU call gates. #### Selector Fields A selector in Protected Mode has three fields: Local or Global Descriptor Table Indicator (TI), Descriptor Entry Index (Index), and Requestor (the selector's) Privilege Level (RPL), as shown in Figure 14. The TI bit selects either the Global Descriptor Table or the Local Descriptor Table. The Index selects one of 8K descriptors in the appropriate descriptor table. The RPL bits allow high speed testing of the selector's privilege attributes. #### Seament Descriptor Cache In addition to the selector value, every segment register has a segment descriptor cache register associated with it. Whenever a segment register's contents are changed, the 8-byte descriptor associated with the selector is automatically loaded (cached) on the chip. Once loaded, all references to that segment use the cached descriptor information instead of reaccessing the descriptor. The contents of the descriptor cache are not visible to the programmer. Since descriptor caches only change when a segment register is changed, programs which modify the descriptor tables must reload the appropriate segment registers after changing a descriptor's value. #### **Protection** The Am386SX/SXL microprocessor has four levels of protection which are optimized to support a multitasking operating system and to isolate and protect user programs from each other and the operating system. The privilege levels control the use of privileged instructions, I/O instructions, and access to segments and segment descriptors. The Am386SX/SXL microprocessor also offers an additional type of protection on a page basis when paging is enabled. The four-level hierarchical privilege system is an extension of the user/supervisor privilege mode commonly used by minicomputers. The user/supervisor mode is fully supported by the Am386SX/SXL microprocessor paging mechanism. The Privilege Levels (PL) are numbered 0 through 3. Level 0 is the most privileged level. Figure 14. Example Descriptor Selection 15021B-027 Table 12. Descriptor Types Used for Control Transfer | Control Transfer Types | Operation Types | Descriptor<br>Referenced | Descriptor<br>Table | | | |------------------------------------------------------------|----------------------------------------------------------------------|---------------------------|---------------------|--|--| | Intersegment within the same privilege level | JMP, CALL, RET, IRET* | Code Segment | GDT/LDT | | | | Intersegment to the same or higher privilege level | CALL | Call Gate | GDT/LDT | | | | Interrupt within task may change CPL | Interrupt Instruction,<br>Exception, External<br>Interrupt | Trap or Interrupt<br>Gate | IDT | | | | Intersegment to a lower privilege level (changes task CPL) | RET, IRET* | Code Segment | GDT/LDT | | | | | CALL, JMP | Task State<br>Segment | GDT | | | | | CALL, JMP | Task Gate | GDT/LDT | | | | Task Switch | IRET**<br>Interrupt Instruction,<br>Exception, External<br>Interrupt | Task Gate | IDT | | | <sup>\*</sup>NT (Nested Task bit of flag register) = 0 #### **Rules of Privilege** The Am386SX/SXL microprocessor controls access to both data and procedures between levels of a task. according to the following rules: - -Data stored in a segment with privilege level p can be accessed only by code executing at a privilege level at least as privileged as p. - —A code segment/procedure with privilege level p can only be called by a task executing at the same or lesser privilege level than p. #### **Privilege Levels** At any point in time, a task on the Am386SX/SXL microprocessor always executes at one of the four privilege levels. The Current Privilege Level (CPL) specifies what the task's privilege level is. A task's CPL may only be changed by control transfers through gate descriptors to a code segment with a different privilege level. Thus, an application program running at PL=3 may call an operating system routine at PL=1 (via a gate) which would cause the task's CPL to be set to 1 until the operating system routine was finished. #### Selector Privilege (RPL) The privilege level of a selector is specified by the RPL field. The selector's RPL is only used to establish a less trusted privilege level than the current privilege level of the task for the use of a segment. This level is called the task's Effective Privilege Level (EPL). The EPL is defined as being the least privileged (numerically larger) level of a task's CPL and a selector's RPL. The RPL is most commonly used to verify that pointers passed to an operating system procedure do not access data that is of higher privilege than the procedure that originated the pointer. Since the originator of a selector can specify any RPL value, the Adjust RPL (ARPL) instruction is provided to force the RPL bits to the originator's CPL. #### I/O Privilege The I/O Privilege Level (IOPL) lets the operating system code executing at CPL = 0 define the least privileged level at which I/O instructions can be used. An Exception 13 (General Protection Violation) is generated if an I/O instruction is attempted when the CPL of the task is less privileged then the IOPL. The IOPL is stored in bits 13 and 14 of the EFLAGS register. The following instructions cause an Exception 13 if the CPL is greater than IOPL: IN, INS, OUT, OUTS, STI, CLI, and LOCK prefix. ## **Descriptor Access** There are basically two types of segment accesses: those involving code segments such as control transfers, and those involving data accesses. Determining the ability of a task to access a segment involves the type of segment to be accessed, the instruction used, the type of descriptor used, and CPL, RPL, and DPL as described above. Any time an instruction loads a data segment register (DS, ES, FS, GS) the Am386SX/SXL CPU makes protection validation checks. Selectors loaded in the DS. ES, FS, GS registers must refer only to data segment or readable code segments. Finally, the privilege validation checks are performed. The CPL is compared to the EPL and if the EPL is more privileged than the CPL, an Exception 13 (General Protection Fault) is generated. <sup>\*\*</sup>NT (Nested Task bit of flag register) = 1 Figure 15. TSS and TSS Registers Figure 16. Sample I/O Permission Bit Map 15022B-032b The rules regarding the stack segment are slightly different than those involving data segments. Instructions that load selectors into SS must refer to data segment descriptors for writeable data segments. The DPL and RPL must equal the CPL of all other descriptor types or a privilege level violation will cause an Exception 13. A stack not present fault causes an Exception 12. #### **Privilege Level Transfers** Inter-segment control transfers occur when a selector is loaded in the CS register. For a typical system most of these transfers are simply the result of a call or a jump to another routine. There are five types of control transfers which are summarized in Table 12. Many of these transfers result in a privilege level transfer. Changing privilege levels is done only by control transfers, using gates, task switches, and interrupt or trap gates. Control transfers can only occur if the operation which loaded the selector references the correct descriptor type. Any violation of these descriptor usage rules will cause an Exception 13. ## **CALL Gates** Gates provide protected indirect CALLs. One of the major uses of gates is to provide a secure method of privilege transfers within a task. Since the operating system defines all of the gates in a system, it can ensure that all gates only allow entry into a few trusted procedures. #### Task Switching A very important attribute of any multitasking/multi-user operating system is its ability to rapidly switch between tasks or processes. The Am386SX/SXL microprocessor directly supports this operation by providing a task switch instruction in hardware. The task switch operation saves the entire state of the machine (all of the registers, address space, and a link to the previous task), loads a new execution state, performs protection checks, and commences execution in the new task. Like transfer of control by gates, the task switch operation is invoked by executing an intersegment JMP or CALL instruction which refers to a Task State Segment (TSS), or a task gate descriptor in the GDT or LDT. An INT n instruction, exception, trap, or external interrupt may also invoke the task switch operation if there is a task gate descriptor in the associated IDT descriptor slot. The TSS descriptor points to a segment (see Figure 15) containing the entire execution state. A task gate descriptor contains a TSS selector. The Am386SX/SXL microprocessor supports both 80286 and Am386SX/SXL CPU TSSs. The limit of an Am386SX/SXL CPU TSS must be greater than 64H (2BH for a 80286 TSS), and can be as large as 16 Mb. In the additional TSS space, the operating system is free to store additional information such as the reason the task is inactive, the time the task has spent running, or open files belonging to the task. Each task must have a TSS associated with it. The current TSS is identified by a special register in the Am386SX/SXL microprocessor called the Task State Segment Register (TR). This register contains a selector referring to the task state segment descriptor that defines the current TSS. A hidden base and limit register associated with TSS descriptor are loaded whenever TR is loaded with a new selector. Returning from a task is accomplished by the IRET instruction. When IRET is executed, control is returned to the task which was interrupted. The currently executing task's state is saved in the TSS and the old task state is restored from its TSS. Several bits in the flag register and machine status word (CR0) give information about the state of a task which is useful to the operating system. The Nested Task bit (NT) controls the function of the IRET instruction. If NT = 0, the IRET instruction performs the regular return. If NT = 1, IRET performs a task switch operation base to the previous task. The NT bit is set or reset in the following fashion: When a CALL or INT instruction initiates a task switch, the new TSS will be marked busy and the back link field of the new TSS set to the old TSS selector. The NT bit of the new task is set by CALL or INT initiated task switches. An interrupt that does not cause a task switch will clear NT (the NT bit will be restored after execution of the interrupt handler). NT may also be set or cleared by POPF or IRET instructions. The Am386SX/SXL microprocessor task state segment is marked busy by changing the descriptor type field from Type 9 to Type 0BH. An 80286 TSS is marked busy by changing the descriptor type field from Type 1 to Type 3. Use of a selector that references a busy task state segment causes an Exception 13. The VM (Virtual Mode) bit is used to indicate if a task is a Virtual 8086 task. If VM = 1 then the tasks will use the Real Mode addressing mechanism. The Virtual 8086 environment is only entered and exited by a task switch. The coprocessor's state is not automatically saved when a task switch occurs. The Task Switched Bit (TS) in the CR0 register helps deal with the coprocessor's state in a multitasking environment. Whenever the Am386SX/SXL microprocessor switches tasks, it sets the TS bit. The Am386SX/SXL CPU detects the first use of a processor extension instruction after a task switch and causes the processor extension Not Available Exception 7. The exception handler for Exception 7 may then decide whether to save the state of the coprocessor. The T bit in the Am386SX/SXL microprocessor TSS indicates that the processor should generate a debug exception when switching to a task. If T=1, then upon entry to a new task a debug Exception 1 will be generated. #### Initialization and Transition To Protected Mode Since the Am386SX/SXL microprocessor begins executing in Real Mode immediately after RESET, it is necessary to initialize the system tables and registers with the appropriate values. The GDT and IDT registers must refer to a valid GDT and IDT. The IDT should be at least 256 bytes long, and the GDT must contain descriptors for the initial code and data segments. Protected Mode is enabled by loading CR0 with PE bit set. This can be accomplished by using the MOV CR0, R/M instruction. After enabling Protected Mode, the next instruction should execute an intersegment JMP to load the CS register and flush the instruction decode queue. The final step is to load all of the data segment registers with the initial selector values. An alternate approach to entering Protected Mode is to use the built in task-switch to load all of the registers. In this case the GDT would contain two TSS descriptors in addition to the code and data descriptors needed for the first task. The first JMP instruction in Protected Mode would jump to the TSS causing a task switch and loading all of the registers with the values stored in the TSS. The Task State Segment Register should be initialized to point to a valid TSS descriptor. ## **Paging** Paging is another type of memory management useful for virtual memory multitasking operating systems. Unlike segmentation, which modularizes programs and data into variable length segments, paging divides programs into multiple uniform size pages. Pages bear no direct relation to the logical structure of a program. While segment selectors can be considered the logical name of a program module or data structure, a page most likely corresponds to only a portion of a module or data structure. ## **Page Organization** The Am386SX/SXL microprocessor uses two levels of tables to translate the linear address (from the segmentation unit) into a physical address. There are three components to the paging mechanism of the Am386SX/SXL CPU: the Page Directory, the Page Tables, and the page itself (Page Frame). All memory-resident elements of the Am386SX/SXL microprocessor paging mechanism are the same size, namely 4 Kb. A uniform size for all of the elements simplifies memory allocation and reallocation schemes, since there is no problem with memory fragmentation. Figure 17 shows how the paging mechanism works. Figure 17. Paging Mechanism | 31 | 12 | 11 | 10 | 9 | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | |--------------------------|----|----|------------------------|---|---|---|---|---|---|---|--------|--------|---| | Page Table Address 31–12 | | So | stem<br>ftwar<br>finab | | 0 | 0 | D | А | 0 | 0 | U<br>S | R<br>W | Р | Figure 18. Page Directory Entry (Points to Page Table) 15022B-037 | 31 | 12 | 11 | 10 | 9 | 8 | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | | |--------------------------|----|----|------------------------|---|---|---|---|---|---|---|-------|--------|---|--| | Page Frame Address 31–12 | | Só | stem<br>ftwar<br>finab | е | 0 | 0 | D | А | 0 | 0 | D S | R<br>W | Р | | Figure 19. Page Table Entry (Points to Page) 15022B-038 #### Page Fault Register CR2 is the Page Fault Linear Address register. It holds the 32-bit linear address which caused the last Page Fault detected. #### Page Descriptor Base Register CR3 is the Page Directory Physical Base Address Register. It contains the physical starting address of the Page Directory (this value is truncated to a 24-bit value associated with the Am386SX/SXL CPU's 16-Mb physical memory limitation). The lower 12 bits of CR3 are always zero to ensure that the Page Directory is always page aligned. Loading it with a MOV CR3, reg instruction causes the Page Table entry cache to be flushed, as will a task switch through a TSS which changes the value of CR0. #### Page Directory The Page Directory is 4-Kb long and allows up to 1024 Page Directory entries. Each Page Directory entry contains information about the Page Table and the address of the next level of tables, the Page Tables. The contents of a Page Directory entry are shown in Figure 18. The upper 10 bits of the linear address (A31-A22) are used as an index to select the correct Page Directory entry. The Page Table address contains the upper 20 bits of a 32-bit physical address that is used as the base address for the next set of tables, the Page Tables. The lower 12 bits of the Page Table addresses appear on 4-Kb boundaries. For an Am386DX/DXL CPU system, the upper 20 bits will select one of 2<sup>20</sup> Page Tables, but for an Am386SX/SXL microprocessor system, the upper 20 bits only select one of 2<sup>12</sup> Page Tables. Again, this is because the Am386SX/SXL CPU is limited to a 24-bit physical address, and the upper 8 bits (A31–A24) are truncated when the address is output on its 24 address pins. #### Page Tables Each Page Table is 4-Kb long and allows up to 1024 Page Table entries. Each Page Table entry contains information about the Page Frame and its address. The contents of a Page Table entry are shown in Figure 19. The middle 10 bits of the linear address (A21–A12) are used as an index to select the correct Page Table The Page Frame address contains the upper 20 bits of a 32-bit physical address which is used as the base address for the Page Frame. The lower 12 bits of the Page Frame address are zero so that the Page Frame addresses appear on 4-Kb boundaries. For an Am386DX/DXL CPU system, the upper 20 bits will select one of 2<sup>20</sup> Page Frames, but for an Am386SX/SXL microprocessor system, the upper 20 bits only select one of 2<sup>12</sup> Page Frames. Again, this is because the Am386SX/SXL CPU is limited to a 24-bit physical address space, and the upper 8 bits (A31–A24) are truncated when the address is output on its 24 address pins. #### Page Directory/Table Entries The lower 12 bits of the Page Table entries and Page Directory entries contain statistical information about pages and Page Tables, respectively. The P (Present) bit indicates if a Page Directory or Page Table entry can be used in address translation. If P=1, the entry can be used for address translation. If P=0, the entry cannot be used fortranslation. All of the other bits, are available for use by the software. For example, the remaining 31 bits could be used to indicate where on disk the page is stored. The A (Accessed) bit is set by the Am386SX/SXL CPU for both types of entries before a read or write access occurs to an address covered by the entry. The D (Dirty) bit is set to 1 before a write to an address covered by that Page Table entry occurs. The D bit is undefined for Page Directory entries. When the P, A, and D bits are updated by the Am386SX/SXL CPU, the processor generates a Read-Modify-Write cycle which locks the bus and prevents conflicts with other processors or peripherals. Software which modifies these bits should use the LOCK prefix to ensure the integrity of the Page Tables in multi-master systems. The 3 bits marked system software definable (in Figures 18 and 19) are software definable. System software writers are free to use these bits for whatever purpose they wish. #### Page Level Protection (R/W, U/S Bits) The Am386SX/SXL microprocessor provides a set of protection attributes for paging systems. The paging mechanism distinguishes between two levels of protection: User, which corresponds to level 3 of the segmentation based protection; and Supervisor, which encompasses all of the other protection levels (0, 1, and 2). Programs executing at Level 0, 1, or 2 bypass the page protection, although segmentation-based protection is still enforced by the hardware. The U/S and R/W bits are used to provide User/Supervisor and Read/Write protection for individual pages, or for all pages covered by a Page Table Directory entry. The U/S and R/W bits in the second level Page Table entry apply only to the page described by that entry. The U/S and R/W bits in the first level Page Directory Table apply to all pages described by the Page Table pointed to by that directory entry. The U/S and R/W bits for a given page are obtained by taking the most restrictive of the U/S and R/W bits from the Page Directory Table entries and using these bits to address the page. #### **Translation Look-Aside Buffer** The Am386SX/SXL microprocessor paging hardware is designed to support demand paged virtual memory systems. However, performance would degrade substantially if the processor was required to access two levels of tables for every memory reference. To solve this problem, the Am386SX/SXL CPU keeps a cache of the most recently accessed pages; this cache is called the Translation Look-Aside Buffer (TLB). The TLB is a four-way set associative 32-entry Page Table cache. It automatically keeps the most commonly used Page Table entries in the processor. The 32-entry TLB coupled with a 4K page size results in coverage of 128 Kb of memory addresses. For many common multitasking systems, the TLB will have a hit rate of greater than 98%. This means that the processor will only have to access the two-level page structure for less than 2% of all memory references. #### **Paging Operation** The paging hardware operates in the following fashion. The paging unit hardware receives a 32-bit linear address from the segmentation unit. The upper 20 linear address bits are compared with all 32 entries in the TLB to determine if there is a match. If there is a match (i.e., a TLB hit), then the 24-bit physical address is calculated and is placed on the address bus. If the Page Table entry is not in the TLB, the Am386SX/SXL microprocessor will read the appropriate Page Directory entry. If P=1 on the Page Directory entry, indicating that the Page Table is in memory, then the Am386SX/SXL CPU will read the appropriate Page Table entry and set the Access bit. If P=1 on the Page Table entry, indicating that the page is in memory, the Am386SX/SXL microprocessor will update the Access and Dirty bits as needed and fetch the operand. The upper 20 bits of the linear address, read from the Page Table, will be stored in the TLB for future accesses. If P = 0 for either the Page Directory entry or the Page Table entry, then the processor will generate a Page Fault (Exception 14). The processor will also generate a Page Fault (Exception 14) if the memory reference violated the page protection attributes. CR2 will hold the linear address which caused the Page Fault. Since Exception 14 is classified as a fault, CS:EIP will point to the instruction causing the Page Fault. The 16-bit error code, pushed as part of the Page Fault handler, will contain status bits which indicate the cause of the Page Fault. The 16-bit error code is used by the operating system to determine how to handle the Page Fault. Figure 20 shows the format of the Page Fault error code and the interpretation of the bits. Even though the bits in the error code (U/S, W/R, and P) have similar names as the bits in the Page Directory/Table Entries, the interpretation of the error code bits is different. Figure 21 indicates what type of access caused the Page Fault. U/S: The U/S bit indicates whether the access causing the fault occurred when the processor was executing in User Mode (U/S = 1) or in Supervisor mode (U/S = 0). W/R: The W/R bit indicates whether the access causing the fault was a Read (W/R = 0) or a Write (W/R = 1). **P**: The P bit indicates whether a Page Fault was caused by a not-present page (P = 0), or by a page level protection violation (P=1). U = Undefined Figure 20. Page Fault Error Code Format | U/S | W/R | Access Type | |-----|-----|------------------| | 0 | 0 | Supervisor* Read | | 0 | 1 | Supervisor Write | | 1 | 0 | User Read | | 1 | 1 | User Write | <sup>\*</sup>Descriptor table access will fault with U/S = 0, even if the program is executing at level 3. Figure 21. Type of Access Causing Page Fault #### **Operating System Responsibilities** When the operating system enters or exits paging mode (by setting or resetting bit 31 in the CR0 register), a short JMP must be executed to flush the Am386SX/SXL microprocessor's prefetch queue. This ensures that all instructions executed after the address mode change will generate correct addresses. The Am386SX/SXL microprocessor takes care of the page address translation process, relieving the burden from an operating system in a demand-paged system. The operating system is responsible for setting up the initial Page Tables and handling any Page Faults. The operating system is also required to invalidate (i.e., flush) the TLB when any changes are made to any of the Page Table entries. The operating system must reload CR3 to cause the TLB to be flushed. Setting up the tables is simply a matter of loading CR3 with the address of the Page Directory, and allocating space for the Page Directory and the Page Tables. The primary responsibility of the operating system is to implement a swapping policy and handle all of the Page Faults. A final concern of the operating system is to ensure that the TLB cache matches the information in the paging tables. In particular, any time the operating systems sets the P (Present) bit of Page Table entry to zero, the TLB must be flushed by reloading CR3. Operating systems may want to take advantage of the fact that CR3 is stored as part of a TSS, to give every task or group of tasks its own set of Page Tables. #### Virtual 8086 Environment The Am386SX/SXL microprocessor allows the execution of 8086 application programs in both Real Mode and in Virtual 8086 Mode. The Virtual 8086 Mode allows the execution of 8086 applications, while still allowing the system designer to take full advantage of the Am386SX/SXL CPU's protection mechanism. #### Virtual 8086 Addressing Mechanism One of the major differences between Am386SX/SXL CPU Real and Protected modes is how the segment selectors are interpreted. When the processor is executing in Virtual 8086 Mode, the segment registers are used in a fashion identical to Real Mode. The contents of the segment register are shifted left four bits and added to the offset to form the segment base linear address. The Am386SX/SXL microprocessor allows the operating system to specify which programs use the 8086 address mechanism and which programs use Protected Mode addressing on a per task basis. Through the use of paging, the 1-Mb address space of the Virtual Mode task can be mapped to anywhere in the 4-Gb linear address space of the Am386SX/SXL CPU. Like Real Mode, Virtual Mode addresses that exceed 1 Mb will cause an Exception 13. However, these restrictions should not prove to be important, because most tasks running in Virtual 8086 Mode will simply be existing 8086 application programs. #### Paging in Virtual Mode The paging hardware allows the concurrent running of multiple Virtual Mode tasks, and provides protection and operating system isolation. Although it is not strictly necessary to have the paging hardware enabled to run Virtual Mode tasks, it is needed in order to run multiple Virtual Mode tasks or to relocate the address space of a Virtual Mode task to physical address space greater than 1 Mb. The paging hardware allows the 20-bit linear address produced by a Virtual Mode program to be divided into as many as 256 pages. Each one of the pages can be located anywhere within the maximum 16-Mb physical address space of the Am386SX/SXL microprocessor. In addition, since CR3 (the Page Directory Base Register) is loaded by a task switch, each Virtual Mode task can use a different mapping scheme to map pages to different physical locations. Finally, the paging hardware allows the sharing of the 8086 operating system code between multiple 8086 applications. #### Protection and I/O Permission Bit Map All Virtual Mode programs execute at privilege level 3. As such, Virtual Mode programs are subject to all of the protection checks defined in Protected Mode. This is different from Real Mode, which implicitly is executing at privilege level 0. Thus, an attempt to execute a privileged instruction in Virtual Mode will cause an Exception 13 fault. The following are privileged instructions, which may be executed only at Privilege level 0. Attempting to execute these instructions in Virtual 8086 Mode (or anytime $CPL \ge 0$ ) causes an Exception 13 fault: ``` LIDT; MOV DRn,REG; MOV reg,DRn; LGDT; MOV TRn,reg; MOV reg,TRn; LMSW; MOV CRn,reg; MOV reg,CRn; CLTS; HLT; ``` Several instructions, particularly those applying to the multitasking and the protection model, are available only in Protected Mode. Therefore, attempting to execute the following instructions in Real Mode or in Virtual 8086 Mode generates an Exception 6 fault: ``` LTR; STR; LLDT; SLDT; LAR; VERR; LSL; VERW; ARPL: ``` The instructions which are IOPL sensitive in Protected Mode are: ``` IN; STI; OUT; CLI INS; OUTS; REP INS; REP OUTS; ``` In Virtual 8086 Mode the following instructions are IOPL sensitive: ``` INT n; STI; PUSHF; CLI; POPF; IRET; ``` The PUSHF, POPF, and IRET instructions are IOPL sensitive in Virtual 8086 Mode only. This provision allows the IF flag to be virtualized to the virtual 8086 Mode program. The INT n software interrupt instruction is also IOPL sensitive in Virtual 8086 Mode. Note that the INT 3, INTO, and BOUND instructions are not IOPL sensitive in Virtual 8086 Mode. The I/O instructions that directly refer to addresses in the processor's I/O space are IN, INS, OUT, and OUTS. The Am386SX/SXL microprocessor has the ability to selectively trap references to specific I/O addresses. The structure that enables selective trapping is the I/O Permission Bit Map in the TSS segment (see Figures 15 and 16). The I/O permission map is a bit vector. The size of the map and its location in the TSS segment are variable. The processor locates the I/O permission map by means of the I/O map base field in the fixed portion of the TSS. The I/O map base field is 16-bits wide and contains the offset of the beginning of the I/O permission In protected mode, when an I/O instruction (IN, INS, OUT, or OUTS) is encountered, the processor first checks whether CPL < IOPL. If this condition is true, the I/O operation may proceed. If not true, the processor checks the I/O permission map (in Virtual 8086 Mode, the processor consults the map without regard for the IOPL). Each bit in the map corresponds to an I/O port byte address; for example, the bit for port 41 is found at I/O map base +5, bit offset 1. The processor tests all the bits that correspond to the I/O addresses spanned by an I/O operation; for example, a Dword operation tests four bits corresponding to four adjacent byte addresses. If any tested bit is set, the processor signals a general protection exception. If all the tested bits are zero, the I/O operations may proceed. It is not necessary for the I/O permission map to represent all the I/O addresses. I/O addresses not spanned by the map are treated as if they had one-bits in the map. The I/O map base should be at least one byte less than the TSS limit: the last byte beyond the I/O mapping information must contain all 1s. Because the I/O permission map is in the TSS segment. different tasks can have different maps. Thus, the operating system can allocate ports to a task by changing the I/O permission map in the task's TSS. Important Implementation Note: Beyond the last byte of I/O mapping information in the I/O permission bit map must be a byte containing all 1s. The byte of all 1s must be within the limit of the Am386SX/SXL CPU TSS segment (see Figure 15). #### Interrupt Handling In order to fully support the emulation of an 8086 machine, interrupts in Virtual 8086 Mode are handled in a unique fashion. When running in Virtual Mode all interrupts and exceptions involve a privilege change back to the host Am386SX/SXL microprocessor operating system. The Am386SX/SXL CPU operating system deter-mines if the interrupt comes from a Protected Mode application, or from a Virtual Mode program, by examining the VM bit in the EFLAGS image stored on the stack. When a Virtual Mode program is interrupted, and execution passes to the interrupt routine at level 0, the VM bit is cleared. However, the VM bit is still set in the EFLAGS image on the stack. The Am386SX/SXL microprocessor operating system in turn handles the exception or interrupt and then returns control to the 8086 program. The Am386SX/ SXL microprocessor operating system may choose to let the 8086 operating system handle the interrupt, or it may emulate the function of the interrupt handler. For example, many 8086 operating system calls are accessed by PUSHing parameters on the stack, and then executing an INT n instruction. If the IOPL is set to 0, then all INT n instructions will be intercepted by the Am386SX/SXL CPU operating system. An Am386SX/SXL microprocessor operating system can provide a Virtual 8086 environment which is totally transparent to the application software by intercepting and then emulating 8086 operating system's calls, and intercepting IN and OUT instructions. #### Entering and Leaving Virtual 8086 Mode Virtual 8086 Mode is entered by executing a 32-bit IRET instruction at CPL = 0, where the stack has a 1 in the VM bit of its EFLAGS image, or a Task Switch (at any CPL) to an Am386SX/SXL microprocessor task whose Am386SX/SXL CPU TSS has an EFLAGS image containing a 1 in the VM bit position, while the processor is executing in the Protected Mode. POPF does not affect the VM bit, but a PUSHF always pushes a 0 in the VM bit. The transition out of Virtual 8086 Mode to Protected Mode occurs only on receipt of an interrupt or exception. In Virtual 8086 Mode, all interrupts and exceptions vector through the Protected Mode IDT, and enter an interrupt handler in Protected Mode. As part of the interrupt processing the VM bit is cleared. Because the matching IRET must occur from Level 0. Interrupt or Trap Gates used to field an interrupt or exception out of Virtual 8086 Mode must perform an inter-level interrupt only to Level 0. Interrupt or Trap Gates through conforming segments, or through segments with DPL > 0, will raise a GP fault with the CS selector as the error code. #### Task Switches To/From Virtual 8086 Mode Tasks which can execute in Virtual 8086 Mode must be described by a TSS with the Am386SX/SXL CPU format (Type 9 or 11 descriptor). A task switch out of Virtual 8086 Mode will operate exactly the same as any other task switch out of a task with an Am386SX/SXL CPU TSS. All of the programmer visible state, including the EFLAGS register with the VM bit set to 1, is stored in the TSS. The segment registers in the TSS will contain 8086 segment base values rather than selectors. A task switch into a task described by an Am386SX/SXL CPU TSS will have an additional check to determine if the incoming task should be resumed in Virtual 8086 Mode. Tasks described by 80286 format TSSs cannot be resumed in Virtual 8086 Mode, so no check is required there (the FLAGS image in 80286 format TSS has only the low-order 16 FLAGS bits). Before loading the segment register images from an Am386SX/SXL CPU TSS, the FLAGS image is loaded, so that the segment registers are loaded from the TSS image as 8086 segment base values. The task is now ready to resume in Virtual 8086 Mode. ## Transitions Through Trap and Interrupt Gates, and IRET A task switch is one way to enter or exit Virtual 8086 Mode. The other method is to exit through a Trap or Interrupt gate, as part of handling an interrupt, and to enter as part of executing an IRET instruction. The transition out must use an Am386SX/SXL CPU Trap Gate (Type 14), or Am386SX/SXL CPU interrupt Gate (Type 15), which must point to a non-conforming Level 0 segment (DPL = 0) in order to permit the trap handler to IRET back to the Virtual 8086 program. The Gate must point to a non-conforming Level 0 segment to perform a level switch to Level 0 so that the matching IRET can change the VM bit. Am386SX/SXL CPU gates must be used since 80286 gates save only the lower 16 bits of the EFLAGS register (the VM bit will not be saved). Also. the 16-bit IRET used to terminate the 80286 interrupt handler will pop only the lower 16 bits from FLAGS, and will not affect the VM bit. The action taken for an Am386SX/SXL CPU Trap or Interrupt gate, if an interrupt occurs while the task is executing in Virtual 8086 Mode, is given by the following sequence: - Save the FLAGS register in a temp to push later. Turn off the VM. TF. and IF bits. - Interrupt and Trap gates must perform a level switch from 3 (where the Virtual 8086 Mode program executes) to 0 (so IRET can return). - Push the 8086 segment register values onto the new stack, in this order: GS, FS, DS, and ES. These are pushed as 32-bit quantities. Then load these 4 registers with null selectors (0). - 4. Push the old 8086 stack pointer onto the new stack by pushing the SS register (as 32 bits), then pushing the 32-bit ESP register saved above. - 5. Push the 32-bit EFLAGS register saved in step 1. - Push the old 8086 instruction onto the new stack by pushing the CS register (as 32 bits), then pushing the 32-bit EIP register. - Load up the new CS:EIP value from the interrupt gate, and begin execution of the interrupt routine in protected mode. The transition out of Virtual 8086 Mode performs a level change and stack switch, in addition to changing back to protected mode. Also, all of the 8086 segment register images are stored on the stack (behind the SS:ESP image), and then loaded with null (0) selectors before entering the interrupt handler. This will permit the handler to safely save and restore the DS, ES, FS, and GS registers as 80286 selectors. This is needed so that interrupt handlers, which do not care about the mode of the interrupted program, can use the same prologue and epilogue code for state saving, regardless of whether or not a native mode or Virtual 8086 Mode program was interrupted. Restoring null selectors to these registers before executing the IRET will cause a trap in the interrupt handler. Interrupt routines which expect or return values in the segment registers will have to obtain/return values from the 8086 register images pushed onto the new stack. They will need to know the mode of the interrupted program in order to know where to find/return segment registers, and also to know how to interpret segment register values. The IRET instruction will perform the inverse of the above sequence. Only the extended IRET instruction (operand size = 32) can be used, and must be executed at Level 0, to change the VM bit to 1. - If the NT bit in the FLAGS register is On, an intertask return is performed. The current state is stored in the current TSS, and the link field in the current TSS is used to locate the TSS for the interrupted task which is to be resumed. Otherwise, continue with the following sequence. - Read the FLAGS image from SS:8[ESP] into the FLAGS register. This will set VM to the value active in the interrupted routine. - Pop off the instruction pointer CS:EIP. EIP is popped first, then a 32-bit word is popped which contains the CS value in the lower 16 bits. If VM = 0, this CS load is done as a protected mode segment load. If VM = 1, this will be done as an 8086 segment load. - 4. Increment the ESP register by 4 to bypass the FLAGS image which was popped in step 1. - If VM = 1, load segment registers ES, DS, FS, and GS from memory locations SS:[ESP+8], SS:[ESP+ 12], SS:[ESP+16], and SS:[ESP=20], respectively, where the new value of ESP stored in step 4 is used. Since VM = 1, these are done as 8086 segment register loads. - Else if VM=0, check that the selectors in ES, DS, FS, and GS are valid in the interrupted routines. Null out invalid selectors to trap, if an attempt is made to access through them. - If RPL (CS) > CPL, pop the stack pointer SS:ESP from the stack. The ESP register is popped first, followed by 32 bits containing SS in the lower 16 bits. If VM = 0, SS is loaded as a protected mode segment register load. If VM = 1, an 8086 segment register load is used. - Resume execution of the interrupted routine. The VM bit in the FLAGS register (restored from the interrupt routine's stack image in step 1) determines whether the processor resumes the interrupted routine in Protected Mode or Virtual 8086 Mode. #### **FUNCTIONAL DATA** The Am386SX/SXL microprocessor features a straightforward functional interface to the external hardware. The Am386SX/SXL CPU has separate parallel buses for data and address. The data bus is 16-bits in width, and bidirectional. The address bus outputs 24-bit address values using 23 address lines and two Byte Enable signals. The Am386SX/SXL microprocessor has two selectable address bus cycles: address pipelined and non-address pipelined. The address pipelining option allows as much time as possible for data access by starting the pending bus cycle before the present bus cycle is finished. A non-pipelined bus cycle gives the highest bus performance by executing every bus cycle in two processor CLK cycles. For maximum design flexibility, the address pipelining option is selectable on a cycle-by-cycle basis. The processor's bus cycle is the basic mechanism for information transfer, either from system to processor, or from processor to system. The Am386SX/SXL microprocessor bus cycles perform data transfer in a minimum of only two clock periods. The maximum transfer band-width at 16 MHz is therefore 16 Mb/s. However, any bus cycle will be extended for more than two clock periods if external hardware withholds acknowledgment of the cycle. The Am386SX/SXL microprocessor can relinquish control of its local buses to allow mastership by other devices, such as Direct Memory Access (DMA) channels. When relinquished, HLDA is the only output pin driven by the Am386SX/SXL microprocessor, providing near complete isolation of the processor from its system (all other output pins are in a float condition). ### **Signal Description Overview** Below is a brief description of the Am386SX/SXL microprocessor input and output signals arranged by functional groups. Example signal: M/IO—High voltage indicates memory selected ---Low voltage indicates I/O selected The signal descriptions sometimes refer to Switching timing parameters, such as t25 Reset Setup Time and t26 Reset Hold Time. The values of these parameters can be found in the Switching Characteristics table. #### Clock (CLK2) CLK2 provides the fundamental timing for the Am386SX/SXL microprocessor. It is divided by two internally to generate the internal processor clock used for instruction execution. The internal clock is comprised of two phases, phase one and phase two. Each CLK2 period is a phase of the internal clock. Figure 23 illustrates the relationship. If desired, the phase of the internal processor clock can be synchronized to a known phase by ensuring the falling edge of the RESET signal meets the applicable setup and hold times, t25 and t26. Figure 22. Functional Signal Groups 15022B-011 Figure 23. CLK2 Signal and Internal Processor Clock #### Data Bus (D15-D0) These three-state, bidirectional signals provide the general purpose data path between the Am386SX/SXL microprocessor and other devices. The data bus outputs are active High and will float during Bus Hold Acknowledge. Data bus reads require that read-data setup and hold times (t21 and t22) be met relative to CLK2 for correct operation. #### Address Bus (A23-A1, BHE, BLE) These three-state outputs provide physical memory addresses or I/O port addresses. A23–A16 are Low during I/O transfers, except for I/O transfers automatically generated by coprocessor instructions. During coprocessor I/O transfers, A22–A16 are driven Low and A23 is driven High, so that this address line can be used by external logic to generate the coprocessor select signal. Thus, the I/O address driven by the Am386SX/SXL microprocessor for coprocessor commands is 8000FBH, the I/O addresses driven by the Am386SX/SXL CPU for coprocessor data are 8000FCH or 8000FEH or cycles to a 387SX math coprocessor. The address bus is capable of addressing 16 Mb of physical memory space (000000H through FFFFFFH), and 64 Kb of I/O address space (000000H through 00FFFFH) for programmed I/O. The address bus is active High and will float during Bus Hold Acknowledge. The Byte Enable outputs, BHE and BLE, directly indicate which bytes of the 16-bit data bus are involved with the current transfer. BHE applies to D15–D8 and BLE applies to D7–D0. If both BHE and BLE are asserted, then 16 bits of data are being transferred. See Table 13 for a complete decoding of these signals. The Byte Enables are active Low and will float during Bus Hold Acknowledge. # Bus Cycle Definition Signals (W/ $\overline{R}$ , D/ $\overline{C}$ , M/ $\overline{IO}$ , LOCK) These three-state outputs define the type of bus cycle being performed: W/R̄ distinguishes between write and read cycles; D/C̄ distinguishes between data and control cycles; M/IŌ distinguishes between memory and I/O cycles; and, LOCK̄ distinguishes between locked and unlocked bus cycles. All of these signals are active Low and will float during Bus Acknowledge. The primary bus cycle definition signals are $W/\overline{R}$ , $D/\overline{C}$ , and $M/\overline{IO}$ , since these are the signals driven valid as $\overline{ADS}$ (Address Status output) becomes active. The $\overline{LOCK}$ is driven valid at the same time the bus cycle begins, which, due to address pipelining, could be after $\overline{ADS}$ becomes active. Exact bus cycle definitions, as a function of $W/\overline{R}$ , $D/\overline{C}$ , and $M/\overline{IO}$ , are given in Table 14. **Table 13. Byte Enable Definitions** | BHE | BLE | Function | |-----|-----|-----------------------------------------------------| | 0 | 0 | Word Transfer | | 0 | 1 | Byte transfer on upper byte of the data bus, D15–D8 | | 1 | 0 | Byte transfer on lower byte of the data bus, D7–D0 | | 1 | 1 | Never occurs | LOCK indicates that other system bus masters are not to gain control of the system bus while it is active. LOCK is activated on the CLK2 edge that begins the first locked bus cycle (i.e., it is not active at the same time as the other bus cycle definition pins) and is deactivated when READY is returned at the end of the last bus cycle M/IO $D/\overline{C}$ W/R **Bus Cycle Type** Locked? O 0 O Interrupt Acknowledge Yes ٥ 0 1 Does not occur 0 1 O I/O Data Read No 0 1 1 I/O Data Write Nο 1 0 0 Memory Code Read No 1 0 1 Halt: Shutdown: No Address = 2Address = 0 $\overline{BHE} = 1$ $\overline{BHE} = 1$ $\overline{BIF} = 0$ $\overline{BLE} = 0$ 1 1 0 Memory Data Read Some Cycles Memory Data Write Some Cycles **Table 14. Bus Cycle Definition** which is to be locked. The beginning of a bus cycle is determined when READY is returned in a previous bus cycle and another is pending (ADS is active), or the clock in which ADS is driven active if the bus was idle. This means that it follows more closely with the write data rules when it is valid, but may cause the bus to be locked longer than desired. The LOCK signal may be explicitly activated by the LOCK prefix on certain instructions **LOCK** is always asserted when executing the XCHG instruction, during descriptor updates, and during the interrupt acknowledge sequence. #### Bus Control Signals (ADS, READY, NA) The following signals allow the processor to indicate when a bus cycle has begun, and allow other system hardware to control address pipelining and bus cycle termination. #### Address Status (ADS) This three-state output indicates that a valid bus cycle definition and address (W/R, D/C, M/IO, BHE, BLE, and A23-A1) are being driven at the Am386SX/SXL microprocessor pins. ADS is an active Low output. Once ADS is driven active, valid address, Byte Enables, and definition signals will not change. In addition, ADS will remain active until its associated bus cycle begins (when READY is returned for the previous bus cycle when running pipelined bus cycles). When address pipelining is utilized, maximum throughput is achieved by initiating bus cycles when ADS and READY are active in the same clock cycle. ADS will float during Bus Hold Acknowledge. See sections Non-Pipelined Address and Pipelined Address for additional information on how ADS is asserted for different bus states. #### Transfer Acknowledge (READY) This input indicates the current bus cycle is complete. and the active bytes indicated by BHE and BLE are accepted or provided. When READY is sampled active during a read cycle or interrupt acknowledge cycle, the Am386SX/SXL microprocessor latches the input data and terminates the cycle. When READY is sampled active during a write cycle, the processor terminates the bus cycle. READY is ignored on the first bus state of all bus cycles. and sampled each bus state thereafter until asserted. READY must eventually be asserted to acknowledge every bus cycle, including Halt Indication and Shutdown Indication bus cycles. When being sampled, READY must always meet setup and hold times (t19 and t20) for correct operation. ### Next Address Request (NA) This is used to request address pipelining. This input indicates the system is prepared to accept new values of BHE, BLE, A23-A1, W/R, D/C, and M/IO from the Am386SX/SXL CPU even if the end of the current cycle is not being acknowledged on READY. If this input is active when sampled, the next address is driven onto the bus, provided the next bus request is already pending internally. NA is ignored in CLK cycles in which ADS or READY is activated. This signal is active Low and must satisfy setup and hold times (t15 and t16) for correct operation. See sections Read and Write Cycles and Pipelined Address for additional information. #### **Bus Arbitration Signals (HOLD, HLDA)** This section describes the mechanism by which the processor relinquishes control of its local buses when requested by another bus master device. See section Entering and Exiting Hold Acknowledge for additional information. #### Bus Hold Request (HOLD) This input indicates some device other than the Am386SX/SXL microprocessor requires bus mastership. When control is granted, the Am386SX/SXL CPU floats A23–A1, BHE, BLE, D15–D0, LOCK, M/IO, D/C, W/R, and ADS, and then activates HLDA, thus entering the Bus Hold Acknowledge state. The local bus will remain granted to the requesting master until HOLD becomes inactive. When HOLD becomes inactive, the Am386SX/SXL microprocessor will deactivate HLDA and drive the local bus (at the same time), thus terminating the Hold Acknowledge condition. HOLD must remain asserted as long as any other device is a local bus master. External pull-up resistors may be required when in the Hold Acknowledge (HLDA) state, since none of the Am386SX/SXL microprocessor floated outputs have internal pull-up resistors. See section Resistor Recommendations for additional information. HOLD is not recognized while RESET is active. If RESET is asserted while HOLD is asserted, RESET has priority and places the bus into an idle state, rather than the Hold Acknowledge (high impedance) state. HOLD is a level-sensitive, active High, synchronous input. HOLD signals must always meet setup and hold times (t23 and t24) for correct operation. #### Bus Hold Acknowledge (HLDA) When active (High), this output indicates the Am386SX/SXL microprocessor has relinquished control of its local bus in response to an asserted HOLD signal, and is in the Bus Hold Acknowledge state. The Bus Hold Acknowledge state offers near complete signal isolation. In the HLDA state is the only signal being driven by the Am386SX/SXL microprocessor. The other output or bidirectional signals (D15–D0, BHE, BLE, A23–A1, W/R, D/C, M/IO, LOCK, and ADS) are in a high-impedance state so the requesting bus master may control them. These pins remain Off throughout the time that HLDA remains active (see Table 15). Pull-up resistors may be desired on several signals to avoid spurious activity when no bus master is driving them. See section Resistor Recommendations for additional information. When the HOLD signal is made inactive, the Am386SX/SXL microprocessor will deactivate HLDA and drive the bus. One rising edge on the NMI input is remembered for processing after the HOLD input is negated. **Table 15. Output Pin State During HOLD** | Pin Value | Pin Names | |-----------|--------------------------------------------------------| | 1 | HLDA | | Float | LOCK, M/IO, D/C, W/R, ADS,<br>A23-A1, BHE, BLE, D15-D0 | In addition to the normal usage of Hold Acknowledge with DMA controllers or master peripherals, the near complete isolation has particular attractiveness during system test when test equipment drives the system, and in hardware fault-tolerant applications. #### **HOLD Latencies** The maximum possible HOLD latency depends on the software being executed. The actual HOLD latency at any time depends on the current bus activity, the state of the LOCK signal (internal to the CPU) activated by the LOCK prefix, and interrupts. The Am386SX/SXL microprocessor will not honor a HOLD request until the current bus operation is complete. The Am386SX/SXL microprocessor breaks 32-bit data or I/O accesses into 2 internally locked 16-bit bus cycles; the LOCK signal is not asserted. The Am386SX/SXL microprocessor breaks unaligned 16-bit or 32-bit data or I/O accesses into 2 or 3 internally locked 16-bit bus cycles. Again, the LOCK signal is not asserted but a HOLD request will not be recognized until the end of the entire transfer. Wait states affect HOLD latency. The Am386SX/SXL microprocessor will not honor a HOLD request until the end of the current bus operation, no matter how many wait states are required. Systems with DMA where data transfer is critical must insure that READY returns promptly. ## Coprocessor Interface Signals (PEREQ, $\overline{\text{BUSY}}$ , $\overline{\text{ERROR}}$ ) In the following sections are descriptions of signals dedicated to the math coprocessor interface. In addition to the data bus, address bus, and bus cycle definition signals, the following signals control communication between the Am386SX/SXL microprocessor and its 387SX math coprocessor extension. #### Coprocessor Request (PEREQ) When asserted (High), this input signal indicates a coprocessor request for a data operand to be transferred to/from memory by the Am386SX/SXL Microprocessor. In response, the Am386SX/SXL microprocessor transfers information between the math coprocessor and memory. Because the Am386SX/SXL CPU has internally stored the math coprocessor opcode being executed, it performs the requested data transfer with the correct direction and memory address. PEREQ is level-sensitive active High asynchronous signal. Setup and hold times (t29 and t30) relative to the CLK2 signal must be met to guarantee recognition at a particular clock edge. This signal is provided with a weak internal pull-down resistor of around 20 Kohms to Ground so that it will not float active when left unconnected. #### Coprocessor Busy (BUSY) When asserted Low, this input indicates that the coprocessor is still executing an instruction, and is not yet able to accept another. When the Am386SX/SXL microprocessor encounters any coprocessor instruction which operates on the numerics stack (e.g., load, pop, or arithmetic operation), or the WAIT instruction, this input is first automatically sampled until it is seen to be inactive. This sampling of the BUSY input prevents overrunning the execution of a previous coprocessor instruction. The FNINIT, FNSTENV, FNSAVE, FNSTSW, FNSTCW, and FNCLEX coprocessor instructions are allowed to execute even if $\overline{\text{BUSY}}$ is active, since these instructions are used for coprocessor initialization and exception-clearing. $\overline{\text{BUSY}}$ is an active Low, level-sensitive, asynchronous signal. Setup and hold times (t29 and t30), relative to the CLK2 signal, must be met to guarantee recognition at a particular clock edge. This pin is provided with a weak internal pull-up resistor of around 20 Kohms to $V_{cc}$ so that it will not float active when left unconnected. BUSY serves an additional function. If BUSY is sampled Low at the falling edge of RESET, the Am386SX/SXL microprocessor performs an internal self-test (see section Bus Activity During and Following Reset). If BUSY is sampled High, no self-test is performed. #### Coprocessor Error (ERROR) When asserted Low, this input signal indicates that the previous coprocessor instruction generated a coprocessor error of a type not masked by the coprocessor's control register. This input is automatically sampled by the Am386SX/SXL microprocessor when a coprocessor instruction is encountered, and if active, the Am386SX/SXL CPU generates Exception 16 to access the error-handling software. Several coprocessor instructions, generally those which clear the numeric error flags in the coprocessor or save coprocessor state, do execute without the Am386SX/SXL CPU generating Exception 16 even if ERROR is active. These instructions are FNINIT, FNCLEX, FNSTSW, FNSTSWAX, FNSTCW, FNSTENV, and FNSAVE ERROR is an active Low, level-sensitive, asynchronous signal. Setup and hold times (t29 and t30), relative to the CLK2 signal, must be met to guarantee recognition at a particular clock edge. This pin is provided with a weak internal pull-up resistor of around 20 Kohms to $V_{\infty}$ so that it will not float active when left unconnected. #### Interrupt Signals (INTR, NMI, RESET) The following descriptions cover inputs that can interrupt or suspend execution of the processor's current instruction stream. #### Maskable Interrupt Request (INTR) When asserted, this input indicates a request for interrupt service, which can be masked by the Am386SX/SXL microprocessor Flag Register IF bit. When the Am386SX/SXL CPU responds to the INTR input, it performs two interrupt acknowledge bus cycles and, at the end of the second, latches an 8-bit interrupt vector on D7–D0 to identify the source of the interrupt. INTR is an active High, level-sensitive, asynchronous signal. Setup and hold times (t27 and t28), relative to the CLK2 signal, must be met to guarantee recognition at a particular clock edge. To assure recognition of an INTR request, INTR should remain active until the first interrupt acknowledge bus cycle begins. INTR is sampled at the beginning of every instruction in the Am386SX/SXL microprocessor's Execution Unit. In order to be recognized at a particular instruction boundary, INTR must be active at least eight CLK2 clock periods before the beginning of the instruction. If recognized, the Am386SX/SXL CPU will begin execution of the interrupt. #### Non-Maskable Interrupt Request (NMI) This input indicates a request for interrupt service which cannot be masked by software. The non-maskable interrupt request is always processed according to the pointer or gate in slot 2 of the interrupt table. Because of the fixed NMI slot assignment, no interrupt acknowledge cycles are performed when processing NMI. NMI is an active High, rising edge-sensitive, asynchronous signal. Setup and hold times (t27 and t28), relative to the CLK2 signal, must be met to guarantee recognition at a particular clock edge. To assure recognition of NMI, it must be inactive for at least eight CLK2 periods, and then be active for at least eight CLK2 periods before the beginning of the instruction boundary in the Am386SX/SXL microprocessor's Execution Unit. Once NMI processing has begun, no additional NMI's are processed until after the next IRET instruction, which is typically the end of the NMI service routine. If NMI is re-asserted prior to that time, however, one rising edge on NMI will be remembered for processing after executing the next IRET instruction. #### Interrupt Latency The time that elapses before an interrupt request is serviced (interrupt latency) varies according to several factors. This delay must be taken into account by the interrupt source. Any of the following factors can affect interrupt latency: - If interrupts are masked, an INTR request will not be recognized until interrupts are re-enabled. - If an NMI is currently being serviced, an incoming NMI request will not be recognized until the Am386SX/SXL microprocessor encounters the IRET instruction. - An interrupt request is recognized only on an instruction boundary of the Am386SX/SXL microprocessor's Execution Unit except for the following cases: - Repeat string instructions can be interrupted after each iteration. - If the instruction loads the Stack Segment register, an interrupt is not processed until after the following instruction, which should be an ESP. This allows the entire stack pointer to be loaded without interruption. - If an instruction sets the interrupt flag (enabling interrupts), an interrupt is not processed until after the next instruction. The longest latency occurs when the interrupt request arrives while the Am386SX/SXL microprocessor is executing a long instruction such as multiplication, division, or a task switch in the Protected Mode. - 4. Saving the Flags register and CS:EIP registers. - 5. If interrupt service routine requires a task switch. time must be allowed for the task switch. - 6. If the interrupt service routine saves registers that are not automatically saved by the Am386SX/ SXL microprocessor. #### Reset This input signal suspends any operation in progress and places the Am386SX/SXL microprocessor in a known reset state. The Am386SX/SXL CPU is reset by asserting RESET for 15 or more CLK2 periods (80 or more CLK2 periods before requesting self-test). When RESET is active, all other input pins, except FLT, are ignored, and all other bus pins are driven to an Idle Bus state, as shown in Table 16. If RESET and HOLD are both active at a point in time. RESET takes priority even if the Am386SX/SXL microprocessor was in a Hold Acknowledge state prior to RESET active. Reset is an active High, level-sensitive, synchronous signal. Setup and hold times (t25 and t26) must be met in order to assure proper operation of the Am386SX/ SXL microprocessor. #### **Bus Transfer Mechanism** All data transfers occur as a result of one or more bus cycles. Logical data operands of byte and word lengths may be transferred without restrictions on physical address alignment. Any byte boundary may be used, although two physical bus cycles are performed as required for unaligned operand transfers. The Am386SX/SXL microprocessor address signals are designed to simplify external system hardware. Higher-order address bits are provided by A23-A1. BHE and BLE provide linear selects for the two bytes of the 16-bit data bus. Byte Enable outputs BHE and BLE are asserted when their associated data bus bytes are involved with the present bus cycle, as listed in Table 17. Each bus cycle is composed of at least two bus states. Each bus state requires one processor clock period. Additional bus states added to a single bus cycle are called wait states. See section Bus Functional Description. Table 16. Pin State (Bus Idle) During Reset | Pin Name | Signal Level During Reset | |------------------|---------------------------| | ADS | 1 | | D15-D0 | Float | | BHE, BLE | 0 | | A23-A1 | 1 | | W/R | 0 | | D/C | 1 | | M/ <del>IO</del> | 0 | | LOCK | 1 | | HLDA | 0 | Table 17. Byte Enables and Associated **Data and Operand Bytes** | Byte Enable Signal | Associated Data Bus Signals | |--------------------|----------------------------------| | BLE | D7-D0 (Byte 0-least significant) | | BHE | D15-D8 (Byte 1-most significant) | ### Memory and I/O Spaces Bus cycles may access physical memory space or I/O space. Peripheral devices in the system may either be memory-mapped, I/O-mapped, or both. As shown in Figure 24, physical memory addresses range from 000000H to 0FFFFFFH (16 Mb) and I/O addresses from 000000H to 00FFFFH (64 Kb). Note the I/O addresses used by the automatic I/O cycles for coprocessor communication are 8000F8H to 8000FFH, beyond the address range of programmed I/O, to allow easy generation of a coprocessor chip select signal using the A23 and M/IO signals. #### **Bus Functional Description** The Am386SX/SXL microprocessor has separate, parallel buses for data and address. The data bus is 16-bits in width, and bidirectional. The address bus provides a 24-bit value using 23 signals for the 23 upper-order address bits and 23 Byte Enable signals to directly indicate the active bytes. These buses are interpreted and controlled by several definition signals. Note: Since A23 is High during automatic communication with coprocessor, A23 High and M/IO Low can be used to easily generate a coprocessor select signal. 15022B-012 Figure 24. Physical Memory and I/O Spaces Fastest non-pipelined bus cycles consist of T1 and T2 Figure 25. Fastest Read Cycles with Non-Pipelined Address Timing The definition of each bus cycle is given by three signals: $M/\overline{O}$ , $W/\overline{R}$ , and $D/\overline{C}$ . At the same time, a valid address is present on the Byte Enable signals, $\overline{BHE}$ and $\overline{BLE}$ , and the other address signals, A23–A1. A status signal, $\overline{ADS}$ , indicates when the Am386SX/SXL microprocessor issues a new bus cycle definition and address. Collectively, the address bus, data bus, and all associated control signals are referred to simply as the bus. When active, the bus performs one of the bus cycles below: - 1. Read from memory space; - 2. Locked read from memory space; - 3. Write to memory space: - 4. Locked write to memory space; - 5. Read from I/O space (or math coprocessor); - 6. Write to I/O space (or math coprocessor); - 7. Interrupt acknowledge (always locked); - 8. Indicate halt, or indicate shutdown. Table 14 shows the encoding of the bus cycle definition signals for each bus cycle. See section Bus Cycle Definition Signals for additional information. When the Am386SX/SXL microprocessor bus is not per-forming one of the activities listed above, it is either idle or in the Hold Acknowledge state, which may be detected externally. The idle state can be identified by the Am386SX/SXL CPU giving no further assertions on its address strobe output (ADS) since the beginning of its most recent cycle, and the most recent bus cycle having been terminated. The Hold Acknowledge state is identified by the Am386SX/SXL microprocessor asserting its Hold Acknowledge (HLDA) output. ## **Bus Functional Description** The Am386SX/SXL microprocessor has separate, parallel buses for data and address. The data bus is 16-bits in width, and bidirectional. The address bus provides a 24-bit value using 23 signals for the 23 upper-order address bits and 2 Byte Enable signals to directly indicate the active bytes. These buses are interpreted and controlled by several definition signals. Fastest pipelined bus cycles consist of T1P and T2P 15022B-014 Figure 26. Fastest Read Cycles with Pipelined Address Timing The shortest time unit of bus activity is a bus state. A bus state is one processor clock period (two CLK2 periods) in duration. A complete data transfer occurs during a bus cycle, composed of two or more bus states. The fastest Am386SX/SXL microprocessor bus cycle requires only two bus states. For example, three consecutive bus read cycles, each consisting of two bus states, are shown in Figure 25. The bus states in each cycle are named T1 and T2. Any memory or I/O address may be accessed by such a two-state bus cycle, if the external hardware is fast enough. Every bus cycle continues until it is acknowledged by the external system hardware, using the Am386SX/SXL microprocessor READY input. Acknowledging the bus cycle at the end of the first T2 results in the shortest bus cycle, requiring only T1 and T2. If READY is not immediately asserted however, T2 states are repeated indefinitely until the READY input is sampled active. The address pipelining option provides a choice of bus cycle timings. Pipelined or non-pipelined address timing is selectable on a cycle-by-cycle basis with the Next Address $(\overline{NA})$ input. When address pipelining is selected, the address ( $\overline{BHE}$ , $\overline{BLE}$ , and A23–A1) and definition ( $W/\overline{R}$ , $D/\overline{C}$ , $M/\overline{IO}$ , and $\overline{LOCK}$ ) of the next cycle are available before the end of the current cycle. To signal their availability, the Am386SX/SXL microprocessor address status output ( $\overline{ADS}$ ) is asserted. Figure 26 illustrates the fastest read cycles with pipelined address timing. Note from Figure 26 the fastest bus cycles using pipelined address require only two bus states, named T1P and T2P. Therefore, cycles with pipelined address timing allow the same data bandwidth as non-pipelined cycles, but address-to-data access time is increased by one T-state time compared to that of a non-pipelined cycle. Note: Idle states are shown here for diagram variety only. Write cycles are not always followed by an idle state; an active bus cycle can immediately follow the write cycle. Figure 27. Various Bus Cycles with Non-Pipelined Address (Zero Wait States) #### **Read and Write Cycles** Data transfers occur as a result of bus cycles, classified as read or write cycles. During read cycles, data is transferred from an external device to the processor. During write cycles, data is transferred from the processor to an external device. Two choices of address timing are dynamically selectable: non-pipelined or pipelined. After an idle bus state, the processor always uses non-pipelined address timing. However, the $\overline{\text{NA}}$ (Next Address) input may be asserted to select pipelined address timing for the next bus cycle. When pipelining is selected and the Am386SX/SXL microprocessor has a bus request pending internally, the address and definition of the next cycle is made available even before the current bus cycle is acknowledged by $\overline{\text{READY}}$ . Terminating a read or write cycle, like any bus cycle, requires acknowledging the cycle by asserting the READY input. Until acknowledged, the processor inserts wait states into the bus cycle, to allow adjustment for the speed of any external device. External hardware, which has decoded the address and bus cycle type, asserts the READY input at the appropriate time. At the end of the second bus state within the bus cycle, READY is sampled. At that time, if external hardware acknowledges the bus cycle by asserting READY, the bus cycle terminates as shown in Figure 27. If READY is negated, as in Figure 28, the Am386SX/SXL microprocessor executes another bus state (a wait state) and READY is sampled again at the end of that state. This continues indefinitely until the cycle is acknowledged by READY asserted. When the current cycle is acknowledged, the Am386SX/SXL microprocessor terminates it. When a read cycle is acknowledged, the Am386SX/SXL CPU latches the information present at its data pins. When a write cycle is acknowledged, the Am386SX/SXL microprocessor's write data remains valid throughout phase one of the next bus state, to provide write data hold time. Note: Idle states are shown here for diagram variety only. Write cycles are not always followed by an idle state; an active bus cycle can immediately follow the write cycle. Figure 28. Various Bus Cycles with Non-Pipelined Address (Various Number of Walt States) #### Non-Pipelined Address Any bus cycle may be performed with non-pipelined address timing. For example, Figure 27 shows a mixture of read and write cycles with non-pipelined address timing. Figure 27 shows that the fastest possible cycles with non-pipelined address have two bus states per bus cycle. The states are named T1 and T2. In phase one of T1, the address signals and bus cycle definition signals are driven valid and, to signal their availability, address strobe (ADS) is simultaneously asserted. During read or write cycles the data bus behaves as follows. If the cycle is a read, the Am386SX/SXL microprocessor floats its data signal to allow driving by the external device being addressed. The Am386SX/SXL microprocessor requires that all data bus pins be at a valid logic state (High or Low) at the end of each read cycle, when READY is asserted. The system must be designed to meet this requirement. If the cycle is a write, data signals are driven by the Am386SX/SXL CPU beginning in phase two of T1 until phase one of the bus state following cycle acknowledgment. Figure 28 illustrates non-pipelined bus cycles with one wait state added to Cycles 2 and 3. READY is sampled inactive at the end of the first T2 in Cycles 2 and 3. Therefore, Cycles 2 and 3 have T2 repeated again. At the end of the second T2, READY is sampled active. When address pipelining is not used, the address and bus cycle definition remain valid during all wait states. When wait states are added, and it is desirable to maintain non-pipelined address timing, it is necessary to negate NA during each T2 state, except the last one, as shown in Figure 28, Cycles 2 and 3. If NA is sampled active during a T2 other than the last one, the next state would be T2I or T2P instead of another T2. The bus states and transitions, when address pipelining is not used, are completely illustrated by Figure 29. The bus transitions between four possible states, T1, T2, Ti, and Th. Bus cycles consist of T1 and T2, with T2 being repeated for wait states. Otherwise the bus may be idle, Ti, or in the Hold Acknowledge state Th. **Bus States**: T1 — First clock of a non-pipelined bus cycle (Am386SX/SXL CPU drives new address and asserts ADS). T2—Subsequent clocks of a bus cycle when $\overline{\text{NA}}$ has not been sampled asserted in the current bus cycle. Ti — Idle state. Th- Hold Acknowledge state (Am386SX/SXL CPU asserts HLDA). The fastest bus cycle consists of two states: T1 and T2. Four basic bus states describe bus operation when not using pipelined address. Figure 29. Bus States (Not Using Pipelined Address) Bus cycles always begin with T1. T1 always leads to T2. If a bus cycle is not acknowledged during T2 and $\overline{\text{NA}}$ is inactive, T2 is repeated. When a cycle is acknowledged during T2, the following state will be T1 of the next bus cycle, if a bus request is pending internally, or Ti, if there is no bus request pending, or Th, if the HOLD input is being asserted. Use of pipelined address allows the Am386SX/SXL microprocessor to enter three additional bus states not shown in Figure 29. Figure 33 is the complete bus state diagram, including pipelined address cycles. #### Pipelined Address Address pipelining is the option of requesting the address and the bus cycle definition of the next internally pending bus cycle before the current bus cycle is acknowledged with READY asserted. ADS is asserted by the Am386SX/SXL microprocessor when the next address is issued. The address pipelining option is controlled on a cycle-by-cycle basis with the NA input signal. Once a bus cycle is in progress and the current address has been valid for at least one entire bus state, the $\overline{\text{NA}}$ input is sampled at the end of every phase one until the bus cycle is acknowledged. During non-pipelined bus cycles, $\overline{\text{NA}}$ is sampled at the end of phase one in every T2. An example is Cycle 2 in Figure 30, during which $\overline{\text{NA}}$ is sampled at the end of phase one of every T2 (it was asserted once during the first T2 and has no further effect during that bus cycle). Note: Following any idle bus state (Ti), addresses are non-pipelined. Within non-pipelined bus cycles, NA is only sampled during wait states. Therefore, to begin address pipelining during a group of non-pipelined bus cycles requires a non-pipelined cycle with at least one wait state (Cycle 2 above). Figure 30. Transitioning to Pipelined Address During Burst of Bus Cycles If $\overline{\text{NA}}$ is sampled active, the Am386SX/SXL microprocessor is free to drive the address and bus cycle definition of the next bus cycle, and assert $\overline{\text{ADS}}$ , as soon as it has a bus request internally pending. It may drive the next address as early as the next bus state, whether the current bus cycle is acknowledged at that time or not. Regarding the details of address pipelining, the Am386SX/SXL CPU has the following characteristics: - The next address may appear as early as the bus state after NA was sampled active (see Figures 30 and 31). In that case, state T2P is entered immediately. However, when there is not an internal bus request already pending, the next address will not be available immediately after NA is asserted and T2I is entered instead of T2P (see Figure 32, Cycle 3). - Provided the current bus cycle is not yet acknow-ledged by READY asserted, T2P will be entered as soon as the Am386SX/SXL microprocessor does drive the next address. External hardware should therefore observe the ADS output as confirmation the next address is actually being driven on the bus. - Any address which is validated by a pulse on the ADS output will remain stable on the address pins for at least two processor clock periods. The Am386SX/SXL CPU cannot produce a new address more frequently than every two processor clock periods (see Figures 30, 31, and 32). - Only the address and bus cycle definition of the very next bus cycle is available. The pipelining capability cannot look further than one bus cycle ahead (see Figure 32, Cycle 1). Note: Following any idle bus state (Ti) the address is always non-pipelined and NA is only sampled during wait states. To start address pipelining after an idle state requires a non-pipelined cycle with at least one wait state (Cycle 1 above). The pipelined cycles (2, 3, and 4 above) are shown with various numbers of wait states. Figure 31. Fastest Transition to Pipelined Address Following Idle Bus State The complete bus state transition diagram, including operation with pipelined address, is given in Figure 33. Note that it is a superset of the diagram for non-pipelined address only, and the three additional bus states for pipelined address are drawn in bold. The fastest bus cycle with pipelined address consists of just two bus states, T1P and T2P (recall for non-pipelined address it is T1 and T2). T1P is the first bus state of a pipelined cycle. 15022A-020 Figure 32. Details of Address Pipelining During Cycles with Wait States Figure 33. Complete Bus States (Including Pipelined Address) #### Initiating and Maintaining Pipelined Address Using the state diagram Figure 33, observe the transitions from an idle state (Ti) to the beginning of a pipelined bus cycle (T1P). From an idle state (Ti) the first bus cycle must begin with T1, and is therefore a non-pipelined bus cycle. The next bus cycle will be pipelined, however, provided $\overline{\rm NA}$ is asserted and the first bus cycle ends in a T2P state (the address for the next bus cycle is driven during T2P). The fastest path from an idle state to a bus cycle with pipelined address is shown in bold below: #### Ti, Ti, Ti, T1-T2-T2P, T1P-T2P, Idle Non-Pipelined Pipelined States Cycle Cycle T1-T2-T2P are the states of the bus cycle that establish address pipelining for the next bus cycle, which begins with T1P. The same is true after a bus hold state, shown below: Th, Th, Th, T1-T2-T2P, T1P-T2P, Hold Acknowledge Non-Pipelined Pipelined States Cycle Cycle The transition to pipelined address is shown functionally by Figure 31, Cycle 1. Note that Cycle 1 is used to transition into pipelined address timing for the subsequent Cycles 2, 3, and 4, which are pipelined. The NA input is asserted at the appropriate time to select address pipelining for Cycle 2, 3, and 4. Once a bus cycle is in progress and the current address has been valid for one entire bus state, the $\overline{\text{NA}}$ input is sampled at the end of every phase one until the bus cycle is acknowledged. Sampling begins in T2 during Cycle 1 in Figure 31. Once $\overline{\text{NA}}$ is sampled active during the current cycle, the Am386SX/SXL microprocessor is free to drive a new address and bus cycle definition on the bus as early as the next bus state. In Figure 31, Cycle 1 for example, the next address is driven during state T2P. Thus, Cycle 1 makes the transition to pipelined address timing, since it begins with T1 but ends with T2P. Because the address for Cycle 2 is available before Cycle 2 begins, Cycle 2 is called a pipelined bus cycle, and it begins with T1P. Cycle 2 begins as soon as $\overline{\text{READY}}$ asserted terminates Cycle 1. Examples of transition bus cycles are Figure 31, Cycle 1 and Figure 30, Cycle 2. Figure 31 shows transition during the very first cycle after an idle bus state, which is the fastest possible transition into address pipelining. Figure 30, Cycle 2 shows a transition cycle occurring during a burst of bus cycles. In any case, a transition cycle is the same whenever it occurs: it consists at least of T1, T2 (NA is asserted at that time), and T2P (provided the Am386SX/SXL microprocessor has an internal bus request already pending, which it almost always has). T2P states are repeated if wait states are added to the cycle. Note that only three states (T1, T2, and T2P) are required in a bus cycle performing a transition from non-pipelined address into pipelined address timing (e.g., Figure 31, Cycle 1). Figure 31, Cycles 2, 3, and 4 show that address pipelining can be maintained with two-state bus cycles consisting only of T1P and T2P. Once a pipelined bus cycle is in progress, pipelined timing is maintained for the next cycle by asserting $\overline{\text{NA}}$ and detecting that the Am386SX/SXL microprocessor enters T2P during the current bus cycle. The current bus cycle must end in state T2P for pipelining to be maintained in the next cycle. T2P is identified by the assertion of $\overline{\text{ADS}}$ . Figures 30 and 31, however, each show pipelining ending after Cycle 4, because Cycle 4 ends in T2I. This indicates the Am386SX/SXL CPU did not have an internal bus request prior to the acknowledgment of Cycle 4. If a cycle ends with a T2 or T2I, the next cycle will not be pipelined. Realistically, address pipelining is almost always maintained as long as $\overline{\text{NA}}$ is sampled asserted. This is so because in the absence of any other request, a code prefetch request is always internally pending until the instruction decoder and code prefetch queue are completely full. Therefore, address pipelining is maintained for long bursts of bus cycles, if the bus is available (i.e., HOLD inactive), and $\overline{\text{NA}}$ is sampled active in each of the bus cycles. #### Interrupt Acknowledge (INTA) Cycles In response to an interrupt request on the INTR input when interrupts are enabled, the Am386SX/SXL microprocessor performs two interrupt acknowledge cycles. These bus cycles are similar to read cycles in that bus definition signals define the type of bus activity taking place, and each cycle continues until acknowledged by READY sampled active. The state of A2 distinguishes the first and second interrupt acknowledge cycles. The byte address driven during the first interrupt acknowledge cycle is 4 (A23–A3, A1, BLE Low, A2 and BHE High). The byte address driven during the second interrupt acknowledge cycle is 0 (A23–A1, BLE Low, and BHE High). The LOCK output is asserted from the beginning of the first interrupt acknowledge cycle until the end of the second interrupt acknowledge cycle. Four idle bus states (Ti) are inserted by the Am386SX/SXL microprocessor between the two interrupt acknowledge cycles for compatibility with spec TRHRL of the 8259A Interrupt Controller. During both interrupt acknowledge cycles, D15–D0 float. No data is read at the end of the first interrupt acknowledge cycle. At the end of the second interrupt acknowledge cycle, the Am386SX/SXL microprocessor will read an external interrupt vector from D7–D0 of the data bus. The vector indicates the specific interrupt number (from 0–255) requiring service. ## **Halt Indication Cycle** The execution unit halts as a result of executing a HLT instruction. Signaling its entrance into the halt state, a halt indication cycle is performed. The halt indication cycle is identified by the state of the bus definition signals shown on page 39, Bus Cycle Definition Signals, and an address of 2. The halt indication cycle must be acknowledged by READY asserted. A halted Am386SX/SXL CPU resumes execution when INTR (if interrupts are enabled), NMI, or RESET is asserted. #### **Shutdown Indication Cycle** The Am386SX/SXL microprocessor shuts down as a result of a protection fault while attempting to process a double fault. Signaling its entrance into the shutdown state, a shutdown indication cycle is performed. The shutdown indication cycle is identified by the state of the bus definition signals shown in section Bus Cycle Definition Signals and an address of 0. The shutdown indication cycle must be acknowledged by READY asserted. A shut-down Am386SX/SXL microprocessor resumes execution when NMI or RESET is asserted. #### **Entering and Exiting Hold Acknowledge** The Bus Hold Acknowledge state (Th) is entered in response to the HOLD input being asserted. In the Bus Hold Acknowledge state, the Am386SX/SXL microprocessor floats all outputs or bidirectional signals, except for HLDA. HLDA is asserted as long as the Am386SX/SXL CPU remains in the Bus Hold Acknowledge state. In the Bus Hold Acknowledge state, all inputs except HOLD, FLT, and RESET are ignored. Interrupt Vector (0–255) is read on D7–D0 at end of second Interrupt Acknowledge bus cycle. Because each Interrupt Acknowledge bus cycle is followed by idle bus states, asserting NA has no practical effect. Choose the approach which is simplest for your system hardware design. Figure 34. Interrupt Acknowledge Cycles Th may be entered from a bus idle state, as in Figure 37, or after the acknowledgment of the current physical bus cycle, if the $\overline{\text{LOCK}}$ signal is not asserted, as in Figures 38 and 39 Th is exited in response to the HOLD input being negated. The following state will be Ti if no bus request is pending, as in Figure 37. The following bus state will be T1 if a bus request is internally pending, as in Figures 38 and 39. Th is exited in response to RESET being asserted. If a rising edge occurs on the edge-triggered NMI input while in Th, the event is remembered as a nonmaskable interrupt 2 and is serviced when Th is exited, unless the Am386SX/SXL microprocessor is reset before Th is exited. #### **Reset During Hold Acknowledge** RESET being asserted takes priority over HOLD being asserted. If RESET is asserted while HOLD remains asserted, the Am386SX/SXL microprocessor drives its pins to defined states during reset, as in Table 16 (Pin State During Reset), and performs internal reset activity as usual. If HOLD remains asserted when RESET is inactive, the Am386SX/SXL CPU enters the Hold Acknowledge state before performing its first bus cycle, provided HOLD is still asserted when the Am386SX/SXL microprocessor would otherwise perform its first bus cycle. #### **FLOAT** Activating the FLT input floats all Am386SX/SXL microprocessor bidirectional and output signals, including HLDA. Asserting FLT isolates the Am386SX/SXL microprocessor from the surrounding circuitry. As the Am386SX/SXL microprocessor is packaged in a surface mount PQFP, it cannot be removed from the motherboard when In-Circuit Emulation (ICE) is needed. The FLT input allows the Am386SX/SXL CPU to be electrically isolated from the surrounding circuitry. This allows connection of an emulator to the Am386SX/SXL microprocessor PQFP without removing it from the PCB. This method of emulation is referred to as ON-Circuit Emulation (ONCE). Figure 35. Example Halt Indication Cycle from Non-Pipelined Cycle #### **Entering and Exiting FLOAT** FLT is an asynchronous, active Low input. It is recognized on the rising edge of CLK2. When recognized, it aborts the current bus cycle and floats the outputs of the Am386SX/SXL microprocessor (Figure 41). FLT must be held Low for a minimum of 16-CLK2 cycles. Reset should be asserted and held asserted until after FLT is deasserted. This will ensure that the Am386SX/SXL CPU will exit FLOAT in a valid state. Asserting the FLT input unconditionally aborts the current bus cycle and forces the Am386SX/SXL CPU into the FLOAT mode. Since activating FLT unconditionally forces the Am386SX/SXL CPU into FLOAT mode, the Am386SX/SXL microprocessor is not guaranteed to enter FLOAT in a valid state. After deactivating FLT, the Am386SX/SXL CPU is not guaranteed to exit FLOAT mode in a valid state. This is not a problem, as the FLT pin is meant to be used only during ONCE. After exiting FLOAT, the Am386SX/SXL microprocessor must be reset to return it to a valid state. Reset should be asserted before FLT is deasserted. This will ensure that the Am386SX/SXL CPU will exit FLOAT in a valid state. FLT has an internal pull-up resistor, and if it is not used it should be unconnected. #### **Bus Activity During and Following Reset** RESET is the highest priority input signal, capable of interrupting any processor activity when it is asserted. A bus cycle in progress can be aborted at any stage, or idle states and Bus Hold Acknowledge states discontinued, so that the reset state is established. RESET should remain asserted for at least 15-CLK2 periods to ensure it is recognized throughout the Am386SX/SXL microprocessor, and at least 80-CLK2 periods if self-test is going to be requested at the falling edge. RESET asserted pulses less than 15-CLK2 periods may not be recognized. RESET pulses less than 80-CLK2 periods followed by a self-test may cause the self-test to report a failure when no true failure exists. Figure 36. Example Shutdown Indication Cycle from Non-Pipelined Cycle Provided the RESET falling edge meets setup and hold times (t25 and t26), the internal processor clock phase is defined at that time as illustrated by Figure 40 and Figure 48. A self-test may be requested at the time RESET goes inactive by having the BUSY input at a Low level, as shown in Figure 40. The self-test requires approximately (2<sup>20</sup> + 60) CLK2 periods to complete. The self-test duration is not affected by the test results. Even if the self-test indicates a problem, the Am386SX/SXL microprocessor attempts to proceed with the reset sequence afterwards. After the RESET falling edge (and after the self-test if it was requested), the Am386SX/SXL microprocessor performs an internal initialization sequence for approximately 350- to 450-CLK2 periods. #### Self-Test Signature Upon completion of self-test (if self-test was requested by driving BUSY Low at the falling edge of RESET) the EAX register will contain a signature of 00000000H, indicating the Am386SX/SXL microprocessor passed its self-test of microcode and major PLA contents with no problems detected. The passing signature in EAX, 00000000H, applies to all revision levels. Any non-zero signature indicates the unit is faulty. ### **Component and Revision Identifiers** To assist users the Am386SX/SXL microprocessor, after reset, holds a component identifier and revision identifier in its DX register. The upper 8 bits of DX hold 23H as identification of the Am386SX/SXL CPU (the lower nibble, 03H, refers to the Am386DX/DXL microprocessor architecture. The upper nibble, 02H, refers to the second member of the Am386DX/DXL microprocessor Family). The lower 8 bits of DX hold an 8-bit unsigned binary number related to the component revision level. The revision identifier will, in general, chronologically track those component steppings which are intended to have certain improvements or distinction from previous steppings. The Am386SX/SXL microprocessor revision identifier will track that of the Am386DX/DXL CPU where possible. Note: For maximum design flexibility the Am386SX/SXL CPU has no internal pull-up resistors on its outputs. The design may require an external pull-up on ADS and other outputs to keep them negated during float periods. Figure 37. Requesting Hold from Idle Bus Note: HOLD is a synchronous input and can be asserted at any CLK2 edge, provided setup and hold (t23 and t24) requirements are met. This waveform is useful for determining Hold Acknowledge latency. Figure 38. Requesting Hold from Active Bus (NA Inactive) Figure 39. Requesting Hold from Idle Bus (NA Active) #### Notes: - 1. BUSY should be held stable for 8-CLK2 periods before and after the CLK2 period in which RESET falling edge occurs. - 2. If self-test is requested, the outputs remain in their reset state as shown here. 15022B-028 Figure 40. Bus Activity from Reset Until First Code Fetch Figure 41. Entering and Exiting FLT The revision identifier is intended to assist users to a practical extent. However, the revision identifier value is not guaranteed to change with every stepping revision, or to follow a completely uniform numerical sequence, depending on the type or intention of revision, or manufacturing materials required to be changed. Table 18. Component and Revision Identifier History | Intel i386SX<br>Stepping | | Revision Identifier | |--------------------------|----|---------------------| | В | A1 | 05H | | - С | В | 08H | #### Coprocessor Interfacing The Am386SX/SXL microprocessor provides an automatic interface for a 387SX math coprocessor. A 387SX math coprocessor uses an I/O mapped interface driven automatically by the Am386SX/SXL CPU and assisted by three dedicated signals: BUSY, ERROR, and PEREQ. As the Am386SX/SXL microprocessor begins supporting a math coprocessor instruction, it tests the BUSY and ERROR signals to determine if the coprocessor can accept its next instruction. Thus, the BUSY and ERROR inputs eliminate the need for any preamble bus cycles for communication between processor and math coprocessor. A 387SX math coprocessor can be given its command op-code immediately. The dedicated signals provide instruction synchronization and eliminate the need of using the WAIT op-code (9BH) for 387SX math coprocessor instruction synchronization (the WAIT opcode was required when the 8086 or 8088 was used with the 8087 math coprocessor). Custom math coprocessors can be included in Am386SX/SXL microprocessor based systems by memory-mapped or I/O-mapped interfaces. Such math coprocessor interfaces allow a completely custom protocol, and are not limited to a set of math coprocessor protocol primitives. Instead, memory-mapped or I/O-mapped interfaces may use all applicable instructions for high-speed math coprocessor communication. The BUSY and ERROR inputs of the Am386SX/SXL microprocessor may also be used for the custom math coprocessor interface, if such hardware assist is desired. These signals can be tested by the WAIT opcode (9BH). The WAIT instruction will wait until the BUSY input is inactive (interruptable by an NMI or enabled INTR input), but generates an Exception 16 fault if the ERROR pin is active when the BUSY goes (or is) inactive. If the custom math coprocessor interface is memory-mapped, protection of the addresses used for the interface can be provided with the Am386SX/SXL CPU's on-chip paging or segmentation mechanisms. If the custom interface is I/O-mapped, protection of the interface can be provided with the IOPL (I/O Privilege Level) mechanism. A 387SX math coprocessor interface is I/O mapped as shown in Table 19. Note that a 387SX math coprocessor interface addresses are beyond the 0H–0FFFFH range for programmed I/O. When the Am386SX/SXL microprocessor supports the 387SX math coprocessor, the Am386SX/SXL CPU automatically generates bus cycles to the coprocessor interface addresses. Table 19. Math Coprocessor Port Address | Address in Am386SXL | 387SX-Compatible Math | |---------------------|-----------------------| | CPU I/O Space | Coprocessor Register | | 8000F8H | Op-code Register | | 8000FCH/8000FEH* | Operand Register | \*Generated as 2nd bus cycle during Dword transfer. To correctly map a 387SX math coprocessor registers to the appropriate I/O addresses, connect the CMDO and CMD1 lines of a 387SX math coprocessor, as listed in Table 20. Table 20. Connections for CMD0 and CMD1 Inputs for a 387SX | | Signal | Connection | |---|--------|-----------------------------------------------| | | CMD0 | Connected directly to Am386SXL CPU A2 signal. | | - | CMD1 | Connect to ground. | #### Software Testing for Math Coprocessor Presence When software is used to test for math coprocessor (387SX) presence, it should use only the following math coprocessor op-codes: FINIT, FNINIT, FSTCW mem, FSTSW mem, and FSTSW AX. To use other math coprocessor op-codes when a math coprocessor is known to be not present, first set EM = 1 in the Am386SX/SXL CPU's CR0 register. #### PACKAGE THERMAL SPECIFICATIONS The Am386SX/SXL microprocessor is specified for operation when case temperature is within the range of 0°C-100°C. The case temperature may be measured in any environment to determine whether the Am386SX/SXL CPU is within specified operating range. The case temperature should be measured at the center of the top surface opposite the pins. The ambient temperature is guaranteed as long as Tc is not violated. The ambient temperature can be calculated from the $\theta$ ic and $\theta$ ia from the following equations: $$Tj = Tc + P \bullet \theta jc$$ $$Ta = Tj - P \bullet \theta ja$$ $$Tc = Ta + P \bullet [\theta ja - \theta jc]$$ #### **ELECTRICAL SPECIFICATIONS** The following sections describe recommended electrical connections for the Am386SX/SXL microprocessor, and its electrical specifications. ## **Power and Grounding** The Am386SX/SXL CPU has modest power requirements. However, its high clock frequency and 47 output buffers (address, data, control, and HLDA) can cause power surges as multiple output buffers drive new signal levels simultaneously. For clean on-chip power distribution at high frequency, 14 V<sub>cc</sub> and 18 V<sub>ss</sub> pins separately feed functional units of the Am386SX/SXL microprocessor. Power and ground connections must be made to all external $V_{\text{cc}}$ and $V_{\text{ss}}$ pins of the Am386SX/SXL microprocessor. On the circuit board, all $V_{\text{cc}}$ pins should be connected on a $V_{\text{cc}}$ plane, and $V_{\text{ss}}$ pins should be connected on a GND plane. #### **Power Decoupling Recommendations** Liberal decoupling capacitors should be placed near the Am386SX/SXL microprocessor. The Am386SX/SXL CPU driving its 24-bit address bus and 16-bit data bus at high frequencies can cause transient power surges, particularly when driving large capacitive loads. Low inductance capacitors and interconnects are recommended for best high frequency electrical performance. Inductance can be reduced by shortening circuit board traces between the Am386SX/SXL microprocessor and decoupling capacitors as much as possible. #### **Resistor Recommendations** The ERROR, FLT, and BUSY inputs have internal pullup resistors of approximately 20 Kohms, and the PEREQ input has an internal pull-down resistor of approximately 20 Kohms, built into the Am386SX/SXL microprocessor to keep these signals inactive when a 387SX-compatible math coprocessor is not present in the system (or temporarily removed from its socket). In typical designs, the external pull-up resistors shown in Table 21 are recommended. However, a particular design may have reason to adjust the resistor values recommended here, or alter the use of pull-up resistors in other ways. #### Other Connection Recommendations For reliable operation, always connect unused inputs to an appropriate signal level. NC pins should always remain unconnected. Connection of NC pins to $V_{\rm cc}$ or $V_{\rm ss}$ will result in component malfunction or incompatibility with future steppings of the Am386SX/SXL CPU. Particularly when not using the interrupts or bus hold (as when first prototyping), prevent any chance of spurious activity by connecting these associated inputs to GND. | Pin | Signal | |-----|--------| | 40 | INTR | | 38 | NMI | | 4 | HOLD | If not using address pipelining, connect pin 6 $(\overline{NA})$ through a pull-up in the range of 20 Kohms to $V_{cc}$ . Table 21. Recommended Resistor Pull-Ups to Vcc | Pin | Signal | Pull-Up Value | Purpose | |-----|--------|---------------|----------------------------------------------------------------------------| | 16 | ADS | 20 Kohms ±10% | Lightly pull ADS inactive during Am386SX/SXL CPU Hold Acknowledge states. | | 26 | LOCK | 20 Kohms ±10% | Lightly pull LOCK inactive during Am386SX/SXL CPU Hold Acknowledge states. | #### **ABSOLUTE MAXIMUM RATINGS** Ambient Temperature under bias .....-65 to 125°C Storage Temperature .....-65 to 150°C Stresses above those listed may cause permanent damage to the device. Exposure to absolute maximum rating conditions for extended periods of time may affect device reliability. #### **OPERATING RANGES** Supply Voltage with respect to $V_{ss}$ . . . . -0.5 V to 7 V Voltage on other pins . . . . . . -0.5 V to $(V_{co}$ + 0.5)V Operating ranges define those limits between which the functionality of the device is guaranteed. ## DC CHARACTERISTICS over COMMERCIAL operating ranges $V_{CC}$ = 5 V ±10%; $T_{CASE}$ = 0°C to +100°C (16, 20, 25, and 33 MHz) $V_{CC}$ = 5 V ±5%; $T_{CASE}$ = 0°C to +100°C (40 MHz) | Symbol | Parameter Description | Notes | Min | Max | Unit | |------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------|--------------------------|----------------------------| | V <sub>IL</sub> | Input Low Voltage | (Note 1) | -0.3 | +0.8 | V | | V <sub>IH</sub> | Input High Voltage | | 2.0 | V <sub>cc</sub> + 0.3 | V | | V <sub>ILC</sub> | CLK2 Input Low Voltage | (Note 1) | -0.3 | +0.8 | ٧ | | V <sub>IHC</sub> | CLK2 Input High Voltage | | 2.7 | V <sub>cc</sub> + 0.3 | ٧ | | V <sub>oL</sub> | Output Low Voltage $I_{OL} = 4$ mA: A23–A1, D15–D0 $I_{OL} = 5$ mA: BHE, BLE, W/R, D/G, M/TO, LOCK, ADS, HLDA | | | 0.45<br>0.45 | <b>V V</b> | | V <sub>он</sub> | Output High Voltage loн = 1.0 mA: A23—A1, D15—D0 loн = 0.2 mA: A23—A1, D15—D0 loн = 0.9 mA: BHE, BLE, W/R, D/G, M/IO, LOCK, ADS, HLDA loн = 0.18 mA: BHE, BLE, W/R, D/G, M/IO, LOCK, ADS, HLDA | | 2.4<br>V <sub>cc</sub> - 0.5<br>2.4<br>V <sub>cc</sub> - 0.5 | | V<br>V<br>V | | lu | Input Leakage Current (for all pins except PEREQ, BUSY, FLT, and ERROR) | $0~V \leq V_{\text{IN}} \leq V_{\text{CC}}$ | | ±15 | μΑ | | lıн | Input Leakage Current (PEREQ pin) | V <sub>IH</sub> = 2.4 V (Note 2) | | 200 | μА | | l <sub>IL</sub> | Input Leakage Current<br>(BUSY, ERROR, and FLT pins) | V <sub>IL</sub> = 0.45 V (Note 3) | | -400 | μΑ | | ILO | Output Leakage Current | $0.45~V \le V_{\text{OUT}} \le V_{\text{CC}}$ | | ±15 | μА | | lcc | Supply Current CLK2 = 32 MHz: with -16* CLK2 = 40 MHz: with -20 CLK2 = 50 MHz: with -25 CLK2 = 66 MHz: with -33 CLK2 = 80 MHz: with -40 | V <sub>cc</sub> = 5.0 V<br>lcc Typ =<br>lcc Typ = 130 mA<br>lcc Typ = 160 mA<br>lcc Typ = 210 mA<br>lcc Typ = 255 mA | | 155<br>190<br>245<br>295 | mA<br>mA<br>mA<br>mA<br>mA | | Іссѕв | Standby Current (Am386SXL CPU) | I <sub>CCSB</sub> Typ = 20 μA (Note 5) | | 150 | μΑ | | C <sub>IN</sub> | Input Capacitance | Fc = 1 MHz (Note 4) | | 10 | pF | | Соит | Output or I/O Capacitance | F <sub>c</sub> = 1 MHz (Note 4) | | 12 | pF | | C <sub>CLK</sub> | CLK2 Capacitance | F <sub>c</sub> = 1 MHz (Note 4) | | 20 | pF | Notes: Tested at the minimum operating frequency of the part. \*Contact AMD for 16-MHz availability. - 1. The Min value, -0.3, is not 100% tested. - 2. PEREQ input has an internal pull-down resistor. - BUSY, FLT, and ERROR inputs each have an internal pull-up resistor. - 4. Not 100% tested. Inputs at rails, outputs unloaded, PEREQ Low, ERROR High, BUSY High, and FLT High. #### **SWITCHING CHARACTERISTICS** The switching characteristics given consist of output delays, input setup requirements, and input hold requirements. All switching characteristics are relative to the CLK2 rising edge crossing the 2.0 V level. Switching characteristic measurement is defined by Figure 42. Inputs must be driven to the voltage levels indicated by Figure 42 when switching characteristics are measured. Output delays are specified with minimum and maximum limits measured, as shown. The minimum delay times are hold times provided to external circuitry. Input setup and hold times are specified as minimums, defining the smallest acceptable sampling window. Within the sampling window, a synchronous input signal must be stable for correct operation. Outputs $\overline{\text{ADS}}$ , $W/\overline{\text{R}}$ , $D/\overline{\text{C}}$ , $M/\overline{\text{IO}}$ , $\overline{\text{LOCK}}$ , $\overline{\text{BHE}}$ , $\overline{\text{BLE}}$ , A23–A1, and HLDA only change at the beginning of phase one. D15–D0 (write cycles) only change at the beginning of phase two. The $\overline{\text{READY}}$ , HOLD, $\overline{\text{BUSY}}$ , $\overline{\text{ERROR}}$ , PEREQ, $\overline{\text{FLT}}$ , and D15–D0 (read cycles) inputs are sampled at the beginning of phase one. The $\overline{\text{NA}}$ , INTR, and NMI inputs are sampled at the beginning of phase two. Figure 42. Drive Levels and Measurement Points for Switching Characteristics Switching Characteristics at 40 MHz (Am386SX Microprocessor Only): $V_{cc} = 5 \text{ V} \pm 5\%$ ; $T_{\text{CASE}} = 0^{\circ}\text{C}$ to $100^{\circ}\text{C}$ | Symbol | Parameter Description | Notes | | Ref.<br>Figure | Min | Max | Unit | |--------|-------------------------------------|------------------------------------------|-----------------------------------------|----------------|------|-----|------| | | Operating Frequency | Half CLK2 frequence | су | | 2 | 40 | MHz | | 1 | CLK2 Period | | | 44 | 12.5 | 250 | ns | | 2 | CLK2 High Time | at 2.7 V | *************************************** | 44 | 4.5 | | ns | | 3 | CLK2 Low Time | at 0.8 V | | 44 | 4.5 | | ns | | 4 | CLK2 Fall Time | 2.7 V to 0.8 V | (Note 3) | 44 | | 4 | ns | | 5 | CLK2 Rise Time | 0.8 V to 2.7 V | (Note 3) | 44 | | 4 | ns | | 6 | A23-A1 Valid Delay | C <sub>L</sub> = 50 pF | | 47 | 4 | 13 | ns | | 7 | A23-A1 Float Delay | | (Note 1) | 51 | 4 | 20 | ns | | 8 | BHE, BLE, LOCK Valid Delay | C <sub>L</sub> = 50 pF | | 47 | 4 | 13 | ns | | 9 | BHE, BLE, LOCK Float Delay | | (Note 1) | 51 | 4 | 20 | ns | | 10 | M/IO, D/C, W/R, ADS Valid Delay | C <sub>L</sub> = 50 pF | | 47 | 4 | 13 | ns | | 11 | W/R, M/IO, D/C, ADS Float Delay | | (Note 1) | 51 | 4 | 20 | ns | | 12 | D15-D0 Write Data Valid Delay | C <sub>L</sub> = 50 pF | (Note 4) | 47 | 7 | 18 | ns | | 12a | D15-D0 Write Data Hold Time | C <sub>L</sub> = 50 pF | | 49 | 2 | | ns | | 13 | D15-D0 Write Data Float Delay | | (Note 1) | 51 | 4 | 17 | ns | | 14 | HLDA Valid Delay | C <sub>L</sub> = 50 pF | | 51 | 4 | 17 | ns | | 14f | HLDA Float Delay | 1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1, | | 51 | 4 | 17 | ns | | 15 | NA Setup Time | | | 46 | 5 | | ns | | 16 | NA Hold Time | | | 46 | 2 | | ns | | 19 | READY Setup Time | | | 46 | 7 | | ns | | 20 | READY Hold Time | | | 46 | 4 | | ns | | 21 | D15-D0 Read Data Setup Time | | | 46 | 4 | | ns | | 22 | D15-D0 Read Data Hold Time | | | 46 | 3 | | ns | | 23 | HOLD Setup Time | | | 46 | 4 | | ns | | 24 | HOLD Hold Time | | | 46 | 2 | | ns | | 25 | RESET Setup Time | | | 52 | 4 | | ns | | 26 | RESET Hold Time | | | 52 | 2 | | ns | | 27 | NMI, INTR Setup Time | | (Note 2) | 46 | 5 | | ns | | 28 | NMI, INTR Hold Time | | (Note 2) | 46 | 5 | | ns | | 29 | PEREQ, ERROR, BUSY, FLT* Setup Time | | (Note 2) | 46 | 5 | | ns | | 30 | PEREQ, ERROR, BUSY, FLT* Hold Time | | (Note 2) | 46 | 4 | | ns | Notes: \*Float feature is available in Rev. B0 and later. <sup>1.</sup> Float condition occurs when maximum output current becomes less than ILO in magnitude. Float delay is not 100% tested. <sup>2.</sup> These inputs are allowed to be asynchronous to CLK2. The setup and hold specifications are given for testing purposes to assure recognition within a specific CLK2 period. <sup>3.</sup> Rise and Fall times are not tested. They are guaranteed by design characterization. <sup>4.</sup> Min time is not 100% tested. Switching Characteristics at 33 MHz: $V_{cc} = 5 \text{ V} \pm 10\%$ ; $T_{case} = 0^{\circ}\text{C}$ to $100^{\circ}\text{C}$ | Symbol | Parameter Description | Notes | | Ref.<br>Figure | Min | Max | Unit | |--------|--------------------------------------------------|------------------------------------|----------|----------------|----------|----------|------------| | | Operating Frequency: Am386SX CPU<br>Am386SXL CPU | Half CLK2 freq.<br>Half CLK2 freq. | | | 2 0 | 33<br>33 | MHz<br>MHz | | 1 | CLK2 Period: Am386SX CPU<br>Am386SXL CPU | | | 43<br>43 | 15<br>15 | | ns<br>ns | | 2a | CLK2 High Time | at 2 V | | 43 | 6.25 | | ns | | 2b | CLK2 High Time | at 3.7 V | | 43 | 4 | | ns | | За | CLK2 Low Time | at 2 V | | 43 | 6.25 | | ns | | 3b | CLK2 Low Time | at 0.8 V | | 43 | 4.5 | | ns | | 4 | CLK2 Fall Time | 3.7 V to 0.8 V | (Note 3) | 43 | | 4 | ns | | 5 | CLK2 Rise Time | 0.8 V to 3.7 V | (Note 3) | 43 | | 4 | ns | | 6 | A23-A1 Valid Delay | C <sub>L</sub> = 50 pF | | 47 | 4 | 15 | ns | | 7 | A23-A1 Float Delay | | (Note 1) | 51 | 4 | 20 | ns | | 8 | BHE, BLE, LOCK Valid Delay | C <sub>L</sub> = 50 pF | | 47 | 4 | 15 | ns | | 9 | BHE, BLE, LOCK Float Delay | | (Note 1) | 51 | 4 | 20 | ns | | 10 | M/IO, D/C, W/R, ADS Valid Delay | C <sub>L</sub> = 50 pF | , | 47 | 4 | 15 | ns | | 11 | W/R, M/IO, D/C, ADS Float Delay | | (Note 1) | 51 | 4 | 20 | ns | | 12 | D15-D0 Write Data Valid Delay | C <sub>L</sub> = 50 pF | (Note 4) | 47 | 7 | 23 | ns | | 12a | D15-D0 Write Data Hold Time | C <sub>L</sub> = 50 pF | | 49 | 2 | | ns | | 13 | D15-D0 Write Data Float Delay | | (Note 1) | 51 | 4 | 17 | ns | | 14 | HLDA Valid Delay | C <sub>L</sub> = 50 pF | | 47 | 4 | 20 | ns | | 14f | HLDA Float Delay | | | 51 | 4 | 20 | ns | | 15 | NA Setup Time | | | 46 | 5 | | ns | | 16 | NA Hold Time | | | 46 | 2 | | ns | | 19 | READY Setup Time | | | 46 | 7 | | ns | | 20 | READY Hold Time | | | 46 | 4 | | ns | | 21 | D15-D0 Read Data Setup Time | | | 46 | 5 | | ns | | 22 | D15-D0 Read Data Hold Time | | | 46 | 3 | | ns | | 23 | HOLD Setup Time | | | 46 | 9 | | ns | | 24 | HOLD Hold Time | | | 46 | 2 | | ns | | 25 | RESET Setup Time | | | 52 | 5 | | ns | | 26 | RESET Hold Time | | | 52 | 2 | | ns | | 27 | NMI, INTR Setup Time | | (Note 2) | 46 | 5 | | ns | | 28 | NMI, INTR Hold Time | | (Note 2) | 46 | 5 | | ns | | 29 | PEREQ, ERROR, BUSY Setup Time | | (Note 2) | 46 | 5 | | ns | | 30 | PEREQ, ERROR, BUSY Hold Time | | (Note 2) | 46 | 4 | | ns | Notes: 1. Float condition occurs when maximum output current becomes less than ILo in magnitude. Float delay is not 100% tested. <sup>2.</sup> These inputs are allowed to be asynchronous to CLK2. The setup and hold specifications are given for testing purposes to assure recognition within a specific CLK2 period. <sup>3.</sup> Rise and Fall times are not tested. They are guaranteed by design characterization. <sup>4.</sup> Min time is not 100% tested. Switching Characteristics at 25 MHz: $V_{co} = 5 \text{ V} \pm 10\%$ ; $T_{case} = 0^{\circ}\text{C}$ to $100^{\circ}\text{C}$ | Symbol | Parameter Description | Notes | Ref.<br>Figure | Min | Max | Unit | |--------|--------------------------------------------------|--------------------------------------------|----------------|----------|----------|------------| | | Operating Frequency: Am386SX CPU<br>Am386SXL CPU | Half CLK2 freq.<br>Half CLK2 freq. | | 2<br>0 | 25<br>25 | MHz<br>MHz | | 1 | CLK2 Period: Am386SX CPU<br>Am386SXL CPU | | 43<br>43 | 20<br>20 | | ns<br>ns | | 2a | CLK2 High Time | at 2 V | 43 | 7 | | ns | | 2b | CLK2 High Time | at (Vcc-0.8 V) | 43 | 4 | | ns | | За | CLK2 Low Time | at 2 V | 43 | 7 | | ns | | 3b | CLK2 Low Time | at 0.8 V | 43 | 5 | | ns | | 4 | CLK2 Fall Time | (Vcc-0.8 V) to 0.8 V (Note 3) | 43 | | 7 | ns | | 5 | CLK2 Rise Time | 0.8 V to (V <sub>cc</sub> -0.8 V) (Note 3) | 43 | | 7 | ns | | 6 | A23-A1 Valid Delay | C <sub>L</sub> = 50 pF | 47 | 4 | 17 | ns | | 7 | A23-A1 Float Delay | (Note 1) | 51 | 4 | 30 | ns | | 8 | BHE, BLE, LOCK Valid Delay | C <sub>L</sub> = 50 pF | 47 | 4 | 17 | ns | | 9 | BHE, BLE, LOCK Float Delay | (Note 1) | 51 | 4 | 30 | ns | | 10 | M/IO, D/C, W/R, ADS Valid Delay | C <sub>L</sub> = 50 pF | 47 | 4 | 17 | ns | | 11 | W/R, M/IO, D/C, ADS Float Delay | (Note 1) | 51 | 4 | 30 | ns | | 12 | D15-D0 Write Data Valid Delay | C <sub>L</sub> = 50 pF | 47 | 7 | 23 | ns | | 12a | D15-D0 Write Data Hold Time | C <sub>L</sub> = 50 pF | 49 | 2 | | ns | | 13 | D15-D0 Write Data Float Delay | (Note 1) | 51 | 4 | 22 | ns | | 14 | HLDA Valid Delay | C <sub>L</sub> = 50 pF | 47 | 4 | 22 | ns | | 14f | HLDA Float Delay | | 51 | 4 | 22 | ns | | 15 | NA Setup Time | | 46 | 5 | | ns | | 16 | NA Hold Time | | 46 | 3 | | ns | | 19 | READY Setup Time | | 46 | 9 | | ns | | 20 | READY Hold Time | | 46 | 4 | | ns | | 21 | D15-D0 Read Data Setup Time | | 46 | 7 | | ns | | 22 | D15-D0 Read Data Hold Time | | 46 | 5 | | ns | | 23 | HOLD Setup Time | | 46 | 9 | | ns | | 24 | HOLD Hold Time | | 46 | 3 | | ns | | 25 | RESET Setup Time | | 52 | 8 | | ns | | 26 | RESET Hold Time | | 52 | 3 | | ns | | 27 | NMI, INTR Setup Time | (Note 2) | 46 | 6 | | ns | | 28 | NMI, INTR Hold Time | (Note 2) | 46 | 6 | | ns | | 29 | PEREQ, ERROR, BUSY, FLT Setup Time | (Note 2) | 46 | 6 | | ns | | 30 | PEREQ, ERROR, BUSY, FLT Hold Time | (Note 2) | 46 | 5 | | ns | Notes: 1. Float condition occurs when maximum output current becomes less than ILo in magnitude. Float delay is not 100% tested. These inputs are allowed to be asynchronous to CLK2. The setup and hold specifications are given for testing purposes, to assure recognition within a specific CLK2 period. <sup>3.</sup> These are not tested. They are guaranteed by design characterization. Switching Characteristics at 20 MHz: $V_{cc} = 5 \text{ V} \pm 10\%$ ; $T_{case} = 0^{\circ}\text{C}$ to $100^{\circ}\text{C}$ | Symbol | Parameter Description | Notes | Ref.<br>Figure | Min | Max | Unit | |--------|--------------------------------------------------|--------------------------------------------|----------------|----------|----------|------------| | | Operating Frequency: Am386SX CPU<br>Am386SXL CPU | Half CLK2 freq.<br>Half CLK2 freq. | | 2<br>0 | 20<br>20 | MHz<br>MHz | | 1 | CLK2 Period: Am386SX CPU<br>Am386SXL CPU | | 43<br>43 | 25<br>25 | | ns<br>ns | | 2a | CLK2 High Time | at 2 V | 43 | 8 | | ns | | 2b | CLK2 High Time | at (Vcc-0.8 V) | 43 | 5 | | ns | | 3a | CLK2 Low Time | at 2 V | 43 | 8 | | ns | | 3b | CLK2 Low Time | at 0.8 V | 43 | 6 | | ns | | 4 | CLK2 Fall Time | (Vcc-0.8 V) to 0.8 V (Note 3 | ) 43 | | 8 | ns | | 5 | CLK2 Rise Time | 0.8 V to (V <sub>cc</sub> - 0.8 V) (Note 3 | ) 43 | | 8 | ns | | 6 | A23-A1 Valid Delay | C <sub>L</sub> = 120 pF (Note 4 | ) 47 | 4 | 30 | ns | | 7 | A23-A1 Float Delay | (Note 1 | ) 51 | 4 | 32 | ns | | 8 | BHE, BLE, LOCK Valid Delay | C <sub>L</sub> = 75 pF (Note 4 | ) 47 | 4 | 30 | ns | | 9 | BHE, BLE, LOCK Float Delay | (Note 1 | ) 51 | 4 | 32 | ns | | 10a | M/IO, D/C Valid Delay | $C_L = 75 pF$ (Note 4 | ) 47 | 4 | 28 | ns | | 10b | W/R, ADS Valid Delay | $C_L = 75 pF$ (Note 4 | ) 47 | 4 | 26 | ns | | 11 | W/R, M/IO, D/C, ADS Float Delay | (Note 1 | ) 51 | 6 | 30 | ns | | 12 | D15-D0 Write Data Valid Delay | C <sub>L</sub> = 120 p (Note 4 | ) 47 | 4 | 38 | ns | | 13 | D15-D0 Write Data Float Delay | (Note 1 | ) 51 | 4 | 27 | ns | | 14 | HLDA Valid Delay | $C_L = 75 pF$ (Note 4 | ) 47 | 4 | 28 | ns | | 14f | HLDA Float Delay | | 51 | 4 | 28 | ns | | 15 | NA Setup Time | | 46 | 5 | | ns | | 16 | NA Hold Time | | 46 | 12 | | ns | | 19 | READY Setup Time | | 46 | 12 | | ns | | 20 | READY Hold Time | | 46 | 4 | | ns | | 21 | D15-D0 Read Data Setup Time | | 46 | 9 | | ns | | 22 | D15-D0 Read Data Hold Time | | 46 | 6 | | ns | | 23 | HOLD Setup Time | | 46 | 17 | | ns | | 24 | HOLD Hold Time | | 46 | 5 | | ns | | 25 | RESET Setup Time | | 52 | 12 | | ns | | 26 | RESET Hold Time | | 52 | 4 | | ns | | 27 | NMI, INTR Setup Time | (Note 2 | ) 46 | 16 | | ns | | 28 | NMI, INTR Hold Time | (Note 2 | ) 46 | 16 | | ns | | 29 | PEREQ, ERROR, BUSY, FLT Setup Time | (Note 2 | ) 46 | 14 | | ns | | 30 | PEREQ, ERROR, BUSY, FLT Hold Time | (Note 2 | ) 46 | 5 | | ns | Notes: 1. Float condition occurs when maximum output current becomes less than I<sub>LO</sub> in magnitude. Float delay is not 100% tested. 3. These are not tested. They are guaranteed by design characterization. <sup>2.</sup> These inputs are allowed to be asynchronous to CLK2. The setup and hold specifications are given for testing purposes, to assure recognition within a specific CLK2 period. <sup>4.</sup> Tested with C<sub>L</sub> set at 50 pF and derated to support the indicated distributed capacitive load. See Figures 53 and 54 for the capacitive derating curve. Switching Characteristics at 16 MHz\*: $V_{cc} = 5 \text{ V} \pm 10\%$ ; $T_{case} = 0^{\circ}\text{C}$ to $100^{\circ}\text{C}$ | Symbol | Parameter Description | Notes | Ref.<br>Figure | Min | Max | Unit | |--------|--------------------------------------------------|--------------------------------------------|----------------|----------|----------|------------| | | Operating Frequency: Am386SX CPU<br>Am386SXL CPU | Half CLK2 freq.<br>Half CLK2 freq. | | 2<br>0 | 16<br>16 | MHz<br>MHz | | 1 | CLK2 Period: Am386SX CPU<br>Am386SXL CPU | | 43<br>43 | 31<br>31 | | ns<br>ns | | 2a | CLK2 High Time | at 2 V | 43 | 9 | | ns | | 2b | CLK2 High Time | at (Vcc-0.8 V) | 43 | 5 | | ns | | 3a | CLK2 Low Time | at 2 V | 43 | 9 | | ns | | 3b | CLK2 Low Time | at 0.8 V | 43 | 7 | | ns | | 4 | CLK2 Fall Time | (Vcc-0.8 V) to 0.8 V (Note 3) | 43 | | 8 | ns | | 5 | CLK2 Rise Time | 0.8 V to (V <sub>cc</sub> -0.8 V) (Note 3) | 43 | | 8 | ns | | 6 | A23-A1 Valid Delay | C <sub>L</sub> = 120 pF (Note 4) | 47 | 4 | 36 | ns | | 7 | A23-A1 Float Delay | (Note 1) | 51 | 4 | 40 | ns | | 8 | BHE, BLE, LOCK Valid Delay | $C_L = 75 pF$ (Note 4) | 47 | 4 | 36 | ns | | 9 | BHE, BLE, LOCK Float Delay | (Note 1) | 51 | 4 | 40 | ns | | 10 | W/R, M/IO, D/C, ADS Valid Delay | $C_L = 75 pF$ (Note 4) | 47 | 4 | 33 | ns | | 11 | W/R, M/IO, D/C, ADS Float Delay | (Note 1) | 51 | 6 | 35 | ns | | 12 | D15-D0 Write Data Valid Delay | C <sub>L</sub> = 120 pF (Note 4) | 47 | 4 | 40 | ns | | 13 | D15-D0 Write Data Float Delay | (Note 1) | 51 | 4 | 35 | ns | | 14 | HLDA Valid Delay | C <sub>L</sub> = 75 pF (Note 4) | 47 | 4 | 33 | ns | | 14f | HLDA Float Delay | | 51 | 4 | 33 | ns | | 15 | NA Setup Time | | 46 | 5 | | ns | | 16 | NA Hold Time | | 46 | 21 | | ns | | 19 | READY Setup Time | | 46 | 19 | | ns | | 20 | READY Hold Time | | 46 | 4 | | ns | | 21 | D15-D0 Read Data Setup Time | | 46 | 9 | | ns | | 22 | D15-D0 Read Data Hold Time | | 46 | 6 | | . ns | | 23 | HOLD Setup Time | | 46 | 26 | | ns | | 24 | HOLD Hold Time | | 46 | 5 | | ns | | 25 | RESET Setup Time | | 52 | 13 | | ns | | 26 | RESET Hold Time | | 52 | 4 | | ns | | 27 | NMI, INTR Setup Time | (Note 2) | 46 | 16 | | ns | | 28 | NMI, INTR Hold Time | (Note 2) | 46 | 16 | | ns | | 29 | PEREQ, ERROR, BUSY, FLT Setup Time | (Note 2) | 46 | 16 | | ns | | 30 | PEREQ, ERROR, BUSY, FLT Hold Time | (Note 2) | 46 | 5 | | ns | Notes: \*Contact AMD for 16-MHz availability. <sup>1.</sup> Float condition occurs when maximum output current becomes less than I<sub>LO</sub> in magnitude. Float delay is not 100% tested. These inputs are allowed to be asynchronous to CLK2. The setup and hold specifications are given for testing purposes, to assure recognition within a specific CLK2 period. <sup>3.</sup> These are not tested. They are guaranteed by design characterization. Tested with C<sub>L</sub> set at 50 pF and derated to support the indicated distributed capacitive load. See Figures 53 and 54 for the capacitive derating curve. 15022B-031 Figure 43. CLK2 Timing (16, 20, 25, and 33 MHz) 15022B-031a Figure 44. CLK2 Timing (40 MHz) Figure 45. AC Test Circuit ## **SWITCHING WAVEFORMS** Figure 46. Input Setup and Hold Timing 15022B-033 Figure 47. Output Valid Delay Timing 15022B-034 15021B-076 15021B-077 ## **SWITCHING WAVEFORMS (continued)** Figure 48. Write Data Valid Delay Timing (20 and 25 MHz) Figure 49. Write Data Hold Timing Figure 50. Write Data Valid Delay Timing (20 MHz) 15021B-078 Figure 51. Output Float Delay and HLDA Valid Delay Timing 15022B-036 Figure 52. RESET Setup and Hold Timing and Internal Phase Figure 53. Typical Output Valid Delay Versus Load Capacitance at Maximum Operating Temperature (C<sub>L</sub> = 120 pF) 15022B-038 Figure 54. Typical Output Valid Delay Versus Load Capacitance at Maximum Operating Temperature (C<sub>L</sub> = 75 pF) Figure 55. Typical Output Valid Delay Versus Load Capacitance at Maximum Operating Temperature (C<sub>L</sub> = 50 pF) Figure 56. Typical Output Rise Time Versus Load Capacitance at Maximum Operating Temperature ## DIFFERENCES BETWEEN THE Am386SX/ SXL CPU AND THE Am386DX/DXL CPU The following are the major differences between the Am386SX/SXL CPU and the Am386DX/DXL CPU: - The Am386SX/SXL CPU generates byte selects on BHE and BLE (like the 8086 and 80286) to distinguish the upper and lower bytes on its 16-bit data bus. The Am386DX/DXL CPU uses four byte - selects, BE3-BE0, to distinguish between the different bytes on its 32-bit bus. - The Am386SX/SXL CPU has no bus sizing option. The Am386DX/DXL CPU can select between either a 32-bit bus or a 16-bit bus by use of the BS16 input. The Am386SX/SXL CPU has a 16-bit bus size. - 3. The $\overline{\text{NA}}$ pin operation in the Am386SX/SXL CPU is identical to that of the $\overline{\text{NA}}$ pin on the Am386DX/DXL CPU with one exception: the Am386DX/DXL CPU $\overline{\text{NA}}$ pin cannot be activated on 16-bit bus cycles (where $\overline{\text{BS16}}$ is Low in the Am386DX/DXL CPU case), whereas $\overline{\text{NA}}$ can be activated on any Am386SX/SXL CPU bus cycle. The contents of all Am386SX/SXL CPU registers at reset are identical to the contents of the Am386DX/ DXL CPU registers at reset, except the DX register. The DX register contains a component-stepping identifier at reset, that is, in Am386DX/DXL CPU, after reset DH = 3 indicates Am386DX/DXL CPU DI = revision number; in Am386SX/SXL CPU, after reset DH = 23H indicates Am386SX/SXL CPU DL = revision number. - The Am386DX/DXL CPU uses A31 and M/IO as selects for the math coprocessor. The Am386SX/ SXL CPU uses A23 and M/IO as selects. - 6. The Am386DX/DXL CPU prefetch unit fetches code in four-byte units. The Am386SX/SXL CPU prefetch unit reads two bytes as one unit (like the 80286). In BS16 mode, the Am386DX/DXL CPU takes two consecutive bus cycles to complete a prefetch request. If there is a data read or write request after the prefetch starts, the Am386DX/DXL CPU will fetch all four bytes before addressing the new request. - 7. Both Am386DX/DXL CPU and Am386SX/SXL CPU have the same logical address space. The only difference is that the Am386DX/DXL CPU has a 32-bit physical address space and the Am386SX/ SXL CPU has a 24-bit physical address space. The Am386SX/SXL CPU has a physical memory address space of up to 16 Mb instead of the 4 Gb available to the Am386DX/DXL CPU. Therefore, in Am386SX/SXL CPU systems, the operating system must be aware of this physical memory limit and should allocate memory for applications programs within this limit. If an Am386DX/DXL CPU system uses only the lower 16 Mb of physical address, then there will be no extra effort required to migrate Am386DX/DXL CPU software to the Am386SX/SXL CPU. Any application which uses more than 16 Mb of memory can run on the Am386SX/SXL CPU, if the operating system utilizes the Am386SX/SXL CPU's paging mechanism. In spite of this difference in physical address space, the Am386SX/SXL CPU and Am386DX/DXL CPU can run the same operating systems and applications within their respective physical memory constraints. - The Am386SX/SXL CPU has an input called FLT which three-states all bi-directional and output pins, including HLDA, when asserted. It is used with ON-Circuit Emulation (ONCE). #### **INSTRUCTION SET** This section describes the instruction set. The Instruction Set Clock Count Summary lists all instructions along with instruction encoding diagrams and clock counts. Further details of the instruction encoding are then provided in the following sections, which completely describe the encoding structure and the definition of all fields occurring within instructions. ## The Am386SX/SXL CPU Instruction Encoding and Clock Count Summary To calculate elapsed time for an instruction, multiply the instruction clock count, as listed in the Instruction Set Clock Count Summary, by the processor clock period (e.g., 40 ns for a 25-MHz, 50 ns for a 20-MHz, and 62.5 ns for a 16-MHz Am386SX/SXL microprocessor). The actual clock count of an Am386SX/SXL CPU program will average 5% more than the calculated clock count due to instruction sequences which execute faster than they can be fetched from memory. #### **Instruction Clock Count Assumptions** - The instruction has been prefetched, decoded, and is ready for execution. - 2. Bus cycles do not require wait states. - There are no local bus HOLD requests delaying processor access to the bus. - 4. No exceptions are detected during instruction execution. - If an effective address is calculated, it does not use two general register components. One register, scaling and displacement can be used within the clock counts shown. However, if the effective address calculation uses two general register components, add 1 clock to the clock count shown. #### **Instruction Clock Count Notation** - If two clock counts are given, the smaller refers to a register operand and the larger refers to a memory operand. - 2. n = number of times repeated. - m = number of components in the next instruction executed, where the entire displacement (if any) counts as one component, the entire immediate data (if any) counts as one component, and all other bytes of the instruction and prefix(es) each count as one component. ### Misaligned or 32-Bit Operand Accesses - If instructions access a misaligned 16-bit operand or 32-bit operand on even address add: - 2x clocks for read or write - 4x clocks for read and write - If instructions access a 32-bit operand on odd address add: - 4x clocks for read or write - 8 x clocks for read and write #### Wait States Wait states add 1 clock per wait state to instruction execution for each data access. Am386SX/SXL Instruction Set Clock Count Summary | | | Clock | Count | No | tes | |-------------------------------------------------|----------------------------------------------|--------------------------------------------------|-----------------------------------------|--------------------------------------------------|-----------------------------------------| | Instruction | Format | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | | GENERAL DATA TRANSFER | | | | | | | MOV=Move: | | | | | | | Register to Register/Memory | 1 0 0 0 1 0 0 w mod reg r/m | 2/2 | 2/2* | b | h | | Register/Memory to Register | 1 0 0 0 1 0 1 w mod reg r/m | 2/4 | 2/4* | b | h | | Immediate to Register/Memory | 1 1 0 0 0 1 1 w mod 0 0 0 r/m immediate data | 2/2 | 2/2* | b | h | | Immediate to Register (short form) | 1 0 1 1 w reg immediate data | 2 | 2 | | | | Memory to Accumulator (short form) | 1 0 1 0 0 0 0 w full displacement | 4* | 4* | b | h | | Accumulator to Memory (short form) | 1 0 1 0 0 0 1 w full displacement | 2* | 2* | b | h | | Register/Memory to Segment Register | 1 0 0 0 1 1 1 0 mod sreg 3 r/m | 2/5 | 22/23 | b | h, i, j | | Segment Register to Register/Memory | 1 0 0 0 1 1 0 0 mod sreg r/m | 2/2 | 2/2 | b | ħ | | MOVSX = Move with Sign Extension | | | | | | | Register from Register/Memory | 0 0 0 0 1 1 1 1 1 0 1 1 1 1 1 w mod reg r/m | 3/6* | 3/6* | b | h | | MOVZX = Move with Zero Extension | | | | | | | Register from Register/Memory | 0 0 0 0 1 1 1 1 1 0 1 1 0 1 1 w mod reg r/m | 3/6* | 3/6* | b | h | | PUSH = Push: | | | | | | | Register/Memory | 1 1 1 1 1 1 1 mod 1 1 0 r/m | 5/7* | 7/9* | b | h | | Register (short form) | 0 1 0 1 0 reg | 2 | 4 | b | h | | Segment Register (ES,CS,SS, or DS) (short form) | 000 sreg 2 110 | 2 | 4 | b | h | | Segment Register (ES, CS, SS, DS, FS, or GS) | 0 0 0 0 1 1 1 1 10 sreg 3 000 | 2 | 4 | b | h | | Immediate | 0 1 1 0 1 0 s 0 immediate data | 2 | 4 | b | h | | PUSHA = Push All | 0 1 1 0 0 0 0 0 | 18 | 34 | b | h | | POP=Pop | | | | | | | Register/Memory | 1 0 0 0 1 1 1 1 mod 0 0 0 r/m | 5/7 | 7/9 | b | h | | Register (short form) | 0 1 0 1 1 reg | 6 | 6 | b | h | | Segment Register (ES, CS, SS, or DS) | 000 sreg 2 111 | 7 | 25 | b | h, i, j | | Segment Register (ES, CS, SS, DS, FS, or GS) | 0 0 0 0 1 1 1 1 10 sreg 3 001 | 7 | 25 | b | h, i, j | | POPA=Pop All | 0 1 1 0 0 0 0 1 | 24 | 40 | b | h | | XCHG = Exchange | | | | | | | Register/Memory with Register | 1 0 0 0 0 1 1 w mod reg r/m | 3/5** | 3/5** | b, f | f, h | | Register with Accumulator (short form) | 1 0 0 1 0 reg | 3 | 3 | | | | IN = Input From: | CLK Count<br>Virtual 8086 Mode | | | | | | Fixed Port | 1 1 1 0 0 1 0 w port number 28*** | 12* | 6*/26* | | s/t, m | | Variable Port | 1 1 1 0 1 1 0 w 27*** | 13* | 7*/27* | | s/t, m | <sup>\*</sup> If CPL \leq IOPL \*\* If CPL \leq IOPL \*\* If CPL \leq IOPL \*\*\*Clock count shown applies if I/O permission allows I/O to the port in Virtual 8086 Mode. If I/O bit map denies permission Exception 13 fault occurs; refer to clock counts for INT3 instruction. | | | | Clock | Count | Not | es | |---------------------------------------|--------------------------------|--------------------------------|--------------------------------------------------|-----------------------------------------|--------------------------------------------------|-----------------------------------------| | Instruction | Format | | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | | OUT = Output To: | | CLK Count<br>Virtual 8086 Mode | | | | | | Fixed Port | 1 1 1 0 0 1 1 w port number | 24*** | 10° | 4*/24* | | s/t, m | | Variable Port | 1 1 1 0 1 1 1 W | 25*** | 11* | 5*/25* | | s/t, m | | LEA = Load EA to Register | 1 0 0 0 1 1 0 1 mod reg r/m | | 2 | 2 | | | | SEGMENT CONTROL | | | | | | | | LDS = Load Pointer to DS | 1 1 0 0 0 1 0 1 mod reg r/m | | 7* | 26*/28* | ь | h, i, j | | LES=Load Pointer to ES | 1 1 0 0 0 1 0 0 mod reg r/m | | 7* | 26*/28* | ь | h, i, j | | LFS = Load Pointer to FS | 00001111 10110100 | mod reg r/m | 7* | 26*/28* | ь | h, i, j | | LGS = Load Pointer to GS | 00001111 10110101 | mod reg r/m | 7* | 26*/28* | ь | h, i, j | | LSS = Load Pointer to SS | 00001111 10110010 | mod reg r/m | 7* | 26*/28* | ь | h, i, j | | FLAG CONTROL | | | | | | | | CLC = Clear Carry Flag | 1 1 1 1 0 0 0 | | 2 | 2 | | | | CLD = Clear Direction Flag | 1111100 | | 2 | 2 | | | | CLI = Clear Interrupt Enable Flag | 1 1 1 1 0 1 0 | | 8 | 8 | | m | | CLTS = Clear Task Switched Flag | 00001111 00000110 | | 5 | 5 | С | 1 | | CMC = Complement Carry Flag | 1 1 1 1 0 1 0 1 | | 2 | 2 | | | | LAHF = Load AH into Flag | 10011111 | | 2 | 2 | | | | POPF = Pop Flags | 10011101 | | 5 | 5 | ь | h, n | | PUSHF = Push Flags | 10011100 | | 4 | 4 | ь | h | | SAHF = Store AH into Flags | 10011110 | | 3 | 3 | | | | STC = Set Carry Flag | 1 1 1 1 1 0 0 1 | | 2 | 2 | | | | STD = Set Direction Flag | 1111101 | | | | | | | STI = Set Interrupt Enable Flag | 1 1 1 1 1 0 1 1 | | 8 | 8 | | m | | ARITHMETIC | | | | | | | | ADD = Add | | | | | | | | Register to Register | 0 0 0 0 0 0 d w mod reg r/m | | 2 | 2 | | | | Register to Memory | 0 0 0 0 0 0 0 w mod reg r/m | | 7** | 7** | ь | h | | Memory to Register | 0 0 0 0 0 0 1 w mod reg r/m | | 6 <b>*</b> | 6* | b | h | | Immediate to Register/Memory | 1 0 0 0 0 0 s w mod 0 0 0 r/m | immediate data | 2/7** | 2/7** | b | h | | Immediate to Accumulator (short form) | 0 0 0 0 0 1 0 w immediate data | | 2 | 2 | | | | ADC = Add with Carry | | | | | | | | Register to Register | 0 0 0 1 0 0 d w mod reg r/m | | 2 | 2 | | | <sup>\*</sup>If CPL > IOPL \*\* If CPL > IOPL \*\*\* If CPL > IOPL \*\*\*Clock count shown applies if I/O permission allows I/O to the port in Virtual 8086 Mode. If I/O bit map denies permission Exception 13 fault occurs; refer to clock counts for INT3 instruction. | Automotive ( ) | | | Count | No | tes | |-----------------------------------------|----------------------------------------------|--------------------------------------------------|-----------------------------------------|--------------------------------------------------|-----------------------------------------| | Instruction | Format | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | | ADC = Add with Carry (continued) | | | | | | | Register to Memory | 0 0 0 1 0 0 0 w mod reg r/m | 7** | 7** | b | h | | Memory to Register | 0 0 0 1 0 0 1 w mod reg r/m | 6* | 6 <b>*</b> | ь | h | | Immediate to Register/Memory | 1 0 0 0 0 0 s w mod 0 1 0 r/m immediate data | 2/7** | 2/7** | b | h | | Immediate to Accumulator (short form) | 0 0 0 1 0 1 0 w immediate data | 2 | 2 | | | | INC = increment | | | | | | | Register/Memory | 1 1 1 1 1 1 1 w mod 0 0 0 r/m | 2/6** | 2/6** | b | h | | Register (short form) | 0 1 0 0 0 reg | 2 | 2 | | | | SUB=Subtract | | | | | | | Register from Register | 0 0 1 0 1 0 d w mod reg r/m | 2 | 2 | | | | Register from Memory | 0 0 1 0 1 0 0 w mod reg r/m | 7** | 7** | ь | h | | Memory from Register | 0 0 1 0 1 0 1 w mod reg r/m | 6* | 6* | b | h | | Immediate from Register/Memory | 1 0 0 0 0 0 s w mod 1 0 1 r/m immediate data | 2/7** | 2/7** | b | h | | Immediate from Accumulator (short form) | 0 0 1 0 1 1 0 w immediate data | 2 | 2 | : | | | SBB=Subtract with Borrow | | | | | <b> </b> | | Register from Register | 0 0 0 1 1 0 d w mod reg r/m | 2 | 2 | | | | Register from Memory | 0 0 0 1 1 0 0 w mod reg r/m | 7** | 7** | b | h | | Memory from Register | 0 0 0 1 1 0 1 w mod reg r/m | 6* | 6* | ь | h | | Immediate from Register/Memory | 1 0 0 0 0 0 s w mod 0 1 1 r/m immediate data | 2/7** | 2/7** | b | h | | Immediate from Accumulator (short form) | 0 0 0 1 1 1 0 w immediate data | 2 | 2 | | | | DEC=Decrement | | | | | | | Register/Memory | 1 1 1 1 1 1 1 w reg 0 0 1 r/m | 2/6 | 2/6 | ь | h | | Register (short form) | 0 1 0 0 1 reg | 2 | 2 | | ļ | | CMP = Compare | | | | | | | Register with Register | 0 0 1 1 1 0 d w mod reg r/m | 2 | 2 | | | | Memory with Register | 0 0 1 1 1 0 0 w mod reg r/m | 5* | 5* | b | h | | Register with Memory | 0 0 1 1 1 0 1 w mod reg r/m | 6* | 6* | ь | h | | Immediate with Register/Memory | 1 0 0 0 0 0 s w mod 1 1 1 r/m immediate data | 2/5* | 2/5* | b | h | | Immediate with Accumulator (short form) | 0 0 1 1 1 1 0 w immediate data | 2 | 2 | | | | NEG = Change Sign | 1 1 1 1 0 1 1 w mod 0 1 1 r/m | 2/6* | 2/6* | b | h | | AAA = ASCII Adjust for Add | 0 0 1 1 0 1 1 1 | 4 | 4 | | | <sup>\*</sup> If CPL≤IOPL \*\* If CPL>IOPL | | | Clock | Count | | tes | |--------------------------------------------|---------------------------------------------|--------------------------------------------------|-----------------------------------------|--------------------------------------------------|----------------------------------------| | Instruction | Format | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protecte<br>Virtual<br>Address<br>Mode | | AAS = ASCII Adjust for Subtract | 0 0 1 1 1 1 1 1 | 4 | 4 | | | | DAA = Decimal Adjust for Add | 00100111 | 4 | 4 | | | | DAS = Decimal Adjust for Subtract | 00101111 | 4 | 4 | | | | MUL = Multiply (unsigned) | | | | | | | Accumulator with Register Memory | 1 1 1 1 0 1 1 w mod 1 0 0 r/m | | } | | | | Multiplier - Byte | | 12-17/15-20* | | b, d | d, h | | -Word<br>-Doubleword | | 12-25/15-28*<br>12-41/17-46* | | b, d<br>b, d | d, h<br>d, h | | IMUL = Integer Multiply (signed) | | | | | | | Accumulator with Register Memory | 1 1 1 1 0 1 1 w mod 1 0 1 r/m | | | | | | Multiplier – Byte | | 12-17/15-20* | | b, d | d, h | | – Word<br>– Doubleword | | 12-25/15-28*<br>12-41/17-46* | | b, d<br>b, d | d, h<br>d, h | | Register with Register/Memory | 0 0 0 0 1 1 1 1 1 0 1 0 1 1 1 1 mod reg r/m | | | | | | Multiplier – Byte | | 12-17/15-20* | 2-17/15-20* | b, d | d, h | | Word<br>Doubleword | | 12-25/15-28*<br>12-41/17-46* | | b, d<br>b, d | d, h<br>d, h | | Register/Memory with Immediate to Register | 0 1 1 0 1 0 s 1 mod reg r/m immediate data | 12-4//// | 2 4,11,1 | 0,0 | 0,11 | | -Word<br>-Doubleword | | 13–26<br>13–42 | 13-26/14-27<br>13-42/16-45 | b, d<br>b, d | d, h<br>d, h | | DIV = Divide (unsigned) | | 1,0 42 | 10 42 10 40 | 0,0 | G, | | Accumulator by Register/Memory | 1 1 1 1 0 1 1 w mod 1 1 0 r/m | | | | | | Divisor - Byte | | 14/17 | 14/17 | b, e | e, h | | – Word<br>– Doubleword | | 22/25<br>38/43 | 22/25<br>38/43 | b, e<br>b, e | e, h<br>e, h | | IDIV = Integer Divide (signed) | | | | | | | Accumulator by Register/Memory | 1 1 1 1 0 1 1 w mod 1 1 1 r/m | | | | | | Divisor - Byte | | 19/22 | 19/22 | b, e | e, h | | <ul><li>Word</li><li>Doubleword</li></ul> | | 27/30<br>43/48 | 27/30<br>43/48 | b, e<br>b, e | e, h<br>e, h | | AAD = ASCII Adjust for Divide | 11010101 00001010 | 19 | 19 | | | | AAM = ASCII Adjust for Multiply | 1 1 0 1 0 1 0 0 0 0 0 0 1 0 1 0 | 17 | 17 | | | | CBW=Convert Byte to Word | 1 0 0 1 1 0 0 0 | 3 | 3 | | | | CWD = Convert Word to Double Word | 1 0 0 1 1 0 0 1 | 2 | 2 | | | | LOGIC | | | | | | | Shift/Rotate Instruction | | | | | | | Not Through Carry (ROL, ROR, SAL, SAR, SH | IL, and SHR) | | | | | | Register/Memory by 1 | 1 1 0 1 0 0 0 w mod TTT r/m | 3/7** | 3/7** | ь | h | <sup>\*</sup> If CPL ≤ IOPL \*\* If CPL > IOPL | | | | | Clock | Count | No | tes | |---------------------------------------------|----------------------------------------|---------------------------------------------|-----------------|--------------------------------------------------|-----------------------------------------|--------------------------------------------------|-----------------------------------------| | Instruction | Format | | | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | | LOGIC (continued) | | | | | | | | | Not Through Carry (ROL, ROR, SAL, SAR, S | HL, and SHR) –(cont | inued) | | | | | | | Register/Memory by CL | 1 1 0 1 0 0 1 w | mod TTT r/m | | 3/7* | 3/7* | b | h | | Register/Memory by Immediate Count | 1 1 0 0 0 0 0 w | mod TTT r/m | (1) | 3/7* | 3/7* | b | h | | Through Carry (RCL and RCR) | | | | | | | | | Register/Memory by 1 | 1 1 0 1 0 0 0 w | mod TTT r/m | | 9/10* | 9/10* | b | h | | Register/Memory by CL | 1 1 0 1 0 0 1 w | mod TTT r/m | | 9/10* | 9/10* | ь | h | | Register/Memory by Immediate Count | 1 1 0 0 0 0 0 w | mod TTT r/m | (1) | 9/10* | 9/10* | ь | h | | SHLD = Shift Left Double | 000<br>001<br>010<br>011<br>100<br>101 | Instruction ROL ROR RCL RCR SHL/SAL SHR SAR | | | | | | | Register/Memory by Immediate | 00001111 | 10100100 | mod reg r/m (1) | 3/7** | 3/7** | | | | Register/Memory by CL | 00001111 | 10100101 | mod reg r/m | 3/7** | 3/7** | | | | SHRD=Shift Right Double | | | | | | | | | Register/Memory by Immediate | 00001111 | 10101100 | mod reg r/m (1) | 3/7** | 3/7** | | | | Register/Memory by CL | 00001111 | 10101101 | mod reg r/m | 3/7** | 3/7** | | | | AND=And | | | | | | | | | Register to Register | 0 0 1 0 0 0 d w | mod reg r/m | | 2 | 2 | | 1 | | Register to Memory | 0 0 1 0 0 0 0 w | mod reg r/m | | 7** | 7 <b>**</b> | b | h | | Memory to Register | 0 0 1 0 0 0 1 w | mod reg r/m | | 6 <b>*</b> | 6* | ь | h | | Immediate to Register/Memory | 1 0 0 0 0 0 0 w | mod 1 0 0 r/m | immediate data | 2/7* | 2/7** | ь | h | | Immediate to Accumulator (short form) | 0 0 1 0 0 1 0 w | immediate data | | 2 | 2 | | | | TEST = And Function to Flags, No Result | | | | , | | | | | Register/Memory and Register | 1 0 0 0 0 1 0 w | mod reg r/m | | 2/5* | 2/5* | b | h | | Immediate Data and Register/Memory | 1 1 1 1 0 1 1 w | mod 0 0 0 r/m | immediate data | 2/5* | 2/5* | ь | h | | Immediate Data and Accumulator (short form) | 1010100w | immediate data | | 2 | 2 | | | | OR = Or | | | | | | | | | Register to Register | 0 0 0 0 1 0 d w | mod reg r/m | | 2 | 2 | | | | Register to Memory | 0 0 0 0 1 0 0 w | mod reg r/m | | 7** | 7** | Ь. | h | | Memory to Register | 0 0 0 0 1 0 1 w | mod reg r/m | | 6* | 6* | ь | h | <sup>\*</sup> If CPL≤IOPL \*\* If CPL>IOPL (1) Immediate 8-Bit Data | | | Clock | Count | Notes | | |----------------------------------------------|----------------------------------------------|--------------------------------------------------|-----------------------------------------|--------------------------------------------------|-----------------------------------------| | Instruction | Format | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | | LOGIC (continued) | | | | | | | Immediate to Register/Memory | 1 0 0 0 0 0 0 w mod 0 0 1 r/m immediate data | 2/7** | 2/7** | b | h | | Immediate to Accumulator (short form) | 0 0 0 0 1 1 0 w immediate data | 2 | 2 | | | | XOR = Exclusive Or | | | 1 | | | | Register to Register | 0 0 1 1 0 0 d w mod reg r/m | 2 | 2 | | | | Register to Memory | 0 0 1 1 0 0 0 w mod reg r/m | 7** | 7** | b | h | | Memory to Register | 0 0 1 1 0 0 1 w mod reg r/m | 6* | 6 <b>*</b> | ь | h | | Immediate to Register/Memory | 1 0 0 0 0 0 0 w mod 1 1 0 r/m immediate data | 2/7** | 2/7** | ь | h | | Immediate to Accumulator (short form) | 0 0 1 1 0 1 0 w immediate data | 2 | 2 | | | | NOT=Invert Register/Memory | 1 1 1 1 0 1 1 w mod 0 1 0 r/m | 2/6** | 2/6** | b | h | | STRING MANIPULATION | Clock Coun | | <b>.</b><br><b>.</b> | | | | CMPS = Compare Byte/Word | 1 0 1 0 0 1 1 w Virtual 8086 Mode | 10* | 10* | b | h | | INS=Input Byte/Word from DX Port | 0 1 1 0 1 1 0 w | 15 | 9*/29** | ь | s/t, h, m | | LODS = Load Byte/Word to AL/AX/EAX | 1 0 1 0 1 1 0 w | 5 | 5* | b | h | | MOVE = Move Byte/Word | 1 0 1 0 0 1 0 w | 7 | 7** | b | h | | OUTS = Output Byte/Word to DX Port | 0 1 1 0 1 1 1 W 28*** | 14 | 8*/28** | b | s/t, h, m | | SCAS = Scan Byte/Word | 1 0 1 0 1 1 1 w | 7* | 7* | ь | h | | STOS = Store Byte/Word from AL/AX/EX | 1 0 1 0 1 0 1 W | 4* | 4* | b | h | | XLAT = Translate String | 1 1 0 1 0 1 1 1 | 5* | 5* | | h | | REPEATED STRING MANIPULATION | | | | | | | Repeated by Count in CX or ECX | | | | | | | REPE CMPS = Compare String (Find non-match) | 1 1 1 1 0 0 1 1 1 0 1 0 0 1 1 w | 5+9n** | 5+9n** | ь | h | | REPNE CMPS = Compare String (Find match) | 1 1 1 1 0 0 1 0 1 0 1 0 0 1 1 w | 5+9n** | 5+9n** | b | h | | REP INS = Input String | 1 1 1 1 0 0 1 0 0 1 1 0 1 1 0 w | 13+6n* | 7+6n*/<br>27+6n** | 6 | s/t, h, m | | REP LODS=Load String | 1 1 1 1 0 0 1 0 1 0 1 0 1 1 0 w | 5+6n* | 5+6n* | ь | h | | REP MOVS = Move String | 1 1 1 1 0 0 1 0 1 0 1 0 0 1 0 w | 7+4n* | 7+4n* | b | h | | REP OUTS = Output String | 1 1 1 1 0 0 1 0 0 1 1 0 1 1 1 w | 12+5n* | 6+5n*/<br>26+5n** | b | s/t, h, m | | REPE SCAS = Scan String (Find non-AL/AX/EAX) | 1 1 1 1 0 0 1 1 1 0 1 0 1 1 1 w | 5+8n* | 5 + 8n* | b | h | | REPNE SCAS = Scan String (Find AL/AX/EAX) | 1 1 1 1 0 0 1 0 1 0 1 0 1 1 1 w | 5+8n* | 5 + 8n* | b | h | | REP STOS = Store String | 1 1 1 1 0 0 1 0 1 0 1 0 1 0 1 w | 5 + 5n* | 5 + 5n* | b | h | | | | | | | | | | | | | | <u> </u> | <sup>\*</sup> If CPL≤IOPL \*\* If CPL>IOPL <sup>\*\*\*</sup>Clock count shown applies if I/O permission allows I/O to the port in Virtual 8086 Mode. If I/O bit map denies permission Exception 13 fault occurs; refer to clock counts for INT3 instruction. | | , (00.000 | Cissis | Count | Notes | | |-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------|-------------------------------------|--------------------------------------------------------------------|-------------------------------------|--------------------------------------------------------------------------------------------------------------| | | _ | Real<br>Address<br>Mode/<br>Virtual | Protected<br>Virtual<br>Address | Real<br>Address<br>Mode/<br>Virtual | Protected<br>Virtual<br>Address | | Instruction | Format | 8086 Mode | Mode | 8086 Mode | Mode | | BIT MANIPULATION | | ĺ | į | | | | BSF=Scan Bit Forward | 0 0 0 0 1 1 1 1 1 0 1 1 1 1 0 0 mod reg r/m (1) | 10+3n* | 10+3n* | b | h | | BSR=Scan Bit Reverse | 0 0 0 0 1 1 1 1 1 0 1 1 1 0 1 mod reg r/m | 10+3n* | 10+3n* | b | h | | BT = Test Bit | | | | į | | | Register/Memory, Immediate | 0 0 0 0 1 1 1 1 1 0 1 1 1 0 1 0 mod 1 0 0 r/m (1) | 3/6* | 3/6* | b | h | | Register/Memory, Register | 0 0 0 0 1 1 1 1 1 0 1 0 0 0 1 1 mod reg r/m | 3/12* | 3/12* | ь | h | | BTC = Test Bit and Complement | | | | | | | Register/Memory, Immediate | 0 0 0 0 1 1 1 1 1 0 1 1 1 0 1 0 mod 1 1 1 r/m (1) | 6/8* | 6/8* | ь | h | | Register/Memory, Register | 0 0 0 0 1 1 1 1 1 0 1 1 1 0 1 1 mod reg r/m | 6/13* | 6/13* | b | h | | BTR=Test Bit and Reset | | | | | | | Register/Memory, Immediate | 0 0 0 0 1 1 1 1 1 0 1 1 1 0 1 0 mod 1 1 0 r/m (1) | 6/8* | 6/8* | ь | h | | Register/Memory, Register | 0 0 0 0 1 1 1 1 1 0 1 1 0 0 1 1 mod reg r/m | 6/13* | 6/13* | b | h | | BTS = Test Bit and Set | | | | | | | Register/Memory, Immediate | 0 0 0 0 1 1 1 1 1 0 1 1 1 0 1 0 mod 1 0 1 r/m (1) | 6/8* | 6/8* | ь | h | | Register/Memory, Register | 0 0 0 0 1 1 1 1 1 0 1 0 1 0 1 1 mod reg r/m | 6/13 <b>*</b> | 6/13* | b | h | | CONTROL TRANSFER | | | | | | | CALL = Call | | : | | | | | Direct Within Segment | 1 1 1 0 1 0 0 0 full displacement | 7 + m* | 9+m* | b | r | | Register/Memory<br>Indirect Within Segment | 1 1 1 1 1 1 1 1 mod 0 1 0 r/m | 7+m*/<br>10+m* | 9+m*/<br>12+m* | ь | h, r | | Direct Intersegment | 1 0 0 1 1 0 1 0 unsigned full offset, selector | 17+m* | 42+m* | b | j, k, r | | Protected Mode Only (Direct Intersegmer Via Call Gate to Same Privilege Level Via Call Gate to Different Privilege Level Via Call Gate to Different Privilege Level From 80286 Task to 80286 TSS From 80286 Task to Am386SX/SXL CFTOM 80286 Task to Virtual 8086 Task From Am386SX/SXL CPU Task to AmFrom Am386SX/SXL CPU Task to AmFrom Am386SX/SXL CPU Task to AmFrom Am386SX/SXL CPU Task to Virtual Rosk to Virtual Rosk to Virtual Rosk to AmFrom Am386SX/SXL CPU Task to AmFrom Am386SX/SXL CPU Task to Vir | rel (No Parameters) rel (x Parameters) PU TSS x (Am386SX/SXL CPU TSS) 286 TSS | | 64+m<br>98+m<br>106+8x+m<br>285<br>310<br>229<br>285<br>392<br>309 | | h, j, k, r<br>h, r | | Indirect Intersegment | 1 1 1 1 1 1 1 1 mod 0 1 1 r/m | 30+m | 46+m | b | h, j, k, r | | Protected Mode Only (Indirect Intersegme<br>Via Call Gate to Same Privilege Level<br>Via Call Gate to Different Privilege Lev<br>Via Call Gate to Different Privilege Lev<br>From 80286 Task to 80286 TSS<br>From 80286 Task to Am386SX/SXL<br>From Am386SX/SXL CPU Task to 80:<br>From Am386SX/SXL CPU Task to 80: | ent) vel (No Parameters) vel (x Parameters) PU TSS x (Am386SX/SXL CPU TSS) 286 TSS | | 68+m<br>102+m<br>110+8x+m | | h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r | | | tual 8086 Task (Am386SX/SXL CPU TSS) | | 399 | | h, j, k, r<br>h, j, k, r | <sup>\*</sup> If CPL≤IOPL \*\* If CPL>IOPL <sup>(1)</sup> Immediate 8-bit data | | | Clock | Count | ļ | tes | |-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------|--------------------------------------------------|-----------------------------------------|--------------------------------------------------|----------------------------------------------------------------------------------| | Instruction | Format | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | | CONTROL TRANSFER (continued) | | | | | | | JMP = Unconditional Jump | | | | | | | Short | 1 1 1 0 1 0 1 1 8-bit displacement | 7 + m | 7 + m | | r | | Direct within Segment | 1 1 0 1 0 0 0 1 full displacement | 7+m | 7 + m | | r | | Register/Memory<br>Indirect Within Segment | 1 1 1 1 1 1 1 1 mod 1 0 0 r/m | 9+m/<br>14+m | 9+m/<br>14+m | ь | h, r | | Direct Intersegment | 1 1 1 0 1 0 1 0 unsigned full offset, selector | 16+m | 31 + m | | j, k, r | | Protected Mode Only (Direct Intersegment) Via Call Gate to Same Privilege Level From 80286 Task to 80286 TSS From 80286 Task to Am386SX/SXL CPU From 80286 Task to Virtual 8086 Task (Ar From Am386SX/SXL CPU Task to 80286 From Am386SX/SXL CPU Task to Am386 From Am386SX/SXL CPU Task to Virtual | n386SX/SXL CPU TSS)<br>TSS<br>SX/SXL CPU TSS | 17+m | 53 + m<br>395<br>31 + m | b | h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r | | Indirect Intersegment | 1 1 1 1 1 1 1 1 mod 1 0 1 r/m | 17+111 | 31+111 | " | h, j, k, r | | Protected Mode Only (Indirect Intersegment) Via Call Gate to Same Privilege Level From 80286 Task to 80286 TSS From 80286 Task to Am386SX/SXL CPU From 80286 Task to Virtual 8086 Task (Ar From Am386SX/SXL CPU Task to 80286 | n386SX/SXL CPU TSS)<br>TSS | | 49 + m<br>328 | | h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r<br>h, j, k, r | | From Am386SX/SXL CPU Task to Am386<br>From Am386SX/SXL CPU Task to Virtual<br>RET = Return from Call | | | 12 · m | ь | h, j, k, r | | Within Segment | | | 12+m | l | g, h, r | | Within Segment Adding Immediate to SP | 1 1 0 0 0 0 1 1 1 1 1 0 0 0 0 1 0 16-bit displacement | | 12+m<br>36+m | b | g, h, r | | Intersegment | 1 1 0 0 1 0 1 1 | | 36+m | ь | g, h, j, k, r | | Intersegment Adding Immediate to SP | 1 1 0 0 1 0 1 0 1 16-bit displacement | | 30+111 | " | g, h, j, k, r | | Protected Mode Only (RET): to Different Privil<br>Intersegment<br>Intersegment Adding Immediate to SP<br>CONDITIONAL JUMPS (Note: Times are Ju | ege Level | | 72<br>72 | | h, j, k, r<br>h, j, k, r | | JO = Jump on Overflow | | 7+m or 3 | 7 + m or 3 | | , | | 8-bit Displacement | | | 7 + m or 3 | | ,<br>, | | Full Displacement | 0 1 1 1 0 0 0 0 8-bit displacement | | | | ' | | JNO = Jump on Not Overflow | 0 0 0 0 1 1 1 1 1 0 0 0 0 0 0 0 0 full displacement | 7+m or 3 | 7 + m or 3 | | r | | 8-bit Displacement | | 7+m or 3 | | | r | | Full Displacement | 0 1 1 1 0 0 0 1 8-bit displacement | 7 +111 01 3 | 7 +111 01 0 | | ' | | | 0 0 0 0 1 1 1 1 1 0 0 0 0 0 0 1 full displacement | | | | | <sup>\*</sup> If CPL≤IOPL \*\* If CPL>IOPL | | | 1 | Clock C | Count | Not | otes | |----------------------------------------|---------------------------------------------|-----------|--------------------------------------------------|-----------------------------------------|--------------------------------------------------|-----------------------------------------| | Instruction | Format | | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | | CONDITIONAL JUMPS (continued) | | | | | | | | JB/JNAE = Jump on Below/Not Above or B | qual | | | | | | | 8-bit Displacement | 0 1 1 1 0 0 1 0 8-bit displacement | | 7 + m or 3 | 7 + m or 3 | | r | | Full Displacement | 0 0 0 0 1 1 1 1 1 0 0 0 0 0 1 0 full disp | placement | 7 + m or 3 | 7 + m or 3 | | r | | JNB/JAE = Jump on Not Below/Above or E | qual | | | | | | | 8-bit Displacement | 0 1 1 1 0 0 1 1 8-bit displacement | | 7 + m or 3 | 7 + m or 3 | | r | | Full Displacement | 0 0 0 0 1 1 1 1 1 0 0 0 0 0 1 1 full disp | placement | 7 + m or 3 | 7 + m or 3 | | r | | JE/JZ≔Jump on Equal/Zero | | | | | | | | 8-bit Displacement | 0 1 1 1 0 1 0 0 8-bit displacement | | 7 + m or 3 | 7 + m or 3 | | r | | Full Displacement | 0 0 0 0 1 1 1 1 1 0 0 0 0 1 0 0 full disp | placement | 7 + m or 3 | 7 + m or 3 | | r | | JNE/JNZ = Jump on Not Equal/Not Zero | | | | | | | | 8-bit Displacement | 0 1 1 1 0 1 0 1 8-bit displacement | | 7 + m or 3 | 7 + m or 3 | | r | | Full Displacement | 0 0 0 0 1 1 1 1 1 0 0 0 0 1 0 1 full disp | placement | 7 + m or 3 | 7 + m or 3 | | r | | JBE/JNA = Jump on Below or Equal/Not A | bove | | | | | | | 8-bit Displacement | 0 1 1 1 0 1 1 0 8-bit displacement | | 7 + m or 3 | 7 + m or 3 | | r | | Full Displacement | 0 0 0 0 1 1 1 1 1 0 0 0 0 1 1 0 full disp | placement | 7 + m or 3 | 7 + m or 3 | | r | | JNBE/JA = Jump on Not Below or Equal/A | bove | | | | | | | 8-bit Displacement | 0 1 1 1 0 1 1 1 8-bit displacement | | 7 + m or 3 | 7 + m or 3 | | r | | Full Displacement | 0 0 0 0 1 1 1 1 1 0 0 0 0 1 1 1 full disp | placement | 7 + m or 3 | 7 + m or 3 | | r | | JS=Jump on Sign | | | | | | | | 8-bit Displacement | 0 1 1 1 1 0 0 0 8-bit displacement | | 7 + m or 3 | 7 + m or 3 | | r | | Full Displacement | 0 0 0 0 1 1 1 1 1 0 0 0 1 0 0 0 full disp | placement | 7 + m or 3 | 7 + m or 3 | | r | | JNS = Jump on Not Sign | | | | | | | | 8-bit Displacement | 0 1 1 1 1 0 0 1 8-bit displacement | | 7 + m or 3 | 7 + m or 3 | | r | | Full Displacement | 0 0 0 0 1 1 1 1 1 1 0 0 0 1 0 0 1 full disp | placement | 7 + m or 3 | 7 + m or 3 | | r | | JP/JPE = Jump on Parity/Parity Even | | | | | | | | 8-bit Displacement | 0 1 1 1 1 0 1 0 8-bit displacement | | 7 + m or 3 | 7 + m or 3 | | r | | Full Displacement | 0 0 0 0 1 1 1 1 1 1 0 0 0 1 0 1 0 full disp | placement | 7 + m or 3 | 7 + m or 3 | | r | | JNP/JPO=Jump on Not Parity/Parity Odd | | | | | | | | 8-bit Displacement | 0 1 1 1 1 0 1 1 8-bit displacement | | 7 + m or 3 | 7 + m or 3 | | r | | Full Displacement | 0 0 0 0 1 1 1 1 1 0 0 0 1 0 1 1 full disp | placement | 7 + m or 3 | 7 + m or 3 | | r | | | | | | | | | | | | | | | i | L | | | | | Clock Count | | es | |-----------------------------------------|---------------------------------------------------|--------------------------------------------------|-----------------------------------------|--------------------------------------------------|-----------------------------------------| | Instruction | Format | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | | CONDITIONAL JUMPS (continued) | | | | | | | JL/JNGE = Jump on Less/Not Greater or E | qual | | | | | | 8-bit Displacement | 0 1 1 1 1 1 0 0 8-bit displacement | 7+m or 3 | 7+m or 3 | | r | | Full Displacement | 0 0 0 0 1 1 1 1 1 0 0 0 1 1 0 0 full displacement | 7 + m or 3 | 7 + m or 3 | | r | | JNL/JGE = Jump on Net Less/Greater or E | iqual | | | | | | 8-bit Displacement | 0 1 1 1 1 0 1 8-bit displacement | 7 + m or 3 | 7 + m or 3 | | r | | Full Displacement | 0 0 0 0 1 1 1 1 1 0 0 0 1 1 0 1 full displacement | 7 + m or 3 | 7 + m or 3 | | r | | JLE/JNG = Jump on Less or Equal/Not Gr | eater | | | | | | 8-bit Displacement | 0 1 1 1 1 1 0 8-bit displacement | 7 + m or 3 | 7 + m or 3 | | r | | Full Displacement | 0 0 0 0 1 1 1 1 1 0 0 0 1 1 1 0 full displacement | 7 + m or 3 | 7 + m or 3 | | r | | JNLE/JG = Jump on Not Less or Equal/Gr | eater | | | | | | 8-bit Displacement | 0 1 1 1 1 1 1 1 8-bit displacement | 7 + m or 3 | 7 + m or 3 | | r | | Full Displacement | 0 0 0 0 1 1 1 1 1 0 0 0 1 1 1 1 full displacement | 7+m or 3 | 7 + m or 3 | | r | | JCXZ=Jump on CX Zero* | 1 1 1 0 0 0 1 1 8-bit displacement | 9+m or 5 | 9+m or 5 | | r | | JECXZ = Jump on ECX Zero | 1 1 1 0 0 0 1 1 8-bit displacement | 9+m or 5 | 9+m or 5 | | r | | LOOP = Loop CX Times | 1 1 1 0 0 0 1 0 8-bit displacement | 11 + m | 11 + m | | r | | LOOPZ/LOOPE = Loop with Zero/Equal | 1 1 1 0 0 0 0 1 8-bit displacement | 11+m | 11 + m | | r | | LOOPNZ/LOOPNE = Loop while Not Zero | 1 1 1 0 0 0 0 0 8-bit displacement | 11+m | 11 + m | | r | | CONDITIONAL BYTE SET (Note: Times A | re Register/Memory) | | | | | | SETO=Set Byte on Overflow | | | | | | | To Register/Memory | 0 0 0 0 1 1 1 1 1 0 0 1 0 0 0 0 mod 0 0 0 r/m | 4/5* | 4/5* | | h | | SETNO=Set Byte on Not Overflow | | | | | | | To Register/Memory | 0 0 0 0 1 1 1 1 1 0 0 1 0 0 0 1 mod 0 0 0 r/m | 4/5* | 4/5* | | h | | SETB/SETNAE = Set Byte on Below/Not A | bove or Equal | | | | | | To Register/Memory | 0 0 0 0 1 1 1 1 1 0 0 1 0 0 1 0 mod 0 0 0 r/m | 4/5* | 4/5* | | h | | SETNB = Set Byte on Not Below/Above or | Equal | | | | | | To Register/Memory | 0 0 0 0 1 1 1 1 1 0 0 1 0 0 1 1 mod 0 0 0 r/m | 4/5* | 4/5* | | h | | SETE/SETZ = Set Byte on Equal/Zero | | | | | | | To Register/Memory | 0 0 0 0 1 1 1 1 1 0 0 1 0 1 0 0 mod 0 0 0 r/m | 4/5* | 4/5* | | h | | SETNE/SETNZ = Set Byte on Not Equal/No | ot Zero | | | | | | To Register/Memory | 0 0 0 0 1 1 1 1 1 0 0 1 0 1 0 1 mod 0 0 0 r/m | 4/5* | 4/5* | | h | | | | | | | <u>L</u> | <sup>\*</sup> Address Size Prefix differentiates JCXZ from JECXZ | | | Clock | Count | Not | tes | |------------------------------------------|--------------------------------------------------|--------------------------------------------------|-----------------------------------------|--------------------------------------------------|-----------------------------------------| | Instruction | Format | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | | CONDITIONAL BYTE SET (continued) | | | | | | | SETBE/SETNA = Set Byte on Below or Equ | al/Not Above | | | | | | To Register/Memory | 0 0 0 0 1 1 1 1 1 0 0 1 0 1 1 0 mod 0 0 0 r/m | 4/5* | 4/5* | | h | | SETNBE/SETA = Set Byte on Not Below or | Equal/Above | | | | | | To Register/Memory | 0 0 0 0 1 1 1 1 1 0 0 1 0 1 1 1 mod 0 0 0 r/m | 4/5* | 4/5* | | h | | SETS=Set Byte on Sign | | | | | | | To Register/Memory | 0 0 0 0 1 1 1 1 1 1 0 0 1 1 0 0 0 mod 0 0 0 r/m | 4/5* | 4/5* | | h | | SETNS = Set Byte on Not Sign | | | | | | | To Register/Memory | 0 0 0 0 1 1 1 1 1 0 0 1 1 0 0 1 mod 0 0 0 r/m | 4/5* | 4/5* | | h | | SETP/SETPE = Set Byte on Parity/Parity E | ven | | | | | | To Register/Memory | 0 0 0 0 1 1 1 1 1 1 0 0 1 1 0 1 0 mod 0 0 0 r/m | 4/5* | 4/5* | | h | | SETNP/SETPO=Set Byte on Not Parity/Pa | rity Odd | | | | | | To Register/Memory | 0 0 0 0 1 1 1 1 1 1 0 0 1 1 0 1 1 mod 0 0 0 r/m | 4/5* | 4/5* | | h | | SETL/SETNGE = Set Byte on Less/Not Gre | ater or Equal | | | | | | To Register/Memory | 0 0 0 0 1 1 1 1 1 1 0 0 1 1 1 0 0 mod 0 0 0 r/m | 4/5* | 4/5* | | h | | SETNL/SETGE = Set Byte on Not Less/Gre | <u> </u> | | | | | | To Register/Memory | 0 0 0 0 1 1 1 1 0 1 1 1 1 0 1 mod 0 0 0 r/m | 4/5* | 4/5* | : | h | | SETLE/SETNG = Set Byte on Less or Equa | | | | | | | To Register/Memory | 0 0 0 0 1 1 1 1 1 1 0 0 1 1 1 1 0 mod 0 0 0 r/m | 4/5* | 4/5* | | h | | SETNLE/SETG=Set Byte on Not Less or E | | | | | | | To Register/Memory | 0 0 0 0 1 1 1 1 1 1 0 0 1 1 1 1 1 mod 0 0 0 r/m | 4/5* | 4/5* | | h | | ENTER = Enter Procedure | 1 1 0 0 1 0 0 0 16-bit displacement, 8-bit level | | | | | | L=0 | | 10 | 10 | b | h | | L=1<br>L>1 | | 14<br>17+8(n-1) | 14<br>17+8 (n – 1) | b<br>b | h<br>h | | LEAVE = Leave Procedure | [11001001] | | | | | | INTERRUPT INSTRUCTIONS | 1 1 0 0 1 0 0 1 | 4 | 4 | b | h | | INT = interrupt: | | | | | | | Type Specified | 11001101 190 | 07 | | | | | Туре 3 | 1 1 0 0 1 1 0 1 type | 37 | | b | | | INTO=Interrupt 4 if Overflow Flag Set | 11001100 | 33 | | b | | | If OF = 1 If OF = 0 | 11001110 | 35 | | b, e | | | II OF=U | | 3 | 3 | b, e | | | | | | | | | <sup>\*</sup> If CPL≤IOPL \*\* If CPL>IOPL | | | | | Notes | | |--------------------------------------------------------------------------|--------------------------------------------|--------------------------------------------------|-----------------------------------------|--------------------------------------------------|-----------------------------------------| | Instruction | Format | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | | INTERRUPT INSTRUCTIONS (contin | ued) | | | | | | INT = Interrupt: | | | | | | | Type Specified | | | ľ | | | | Туре 3 | | | | | | | Bound = Interrupt 5 if Detected Value | e Out of Range 0 1 1 0 0 0 1 0 mod reg r/m | | | | | | If Out of Range | | 44 | | b, e | e,g,h,j,k,r | | If In Range | | 10 | 10 | b, e | e,g,h,j,k,r | | Protected Mode Only (INT) | | | | | | | INT: Type Specified | | | | | ļ | | Via Interrupt or Trap Gate to Same | Privilege Level | | | | | | Via Interrupt or Trap Gate to Differ | - | 1 | 71 | | g, j, k, r | | From 80286 Task to 80286 TSS vi | | | 111 | | g, j, k, r | | From 80286 Task to Am386SX/SX | | | 438 | | g, j, k, r | | From 80286 Task to Virtual 8086 N | | | 465 | ŀ | g, j, k, r | | From Am386SX/SXL CPU Task to | | | 382 | : | g, j, k, r | | | Am386SX/SXL CPU TSS via Task Gate | | 440 | | g, j, k, r | | From Am386SX/SXL CPU Task to | | | 467 | , | g, j, k, r | | From Virtual 8086 Mode to 80286 | | | 384 | ŀ | g, j, k, r | | From Virtual 8086 Mode to Am386 | | 1 | 445<br>472 | | g, j, k, r | | From Virtual 8086 Mode to Privileg | e Level 0 via Trap Gate or Interrupt Gate | | 275 | | g, j, k, r<br>g, j, k, r | | INT: Type 3 | | | "" | | 9,,,.,, | | Via Interrupt or Trap Gate to Same | Privilege Level | | | | l | | Via Interrupt or Trap Gate to Differe | <u> </u> | | 71 | | g, j, k, r | | From 80286 Task to 80286 TSS vi | • | | 111 | | g, j, k, r | | From 80286 Task to Am386SX/SX | L CPU TSS via Task Gate | 1 | 382 | | g, j, k, r | | From 80286 Task to Virtual 8086 N | Mode via Task Gate | | 409 | | g, j, k, r | | From Am386SX/SXL CPU Task to | | | 326 | | g, j, k, r | | From Am386SX/SXL CPU Task to | Am386SX/SXL CPU TSS via Task Gate | | 384 | • | g, j, k, r | | From Am386SX/SXL CPU Task to | Virtual 8086 Mode via Task Gate | | 411 | | g, j, k, r | | From Virtual 8086 Mode to 80286 | TSS via Task Gate | | 328 | | g, j, k, r | | From Virtual 8086 Mode to Am386 | SX/SXL CPU TSS via Task Gate | | 389 | | g, j, k, r | | From Virtual 8086 Mode to Privileg | e Level 0 via Trap Gate or Interrupt Gate | | 416 | | g, j, k, r | | INTO | | | 223 | | g, j, k, r | | Via Interrupt or Trap Gate to Same | Privilege Level | | | | | | Via Interrupt or Trap Gate to Differe | | 1 | 71 | | g, j, k, r | | From 80286 Task to 80286 TSS vi | 9 | | 111 | | g, j, k, r | | From 80286 Task to Am386SX/SX | | 1 | 384 | | g, j, k, r | | From 80286 Task to Virtual 8086 N | | | 411 | | g, j, k, r | | From Am386SX/SXL CPU Task to | | | 328 | | g, j, k, r | | | Am386SX/SXL CPU TSS via Task Gate | | Am386DX | | g, j, k, r | | From Am386SX/SXL CPU Task to | | | 413 | 1 | g, j, k, r | | From Virtual 8086 Mode to 80286 | TSS via Task Gate | | 329 | | g, j, k, r | | From Virtual 8086 Mode to Am386 | SX/SXL CPU TSS via Task Gate | | 391 | | g, j, k, r | | From Virtual 8086 Mode to Privileg | e Level 0 via Trap Gate or Interrupt Gate | | 418 | | g, j, k, r | | BOUND | | | 223 | | g, j, k, r | | Via Interrupt or Trap Gate to Same | Privilege Level | | | | | | | <u> </u> | | 71 | 1 | g, j, k, r | | Via interrupt or Trap Gate to Difference From 80286 Task to 80286 TSS vi | | | 111 | | g, j, k, r | | From 80286 Task to Am386SX/SX | | | 358 | 1 | g, j, k, r | | | L OI O 100 VIR 1RON GRID | | 388 | | g, j, k, r | | | | 1 | 1 | | 1 **** | | | | | Clock Count | | tes | |----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------|------------------------------------------------------------------|--------------------------------------------------|------------------------------------------------------------------------------------------------| | Instruction | Format | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | | INTERRUPT INSTRUCTIONS (continued | 3) | | | | | | BOUND (continued) From 80286 Task to Virtual 8086 Mod From Am386SX/SXL CPU Task to 802 From Am386SX/SXL CPU Task to Am From Am386SX/SXL CPU Task to Vir From Virtual 8086 Mode to 80286 TSS From Virtual 8086 Mode to Am386SX/ From Virtual 8086 Mode to Privilege L | 286 TSS via Task Gate<br>386SX/SXL CPU TSS via Task Gate<br>Jual 8086 Mode via Task Gate<br>5 via Task Gate<br>SXL CPU TSS via Task Gate | | 335<br>368<br>398<br>347<br>368<br>398<br>223 | | g, j, k, r<br>g, j, k, r<br>g, j, k, r<br>g, j, k, r<br>g, j, k, r<br>g, j, k, r<br>g, j, k, r | | INTERRUPT RETURN | | | | | | | IRET = Interrupt Return | 1 1 0 0 1 1 1 1 | 24 | | | g,h,j,k,r | | Protected Mode Only (IRET) Via Interrupt or Trap Gate to Same Privilege Level (within Task) Via Interrupt or Trap Gate to Different Privilege Level (within Task) From 80286 Task to 80286 TSS From 80286 Task to Am386SX/SXL CPU TSS From 80286 Task to Virtual 8086 Task From 80286 Task to Virtual 8086 Mode (within Task) From Am386SX/SXL CPU Task to Virtual 8086 TSS From Am386SX/SXL CPU Task to 80286 TSS From Am386SX/SXL CPU Task to Am386SX/SXL CPU TSS From Am386SX/SXL CPU Task to Virtual 8086 Mode (within Task) | | | 42<br>86<br>285<br>318<br>267<br>113<br>324<br>328<br>377<br>113 | | g,h,j,k,r<br>g,h,j,k,r<br>h,j,k,r<br>h,j,k,r<br>h,j,k,r<br>h,j,k,r<br>h,j,k,r<br>h,j,k,r | | PROCESSOR CONTROL | | | _ | | , | | HLT = Halt MOV = Move To and From Control/Debu | 1 1 1 1 0 1 0 0 | 5 | 5 | ;<br>} | | | CR0/CR2/CR3 from Register | 0 0 0 0 1 1 1 1 0 0 1 0 0 0 1 0 1 1 eee reg | 10/4/5 | 10/4/5 | | | | Register from CR3-CR0 | 0 0 0 0 1 1 1 1 0 0 1 0 0 0 0 0 1 1 eee reg | 5 | 6 | | 1 | | DR3-DR0 from Register | 0 0 0 0 1 1 1 1 0 0 1 0 0 0 1 1 1 1 eee reg | 22 | 22 | | ı | | DR7-DR6 from Register | 0 0 0 0 1 1 1 1 0 0 1 0 0 0 1 1 1 1 eee reg | 16 | 16 | | 1 | | Register from DR7–DR6 | 0 0 0 0 1 1 1 1 0 0 1 0 0 0 0 1 1 1 eee reg | 14 | 14 | | 1 | | Register from DR3-DR0 | 0 0 0 0 1 1 1 1 0 0 1 0 0 0 0 1 1 1 eee reg | 22 | 22 | | 1 | | TR7-TR6 from Register | 0 0 0 0 1 1 1 1 0 0 1 0 0 1 1 0 1 1 eee reg | 12 | 12 | | - 1 | | Register from TR7–TR6 | 0 0 0 0 1 1 1 1 0 0 1 0 0 1 0 0 1 1 eee reg | 12 | 12 | | 1 | | NOP = No Operation | 1 0 0 1 0 0 0 0 | 3 | 3 | | 1 | | WAIT=Wait until BUSY pin is negated | 1 0 0 1 1 0 1 1 | 6 | 6 | | 1 | | PROCESSOR EXTENSION INSTRUCTION | ons | | | | | | Processor Extension Escape | 1 1 0 1 1 T T T mod L L L r/m | See 387SX<br>Data sheet | | | | | PREFIX BYTES | TTT and LLL bits are op-code information for coprocessor. | for clock<br>counts | | | h | | | | 0001110 | | 4 | | | | | Clock | Count | No | tes | |--------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------|-----------------------------------------|--------------------------------------------------|-----------------------------------------| | Instruction | Format | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | | PREFIX BYTES (continued) | | | | | | | LOCK = Bus Lock Prefix | 1 1 1 1 0 0 0 0 | 0 | 0 | | m | | Operand Size Prefix | 0 1 1 0 0 1 1 0 | 0 | 0 | | | | Segment Override Prefix | | | | | | | CS | 00101110 | О | 0 | | | | DS | 0 0 1 1 1 1 1 0 | 0 | 0 | | | | ES | 0 0 1 0 0 1 1 0 | 0 | 0 | | 1 | | FS | 0 1 1 0 0 1 0 0 | 0 | 0 | | | | GS | 01100101 | 0 | 0 | | | | SS | 0 0 1 1 0 1 1 0 | 0 | 0 | | | | PROTECTION CONTROL | | | | | | | ARPL = Adjust Requested Privilege Level | | | | | | | From Register/Memory | 0 1 1 0 0 0 1 1 mod reg r/m | N/A | 20/21** | а | h | | LAR = Load Access Rights | | | | | | | From Register/Memory | 0 0 0 0 1 1 1 1 0 0 0 0 0 1 0 mod reg r/m | N/A | 15/16* | а | g, h, j, | | LGDT = Load Global Descriptor | | | | | | | Table Register | 0 0 0 0 1 1 1 1 0 0 0 0 0 0 1 mod 0 1 0 r/m | 11* | 11* | b, c | h, i | | LIDT = Load Interrupt Descriptor | | | | | | | Table Register | 0 0 0 0 1 1 1 1 0 0 0 0 0 0 1 mod 0 1 1 r/m | 11* | 11* | b, c | h, I | | LLDT = Load Local Descriptor | | | | | | | Table Register to Register/Memory | 0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 0 mod 0 1 0 r/m | N/A | 20/24* | a | g, h, j, | | LMSW = Load Machine Status Word | | | | | | | From Register/Memory | 0 0 0 0 1 1 1 1 0 0 0 0 0 0 1 mod 1 1 0 r/m | 10/13 | 10/13* | b, c | h, I | | LSL = Load Segment Limit | | | | | | | From Register/Memory | 0 0 0 0 1 1 1 1 0 0 0 0 0 0 1 1 mod reg r/m | | | | | | Byte-Granular Limit<br>Page-Granular Limit | | N/A | 20/21* | a | g, h, j, į | | LTR=Load Task Register | | N/A | 25/26* | а | g, h, j, p | | From Register/Memory | 0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 0 mod 0 0 1 r/m | N/A | 23/27* | a | g, h, j, | | SGDT = Store Global Descriptor | The state of s | 100 | 20,2, | | 9, 11, 1 | | Table Register | 0 0 0 0 1 1 1 1 0 0 0 0 0 0 1 mod 0 0 0 r/m | 9* | 9* | b, c | h | | SIDT = Store Interrupt Descriptor | o o o o o o o o o o o o o o o o o o o | | , | 0,0 | " | | | | | 64 | | | | Table Register | 0 0 0 0 1 1 1 1 0 0 0 0 0 0 1 mod 0 0 1 r/m | 9* | 9* | b, c | h | <sup>\*</sup> If CPL≤IOPL \*\* If CPL>IOPL | | | Clock | Count | No | ies | |----------------------------------------|-----------------------------------------------|--------------------------------------------------|-----------------------------------------|--------------------------------------------------|-----------------------------------------| | Instruction | Format | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | Real<br>Address<br>Mode/<br>Virtual<br>8086 Mode | Protected<br>Virtual<br>Address<br>Mode | | PROTECTION CONTROL (continued) | | | | | | | SLDT=Store Local Descriptor Table Regi | ster | | | | | | To Register/Memory | 0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 0 mod 0 0 0 r/m | N/A | 2/2* | а | h | | SMSW = Store Machine Status Word | 0 0 0 0 1 1 1 1 0 0 0 0 0 0 1 mod 1 0 0 r/m | 2/2* | 2/2* | b, c | h, I | | STR=Store Task Register | | | | | | | To Register/Memory | 0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 0 mod 0 0 1 r/m | N/A | 2/2* | а | h | | VERR = Verify Read Access | | | | | | | Register/Memory | 0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 mod 1 0 0 r/m | N/A | 10/11* | a | g, h, j, p | | VERW = Verify Write Access | 0 0 0 0 1 1 1 1 0 0 0 0 0 0 0 mod 1 0 1 r/m | N/A | 15/16* | a | g, h, j, p | | * If CPL≤IOPL ** If CPL>IOPL | | | | | | #### Instruction Notes for Instruction Set Summary #### Notes a through c apply to Real Address Mode only: - a. This is a Protected Mode instruction. Attempted execution in Real Mode will result in Exception 6 (invalid op-code). - b. Exception 13 fault (general protection) will occur in Real Mode if an operand reference is made that partially or fully extends beyond the maximum CS, DS, ES, FS, or GS limit (FFFFH). Exception 12 fault (stack segment limit violation or not present) will occur in Real Mode if an operand reference is made that partially or fully extends beyond the maximum SS limit. - c. This instruction may be executed in Real Mode. In Real Mode, its purpose is primarily to initialize the CPU for Protected Mode. #### Notes d through g apply to Real Address Mode and Protected Virtual Address Mode: d. The Am386SX/SXL CPU uses an early-out multiply algorithm. The actual number of clocks depends on the position of the most significant bit in the operand (multiplier). Clock counts given are minimum to maximum. To calculate actual clocks use the following formula: Actual Clock = if m < > 0, then max ( $[log_2 | m]$ , 3) + b clocks; = if m = 0, then 3 + b clocks In this formula, m is the multiplier, and b = 9 for register to register; b = 12 for memory to register; b = 10 for register with immediate to register; - b = 11 for memory with immediate to register. - f. LOCK is automatically asserted, regardless of the presence or absence of the LOCK prefix. - g. LOCK is asserted during descriptor table accesses. #### Notes h through r apply to Protected Virtual Address Mode only: e. An exception may occur, depending on the value of the operand. - h. Exception 13 fault will occur if the memory operand in CS, DS, ES, FS, or GS cannot be used due to either a segment limit violation or an access rights violation. If a stack limit is violated, an Exception 12 occurs. - For segment load operations, the CPL, RPL, and DPL must agree with the privilege rules to avoid an Exception 13 fault. The segment's descriptor must indicate "present" or Exception 11 (CS, DS, ES, FS, GS not present). If the SS register is loaded and a stack segment not present is detected, an Exception 12 occurs. - All segment descriptor accesses in the GDT or LDT made by this instruction will automatically assert \(\overline{LOCK}\) to maintain descriptor integrity in multiprocessor systems. - JMP, CALL, INT, RET, and IRET instructions referring to another code segment will cause an Exception 13, if an applicable privilege rule is violated. - I. An Exception 13 fault occurs if CPL is greater than 0 (0 is the most privileged level). - m. An Exception 13 fault occurs if CPL is greater than IOPL. - n. The IF bit of the flag register is not updated if CPL is greater than IOPL. The IOPL and VM fields of the flag register are updated only if CPL = 0. - o. The PE bit of the MSW (CR0) cannot be reset by this instruction. Use MOV into CR0 if desiring to reset the PE bit. - p. Any violation of privilege rules as applied to the selector operand does not cause a protection exception; rather, the zero flag is cleared. - q. If the coprocessor's memory operand violates a segment limit or segment access rights, an Exception 13 fault will occur before the ESC instruction is executed. An Exception 12 fault will occur if the stack limit is violated by the operand's starting address. - r. The destination of a JMP, CALL, INT, RET, or IRET must be in the defined limit of a code segment or an Exception 13 fault will occur. - s/t. The instruction will execute in s clocks if CPL ≤ IOPL. If CPL > IOPL, the instruction will take t clock. ## **Instruction Encoding** #### Overview All instruction encodings are subsets of the general instruction format shown in the Am386SX/SXL Instruction Set Clock Count Summary (pages 217 thru 231). Instructions consist of one or two primary op-code bytes, possibly an address specifier consisting of the mod r/m byte and scaled index byte, a displacement if required, and an immediate data field if required. Within the primary op-code(s), smaller encoding fields may be defined. These fields vary according to the class of operation. The fields define such information as direction of the operation, size of the displacements, register encoding, or sign extension. Almost all instructions referring to an operand in memory have an addressing mode byte following the primary op-code byte(s). This byte (mod r/m) specifies the address mode to be used. Certain encodings of the mod r/m byte indicate a second addressing byte (scale-index-base byte) follows the mod r/m byte to fully specify the addressing mode. Addressing modes can include a displacement immediately following the mod r/m byte, or scaled index byte. If a displacement is present, the possible sizes are 8. 16. or 32 bits. If the instruction specifies an immediate operand, the immediate operand follows any displacement bytes. The immediate operand, if specified, is always the last field of the instruction. Figure 57 illustrates several of the fields that can appear in an instruction, such as the mod field and the r/m field. but Figure 57 does not show all fields. Several smaller fields also appear in certain instructions, sometimes within the op-code bytes themselves. Table 22 is a complete list of all fields appearing in the Instruction Set. Further ahead, following Table 22, are detailed tables for each field. #### 32-Bit Extensions of the Instruction Set With the Am386SX/SXL CPU, the 8086/80186/80286 Instruction Set is extended in two orthogonal directions: 32-bit forms of all 16-bit instructions are added to support the 32-bit data types; and, 32-bit addressing modes are made available for all instructions referencing memory. This orthogonal instruction set extension is accomplished having a Default (D) bit in the code segment descriptor, and by having 2 prefixes to the instruction set. Whether the instruction defaults to operations of 16 bits or 32 bits depends on the setting of the D bit in the code segment descriptor, which gives the default length (either 32 bits or 16 bits) for both operands and effective addresses, when executing that code segment. In the Real Address Mode or Virtual 8086 Mode, no code segment descriptors are used, but a D value of 0 is assumed internally by the Am386SX/SXL CPU when operating in those modes (for 16-bit default sizes compatible with the 8086/80186/80286). Two prefixes, the Operand Size Prefix and the Effective Address Size Prefix, allow overriding individually the Default selection of operand size and effective address size. These prefixes may precede any op-code bytes and affect only the instruction they precede. If necessary, one or both of the prefixes may be placed Table 22. Fields within Instructions | Field Name | Description | Number of Bits | |------------|-------------------------------------------------------------------------------------|----------------------| | w | Specifies if data is byte or full size (full size is either 16 or 32 bits) | 1 | | d | Specifies direction of data operation | 1 | | s | Specifies if an immediate data field must be sign-extended | 1 | | reg | General Register Specifier | 3 | | mod r/m | Address Mode Specifier (effective address can be a General Register) | 2 for mod; 3 for r/m | | ss | Scale Factor for Scaled Index Address Mode | 2 | | index | General Register to be used as Index Register | 3 | | base | General Register to be used as Base Register | 3 | | sreg2 | Segment Register Specifier for CS, SS, DS, and ES | 2 | | sreg3 | Segment Register Specifier for CS, SS, DS, ES, FS, and GS | 3 | | tttn | For Conditional Instructions, specifies a condition asserted or a condition negated | 4 | Note: Table 21 shows encoding of individual instructions. Figure 57. General Instruction Format before the op-code bytes. The presence of the Operand Size Prefix and the Effective Address Prefix will toggle the operand size or the effective address size, respectively, to the value opposite from the Default setting. For example, if the default operand size is for 32-bit data operations, then presence of the Operand Size Prefix toggles the instruction to 16-bit data operation. As another example, if the default effective address size is 16 bits, presence of the Effective Address Size prefix toggles the instruction to use 32-bit effective address computations. These 32-bit extensions are available in all modes, including the Real Address Mode and the Virtual 8086 Mode. In these modes the default is always 16 bits, so prefixes are needed to specify 32-bit operands or addresses. For instructions with more than one prefix, the order of prefixes is unimportant. Unless specified otherwise, instructions with 8-bit and 16-bit operands do not affect the contents of the high-order of the extended registers. ### **Encoding of Instruction Fields** Within the instruction are several fields indicating register selection, addressing mode and so on. The exact encodings of these fields are defined immediately ahead. ### Encoding of Operand Length (w) Field For any given instruction performing a data operation, the instruction is executing as a 32-bit operation or a 16-bit operation. Within the constraints of the operation size, the w field encodes the operand size as either one byte or the full operation size, as shown in the table below. | w Field | Operand Size<br>During 16-Bit<br>Data Operations | Operand Size<br>During 32-Bit<br>Data Operations | |---------|--------------------------------------------------|--------------------------------------------------| | 0 | 8 Bits | 8 Bits | | 1 | 16 Bits | 32 Bits | #### Encoding of the General Register (reg) Field The general register is specified by the reg field, which may appear in the primary op-code bytes, or as the reg field of the mod r/m byte, or as the r/m field of the mod r/m byte. ### Encoding of reg Field When w Field is not Present in Instruction | reg Field | Register Selected Register Sel During 16-Bit During 32- Data Operations Data Operations | | | |-----------|-----------------------------------------------------------------------------------------|-----|--| | 000 | AX | EAX | | | 001 | CX | ECX | | | 010 | DX | EDX | | | 011 | BX | EBX | | | 100 | SP | ESP | | | 101 | BP | EBP | | | 101 | SI | ESI | | | 101 | DI | EDI | | #### Encoding of reg Field When w Field is Present in Instruction | Register Specified by reg Field<br>During 16-Bit Data Operations | | | | |------------------------------------------------------------------|--------------|--------------|--| | | Function | of w Field | | | reg | (when w = 0) | (when w = 1) | | | 000 | AL | AX | | | 001 | CL | cx | | | 010 | DL | DX | | | 011 | BL | BX | | | 100 | AH | SP | | | 101 | CH | BP | | | 110 | DH | SI | | | 111 | BH | DI | | | Register Specified by reg Field<br>During 32-Bit Data Operations | | | | |------------------------------------------------------------------|--------------|--------------|--| | | Function | of w Field | | | reg | (when w = 0) | (when w = 1) | | | 000 | AL | EAX | | | 001 | CL | ECX | | | 010 | DL | EDX | | | 011 | BL | EBX | | | 100 | AH | ESP | | | 101 | CH | EBP | | | 110 | DH | ESI | | | 111 | BH | EDI | | ### Encoding of the Segment Register (sreg) Field The sreg field in certain instructions is a 2-bit field, allowing one of the four 80286 segment registers to be specified. The sreg field in other instructions is a 3-bit field, allowing the Am386SX/SXL CPU FS and GS segment registers to be specified. #### 2-Bit sreg2 Field | 2-Bit sreg2 Field | Segment Register Selected | |-------------------|---------------------------| | 00 | ES | | 01 | cs | | 10 | ss | | 11 | DS | #### 3-Bit sreg3 Field | 3-Bit sreg3 Field | Segment Register Selected | | |-------------------|---------------------------|--| | 000 | ES | | | 001 | cs | | | 010 | SS | | | 011 | DS | | | 100 | FS | | | 101 | GS | | | 110 | do not use | | | 111 | do not use | | | | | | ### Encoding of Address Mode Except for special instructions, such as PUSH or POP, where the addressing mode is predetermined, the addressing mode for the current instruction is specified by addressing bytes following the primary op-code. The primary addressing byte is the mod r/m byte, and a second byte of addressing information, the s-i-b (scale-in-dex-base) byte, can be specified. The s-i-b byte is specified when using 32-bit addressing mode, the mod r/m byte has r/m = 100, and mod = 00, 01, or 10. When the s-i-b byte is present, the 32-bit addressing mode is a function of the mod, ss, index, and base fields. The primary addressing byte, the mod r/m byte, also contains three bits (shown as TTT in Figure 57) sometimes used as an extension of the primary opcode. The three bits, however, may also be used as a register field (reg). When calculating an effective address, either 16-bit addressing or 32-bit addressing is used. 16-bit addressing uses 16-bit address components to calculate the effective address, while 32-bit addressing uses 32-bit address components to calculate the effective address. When 16-bit addressing is used, the mod r/m byte is interpreted as a 16-bit addressing mode specifier. When 32-bit addressing is used, the mod r/m byte is interpreted as a 32-bit addressing mode specifier. Tables on the following pages define all encodings of all 16-bit addressing modes and 32-bit addressing modes. ### Encoding of 16-Bit Address Mode with mod r/m Byte | mod r/m | Effective Address | |----------|-------------------| | 00 000 | DS:[BX + SI] | | 00 001 | DS:[BX + DI] | | . 00 010 | SS:[BP + SI] | | 00 011 | DS:[BP + DI] | | 00 100 | DS:[SI] | | 00 101 | DS:[DI] | | 00 110 | DS:d16 | | 00 111 | DS:[BX] | | | | | 01 000 | DS:[BX + SI + d8] | | 01 001 | DS:[BX + DI + d8] | | 01 010 | SS:[BP + SI + d8] | | 01 011 | SS:[BP + DI + d8] | | 01 100 | DS:[SI + d8] | | 01 101 | DS:[DI + d8] | | 01 110 | SS:[BP + d8] | | 01 111 | DS:[BX + d8] | | mod r/m | Effective Address | |---------|--------------------| | 10 000 | DS:[BX + SI + d16] | | 10 001 | DS:[BX + DI + d16] | | 10 010 | SS:[BP + SI + d16] | | 10 011 | SS:[BP+SI+d16] | | 10 100 | DS:[SI + d16] | | 10 101 | DS:[DI + d16] | | 10 110 | SS:[BP + d16] | | 10 111 | DS:[BX + d16] | | | | | 11 000 | Register—See Below | | 11 001 | Register—See Below | | 11 010 | Register—See Below | | 11 011 | Register—See Below | | 11 100 | Register—See Below | | 11 101 | Register—See Below | | 11 110 | Register—See Below | | 11 111 | Register—See Below | | Register Specified by r/m<br>During 16-Bit Data Operations | | | |------------------------------------------------------------|--------------|--------------| | Function of w Field | | | | mod r/m | (when w = 0) | (when w = 1) | | 11 000 | AL | AX | | 11 001 | CL | cx | | 11 010 | DL | DX | | 11 011 | BL | BX | | 11 100 | AH | SP | | 11 101 | CH | BP | | 11 110 | DH | SI | | 11 111 | BH | DI | | Register Specified by r/m<br>During 32-Bit Data Operations | | | | |------------------------------------------------------------|---------------------|--------------|--| | | Function of w Field | | | | mod r/m | (when w = 0) | (when w = 1) | | | 11 000 | AL | EAX | | | 11 001 | CL | ECX | | | 11 010 | DL | EDX | | | 11 011 | BL | EBX | | | 11 100 | AH | ESP | | | 11 101 | CH | EBP | | | 11 110 | DH | ESI | | | 11 111 | BH | EDI | | ## Encoding of 32-Bit Address Mode with mod r/m Byte (no s-i-b byte present) | mod | d r/m | Effective Address | |-----|-------|-------------------| | 00 | 000 | DS:[EAX] | | 00 | 001 | DS:[ECX] | | 00 | 010 | DS:[EDX] | | 00 | 011 | DS:[EBX] | | 00 | 100 | s-i-b is present | | 00 | 101 | DS:d32 | | 00 | 110 | DS:[ESI] | | 00 | 111 | DS:[EDI] | | | | | | 01 | 000 | DS:[EAX + d8] | | 01 | 001 | DS:[ECX + d8] | | 01 | 010 | DS:[EDX + d8] | | 01 | 011 | DS:[EBX + d8] | | 01 | 100 | s-i-b is present | | 01 | 101 | SS:[EBP + d8] | | 01 | 110 | DS:[ESI + d8] | | 01 | 111 | DS:[EDI + d8] | | mod | r/m | Effective Address | |------|-----|--------------------| | 10 | 000 | DS:[EAX + d32] | | 10 | 001 | DS:[ECX + d32] | | 10 | 010 | DS:[EDX + d32] | | 10 | 011 | DS:[EBX + d32] | | 10 | 100 | s-i-b is present | | 10 | 101 | SS:[EBP + d32] | | 10 | 110 | DS:[ESI + d32] | | 10 | 111 | DS:[EDI + d32] | | 11 | 000 | Register—See Below | | 1 11 | 001 | Register—See Below | | 11 | 010 | Register—See Below | | 11 | 010 | Register—See Below | | 11 | 100 | Register—See Below | | 11 | 101 | Register—See Below | | 11 | 110 | | | | | Register—See Below | | 11 | 111 | Register—See Below | | | Register Specified by reg or r/m<br>During 32-Bit Data Operations | | | | |---|-------------------------------------------------------------------|---------------------|--------------|--| | | | Function of w Field | | | | | mod r/m | (when w = 0) | (when w = 1) | | | | 11 000 | AL | EAX | | | | 11 001 | CL | ECX | | | | 11 010 | DL | EDX | | | | 11 011 | BL | EBX | | | | 11 100 | AH | ESP | | | | 11 101 | CH | EBP | | | | 11 110 | DH | ESI | | | 1 | 11 111 | BH | EDI | | | Register Specified by reg or r/m<br>During 16-Bit Data Operations | | | | | |-------------------------------------------------------------------|-----------------------------------------|----|--|--| | | Function of w Field | | | | | mod r/m | mod r/m (when $w = 0$ ) (when $w = 1$ ) | | | | | 11 000 | AL | AX | | | | 11 001 | CL | CX | | | | 11 010 | DL | DX | | | | 11 011 | BL | BX | | | | 11 100 | AH | SP | | | | 11 101 | CH | BP | | | | 11 110 | DH | SI | | | | 11 111 | BH | DI | | | ## Encoding of 32-Bit Address Mode (mod r/m byte and s-i-b byte present): | mod base | Effective Address | |----------|---------------------------------| | 00 000 | DS:[EAX + (scaled index)] | | 00 001 | DS:[ECX + (scaled index)] | | 00 010 | DS:[EDX + (scaled index)] | | 00 011 | DS:[EBX + (scaled index)] | | 00 100 | SS:[ESP + (scaled index)] | | 00 101 | DS:[d32 + (scaled index)] | | 00 110 | DS:[ESI + (scaled index)] | | 00 111 | DS:[EDI + (scaled index)] | | 01 000 | DS:[EAX + (scaled index) + d8] | | 01 001 | DS:[ECX + (scaled index) + d8] | | 01 010 | DS:[EDX + (scaled index) + d8] | | 01 010 | DS:[EBX + (scaled index) + d8] | | 01 100 | SS:[ESP + (scaled index) + d8] | | 01 101 | SS:[EBP + (scaled index) + d8] | | 01 110 | DS:[ESI + (scaled index) + d8] | | 01 111 | DS:[EDI + (scaled index) + d8] | | | , | | 10 000 | DS:[EAX + (scaled index) + d32] | | 10 001 | DS:[ECX + (scaled index) + d32] | | 10 010 | DS:[EDX + (scaled index) + d32] | | 10 011 | DS:[EBX + (scaled index) + d32] | | 10 100 | SS:[ESP + (scaled index) + d32] | | 10 101 | SS:[EBP + (scaled index) + d32] | | 10 110 | DS:[ESI + (scaled index) + d32] | | 10 111 | DS:[EDI + (scaled index) + d32] | | ss | Scale Factor | |----|--------------| | 00 | x1 | | 01 | x2 | | 10 | x4 | | 11 | x8 | | Index | Index Register | |-------|-------------------------| | 000 | EAX | | 001 | ECX | | 010 | EDX | | 011 | EBX | | 100 | no index reg (see note) | | 101 | EBP | | 110 | ESI | | 111 | EDI | Note: When index field is 100, indicating no index register, then ss field must equal 00. If index is 100 and ss does not equal 00, the effective address is undefined. Note: Mod field in mod r/m byte; ss, index, and base fields in s-i-b byte. ### Encoding of Operation Direction (d) Field In many two-operand instructions, the d field is present to indicate which operand is considered the source and which is the destination. | d | Direction of Operation | |---|----------------------------------------------------------------------------------------------------------------------------| | 0 | Register/Memory ← Register reg Field indicates Source Operand; mod r/m or mod ss index base indicates Destination Operand. | | 1 | Register ← Register/Memory reg Field indicates Destination Operand; mod r/m or mod ss index base indicates Source Operand. | ### Encoding of Sign-Extend (s) Field The s field occurs primarily to instructions with immediate data fields. The s field has an effect only if the size of the immediate data is 8 bits and is being placed in a 16-bit or 32-bit destination. | s | Effect on<br>Immediate Data8 | Effect on<br>Immediate Data 16 32 | |---|-------------------------------------------------------------|-----------------------------------| | 0 | None | None | | 1 | Sign-Extended Data8 to Fill<br>16-Bit or 32-Bit Destination | None | ## Encoding of Conditional Test (tttn) Field For the conditional instructions (conditional jumps and set on condition), tttn is encoded with n indicating to use the condition (n = 0), or its negation (n = 1), and ttt giving the condition to test. | Mnemonic | Condition | tttn | |----------|-------------------------------------|------| | 0 | Overflow | 0000 | | NO | No Overflow | 0001 | | B/NAE | Below/Not Above or Equal | 0010 | | NB/AE | Not Below/Above or Equal | 0011 | | E/Z | Equal/Zero | 0100 | | NE/NZ | Not Equal/Not Zero | 0101 | | BE/NA | Below or Equal/Not Above | 0110 | | NBE/A | Not Below or Equal/Above | 0111 | | S | Sign | 1000 | | NS | Not Sign | 1001 | | P/PE | Parity/Parity Even | 1010 | | NP/PO | No Parity/Parity Odd | 1011 | | L/NGE | Less Than/Not Greater or Equal | 1100 | | NL/GE | Not Less Than/Greater or Equal | 1101 | | LE/NG | Less Than or Equal/Not Greater Than | 1110 | | NLE/G | Not Less Than or Equal/Greater Than | 1111 | # Encoding of Control or Debug or Test Register (eee) Field For the loading and storing of the Control, Debug, and Test registers. ### When Interpreted as Control Register Field | eee Code | Reg Name | | | |-------------------------------|----------|--|--| | 000 | CR0 | | | | 010 | CR2 | | | | 011 | CR3 | | | | Do not use any other encoding | | | | ### When Interpreted as Debug Register Field | eee Code | eee Code Reg Name | | | | | | |-------------------------------|-------------------|--|--|--|--|--| | 000 | DRO | | | | | | | 001 | DR1 | | | | | | | 010 | DR2 | | | | | | | 011 | DR3<br>DR6 | | | | | | | 110 | | | | | | | | 111 | DR7 | | | | | | | Do not use any other encoding | | | | | | | ### When Interpreted as Test Register Field | eee Code | Reg Name | | | |------------|----------------------|--|--| | 110 | TR6 | | | | 111 | TR7 | | | | Do not use | e any other encoding | | | ### PHYSICAL DIMENSIONS For reference only. All dimensions measured in inches unless otherwise noted. BSC is an ANSI standard for Basic Space Centering. PQB 100 — Plastic Quad Flat Pack; Trimmed and Formed (all measurements are in inches) Top View Side View 15679G CF 43 7/16/92 SG ## **PHYSICAL DIMENSIONS (continued)** PQB 100 — Plastic Quad Flat Pack with Molded Carrier Ring (Inner device measured in inches; outer ring measured in millimeters) #### Sales Offices | Sales Offices | | | International (C<br>Latin America, | | | |--------------------------|-----------------------------------------|----------------------------------|------------------------------------|-----------------------------------------|----------------------------------------| | North American | 7 | | Ft. Lauderdale | TEL | (305) 484-8600 | | ALABAMA | | (205) 882-9122 | | FAX | (305) 485-9736 | | ARIZONA | | | SINGAPORE | TEL | (65) 3481188 | | CALIFORNIA, | | /A/A: : | SWEDEN, | FAX | (65) 3480161 | | | | (310) 645-1524 | | TEL | (08) 98 61 80 | | | | (714) 752-6262 | | | | | | | (916) 786-6700 | TAIWAN, TaipeiTELTEL | | (886) 2-7153536 | | | San Diego<br>San Jose | | FAX | | (886) 2-7122183 | | | | (818) 992-4155 | UNITED KINGDOM, | TEI | (000E) 00000 | | CANADA, Ontario. | | (0.10) 002 4100 | (Warrington) | FAX | (0925) 830380 | | | | (613) 592-0060 | London area | TEL | (0483) 740440 | | | | (416) 222-7800 | (Woking) | FAX | (0483) 756196 | | COLORADO | | | | | | | CONNECTICUT | | | North Americ | an Repres | entatives | | FLORIDA, | | ` , | CANADA | | | | | | (813) 530-9971 | | | NG(604) 430-3680 | | | | (305) 776-2001 | | | ICS (613) 592-0060 | | | | (407) 862-9292 | | | RONICS (416) 564-9720 | | GEORGIA | | (404) 449-7920 | | ITEL ELECTRON | NICS(514) 636-595 | | IDAHO | | (208) 377-0393 | ILLINOIS | | | | ILLINOIS, | | | Skokie – INDUSTR | | (700) 007 0 10 | | | | (708) 773-4422 | | =5,INC | (708) 967-8430 | | Naperville | | (708) 505-9517 | INDIANA | TDONIO MACI | CTINO | | MARYLAND | | | Huntington – ELEC | | ETING<br>(317) 921-3450 | | MASSACHUSETTS | | | Indianapolis – ELE | | | | MINNESOTA<br>NEW JERSEY. | | (612) 938-0001 | | | (317) 921-3450 | | | | (609) 662-2900 | IOWA | NO | (317) 921-3450 | | | | (201) 299-0002 | | | (319) 377-4666 | | NEW YORK. | *************************************** | (201) 200-0002 | KANSAS | • • • • • • • • • • • • • • • • • • • • | (319) 377-4000 | | | | (914) 279-8323 | | 7 SALES | (913) 469-1312 | | | | (716) 425-8050 | | | (316) 721-0500 | | NORTH CAROLINA | | (110) 420 0000 | KENTUCKY | . 0/1220 | (010) 721 0000 | | Charlotte | | (704) 875- 3091 | ELECTRONIC MAI | RKETING | | | | | (919) 878- 8111 | | | (317) 921-3452 | | OHIO, | ••••• | (010) 070 0111 | MICHIGAN | | (017) 021 0402 | | | e) | (614) 891-6455 | | K SALES. INC | (616) 335-8418 | | | Dayton(513) 439-026 | | | | (313) 227-0007 | | OREGON | | (503) 245-0080 | MINNĚSOTA | • | | | PENNSYLVANIA | | | | Sales, Inc | (612) 941-9790 | | TEXAS, | | | MISSOURI | • | ······································ | | | | (512) 346-7830 | LORENZ SALES | | (314) 997-4558 | | | | (214) 934-9099 | NEBRASKA | | , , | | Houston | | (713) 376-8084 | LORENZ SALES | | (402) 475-4660 | | International | | | NEW MEXICO | | | | BELGIUM, Antwerpen | | | THORSON DESER | RT STATES | (505) 883-4343 | | • | EAV | (00) 040 40 40 | NEW YORK | | | | FRANCE. Paris | TEL | (1) 49-75-10-10 | | | (315) 437-8343 | | , | FAX | (1) 49-75-10-13 | Hauppauge – COM | IPONENT | | | GERMANY, | | | | NC | (516) 273-5050 | | • | | (06172)-24061 | OHIO | | | | FA | FAX | (06172)-23195<br>(089) 4114-0 | Centerville – DOLF | USS ROOT & C | O(513) 433-6776 | | wiunchen | I EL | (089) 4114-0 | Columbus - DOLF | USS ROOT & C | O(614) 885-4844 | | HONG KONG, | TFI | (852) 865-4525 | | JSS HOOT & CC | D(216) 899-9370 | | Wanchai | FAX | (852) 865-4335 | OREGON | ICAL CALEC 111 | 0 (500) 040 507 | | Wanchai<br>ITALY, Milano | TEL | (02) 3390541 | | IUAL SALES, IN | C(503) 643-5074 | | | | (02) 38103458 | PENNSYLVANIA | OK CO INC | (412) 242-9500 | | JAPAN, | | | | 1N CO.,INC | (412) 242-9500 | | Atsugi | | (0462) 29-8460 | PUERTO RICO | C INC | (809) 746-6550 | | Kanagawa | | (0462) 29-8458<br>(0462) 47-2911 | | O, INO | (009) /45-6550 | | Nanayawa | | (0462) 47-2911 | UTAH<br>Front Bongo Marke | otina | (801) 288-2500 | | | | (03) 3346-7550 | WASHINGTON | euily | (001) 200-2500 | | Tokyo | | | | | | | Tokyo | FAX | (03) 3342-5196 | ELECTEN TECHN | ICAL GALES | | | · | FAX<br>TEL | (03) 3342-5196<br>(06) 243-3250 | | ICAL SALES | (206) 821-7442 | | Osaka | TEL<br>FAX | (06) 243-3250<br>(06) 243-3253 | WISCONSIN | | (206) 821-7442 | | Osaka | TEL<br>FAX<br>TEL | (06) 243-3250 | WISCONSIN<br>Brookfield – INDUS | STRIAL | (206) 821-7442 | Advanced Micro Devices reserves the right to make changes in its product without notice in order to improve design or performance characteristics. The performance characteristics listed in this document are guaranteed by specific tests, guard banding, design and other practices common to the industry. For specific testing details, contact your local AMD sales representative. The company assumes no responsibility for the use of any circuits described herein. ADVANCED MICRO DEVICES, INC. 901 Thompson Place P.O. Box 3453 Sunnyvale California 94088-3453 (408) 732-2400 (800) 538-8450 TWX: 910-339-9280 TELEX: 34-6306 CORPORATE APPLICATIONS HOTLINE USA (408) 749-5703 JAPAN 81-3-3346-7561 UK & EUROPE 44-(0)256-811101 TOLL FREE USA (800) 222-9323 FRANCE 0590-8621 GERMANY 0130-813875 ITALY 1678-77224 PC PROCESSORS LITERATURE ORDERING USA (800) 292-9263 USA (512) 462-6882 RECYCLED & RECYCLABLE Printed in USA Ban-35.5M-10/92-0 11339C