# SPARC Assembly Language Reference Manual Sun Microsystems, Inc. 901 San Antonio Road Palo Alto, CA 94303-4900 U.S.A. Part Number 806-3774 February 2000 Copyright 2000 Sun Microsystems, Inc. 901 San Antonio Road, Palo Alto, California 94303-4900 U.S.A. All rights reserved. This product or document is protected by copyright and distributed under licenses restricting its use, copying, distribution, and decompilation. No part of this product or document may be reproduced in any form by any means without prior written authorization of Sun and its licensors, if any. Third-party software, including font technology, is copyrighted and licensed from Sun suppliers. Parts of the product may be derived from Berkeley BSD systems, licensed from the University of California. UNIX is a registered trademark in the U.S. and other countries, exclusively licensed through X/Open Company, Ltd. Sun, Sun Microsystems, the Sun logo, SunDocs, Java, the Java Coffee Cup logo, and Solaris are trademarks, registered trademarks, or service marks of Sun Microsystems, Inc. in the U.S. and other countries. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. in the U.S. and other countries. Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems, Inc. The OPEN LOOK and $Sun^{TM}$ Graphical User Interface was developed by Sun Microsystems, Inc. for its users and licensees. Sun acknowledges the pioneering efforts of Xerox in researching and developing the concept of visual or graphical user interfaces for the computer industry. Sun holds a non-exclusive license from Xerox to the Xerox Graphical User Interface, which license also covers Sun's licensees who implement OPEN LOOK GUIs and otherwise comply with Sun's written license agreements. **RESTRICTED RIGHTS**: Use, duplication, or disclosure by the U.S. Government is subject to restrictions of FAR 52.227-14(g)(2)(6/87) and FAR 52.227-19(6/87), or DFAR 252.227-7015(b)(6/95) and DFAR 227.7202-3(a). DOCUMENTATION IS PROVIDED "AS IS" AND ALL EXPRESS OR IMPLIED CONDITIONS, REPRESENTATIONS AND WARRANTIES, INCLUDING ANY IMPLIED WARRANTY OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE OR NON-INFRINGEMENT, ARE DISCLAIMED, EXCEPT TO THE EXTENT THAT SUCH DISCLAIMERS ARE HELD TO BE LEGALLY INVALID. Copyright 2000 Sun Microsystems, Inc. 901 San Antonio Road, Palo Alto, Californie 94303-4900 Etats-Unis. Tous droits réservés. Ce produit ou document est protégé par un copyright et distribué avec des licences qui en restreignent l'utilisation, la copie, la distribution, et la décompilation. Aucune partie de ce produit ou document ne peut être reproduite sous aucune forme, par quelque moyen que ce soit, sans l'autorisation préalable et écrite de Sun et de ses bailleurs de licence, s'il y en a. Le logiciel détenu par des tiers, et qui comprend la technologie relative aux polices de caractères, est protégé par un copyright et licencié par des fournisseurs de Sun. Des parties de ce produit pourront être dérivées du système Berkeley BSD licenciés par l'Université de Californie. UNIX est une marque déposée aux Etats-Unis et dans d'autres pays et licenciée exclusivement par X/Open Company, Ltd. Sun, Sun Microsystems, le logo Sun, SunDocs, Java, le logo Java Coffee Cup, et Solaris sont des marques de fabrique ou des marques déposées, ou marques de service, de Sun Microsystems, Inc. aux Etats-Unis et dans d'autres pays. Toutes les marques SPARC sont utilisées sous licence et sont des marques de fabrique ou des marques déposées de SPARC International, Inc. aux Etats-Unis et dans d'autres pays. Les produits portant les marques SPARC sont basés sur une architecture développée par Sun Microsystems, Inc. L'interface d'utilisation graphique OPEN LOOK et Sun<sup>TM</sup> a été développée par Sun Microsystems, Inc. pour ses utilisateurs et licenciés. Sun reconnaît les efforts de pionniers de Xerox pour la recherche et le développement du concept des interfaces d'utilisation visuelle ou graphique pour l'industrie de l'informatique. Sun détient une licence non exclusive de Xerox sur l'interface d'utilisation graphique Xerox, cette licence couvrant également les licenciés de Sun qui mettent en place l'interface d'utilisation graphique OPEN LOOK et qui en outre se conforment aux licences écrites de Sun. CETTE PUBLICATION EST FOURNIE "EN L'ETAT" ET AUCUNE GARANTIE, EXPRESSE OU IMPLICITE, N'EST ACCORDEE, Y COMPRIS DES GARANTIES CONCERNANT LA VALEUR MARCHANDE, L'APTITUDE DE LA PUBLICATION A REPONDRE A UNE UTILISATION PARTICULIERE, OU LE FAIT QU'ELLE NE SOIT PAS CONTREFAISANTE DE PRODUIT DE TIERS. CE DENI DE GARANTIE NE S'APPLIQUERAIT PAS, DANS LA MESURE OU IL SERAIT TENU JURIDIQUEMENT NUL ET NON AVENU. # Contents ### Preface | 1. | SPARC Assembler for SunOS 5.x 11 | | | | |----|----------------------------------|-------|----------------------------------------------|----| | | 1.1 | Inti | roduction 11 | | | | 1.2 | Op | erating Environment 11 | | | | 1.3 | SPA | ARC Assembler for SunOS 4.1 Versus SunOS 5.x | 12 | | | | 1.3.1 | Labeling Format 12 | | | | | 1.3.2 | Object File Format 12 | | | | | 1.3.3 | Pseudo-Operations 12 | | | | | 1.3.4 | Command Line Options 12 | | | 2. | Assembler Syntax 13 | | | | | | 2.1 | Syr | ntax Notation 13 | | | | 2.2 | Ass | sembler File Syntax 14 | | | | | 2.2.1 | Lines Syntax 14 | | | | | 2.2.2 | Statement Syntax 14 | | | | 2.3 | Lex | cical Features 14 | | | | | 2.3.1 | Case Distinction 14 | | | | | 2.3.2 | Comments 15 | | | | | 2.3.3 | Labels 15 | | | | | 2.3.4 | Numbers 15 | | 2.3.6 Symbol Names 16 2.3.7 Special Symbols - Registers 17 2.3.8 Operators and Expressions 19 2.3.9 SPARC V9 Operators and Expressions 20 2.4 Assembler Error Messages 21 3. **Executable and Linking Format** 3.1 ELF Header 24 3.2 Sections 26 3.2.1 Section Header 26 3.2.2 Predefined User Sections 3.2.3 Predefined Non-User Sections 32 3.3 Locations 33 3.4 Addresses 34 Relocation Tables 3.5 34 3.6 **Symbol Tables** 3.7 **String Tables** 36 **Assembler Directives** 36 3.8 3.8.1 Section Control Directives 37 3.8.2 Symbol Attribute Directives 3.8.3 Assignment Directive 37 3.8.4 Data Generating Directives 4. Converting Files to the New Format 39 Introduction 39 4.1 4.2 Conversion Instructions 39 4.3 Examples 40 **5**. **Instruction-Set Mapping** 41 Table Notation 41 5.1 SPARC Assembly Language Reference Manual • February 2000 Strings 15 2.3.5 4 | 5.4 Coprocessor Instructions 56 | | | | | |-------------------------------------------------------------------------------|--|--|--|--| | 5.5 Synthetic Instructions 56 | | | | | | 5.6 V8/V9 Natural Pseudo Instructions 59 | | | | | | Pseudo-Operations 61 | | | | | | A.1 Alphabetized Listing with Descriptions 61 | | | | | | Examples of Pseudo-Operations 69 | | | | | | B.1 Example 1 69 | | | | | | B.2 Example 2 70 | | | | | | B.3 Example 3 70 | | | | | | B.4 Example 4 71 | | | | | | B.5 Example 5 71 | | | | | | Using the Assembler Command Line 73 | | | | | | C.1 Assembler Command Line 73 | | | | | | C.2 Assembler Command Line Options 74 | | | | | | Disassembling Object Code 77 | | | | | | An Example Language Program 79 | | | | | | SPARC-V9 Instruction Set 85 | | | | | | E.1 SPARC-V9 Changes 85 | | | | | | E.1.1 Registers 85 | | | | | | E.1.2 Alternate Space Access 87 | | | | | | E.1.3 Byte Order 88 | | | | | | E.2 SPARC-V9 Instruction Set Changes 88 | | | | | | E.2.1 Extended Instruction Definitions to Support the 64-bit Model 8 | | | | | | E.2.2 Added Instructions to Support 64 bits 89 | | | | | | E.2.3 Added Instructions to Support High-Performance System Implementation 89 | | | | | | | | | | | | C | | | | | 5.2 5.3 A. B. C. D. E. Integer Instructions 43 Floating-Point Instruction 53 - E.2.4 Deleted Instructions 90 - E.2.5 Miscellaneous Instruction Changes 91 - E.3 SPARC-V9 Instruction Set Mapping 91 - E.4 SPARC-V9 Floating-Point Instruction Set Mapping 102 - E.5 SPARC-V9 Synthetic Instruction-Set Mapping 103 - E.6 UlraSPARC and VIS Instruction Set Extensions 106 - E.6.1 Graphics Data Formats 106 - E.6.2 Eight-bit Format 106 - E.6.3 Fixed Data Formats 107 - E.6.4 SHUTDOWN Instruction 107 - E.6.5 Graphics Status Register (GSR) 107 - E.6.6 Graphics Instructions 108 - E.6.7 Memory Access Instructions 113 ### **Preface** The SunOS assembler that runs on the SPARC operating environment, referred to as the "SunOS SPARC" in this manual, translates source files that are in assembly language format into object files in linking format. In the program development process, the assembler is a tool to use in producing program modules intended to exploit features of the SPARC architecture in ways that cannot be easily done using high level languages and their compilers. Whether assembly language is chosen for the development of program modules depends on the extent to which and the ease with which the language allows the programmer to control the architectural features of the processor. The assembly language described in this manual offers full direct access to the SPARC instruction set. The assembler may also be used in connection with SunOS 5.x macro preprocessors to achieve full macro-assembler capability. Furthermore, the assembler responds to directives that allow the programmer direct control over the contents of the relocatable object file. This document describes the language in which the source files must be written. The nature of the machine mnemonics governs the way in which the program's executable portion is written. This document includes descriptions of the pseudo operations that allow control over the object file. This facilitates the development of programs that are easy to understand and maintain. ## Before You Read This Book You should also become familiar with the following: - Manual pages: as(1), ld(1), cpp(1), elf(3f), dis(1), a.out(1) - SPARC Architecture Manual (Version 8 and Version 9) - ELF-related sections of the *Programming Utilities Guide* manual - SPARC Applications Binary Interface (ABI) # How This Book is Organized This book is organized as follows: Chapter 1, discusses features of the SunOS 5.x SPARC Assembler. Chapter 2, describes the syntax of the SPARC assembler that takes assembly programs and produces relocatable object files for processing by the link editor. Chapter 3, describes the relocatable ELF files that hold code and data suitable for linking with other object files. Chapter 4, describes how to convert existing SunOS 4.1 SPARC assembly files to the SunOS 5.x assembly file format. Chapter 5, describes the relationship between hardware instructions of the SPARC architecture and the assembly language instruction set. Appendix A, lists the pseudo-operations supported by the SPARC assembler. Appendix B, shows some examples of ways to use various pseudo-operations. Appendix C, describes the available assembler command-line options. Appendix D, describes an example C language program with comments to show correspondence between the assembly code and the C code. Appendix E, describes the SPARC-V9 instruction set and the changes due to the SPARC-V9 implementation. # What Typographic Changes Mean The following table describes the typographic changes used in this book. ### TABLE P-1 | Typeface or<br>Symbol | Meaning | Example | |-----------------------|--------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------| | AaBbCc123 | The names of commands, files, and directories; on-screen computer output | Edit your .login file. Use ls -a to list all files. machine_name% You have mail. | | AaBbCc123 | What you type, contrasted with on-screen computer output | machine_name% <b>su</b> Password: | | AaBbCc123 | Command-line placeholder: replace with a real name or value | To delete a file, type rm filename. | | AaBbCc123 | Book titles, new words or<br>terms, or words to be<br>emphasized | Read Chapter 6 in <i>User's Guide</i> . These are called <i>class</i> options. You <i>must</i> be root to do this. | # Shell Prompts in Command Examples The following table shows the default system prompt and superuser prompt for the C shell, Bourne shell, and Korn shell. TABLE P-2 | Shell | Prompt | |----------------------------------------------|---------------| | C shell prompt | machine_name% | | C shell superuser prompt | machine_name# | | Bourne shell and Korn shell prompt | \$ | | Bourne shell and Korn shell superuser prompt | # | # SPARC Assembler for SunOS 5.x # 1.1 Introduction This chapter discusses features of the *SunOS 5.x* SPARC assembler. This document is distributed as part of the developer documentation set with every SunOS operating system release. This document is also distributed with the on-line documentation set for the convenience of SPARCworks<sup>TM</sup> and SPARCompiler<sup>TM</sup> 4.0 users who have products that run on the *SunOS 5.x* operating system. It is included as part of the SPARCworks/SPARCompiler Floating Point and Common Tools AnswerBook, which is the on-line information retrieval system. This document contains information from *The SPARC Architecture Manual*, Version 8. Information about Version 9 support is summarized in Appendix E. # 1.2 Operating Environment The SunOS SPARC assembler runs under the *SunOS 5.x* operating system or the Solaris<sup>TM</sup> 2.x operating environment. *SunOS 5.x* refers to SunOS 5.2 operating system and later releases. *Solaris 2.x* refers to the Solaris 2.2 operating environment and later releases. # 1.3 SPARC Assembler for SunOS 4.1 Versus SunOS 5.x This section describes the differences between the SunOS 4.1 SPARC assembler and the *SunOS 5.x* SPARC assembler. ### 1.3.1 Labeling Format - Symbol names beginning with a dot (.) are assumed to be local symbols. - Names beginning with an underscore (\_) are reserved by ANSI C. ### 1.3.2 Object File Format The type of object files created by the SPARC assembler are ELF (*Executable and Linking Format*) files. These relocatable object files hold code and data suitable for linking with other object files to create an executable file or a shared object file, and are the assembler normal output. ## 1.3.3 Pseudo-Operations See Appendix A, for a detailed description of the pseudo-operations (pseudo-ops). ### 1.3.4 Command Line Options See Appendix C, for a detailed description of command line options and a list of SPARC architectures. # **Assembler Syntax** The *SunOS 5.x* SPARC assembler takes assembly language programs, as specified in this document, and produces relocatable object files for processing by the *SunOS 5.x* SPARC link editor. The assembly language described in this document corresponds to the SPARC instruction set defined in the *SPARC Architecture Manual* (Version 8 and Version 9) and is intended for use on machines that use the SPARC architecture. This chapter is organized into the following sections: - Section 2.1 "Syntax Notation" on page 13 - Section 2.2 "Assembler File Syntax" on page 14 - Section 2.3 "Lexical Features " on page 14 - Section 2.4 "Assembler Error Messages" on page 21 # 2.1 Syntax Notation In the descriptions of assembly language syntax in this chapter: - Brackets ([])enclose optional items. - Asterisks (\*) indicate items to be repeated zero or more times. - Braces ({ }) enclose alternate item choices, which are separated from each other by vertical bars (|). - Wherever blanks are allowed, arbitrary numbers of blanks and horizontal tabs may be used. Newline characters are not allowed in place of blanks. # 2.2 Assembler File Syntax The syntax of assembly language files is: [line]\* ## 2.2.1 Lines Syntax The syntax of assembly language lines is: [statement [ ; statement]\*] [!comment] ### 2.2.2 Statement Syntax The syntax of an assembly language statement is: [label:] [instruction] where: label is a symbol name. instruction is an encoded pseudo-op, synthetic instruction, or instruction. # 2.3 Lexical Features This section describes the lexical features of the assembler syntax. ### 2.3.1 Case Distinction Uppercase and lowercase letters are distinct everywhere *except* in the names of special symbols. Special symbol names have no case distinction. #### 2.3.2 Comments A comment is preceded by an exclamation mark character (!); the exclamation mark character and all following characters up to the end of the line are ignored. C language-style comments ("/\*...\*/") are also permitted and may span multiple lines. #### Labels 2.3.3 A label is either a symbol or a single decimal digit n (0...9). A label is immediately followed by a colon (:). Numeric labels may be defined repeatedly in an assembly file; normal symbolic labels may be defined only once. A numeric label n is referenced after its definition (backward reference) as nb, and before its definition (forward reference) as nf. #### **Numbers** 2.3.4 Decimal, hexadecimal, and octal numeric constants are recognized and are written as in the C language. However, integer suffixes (such as L) are not recognized. For floating-point pseudo-operations, floating-point constants are written with 0r or OR (where r or R means REAL) followed by a string acceptable to atof(3); that is, an optional sign followed by a non-empty string of digits with optional decimal point and optional exponent. The special names Ornan and Orinf represent the special floating-point values Not-A-Number (NaN) and INFinity. Negative Not-A-Number and Negative INFinity are specified as Or-nan and Or-inf. Note - The names of these floating-point constants begin with the digit zero, not the letter "O." #### 2.3.5 Strings A string is a sequence of characters quoted with either double-quote mark (") or single-quote mark (') characters. The sequence must not include a *newline* character. When used in an expression, the numeric value of a string is the numeric value of the ASCII representation of its first character. The suggested style is to use single quote mark characters for the ASCII value of a single character, and double quote mark characters for quoted-string operands such as used by pseudo-ops. An example of assembly code in the suggested style is: ``` add %g1,'a'-'A',%g1 ! g1 + ('a' - 'A') --> g1 ``` The escape codes described in Table 2–1, derived from ANSI C, are recognized in strings. TABLE 2-1 | Escape Code | Description | |-------------|----------------------| | \a | Alert | | \b | Backspace | | \f | Form feed | | \n | Newline (line feed) | | \r | Carriage return | | \t | Horizontal tab | | \v | Vertical tab | | \nnn | Octal value nnn | | \xnn | Hexadecimal value nn | # 2.3.6 Symbol Names The syntax for a symbol name is: In the above syntax: - Uppercase and lowercase letters are distinct; the underscore (\_), dollar sign (\$), and dot (.) are treated as alphabetic characters. - Symbol names that begin with a dot ( . )are assumed to be local symbols. To simplify debugging, avoid using this type of symbol name in hand-coded assembly language routines. - The symbol dot ( . )is predefined and always refers to the address of the beginning of the current assembly language statement. ■ External variable names beginning with the underscore character are reserved by the ANSI C Standard. Do not begin these names with the underscore; otherwise, the program will not conform to ANSI C and unpredictable behavior may result. ### 2.3.7 Special Symbols - Registers Special symbol names begin with a percentage sign (%) to avoid conflict with user symbols. Table 2–2 lists these special symbol names. TABLE 2-2 | Symbol Object | Name | Comment | |----------------------------------|----------|--------------------| | General-purpose registers | %r0 %r31 | | | General-purpose global registers | %g0 %g7 | Same as %r0 %r7 | | General-purpose out registers | %o0 %o7 | Same as %r8 %r15 | | General-purpose local registers | %10 %17 | Same as %r16 %r23 | | General-purpose in registers | %i0 %i7 | Same as %r24 %r31 | | Stack-pointer register | %sp | (%sp = %o6 = %r14) | | Frame-pointer register | %fp | (%fp = %i6 = %r30) | | Floating-point registers | %f0 %f31 | | | Floating-point status register | %fsr | | | Front of floating-point queue | %fq | | | Coprocessor registers | %c0 %c31 | | | Coprocessor status register | %csr | | | Coprocessor queue | %cq | | | Program status register | %psr | | TABLE 2–2 (continued) | Symbol Object | Name | Comment | |-----------------------------------|--------------|-------------------------------------------| | Trap vector base address register | %tbr | | | Window invalid mask | %wim | | | Y register | %у | | | Unary operators | %lo | Extracts least significant 10 bits | | | %hi | Extracts most significant 22 bits | | | %r_disp32 | Used only in Sun compiler-generated code. | | | %r_plt32 | Used only in Sun compiler-generated code. | | Ancillary state registers | %asrl %asr31 | | There is no case distinction in special symbols; for example, %PSR ### is equivalent to %psr The suggested style is to use lowercase letters. The lack of case distinction allows for the use of non-recursive preprocessor substitutions, for example: ``` #define psr %PSR ``` The special symbols %hi and %lo are true unary operators which can be used in any expression and, as other unary operators, have higher precedence than binary operations. For example: ``` hi a+b = (hi a)+b a+b = (hi a)+b ``` To avoid ambiguity, enclose operands of the hi or lo operators in parentheses. For example: ### Operators and Expressions 2.3.8 The operators described in Table 2–3 are recognized in constant expressions. TABLE 2-3 | Binary | Operators | Unary | Operators | |--------|------------------------|------------------------|------------------------------------------------------------------------------------------------------------------------------------------| | + | Integer addition | + | (No effect) | | _ | Integer subtraction | | 2's Complement | | * | Integer multiplication | ~ | 1's Complement | | / | Integer division | %lo(address) | Extract least significant 10 bits as computed by: (address & 0x3ff) | | % | Modulo | %hi(address) | Extract most significant 22 bits as computed by: (address >>10) | | ^ | Exclusive OR | %r_disp32<br>%r_disp64 | Used in Sun compiler-generated code only to instruct the assembler to generate specific relocation information for the given expression. | | << | Left shift | %r_plt32<br>%r_plt64 | Used in Sun compiler-generated code only to instruct the assembler to generate specific relocation information for the given expression. | | >> | Right shift | | | | & | Bitwise AND | | | | 1 | Bitwise OR | | | Since these operators have the same precedence as in the C language, put expressions in parentheses to avoid ambiguity. To avoid confusion with register names or with the hi, 10, $r_disp32/64$ , or %r\_plt32/64 operators, the modulo operator % must not be immediately followed by a letter or digit. The modulo operator is typically followed by a space or left parenthesis character. # 2.3.9 SPARC V9 Operators and Expressions The following V9 64-bit operators and expressions in Table 2-4 ease the task of converting from V8/V8plus assembly code to V9 assembly code.. TABLE 2-4 | Unary | Calculation | Operators | |-------|--------------------------------|-------------------------------------| | %hh | (address) >> 42 | Extract bits 42-63 of a 64-bit word | | %hm | ((address) >> 32) & 0x3ff | Extract bits 32-41 of a 64-bit word | | %lm | (((address) >> 10) & 0x3fffff) | Extract bits 10-31 of a 64-bit word | ### For example::: ``` sethi %hh (address), %l1 or %l1, %hm (address), %l1 sethi %lm (address), %l2 or %l2, %lo (address), %l2 sllx %l1, 32, %l1 or %l1, %l2, %l1 ``` The V9 high 32-bit operators and expressions are identified in Table 2–5. TABLE 2-5 | Unary | Calculation | Operators | |-------|----------------------------------------------------|--------------------------------------------------| | %hix | ((((address) ^ 0xfffffffffffffff >> 10) &0x4fffff) | Invert every bit and extract bits 10-31 | | %lox | ((address) & 0x3ff 0x1c00 | Extract bits 0-9 and sign extend that to 13 bits | ### For example: ``` %sethi %hix (address), %ll or %11, %lox (address), %11 ``` The V9 low 44-bit operators and expressions are identified in Table 2-6.. #### TABLE 2-6 | Unary | Calculation | Operators | |-------|---------------------------|-------------------------------------| | %h44 | ((address) >> 22) | Extract bits 22-43 of a 64-bit word | | %m44 | ((address) >> 12) & 0x3ff | Extract bits 12-21 of a 64-bit word | | 144 | (address) & 0xfff | Extract bits 0-11 of a 64-bit word | ### For example:: ``` %sethi %h44 (address), %l1 or %11, %m44 (address), %11 sllx %11, 12, %11 or %11, %144 (address), %11 ``` ### 2.4 **Assembler Error Messages** Messages generated by the assembler are generally self-explanatory and give sufficient information to allow correction of a problem. Certain conditions will cause the assembler to issue warnings associated with delay slots following Control Transfer Instructions (CTI). These warnings are: - Set synthetic instructions in delay slots - Labels in delay slots - Segments that end in control transfer instructions These warnings point to places where a problem could exist. If you have intentionally written code this way, you can insert an .empty pseudo-operation immediately after the control transfer instruction. The .empty pseudo-operation in a delay slot tells the assembler that the delay slot can be empty or can contain whatever follows because you have verified that either the code is correct or the content of the delay slot does not matter. # **Executable and Linking Format** The type of object files created by the SPARC assembler version for *SunOS 5.x* are now *Executable and Linking Format* (ELF) files. These relocatable ELF files hold code and data suitable for linking with other object files to create an executable or a shared object file, and are the assembler normal output. The assembler can also write information to standard output (for example, under the -S option) and to standard error (for example, under the -V option). The SPARC assembler creates a default output file when standard input or multiple files are used. This chapter is organized into the following sections: - Section 3.1 "ELF Header" on page 24 - Section 3.2 "Sections" on page 26 - Section 3.3 "Locations " on page 33 - Section 3.5 "Relocation Tables " on page 34 - Section 3.6 "Symbol Tables" on page 34 - Section 3.4 "Addresses " on page 34 - Section 3.7 "String Tables " on page 36 - Section 3.8 "Assembler Directives " on page 36 The ELF object file format consists of: - Header - Sections - Locations - Addresses - Relocation tables - Symbol tables - String tables # 3.1 ELF Header The *ELF header* is always located at the beginning of the ELF file. It describes the ELF file organization and contains the actual sizes of the object file control structures. The initial bytes of an ELF header specify how the file is to be interpreted. The ELF header contains the following information: ehsize ELF header size in bytes. entry Virtual address at which the process is to start. A value of 0 indicates no associated entry point. flag Processor-specific flags associated with the file. ident Marks the file as an object file and provides machine-independent data to decode and interpret the file contents. machine Specifies the required architecture for an individual file. A value of 2 specifies SPARC. phentsize Size in bytes of entries in the program header table. All entries are the same size. phnum Number of entries in program header table. A value of 0 indicates the file has no program header table. phoff Program header table file offset in bytes. The value of $\boldsymbol{0}$ indicates no program header. #### shentsize Size in bytes of the section header. A section header is one entry in the section header table; all entries are the same size. Number of entries in section header table. A value of 0 indicates the file has no section header table. ### shoff Section header table file offset in bytes. The value of 0 indicates no section header. ### shstrndx Section header table index of the entry associated with the section name string table. A value of Shn\_undef indicates the file does not have a section name string table. ### type Identifies the object file type. Table 3-1 describes the reserved object file types. ### version Identifies the object file version. Table 3-1 shows reserved object file types: TABLE 3-1 | Туре | Value | Description | |--------|--------|--------------------| | none | 0 | No file type | | rel | 1 | Relocatable file | | exec | 2 | Executable file | | dyn | 3 | Shared object file | | core | 4 | Core file | | loproc | 0xff00 | Processor-specific | | hiproc | 0xffff | Processor-specific | # 3.2 Sections A section is the smallest unit of an object that can be relocated. The following sections are commonly present in an ELF file: - Section header - Executable text - Read-only data - Read-write data - Read-write uninitialized data (section header only) Sections do not need to be specified in any particular order. The *current section* is the section to which code is generated. These sections contain all other information in an object file and satisfy several conditions. - 1. Every section must have one section header describing the section. However, a section header does not need to be followed by a section. - 2. Each section occupies one contiguous sequence of bytes within a file. The section may be empty (that is, of zero-length). - 3. A byte in a file can reside in only one section. Sections in a file cannot overlap. - 4. An object file may have inactive space. The contents of the data in the inactive space are unspecified. Sections can be added for multiple text or data segments, shared data, user-defined sections, or information in the object file for debugging. **Note -** Not all of the sections need to be present. ### 3.2.1 Section Header The *section header* allows you to locate all of the file sections. An entry in a section header table contains information characterizing the data in a section. The section header contains the following information: addr Address at which the first byte resides if the section appears in the memory image of a process; the default value is 0. ### addralign Aligns the address if a section has an address alignment constraint; for example, if a section contains a double-word, the entire section must be ensured double-word alignment. Only 0 and positive integral powers of 2 are currently allowed. A value of 0 or 1 indicates no address alignment constraints. #### entsize Size in bytes for entries in fixed-size tables such as the symbol table. One-bit descriptions of section attributes. Table 3-2 describes the section attribute TABLE 3-2 | Flag | Default<br>Value | Description | |---------------|------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------| | SHF_WRITE | 0x1 | Contains data that is writable during process execution. | | SHF_ALLOC | 0×2 | Occupies memory during process execution. This attribute is <i>off</i> if a control section does not reside in the memory image of the object file. | | SHF_EXECINSTR | 0x4 | Contains executable machine instructions. | | SHF_MASKPROC | 0xf0000000 | Reserved for processor-specific semantics. | #### info Extra information. The interpretation of this information depends on the section type, as described in Table 3-3. #### link Section header table index link. The interpretation of this information depends on the section type, as described in Table 3-3. name Specifies the section name. An index into the section header string table section specifies the location of a null-terminated string. #### offset Specifies the byte offset from the beginning of the file to the first byte in the section. **Note** - If the section type is SHT\_NOBITS, *offset* specifies the conceptual placement of the file. #### size Specifies the size of the section in bytes. **Note -** If the section type is $SHT_NOBITS$ , *size* may be non-zero; however, the section still occupies no space in the file. #### type Categorizes the section contents and semantics. Table 3–3 describes the section types. TABLE 3-3 | | | | Interpretation by | | |----------|-------|----------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------|-------------------------------------------------------------------| | Name | Value | Description | info | link | | null | 0 | Marks section header as inactive. | | | | progbits | 1 | Contains information defined explicitly by the program. | | | | symtab | 2 | Contains a symbol table for link editing. This table may also be used for dynamic linking; however, it may contain many unnecessary symbols. | One greater than the symbol table index of the last local symbol. | The section<br>header index of<br>the associated<br>string table. | | | | Note: Only one section of this type is allowed in a file | | | | strtab | 3 | Contains a string table. A file may have multiple string table sections. | | | TABLE 3-3 (continued) | | | | Interpretation by | | | |---------|-------|--------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------|-------------------------------------------------------------------|--| | Name | Value | Description | info | link | | | rela | 4 | Contains relocation entries with explicit addends. A file may have multiple relocation sections. | The section header index of the section to which the relocation applies. | The section<br>header index of<br>the associated<br>symbol table. | | | hash | 5 | Contains a symbol rehash table. | 0 | The section header index of | | | | | Note: Only one section of this type is allowed in a file | | the symbol table<br>to which the hash<br>table applies. | | | dynamic | 6 | Contains dynamic linking information. | 0 | The section header index of | | | | | Note: Only one section of this type is allowed in a file | | the string table used by entries in the section. | | | note | 7 | Contains information that marks the file. | | | | | nobits | 8 | Contains information defined explicitly by the program; however, a section of this type does not occupy any space in the file. | | | | | rel | 9 | Contains relocation entries without explicit addends. A file may have multiple relocation sections. | The section header index of the section to which the relocation applies. | The section<br>header index of<br>the associated<br>symbol table. | | | shlib | 10 | Reserved. | | | | | dynsym | 11 | Contains a symbol table with a minimal set of symbols for dynamic linking. | One greater than the symbol table index of the last local symbol. | The section<br>header index of<br>the associated | | | | | Note: Only one section of this type is allowed in a file | | string table. | | TABLE 3–3 (continued) | | | | Interpretation by | | |--------|------------|---------------------------------------------------------------------------------------------------------------------------------|-------------------|------| | Name | Value | Description | info | link | | loproc | 0x70000000 | Lower and upper bound of range reserved for | | | | hiproc | 0x7fffffff | processor-specific semantics. | | | | louser | 0x80000000 | Lower and upper bound of range reserved for application | | | | hiuser | 0xffffffff | programs. | | | | | | Note: Section types in this range<br>may be used by an application<br>without conflicting with<br>system-defined section types. | | | **Note -** Some section header table indexes are reserved and the object file will not contain sections for these special indexes. ### 3.2.2 Predefined User Sections A section that can be manipulated by the section control directives is known as a *user section*. You can use the section control directives to change the user section in which code or data is generated. Table 3–4 lists the predefined user sections that can be named in the section control directives. TABLE 3-4 | Section Name | Description | |----------------|-------------------------------------------------| | .bss | Section contains uninitialized read-write data. | | .comment | Comment section. | | .data & .data1 | Section contains initialized read-write data. | | .debug | Section contains debugging information. | TABLE 3-4 (continued) | Section Name | Description | |--------------------|-------------------------------------------------------| | .fini | Section contains runtime finalization instructions. | | .init | Section contains runtime initialization instructions. | | .rodata & .rodata1 | Section contains read-only data. | | .text | Section contains executable text. | | .line | Section contains line # info for symbolic debugging. | | .note | Section contains note information. | #### Creating an .init Section in an Object File 3.2.2.1 program is executed. To create an .init section in an object file, use the assembler pseudo-ops shown in Code Example 3-1. ### CODE EXAMPLE 3-1 Creating an .init Section ``` .section ".init" .align 4 <instructions> ``` At link time, the .init sections in a sequence of .o files are concatenated into an .init section in the linker output file. The code in the .init section are executed before the main program is executed. Because the whole .init section is treated as a single function body, it is recommented that the only code added to these sections be in the following form:. ``` call routine_name nop ``` The called routine should be located in another section. This will prevent conflicting register and stack usage within the .init sections. ### 3.2.2.2 Creating a .fini Section in an Object File .fini sections contain codes that are to be executed after the the main program is executed. To create an .fini section in an object file, use the assembler pseudo-ops shown in Code Example 3–2. ### CODE EXAMPLE 3-2 Creating an .fini Section ``` .section ".fini" .align 4 <instructions> ``` At link time, the .fini sections in a sequence of .o files are concatenated into a .fini section in the linker output file. The codes in the .fini section are executed after the main program is executed. Because the whole .fini section is treated as a single function body, it is recommended that the only code added to these section be in the following form:. ``` call routine_name nop ``` The called routine should be located in another section. This will prevent conflicting register and stack usage within the .fini sections. ### 3.2.3 Predefined Non-User Sections Table 3–5 lists sections that are predefined but cannot be named in the section control directives because they are not under user control. TABLE 3-5 | Section Name | Description | |--------------|------------------------------------------------------| | ".dynamic" | Section contains dynamic linking information. | | .dynstr | Section contains strings needed for dynamic linking. | | .dynsym | Section contains the dynamic linking symbol table. | | .got | Section contains the global offset table. | TABLE 3-5 (continued) | Section Name | Description | |----------------------|-------------------------------------------------------------------------------------------------------------------------------------------| | .hash | Section contains a symbol hash table. | | .interp | Section contains the path name of a program interpreter. | | .plt | Section contains the procedure linking table. | | .relname & .relaname | Section containing relocation information. <i>name</i> is the section to which the relocations apply, that is, ".rel.text", ".rela.text". | | .shstrtab | String table for the section header table names. | | .strtab | Section contains the string table. | | .symtab | Section contains a symbol table. | ### 3.3 Locations A location is a specific position within a section. Each location is identified by a section and a byte offset from the beginning of the section. The current location is the location within the current section where code is generated. A location counter tracks the current offset within each section where code or data is being generated. When a section control directive (for example, the .section pseudo-op) is processed, the location information from the location counter associated with the new section is assigned to and stored with the name and value of the current location. The current location is updated at the end of processing each statement, but can be updated during processing of data-generating assembler directives (for example, the .word pseudo-op). Note - Each section has one location counter; if more than one section is present, only one location can be current at any time. # 3.4 Addresses Locations represent *addresses in memory* if a section is allocatable; that is, its contents are to be placed in memory at program runtime. Symbolic references to these locations must be changed to addresses by the SPARC link editor. ### 3.5 Relocation Tables The assembler produces a companion *relocation table* for each relocatable section. The table contains a list of relocations (that is, adjustments to data in the section) to be performed by the link editor. # 3.6 Symbol Tables A *symbol table* contains information to locate and relocate symbolic definitions and references. The SPARC assembler creates a symbol table section for the object file. It makes an entry in the symbol table for each symbol that is defined or referenced in the input file and is needed during linking. The symbol table is then used by the SPARC link editor during relocation. The section header contains the symbol table index for the first non-local symbol. A symbol table contains the following information: name Index into the object file symbol string table. A value of zero indicates the symbol table entry has no name; otherwise, the value represents the string table index that gives the symbol name. value Value of the associated symbol. This value is dependent on the context; for example, it may be an address, or it may be an absolute value. size Size of symbol. A value of $\boldsymbol{0}$ indicates that the symbol has either no size or an unknown size. info Specifies the symbol type and binding attributes. Table 3-6 and Table 3-7 describes these values. ### other Undefined meaning. Current value is 0. ### shndx Contains the section header table index to another relevant section, if specified. As a section moves during relocation, references to the symbol will continue to point to the same location because the value of the symbol will change as well. TABLE 3-6 | Value | Туре | Description | |-------|---------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | 0 | notype | Type not specified. | | 1 | object | Symbol is associated with a data object; for example, a variable or an array. | | 2 | func | <i>Symbol</i> is associated with a function or other executable code. When another object file references a function from a shared object, the link editor automatically creates a procedure linkage table entry for the referenced symbol. | | 3 | section | <i>Symbol</i> is associated with a section. These types of symbols are primarily used for relocation. | | 4 | file | Gives the name of the source file associated with the object file. | | 13 | loproc | Values reserved for processor-specific semantics. | | 15 | hiproc | | Table 3–7 shows the symbol binding attributes. TABLE 3-7 | Value | Binding | Description | |-------|---------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | 0 | local | <i>Symbol</i> is defined in the object file and not accessible in other files. Local symbols of the same name may exist in multiple files. | | 1 | global | <i>Symbol</i> is either defined externally or defined in the object file and accessible in other files. | | 2 | weak | <i>Symbol</i> is either defined externally or defined in the object file and accessible in other files; however, these definitions have a lower precedence than globally defined symbols. | | 13 | loproc | Values reserved for processor-specific semantics. | | 15 | hiproc | | # 3.7 String Tables A *string table* is a section which contains null-terminated variable-length character sequences, or strings, in the object file; for example, symbol names and file names. The strings are referenced in the section header as indexes into the string table section. - A string table index may refer to any byte in the section. - Empty string table sections are permitted; however, the index referencing this section must contain zero. A string may appear multiple times and may also be referenced multiple times. References to substrings may exist, and unreferenced strings are allowed. ## 3.8 Assembler Directives Assembler directives, or pseudo-operations (pseudo-ops), are commands to the assembler that may or may not result in the generation of code. The different types of assembler directives are: ■ Section Control Directives - Symbol Attribute Directives - Assignment Directives - Data Generating Directives - Optimizer Directives See Appendix A, for a complete description of the pseudo-ops supported by the SPARC assembler. ### 3.8.1 Section Control Directives When a section is created, a section header is generated and entered in the ELF object file section header table. The *section control pseudo-ops* allow you to make entries in this table. Sections that can be manipulated with the section control directives are known as *user sections*. You can also use the section control directives to change the user section in which code or data is generated. **Note** - The *symbol table*, *relocation table*, and *string table* sections are created implicitly. The section control pseudo-ops cannot be used to manipulate these sections. The section control directives also create a section symbol which is associated with the location at the beginning of each created section. The section symbol has an offset value of zero. ### 3.8.2 Symbol Attribute Directives The *symbol attribute* pseudo-ops declare the symbol type and size and whether it is local or global. ### 3.8.3 Assignment Directive The *assignment* directive associates the value and type of expression with the symbol and creates a symbol table entry for the symbol. This directive constitutes a *definition* of the symbol and, therefore, must be the only definition of the symbol. ### 3.8.4 Data Generating Directives The data generating directives are used for allocating storage and loading values. ## Converting Files to the New Format ## 4.1 Introduction This chapter discusses how to convert existing SunOS 4.1 SPARC assembly files to the *SunOS 5.x* SPARC assembly file format. ## 4.2 Conversion Instructions - Remove the leading underscore (\_) from symbol names. The *Solaris 2.x* SPARCompilers do not prepend a leading underscore to symbol names in the users' programs as did the SPARCompilers that ran under SunOS 4.1. - Prefix local symbol names with a dot (.) . Local symbol names in the *SunOS 5.x* SPARC assembly language begin with a dot (.) so that they will not conflict with user programs' symbol names. - Change the usage of the pseudo-op .seg to .section, for example, change .seg data to .section .data. See Appendix A, for more information. **Note -** The above conversions can be automatically achieved by passing the -T option to the assembler. # 4.3 Examples Figure 4–1 shows how to convert an existing 4.1 file to the new format. The lines that are different in the new format are marked with change bars. Figure 4–1 Converting a 4.x File to the New Format # **Instruction-Set Mapping** The tables in this chapter describe the relationship between hardware instructions of the SPARC architecture, as defined in *The SPARC Architecture Manual* and the assembly language instruction set recognized by the *SunOS 5.x* SPARC assembler. - Section 5.1 "Table Notation " on page 41 - Section 5.2 "Integer Instructions" on page 43 - Section 5.3 "Floating-Point Instruction" on page 53 - Section 5.4 "Coprocessor Instructions" on page 56 - Section 5.5 "Synthetic Instructions" on page 56 The SPARC-V9 instruction set is described in Appendix E. ## 5.1 Table Notation Table 5–1 shows the table notation used in this chapter to describe the instruction set of the assembler. The following notations are commonly suffixed to assembler mnemonics (uppercase letters refer to SPARC architecture instruction names. TABLE 5-1 | Notations | Describes | Comment | |-----------------------------------------|-----------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------| | address | reg <sub>rs1</sub> + reg <sub>rs2</sub> | Address formed from register contents, immediate constant, or both. | | | reg <sub>rs1</sub> + const13 | | | | reg <sub>rs1</sub> const13 | | | | const13 + reg <sub>rs1</sub> | | | | const13 | | | asi | | Alternate address space identifier; an unsigned 8-bit value. It can be the result of the evaluation of a symbol expression. | | const13 | | A signed constant which fits in 13 bits. It can be the result of the evaluation of a symbol expression. | | const22 | | A constant which fits in 22 bits. It can be the result of the evaluation of a symbol expression. | | creg | %c0 %c31 | Coprocessor registers. | | freg | %f0 %f31 | Floating-point registers. | | imm7 | | A signed or unsigned constant that can be represented in 7 bits (it is in the range -64 127). It can be the result of the evaluation of a symbol expression. | | reg | %r0 %r31 | General purpose registers. | | | %g0 %g7 | Same as %r0 %r7 (Globals) | | | %00 %07 | Same as %r8 %r15 (Outs) | | | %10 %17 | Same as %r16 %r23 (Locals) | | | %i0 %i7 | Same as %r24 %r31 (Ins) | | reg <sub>rd</sub> | | Destination register. | | reg <sub>rs1</sub> , reg <sub>rs2</sub> | | Source register 1, source register 2. | TABLE 5-1 (continued) | Notations | Describes | Comment | |----------------------|------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------| | reg_or_imm | reg <sub>rs2</sub> , const13 | Value from either a single register, or an immediate constant. | | regaddr | reg <sub>rs1</sub> reg <sub>rs1</sub> + reg <sub>rs2</sub> | Address formed with register contents only. | | Software_trap_number | $reg_{rs1} + reg_{rs2}$ $reg_{rs1} + imm7$ $reg_{rs1} - imm7$ $uimm7$ $imm7 + reg_{rs1}$ | A value formed from register contents, immediate constant, or both. The resulting value must be in the range 0127, inclusive. | | uimm7 | | An unsigned constant that can be represented in 7 bits (it is in the range 0 127). It can be the result of the evaluation of a symbol expression. | ### 5.2 **Integer Instructions** The notations described in Table 5-2 are commonly suffixed to assembler mnemonics (uppercase letters for architecture instruction names). TABLE 5-2 | Notation | Description | |----------|---------------------------------------------| | a | Instructions that deal with alternate space | | b | Byte instructions | | С | Reference to coprocessor registers | | d | Doubleword instructions | TABLE 5–2 (continued) | Notation | Description | |----------|---------------------------------------| | f | Reference to floating-point registers | | h | Halfword instructions | | đ | Quadword instructions | | sr | Status register | Table 5–3 outlines the correspondence between SPARC hardware integer instructions and SPARC assembly language instructions. The syntax of individual instructions is designed so that a destination operand (if any), which may be either a register or a reference to a memory location, is always the last operand in a statement. #### Note - In Table 5-3, ■ Braces ({ }) indicate optional arguments. Braces are not literally coded. ■ Brackets ([])indicate indirection: the contents of the addressed memory location are being read from or written to. Brackets are coded literally in the assembly language. Note that the usage of brackets described in Chapter 2 differs from the usage of these brackets. ■ All Bicc and Bfcc instructions described may indicate that the annul bit is to be set by appending ",a" to the opcode mnemonic; for example, TABLE 5-3 | Opcode | Mnemonic | Argument List | Operation | Comments | |--------|----------|----------------------------------------------------|--------------------|----------| | ADD | add | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Add | | | ADDcc | addcc | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Add and modify icc | | <sup>&</sup>quot;bgeu,a label" TABLE 5-3 (continued) | Opcode | Mnemonic | Argument List | Operation | Comments | |--------|----------|----------------------------------------------------|-----------------------------------|---------------| | ADDX | addx | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Add with carry | | | ADDXcc | addxcc | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | | | | AND | and | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | And | | | ANDcc | andcc | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | | | | ANDcc | andn | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | | | | ANDNcc | andcc | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | | | | BN | bn{,a} | label | Branch on integer condition codes | branch never | | BNE | bne{,a} | label | | synonym: bnz | | BE | be{,a} | label | | synonym: bz | | BG | bg{,a} | label | | | | BLE | ble{,a} | label | | | | BGE | bge{,a} | label | | | | BI | bl{,a} | label | | | | BGU | bgu{,a} | label | | | | BLEU | bleu{,a} | label | | | | BCC | bcc{,a} | label | | synonym: bgeu | | BCS | bcs{,a} | label | | synonym: blu | | BPOS | bpos{,a} | label | | | | BNEG | bneg{,a} | label | | | | BVC | bvc{,a} | label | | | | BVS | bvs{,a} | label | | | TABLE 5–3 (continued) | Opcode | Mnemonic | Argument List | Operation | Comments | |--------|-----------|---------------|---------------------------------------|--------------| | ВА | ba{,a} | label | | synonym: b | | CALL | call | label | Call subprogram | | | СВссс | cbn{,a} | label | Branch on coprocessor condition codes | branch never | | | cb3{,a} | label | condition codes | | | | cb2{,a} | label | | | | | cb23{,a} | label | | | | | cb1{,a} | label | | | | | cb13{,eo} | label | | | | | cb12{,a} | label | | | | | cb123{,a} | label | | | | | cb0{,a} | label | | | | | cb03{,a} | label | | | | | cb02{,a} | label | | | | | cb023{,a} | label | | | | | cb01{,a} | label | | | | | cb013{,a} | label | | | | | cb012{,a} | label | | | | | cba{,a} | label | | | TABLE 5-3 (continued) | Opcode | Mnemonic | Argument List | Operation | Comments | |--------|-----------|------------------------------|------------------------------------------|---------------| | FBN | fbn{,a} | label | Branch on floating-point condition codes | branch never | | FBU | fbu{,a} | label | | | | FBG | fbg{,a} | label | | | | FBUG | fbug{,a} | label | | | | FBL | fbl{,a} | label | | | | FBUL | fbul{,a} | label | | | | FBLG | fblg{,a} | label | | | | FBNE | fbne{,a} | label | | synonym: fbnz | | FBE | fbe{,a} | label | | synonym: fbz | | FBUE | fbue{,a} | label | | | | FBGE | fbge{,a} | label | | | | FBUGE | fbuge{,a} | label | | | | FBLE | fble{,a} | label | | | | FBULE | fbule{,a} | label | | | | FBO | fbo{,a} | label | | | | FBA | fba{,a} | label | | | | FLUSH | flush | address | Instruction cache flush | | | JMPL | jmpl | address, reg <sub>rd</sub> | Jump and link | | | LDSB | ldsb | [address], reg <sub>rd</sub> | Load signed byte | | | LDSH | ldsh | [address], reg <sub>rd</sub> | Load signed halfword | | | LDSTUB | ldstub | [address], reg <sub>rd</sub> | Load-store unsigned byte | | TABLE 5–3 (continued) | Opcode | Mnemonic | Argument List | Operation | Comments | |---------|----------|----------------------------------------------------|---------------------------------------|---------------------------------| | LDUB | ldub | [address], reg <sub>rd</sub> | Load unsigned byte | | | LDUH | lduh | [address], reg <sub>rd</sub> | Load unsigned halfword | | | LD | ld | [address], reg <sub>rd</sub> | Load word | | | LDD | ldd | [address], reg <sub>rd</sub> | Load double word | reg <sub>rd</sub> must be even | | LDF | ld | [address], freg <sub>rd</sub> | | | | LDFSR | ld | [address], %fsr | Load floating-point register | | | LDDF | ldd | [address], freg <sub>rd</sub> | Load double floating-point | freg <sub>rd</sub> must be even | | LDC | ld | [address], creg <sub>rd</sub> | Load coprocessor | | | LDCSR | ld | [address], %csr | Load double coprocessor | | | LDDC | ldd | [address], creg <sub>rd</sub> | | | | LDSBA | ldsba | [regaddr]asi, reg <sub>rd</sub> | Load signed byte from alternate space | | | LDSHA | ldsha | [regaddr]asi, reg <sub>rd</sub> | | | | LDUBA | lduba | [regaddr]asi, reg <sub>rd</sub> | | | | LDUHA | lduha | [regaddr]asi, reg <sub>rd</sub> | | | | LDA | lda | [regaddr]asi, reg <sub>rd</sub> | | | | LDDA | ldda | [regaddr]asi, reg <sub>rd</sub> | | reg <sub>rd</sub> must be even | | LDSTUBA | ldstuba | [regaddr]asi, reg <sub>rd</sub> | | | | MULScc | mulscc | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Multiply step (and modify icc) | | | NOP | nop | | No operation | | TABLE 5-3 (continued) | Opcode | Mnemonic | Argument List | Operation | Comments | |---------|----------|----------------------------------------------------|--------------------------------|-----------------------------| | OR | or | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Inclusive or | | | ORcc | orcc | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | | | | ORN | orn | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | | | | ORNcc | orncc | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | | | | RDASR | rd | %asrn <sub>rs1</sub> , reg <sub>rd</sub> | | | | RDY | rd | %y, reg <sub>rd</sub> | | See synthetic instructions. | | RDPSR | rd | %psr, reg <sub>rd</sub> | | See synthetic instructions. | | RDWIM | rd | %wim, reg <sub>rd</sub> | | See synthetic instructions. | | RDTBR | rd | %tbr, reg <sub>rd</sub> | | See synthetic instructions. | | RESTORE | restore | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | | See synthetic instructions. | | RETT | rett | address | Return from trap | | | SAVE | save | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | | See synthetic instructions. | | SDIV | sdiv | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Signed divide | | | SDIVcc | sdivcc | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Signed divide and modify | | | SMUL | smul | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Signed multiply | | | SMULcc | smulcc | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Signed multiply and modify icc | | | SETHI | sethi | const22, reg <sub>rd</sub> | Set high 22 bits of register | | TABLE 5–3 (continued) | Opcode | Mnemonic | Argument List | Operation | Comments | |--------|----------|----------------------------------------------------|--------------------------------------|------------------------------------| | | sethi | %hi(value), reg <sub>rd</sub> | | See synthetic instructions. | | SLL | sll | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Shift left logical | | | SRL | srl | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Shift right logical | | | SRA | sra | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Shift right arithmetic | | | STB | stb | reg <sub>rd</sub> , [address] | Store byte | Synonyms: stub, stsb | | STH | sth | reg <sub>rd</sub> , [address] | Store half-word | Synonyms: stuh, stsh | | ST | st | reg <sub>rd</sub> , [address] | | | | STD | std | reg <sub>rd</sub> , [address] | | reg <sub>rd</sub> Must be<br>even | | STF | st | freg <sub>rd</sub> , [address] | | | | STDF | std | freg <sub>rd</sub> , [address] | | | | STFSR | st | %fsr, [address] | Store floating-point status register | freg <sub>rd</sub> Must be even | | STDFQ | std | %fq, [address] | Store double floating-point queue | | | STC | st | creg <sub>rd</sub> , [address] | Store coprocessor | creg <sub>rd</sub> Must be<br>even | | STDC | std | creg <sub>rd</sub> , [address] | | creg <sub>rd</sub> Must be<br>even | | STCSR | st | %csr, [address] | | | | STDCQ | std | %cq, [address] | Store double coprocessor | | TABLE 5-3 (continued) | Opcode | Mnemonic | Argument List | Operation | Comments | |--------|----------|----------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------|-----------------------------------| | STBA | stba | reg <sub>rd</sub> [regaddr]asi | Store byte into alternate space | Synonyms: stuba,<br>stsba | | STHA | stha | reg <sub>rd</sub> [regaddr]asi | | Synonyms: stuha,<br>stsha | | STA | sta | reg <sub>rd</sub> , [regaddr]asi | | | | STDA | stda | reg <sub>rd</sub> , [regaddr]asi | | reg <sub>rd</sub> Must be<br>even | | SUB | sub | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Subtract | | | SUBcc | subcc | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Subtract and modify icc | | | SUBX | subx | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Subtract with carry | | | SUBXcc | subxcc | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | | | | SWAP | swap | [address], reg <sub>rd</sub> | Swap memory word | | | SWAPA | swapa | [regaddr]asi, reg <sub>rd</sub> | with register | | | Ticc | tn | software_trap_number | Trap on integer condition code | Trap never | | | tne | software_trap_number | Note: Trap numbers 16-31 are reserved for the user. Currently-defined trap numbers are those defined in /usr/include/sys/trap.h | Synonym: tnz | TABLE 5–3 (continued) | Opcode | Mnemonic | Argument List | Operation | Comments | |----------|----------|----------------------------------------------------|------------------------------------------------|--------------| | | te | software_trap_number | | Synonym: tz | | | tg | software_trap_number | | | | | tle | software_trap_number | | | | | tge | software_trap_number | | | | | tl | software_trap_number | | | | | tgu | software_trap_number | | | | | tleu | software_trap_number | | Synonym: tcc | | | tlu | software_trap_number | | Synonym: tcc | | | tgeu | software_trap_number | | | | | tpos | software_trap_number | | | | | tneg | software_trap_number | | | | | tvc | software_trap_number | | Synonym: t | | | tvs | software_trap_number | | | | | ta | software_trap_number | | | | TADDcc | taddcc | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Tagged add and modify | | | TSUBcc | tsubcc | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | | | | TADDccTV | taddcctv | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Tagged add and modify icc and trap on overflow | | | TSUBccTV | tsubcctv | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | | | | UDIV | udiv | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Unsigned divide | | | UDIVcc | udivcc | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Unsigned divide and modify icc | | | UMUL | umul | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Unsigned multiply | | TABLE 5-3 (continued) | Opcode | Mnemonic | Argument List | Operation | Comments | |--------|----------|----------------------------------------------------|----------------------------------|----------------------------| | UMULcc | umulcc | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Unsigned multiply and modify icc | | | UNIMP | unimp | const22 | Illegal instruction | | | WRASR | wr | reg_or_imm, %asrn <sub>rs1</sub> | | | | WRY | wr | reg <sub>rs1</sub> , reg_or_imm, %y | | See synthetic instructions | | WRPSR | wr | reg <sub>rs1</sub> , reg_or_imm, %psr | | See synthetic instructions | | WRWIM | wr | reg <sub>rs1</sub> , reg_or_imm, %wim | | See synthetic instructions | | WRTBR | wr | reg <sub>rs1</sub> , reg_or_imm, %tbr | | See synthetic instructions | | XNOR | xnor | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Exclusive nor | | | XNORcc | xnorcc | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | | | | XOR | xor | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | Exclusive or | | | XORcc | xorcc | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | | | ### 5.3 Floating-Point Instruction Table 5-4 shows floating-point instructions. In cases where more than numeric type is involved, each instruction in a group is described; otherwise, only the first member of a group is described. TABLE 5-4 | SPARC | Mnemonic <sup>1</sup> | Argument List | Description | |--------|-----------------------|------------------------------------------|---------------------------| | FiTOs | fitos | freg <sub>rs2</sub> , freg <sub>rd</sub> | Convert integer to single | | FiTOd | fitod | freg <sub>rs2</sub> , freg <sub>rd</sub> | Convert integer to double | | FiTOq | fitoq | freg <sub>rs2</sub> , freg <sub>rd</sub> | Convert integer to quad | | FsTOi | fstoi | freg <sub>rs2</sub> , freg <sub>rd</sub> | Convert single to integer | | FdTOi | fdtoi | freg <sub>rs2</sub> , freg <sub>rd</sub> | Convert double to integer | | FqTOi | fqtoi | freg <sub>rs2</sub> , freg <sub>rd</sub> | Convert quad to integer | | FsTOd | fstod | freg <sub>rs2</sub> , freg <sub>rd</sub> | Convert single to double | | FsTOq | fstoq | freg <sub>rs2</sub> , freg <sub>rd</sub> | Convert single to quad | | FdTOs | fdtos | freg <sub>rs2</sub> , freg <sub>rd</sub> | Convert double to single | | FdTOq | fdtoq | freg <sub>rs2</sub> , freg <sub>rd</sub> | Convert double to quad | | FqTOd | fqtod | freg <sub>rs2</sub> , freg <sub>rd</sub> | Convert quad to double | | FqTOs | fqtos | freg <sub>rs2</sub> , freg <sub>rd</sub> | Convert quad to single | | FMOVs | fmovs | freg <sub>rs2</sub> , freg <sub>rd</sub> | Move | | FNEGs | fnegs | freg <sub>rs2</sub> , freg <sub>rd</sub> | Negate | | FABSs | fabss | freg <sub>rs2</sub> , freg <sub>rd</sub> | Absolute value | | FSQRTs | fsqrts | freg <sub>rs2</sub> , freg <sub>rd</sub> | Square root | | FSQRTd | fsqrtd | freg <sub>rs2</sub> , freg <sub>rd</sub> | | | FSQRTq | fsqrtq | freg <sub>rs2</sub> , freg <sub>rd</sub> | | TABLE 5-4 (continued) | SPARC | Mnemonic <sup>1</sup> | Argument List | Description | | |--------|-----------------------|----------------------------------------------------------------|--------------------------------------------|--| | FADDs | fadds | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | Add | | | FADDd | faddd | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | | | | FADDq | faddq | $freg_{rs1}$ , $freg_{rs2}$ , $freg_{rd}$ | | | | FSUBs | fsubs | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | Subtract | | | FSUBd | fsubd | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | | | | FSUBq | fsubq | $freg_{rs1}$ , $freg_{rs2}$ , $freg_{rd}$ | | | | FMULs | fmuls | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | Multiply | | | FMULd | fmuld | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | | | | FMULq | fmulq | $freg_{rs1}$ , $freg_{rs2}$ , $freg_{rd}$ | | | | FdMULq | fmulq | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | Multiply double to quad | | | FsMULd | fsmuld | $freg_{rs1}$ , $freg_{rs2}$ , $freg_{rd}$ | Multiply single to double | | | FDIVs | fdivs | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | Divide | | | FDIVd | fdivd | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | | | | FDIVq | fdivq | $freg_{rs1}$ , $freg_{rs2}$ , $freg_{rd}$ | | | | FCMPs | fcmps | freg <sub>rs1</sub> , freg <sub>rs2</sub> | Compare | | | FCMPd | fcmpd | freg <sub>rs1</sub> , freg <sub>rs2</sub> | | | | FCMPq | fcmpq | freg <sub>rs1</sub> , freg <sub>rs2</sub> | | | | FCMPEs | fcmpes | freg <sub>rs1</sub> , freg <sub>rs2</sub> | Compare, generate exception if not ordered | | | FCMPEd | fcmped | freg <sub>rs1</sub> , freg <sub>rs2</sub> | | | | FCMPEq | fcmpeq | freg <sub>rs1</sub> , freg <sub>rs2</sub> | | | $<sup>1. \</sup>quad \text{Types of Operands are denoted by the following lower-case letters:} \verb|integers singled doubleq quad|\\$ # 5.4 Coprocessor Instructions All *coprocessor-operate* (cpop*n*) instructions take all operands from and return all results to coprocessor registers. The data types supported by the coprocessor are coprocessor-dependent. Operand alignment is also coprocessor-dependent. Coprocessor-operate instructions are described in Table 5–5. If the EC (PSR\_enable\_coprocessor) field of the processor state register (PSR) is 0, or if a coprocessor is not present, a cpop*n* instruction causes a *cp\_disabled* trap. The conditions that cause a *cp\_exception* trap are coprocessor-dependent. TABLE 5-5 | SPARC | Mnemonic | Argument List | Name | Comments | |-------|----------|------------------------------------------------------------------|-----------------------|----------------| | CPop1 | cpop1 | opc, reg <sub>rs1</sub> , reg <sub>rs2</sub> , reg <sub>rd</sub> | Coprocessor operation | | | CPop2 | cpop2 | opc, reg <sub>rs1</sub> , reg <sub>rs2</sub> , reg <sub>rd</sub> | Coprocessor operation | May modify ccc | # 5.5 Synthetic Instructions Table 5-6 describes the mapping of synthetic instructions to hardware instructions. TABLE 5-6 | Synthetic Instruction | | Hardware Equivalent(s) | | Comment | |-----------------------|--------------------------------|------------------------|---------------------------------------------------|------------| | btst | reg_or_imm, reg <sub>rs1</sub> | andcc | reg <sub>rs1</sub> , reg_or_imm, %g0 | Bit test | | bset | reg_or_imm, reg <sub>rd</sub> | or | reg <sub>rd</sub> , reg_or_imm, reg <sub>rd</sub> | Bit set | | bclr | reg_or_imm, reg <sub>rd</sub> | andn | reg <sub>rd</sub> , reg_or_imm, reg <sub>rd</sub> | Bit clear | | btog | reg_or_imm, reg <sub>rd</sub> | xor | reg <sub>rd</sub> , reg_or_imm, reg <sub>rd</sub> | Bit toggle | TABLE 5-6 (continued) | Synthetic Instruction | | Hardware Equivalent(s) | | Comment | |-----------------------|------------------------------|------------------------|------------------------------------------------|----------------------------------| | call | reg_or_imm | jmpl | reg_or_imm, %07 | | | clr | reg <sub>rd</sub> | or | %g0, %g0, <i>reg</i> <sub>rd</sub> | Clear (zero) register | | clrb | [address] | stb | %g0, [address] | Clear byte | | clrh | [address] | st | %g0, [address] | Clear halfword | | clr | [address] | st | %g0, [address] | Clear word | | стр | reg, reg_or_imm | subcc | reg <sub>rs1</sub> , reg_or_imm, %g0 | Compare | | dec | $reg_{\mathrm{rd}}$ | sub | reg <sub>rd</sub> , 1, reg <sub>rd</sub> | Decrement by 1 | | dec | const13, reg <sub>rd</sub> | sub | reg <sub>rd</sub> , const13, reg <sub>rd</sub> | Decrement by const13 | | deccc | reg <sub>rd</sub> | subcc | reg <sub>rd</sub> , 1, reg <sub>rd</sub> | Decrement by 1 and set | | deccc | const13, reg <sub>rd</sub> | subcc | reg <sub>rd</sub> , const13, reg <sub>rd</sub> | Decrement by const13 and set icc | | inc | $reg_{\mathrm{rd}}$ | add | reg <sub>rd</sub> , 1, reg <sub>rd</sub> | Increment by 1 | | inc | const13, reg <sub>rd</sub> | add | reg <sub>rd</sub> , const13, reg <sub>rd</sub> | Increment by const13 | | inccc | $\mathit{reg}_{\mathrm{rd}}$ | addcc | reg <sub>rd</sub> , 1, reg <sub>rd</sub> | Increment by 1 and set | | inccc | const13, reg <sub>rd</sub> | addcc | reg <sub>rd</sub> , const13, reg <sub>rd</sub> | Increment by const13 and set icc | | jmp | address | jmpl | address, %g0 | | TABLE 5–6 (continued) | Synthetic Instruction | | Hardware Equi | valent(s) | Comment | |-----------------------|----------------------------------------|---------------|---------------------------------------------------|------------------------------------------------------------------------| | mov | reg_or_imm,reg <sub>rd</sub> | or | %g0, reg_or_imm, reg <sub>rd</sub> | | | mov | %y, reg <sub>rs1</sub> | rd | %y, reg <sub>rs1</sub> | | | mov | %psr, reg <sub>rs1</sub> | rd | %psr, reg <sub>rs1</sub> | | | mov | %wim, reg <sub>rs1</sub> | rd | %wim, reg <sub>rs1</sub> | | | mov | %tbr, reg <sub>rs1</sub> | rd | %tbr, reg <sub>rs1</sub> | | | mov | reg_or_imm, %y | wr | %g0, <b>reg_or_imm</b> ,%y | | | mov | reg_or_imm, %psr | wr | %g0, <i>reg_or_imm</i> ,%psr | | | mov | reg_or_imm, %wim | wr | %g0, <i>reg_or_imm,</i> %wim | | | mov | reg_or_imm, %tbr | wr | %g0, <b>reg_or_imm,</b> %tbr | | | not | reg <sub>rs1</sub> , reg <sub>rd</sub> | xnor | reg <sub>rs1</sub> , %g0, reg <sub>rd</sub> | One's complement | | not | ${ m reg}_{ m rd}$ | xnor | reg <sub>rd</sub> , %g0, <i>reg</i> <sub>rd</sub> | One's complement | | neg | reg <sub>rs1</sub> , reg <sub>rd</sub> | sub | %g0, reg <sub>rs2</sub> , reg <sub>rd</sub> | Two's complement | | neg | reg <sub>rd</sub> | sub | %g0, reg <sub>rd</sub> , reg <sub>rd</sub> | Two's complement | | restore | | restore | %g0, %g0, %g0 | Trivial restore | | save | | save | %g0, %g0, %g0 | Trivial save | | | | | | trivial save should only be used in supervisor code! | | set | value,reg <sub>rd</sub> | or | %g0, value, reg <sub>rd</sub> | if -4096 ≤ <i>value</i> ≤ 4095 | | | Tu Tu | | -14 | Do not use the set synthetic instruction in an instruction delay slot. | | set | value,reg <sub>rd</sub> | sethi | %hi <i>(value), reg<sub>rd</sub></i> | if ((value & 0x3ff) == 0) | TABLE 5-6 (continued) | Synthetic Instruction | | Hardware Equivalent(s) | | Comment | |-----------------------|--------------------------|------------------------|--------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------| | set | value, reg <sub>rd</sub> | sethi<br>or | %hi(value), reg <sub>rd</sub> ; reg <sub>rd</sub> ,<br>%lo(value), reg <sub>rd</sub> | otherwise Do not use the set synthetic instruction in an instruction delay slot. | | skipz | | bnz,a .+8 | | if z is set, ignores next instruction | | skipnz | | bz,a .+8 | | if $z$ is not set, ignores next instruction | | tst | reg | orcc | <i>reg</i> <sub>rs1</sub> , %g0, %g0 | test | ### 5.6 V8/V9 Natural Pseudo Instructions Table 5–7 describes the V8/V9 natural pseudo instructions that will help increase the portability of your assembly code from V8/V8 plus to V9.. TABLE 5-7 | | -xarch= | | | |---------------------|------------------------|------|--| | Pseudo Instructions | V8/V8plus <sup>1</sup> | V9 | | | ldn | ld | ldx | | | stn | st | stx | | | ldna | lda | ldxa | | | stna | sta | stxa | | | setn | set | setx | | TABLE 5–7 (continued) | | -xarch= | | | |---------------------|------------------------|--------|--| | Pseudo Instructions | V8/V8plus <sup>1</sup> | V9 | | | setnhi | sethi | setxhi | | | casn | cas | casx | | | slln | sll | sllx | | | srln | srl | srlx | | | sran | sra | srax | | | clrn | clr | clrx | | <sup>1.</sup> Indicates default setting **Note -** Depending on the value set for the -xarch option, the assembler substitutes the appropriate pseudo instruction. ## **Pseudo-Operations** The pseudo-operations listed in this appendix are supported by the SPARC assembler. # A.1 Alphabetized Listing with Descriptions ``` Turns off the effect of the preceding .noalias pseudo-op. (Compiler-generated only.) .align boundary Aligns the location counter on a boundary where ((''location counter'' mod boundary)==0); boundary may be any power of 2. .ascii string [, string"] Generates the given sequence(s) of ASCII characters. .asciz string [, string]* Generates the given sequence(s) of ASCII characters. This pseudo-op appends a null (zero) byte to each string. .byte 8bitval [, 8bitval]* Generates (a sequence of) initialized bytes in the current segment. ``` Provides a tentative definition of *symbol*. Size bytes are allocated for the object represented by *symbol*. - If the symbol is not defined in the input file and is declared to be *local* to the file, the symbol is allocated in *sect\_name* and its location is optionally aligned to a multiple of *alignment*. If *sect\_name* is not given, the symbol is allocated in the uninitialized data section (*bss*). Currently, only .bss is supported for the section name. (.data is not currently supported.) - If the symbol is not defined in the input file and is declared to be *global*, the SPARC link editor allocates storage for the symbol, depending on the definition of *symbol\_name* in other files. Global is the default binding for common symbols. - If the symbol is defined in the input file, the definition specifies the location of the symbol and the tentative definition is overridden. ``` .double Orfloatval [, Orfloatval]* ``` Generates (a sequence of) initialized double-precision floating-point values in the current segment. *floatval* is a string acceptable to atof(3); that is, an optional sign followed by a non-empty string of digits with optional decimal point and optional exponent. ``` .empty ``` Suppresses assembler complaints about the next instruction presence in a delay slot when used in the delay slot of a Control Transfer Instruction (CTI). Some instructions should not be in the delay slot of a CTI. See the *SPARC* Architecture Manual for details. ``` .file string ``` Creates a symbol table entry where *string* is the symbol name and STT\_FILE is the symbol table type. *string* specifies the name of the source file associated with the object file. ``` .global symbol [, symbol]* .globl symbol [, symbol]* ``` Declares each *symbol* in the list to be global; that is, each symbol is either defined externally or defined in the input file and accessible in other files; default bindings for the symbol are overridden. - A global symbol definition in one file will satisfy an undefined reference to the same global symbol in another file. - Multiple definitions of a defined global symbol is not allowed. If a defined global symbol has more than one definition, an error will occur. - A global psuedo-op oes not need to occur before a definition, or tentative definition, of the specified symbol. **Note -** This pseudo-op by itself does not define the symbol. ``` .half 16bitval [, 16bitval]* ``` Generates (a sequence of) initialized halfwords in the current segment. The location counter must already be aligned on a halfword boundary (use .align 2). ``` .ident string ``` Generates the null terminated string in a comment section. This operation is equivalent to: ``` .pushsection .comment .asciz string .popsection .local symbol [, symbol]* ``` Declares each *symbol* in the list to be local; that is, each symbol is defined in the input file and not accessible in other files; default bindings for the symbol are overridden. These symbols take precedence over weak and global symbols. Since local symbols are not accessible to other files, local symbols of the same name may exist in multiple files. **Note -** This pseudo-op by itself does not define the symbol. ``` .noalias %reg1, %reg2 ``` %reg1 and %reg2 will not alias each other (that is, point to the same destination) until a .alias pseudo-op is issued. (Compiler-generated only.) ``` .nonvolatile ``` Defines the end of a block of instruction. The instructions in the block may not be permuted. This pseudo-op has no effect if: - The block of instruction has been previously terminated by a Control Transfer Instruction (CTI) or a label - There is no preceding .volatile pseudo-op ``` .nword 64bitval [, 64bitval]* ``` If -xarch=v8/v8plus then assembler interprets the instruction as .word. If -xarch=v9 the assembler interprets the instruction as .xword. ``` .optim string ``` This pseudo-op changes the optimization level of a particular function. (Compiler-generated only.) ``` .popsection ``` Removes the top section from the section stack. The new section on the top of the stack becomes the current section. This pseudo-op and its corresponding .pushsection command allow you to switch back and forth between the named sections. ``` .proc n ``` Signals the beginning of a *procedure* (that is, a unit of optimization) to the peephole optimizer in the SPARC assembler; *n* specifies which registers will contain the return value upon return from the procedure. (Compiler-generated only.) ``` .pushsection sect_name [, attributes] ``` Moves the named section to the top of the section stack. This new top section then becomes the current section. This pseudo-op and its corresponding <code>.popsection</code> command allow you to switch back and forth between the named sections. ``` .quad Orfloatval [, Orfloatval]* ``` Generates (a sequence of) initialized quad-precision floating-point values in the current segment. *floatval* is a string acceptable to atof(3); that is, an optional sign followed by a non-empty string of digits with optional decimal point and optional exponent. **Note -** The .quad command currently generates quad-precision values with only double-precision significance. ``` .reserve symbol, size [, sect_name [, alignment]] ``` Defines *symbol*, and reserves *size* bytes of space for it in the *sect\_name*. This operation is equivalent to: ``` .pushsection sect_name .align alignment symbol: .skip size .popsection ``` If a section is not specified, space is reserved in the current segment. ``` .section section_name [, attributes] ``` Makes the specified section the current section. The assembler maintains a section stack which is manipulated by the section control directives. The current section is the section that is currently on top of the stack. This pseudo-op changes the top of the section stack. - If section\_name does not exist, a new section with the specified name and attributes is created. - If section\_name is a non-reserved section, attributes must be included the first time it is specified by the .section directive. See the sections Section 3.2.2 "Predefined User Sections" on page 30 and Section 3.2.3 "Predefined Non-User Sections" on page 32 in Chapter 3, for a detailed description of the reserved sections. See Table 3-2 in Chapter 3, for a detailed description of the section attribute flags. Attributes can be: ``` #write | #alloc | #execinstr .seg section_name ``` **Note** - This pseudo-op is currently supported for compatibility with existing SunOS 4.1 SPARC assembly language programs. This pseudo-op has been replaced by the .section pseudo-op. Changes the current section to one of the predefined user sections. The assembler will interpret the following SunOS 4.1 SPARC assembly directive: to be the same as the following *SunOS 5.x* SPARC assembly directive: ``` .seg text, .seg data, .seg datal, .seg bss, .section .text, .section .data, .section .data1, .section .bss. ``` Predefined user section names are changed in SunOS 5.x. ``` .single Orfloatval [, Orfloatval]* ``` Generates (a sequence of) initialized single-precision floating-point values in the current segment. Note - This operation does not align automatically. ``` .size symbol, expr ``` Declares the symbol size to be expr. expr must be an absolute expression. ``` .skip n ``` Increments the location counter by n, which allocates n bytes of empty space in the current segment. .stabn <various parameters> The pseudo-op is used by *Solaris 2.x* SPARCompilers only to pass debugging information to the symbolic debuggers. .stabs <various parameters> The pseudo-op is used by *Solaris 2.x* SPARCompilers only to pass debugging information to the symbolic debuggers. .type symbol, type Declares the type of symbol, where *type* can be: #object #function #no\_type See Table 3-6 in Chapter 3, for detailed information on symbols. .uahalf 16bitval [, 16bitval]\* Generates a (sequence of) 16-bit value(s). **Note -** This operation does not align automatically. .uaword 32bitval [, 32bitval]\* Generates a (sequence of) 32-bit value(s). **Note** - This operation does not align automatically. .version string Identifies the minimum assembler version necessary to assemble the input file. You can use this pseudo-op to ensure assembler-compiler compatibility. If *string* indicates a newer version of the assembler than this version of the assembler, a fatal error message is displayed and the SPARC assembler exits. .volatile Defines the beginning of a block of instruction. The instructions in the section may not be changed. The block of instruction should end at a <code>.nonvolatile</code> pseudo-op and should not contain any Control Transfer Instructions (CTI) or labels. The volatile block of instructions is terminated after the last instruction preceding a CTI or label. ``` .weak symbol [, symbol] ``` Declares each *symbol* in the list to be defined either externally, or in the input file and accessible to other files; default bindings of the symbol are overridden by this directive. Note the following: - A weak symbol definition in one file will satisfy an undefined reference to a global symbol of the same name in another file. - Unresolved weak symbols have a default value of zero; the link editor does not resolve these symbols. - If a weak symbol has the same name as a defined global symbol, the weak symbol is ignored and no error results. **Note -** This pseudo-op does not itself define the symbol. ``` .word 32bitval [, 32bitval]* ``` Generates (a sequence of) initialized words in the current segment. **Note -** This operation does not align automatically. ``` .xword 64bitval [, 64bitval]* ``` Generates (a sequence of) initialized 64-bit values in the current segment. Note - This operation does not align automatically. ``` .xstabs <various parameters> ``` The pseudo-op is used by Solaris 2.x SPARCompilers only to pass debugging information to the symbolic debuggers. ``` symbol =expr ``` Assigns the value of expr to symbol. # **Examples of Pseudo-Operations** This chapter shows some examples of ways to use various pseudo-ops. # B.1 Example 1 This example shows how to use the following pseudo-ops to specify the bindings of variables in C: ``` common, .global, .local, .weak ``` The following C definitions/declarations: ``` int fool = 1; #pragma weak foo2 = fool static int foo3; static int foo4 = 2; ``` can be translated into the following assembly code: #### CODE EXAMPLE B-1 ``` .pushsection ".data" .global foo1 ! int foo1 = 1 .align 4 foo1: .word 0x1 .type foo1,#object ! foo1 is of type data object, .size foo1,4 ! with size = 4 bytes .weak foo2 ! #pragma weak foo2 = foo1 foo2 = foo1 .local foo3 ! static int foo3 ``` ``` .common foo3,4,4 .align 4 ! static int foo4 = 2 foo4: .word 0x2 .type foo4,#object .size foo4,4 .popsection ``` # B.2 Example 2 This example shows how to use the pseudo-op .ident to generate a string in the .comment section of the object file for identification purposes. ``` .ident "acomp: (CDS) SPARCompilers 2.0 alpha4 12 Aug 1991" ``` # B.3 Example 3 The pseudo-ops shown in this example are .align, .global, .type, and .size. The following C subroutine: ``` int sum(a, b) int a, b; { return(a + b); } ``` can be translated into the following assembly code: ``` .section ".text" .global sum .align 4 sum: retl add %00,%01,%00 ! (a + b) is done in the ! delay slot of retl ``` ``` .type sum, #function ! sum is of type function .size sum,.-sum ! size of sum is the diff ! of current location ! counter and the initial ! definition of sum ``` ### **B.4** Example 4 The pseudo-ops shown in this example are .section, .ascii, and .align. The example calls the printf function to output the string "hello world". ``` .section ".data1" .align 4 .L16: "hello world\n\0" .ascii .section ".text" .global main main: save %sp,-96,%sp set .L16,%o0 call printf,1 nop restore ``` ### **B.5** Example 5 This example shows how to use the .volatile and .nonvolatile pseudo-ops to protect a section of handwritten asembly code from peephole optimization. ``` .volatile t 0x24 std %g2, [%o0] retl nop .nonvolatile ``` ## Using the Assembler Command Line This appendix is organized into the following secitons: - Section C.1 "Assembler Command Line" on page 73 - Section C.2 "Assembler Command Line Options" on page 74 - Section C.3 "Disassembling Object Code" on page 77 ## C.1 Assembler Command Line You invoke the assembler command line as follows: as [options] [inputfile] ... **Note -** The language drivers (such as cc and f77) invoke the assembler command line with the fbe command. You can use either the as or fbe command to invoke the assembler command line. The as command translates the assembly language source files, *inputfile*, into an executable object file, *objfile*. The SPARC assembler recognizes the filename argument *hyphen* (-) as the standard input. It accepts more than one file name on the command line. The input file is the concatenation of all the specified files. If an invalid option is given or the command line contains a syntax error, the SPARC assembler prints the error (including a synopsis of the command line syntax and options) to standard error output, and then terminates. The SPARC assembler supports macros, #include files, and symbolic substitution through use of the C preprocessor cpp. The assembler invokes the preprocessor before assembly begins if it has been specified from the command line as an option. (See the $\neg P$ option.) ## C.2 Assembler Command Line Options -b This option generates extra symbol table information for the source code browser. - If the as command line option -P is set, the cpp preprocessor also collects browser information. - If the as command line option -m is set, this option is ignored as the m4 macro processor does not generate browser data. For more information about the SPARCworks SourceBrowser, see the *Browsing Source Code* manual. -Dname -Dname=def When the -P option is in effect, these options are passed to the cpp preprocessor without interpretation by the as command; otherwise, they are ignored. -Ipath When the -P option is in effect, this option is passed to the cpp preprocessor without interpretation by the as command; otherwise, it is ignored. -K PIC This option generates position-independent code. This option has the same functionality as the -k option under the SunOS 4.1 SPARC assembler. **Note -** -K PIC and -K pic are equivalent. -L Saves all symbols, including temporary labels that are normally discarded to save space, in the ELF symbol table. -m This option runs m4 macro preprocessing on input. The m4 preprocessor is more powerful than the C preprocessor (invoked by the -P option), so it is more useful for complex preprocessing. See the m4 (1) man page for more information about the m4 macro-processor. Suppress all warnings while assembling. -o outfile Takes the next argument as the name of the output file to be produced. By default, the .s suffix, if present, is removed from the input file and the .o suffix is appended to form the ouput file name. -P Run cpp, the C preprocessor, on the files being assembled. The preprocessor is run separately on each input file, not on their concatenation. The preprocessor output is passed to the assembler. $-Q\{y|n\}$ This option produces the "assembler version" information in the comment section of the output object file if the y option is specified; if the n option is specified, the information is suppressed. -q This option causes the assembler to perform a quick assembly. Many error-checks are not performed when -q is specified. Note - This option disables many error checks. It is recommended that you do not use this option to assemble handwritten assembly language. -S[a|b|c|1|A|B|C|L] Produces a disassembly of the emitted code to the standard output. Adding each of the following characters to the -S option produces: - a disassembling with address - b disassembling with ".bof" - c disassembling with comments - 1 disassembling with line numbers Capital letters turn the switch off for the corresponding option. -s This option places all stabs in the ".stabs" section. By default, stabs are placed in "stabs.excl" sections, which are stripped out by the static linker 1d during final execution. When the -s option is used, stabs remain in the final executable because ".stab" sections are not stripped out by the static linker 1d. This is a migration option for SunOS 4.1 assembly files to be assembled on SunOS 5.x systems. With this option, the symbol names in SunOS 4.1 assembly files will be interpreted as SunOS 5.x symbol names. This option can be used in conjunction with the $\neg$ S option to convert SunOS 4.1 assembly files to their corresponding SunOS 5.x versions. #### -Uname When the -P option is in effect, this option is passed to the cpp preprocessor without interpretation by the as command; otherwise, it is ignored. -V This option writes the version information on the standard error output. #### -xarch=v7 This option instructs the assembler to accept instructions defined in the SPARC version 7 (V7) architecture. The resulting object code is in ELF format. #### -xarch=v8 This option instructs the assembler to accept instructions defined in the SPARC-V8 architecture. The resulting object code is in ELF format. The quad-precision floating-point instructions are allowed; however when the program is executed these instructions cause a hardware exception called "trap" (an illegal instruction trap). The kernel has the trap handler to emulate the quad percision floating-point arithmetic. Consequently, all quad percision arithmetic is performed by the emulator in the kernel. #### -xarch=v8a This option instructs the assembler to accept instructions defined in the SPARC-V8 architecture, less the fsmuld instruction. The resulting object code is in ELF format. The quad-precision floating-point instructions are allowed; however when the program is executed these instructions cause a hardware exception called "trap" (an illegal instruction trap). The kernel has the trap handler to emulate the quad percision floating-point arithmetic. Consequently, all quad percision arithmetic is performed by the emulator in the kernel. This is the default choice of the -xarch= options. #### -xarch=v8plus This option instructs the assembler to accept instructions defined in the SPARC-V9 architecture. The resulting object code is in ELF format. The quad-precision floating-point instructions are allowed; however when the program is executed these instructions cause a hardware exception called "trap" (an illegal instruction trap). The kernel has the trap handler to emulate the quad percision floating-point arithmetic. Consequently, all quad percision arithmetic is performed by the emulator in the kernel. It will not execute on a Solaris V8 system (a machine with a V8 processor). It will execute on a Solaris V8+ system. This combination is a SPARC 64-bit processor and a 32-bit OS. For more information regarding SPARC-V9 instructions, see Appendix E." #### -xarch=v8plusa This option instructs the assembler to accept instructions defined in the SPARC-V9 architecture, plus the instructions in the Visual Instruction Set (VIS). The resulting object code is in V8+ ELF format. It will not execute on a Solaris V8 system. It will execute on a Solaris V8+ system. For more information about VIS instructions, see the "UltraSPARC Programmer's Reference Manual" and the "UltraSPARC User's Guide." The quad-precision floating-point instructions are allowed; however when the program is executed these instructions cause a hardware exception called "trap" (an illegal instruction trap). The kernel has the trap handler to emulate the quad percision floating-point arithmetic. Consequently, all quad percision arithmetic is performed by the emulator in the kernel. #### -xarch=v9 This option limits instruction set to the SPARC-V9 architecture. The resulting . o object files are in 64-bit ELF format and can only be linked with other object files in the same format. The resulting executable can only be run on a 64-bit SPARC processor running 64-bit Solaris 2.7 with the 64-bit kernel. **Note -** This option is available only on Solaris 7. #### -xarch=v9a This option limits instruction set to the SPARC-V9 architecture, adding the Visual Instruction Set (VIS) and extensions specific to UltraSPARC processors. The resulting .o object files are in 64-bit ELF format and can only be run on a 64-bit SPARC processor running 64-bit Solaris 2.7 with the 64-bit kernel. **Note -** This option is available only on Solaris.7. #### C.3Disassembling Object Code The dis program is the object code disassembler for ELF. It produces an assembly language listing of the object file. For detailed information about this function, see the man page dis(1). ## An Example Language Program The following code shows an example C language program; the second example code shows the corresponding assembly code generated by SPARCompiler C 3.0.2 that runs on the *Solaris 2.x* operating environment. Comments have been added to the asembly code to show correspondence to the C code. The following C Program computes the first n Fibonacci numbers: #### CODE EXAMPLE D-1 C Program Example Source ``` /* a simple program computing the first n Fibonacci numbers */ extern unsigned * fibonacci(); #define MAX_FIB_REPRESENTABLE 49 /* compute the first n Fibonacci numbers */ unsigned * fibonacci(n) int n; static unsigned fib_array[MAX_FIB_REPRESENTABLE] = {0,1}; unsigned prev_number = 0; unsigned curr_number = 1; if (n >= MAX_FIB_REPRESENTABLE) { printf("Fibonacci(%d) \ cannot \ be \ represented \ in \ a \ 32 \ bit \ word \ ", \ n); exit(1); for (i = 2; i < n; i++) { fib_array[i] = prev_number + curr_number; prev_number = curr_number; curr_number = fib_array[i]; return(fib_array); ``` ``` main() { int n, i; unsigned * result; printf("Fibonacci(n):, please enter n:\n"); scanf("%d", &n); result = fibonacci(n); for (i = 1; i <= n; i++) printf("Fibonacci (%d) is %u\n", i, *result++); }</pre> ``` The C SPARCompiler generates the following assembler output for the Fibonacci number C source. Annotation has been added to help you understand the code. #### CODE EXAMPLE D-2 Assembler Output From C Source ``` ! a simple program computing the first n Fibonacci numbers, showing various pseudo-operations, sparc instructions, synthetic instructions pseudo-operations: .align, .ascii, .file, .global, .ident, .proc, .section, .size, .skip, .type, .word ! sparc instructions: add, bg, bge, bl, ble, ld, or, restore, save, sethi, st synthetic instructions: call, cmp, inc, mov, ret .file "fibonacci.c" ! the original source file name .section ".text" ! text section (executable instructions) .proc 79 ! subroutine fibonacci, it's return ! value will be in %i0 .global fibonacci ! fibonacci() can be referenced ! outside this file .align 4 ! align the beginning of this section ! to word boundary fibonacci: save %sp,-96,%sp ! create new stack frame and register ! window for this subroutine /* if (n >= MAX_FIB_REPRESENTABLE) { */ ! note, C style comment strings are ! also permitted cmp %i0,49 ! n >= MAX_FIB_REPRESENTABLE ? ! note, n, the 1st parameter to ! fibonacci(), is stored in %i0 upon ! entry bl .L77003 mov 0,%i2 ! initialization of variable ! prev_number is executed in the ! delay slot /* printf("Fibonacci(%d) cannot be represented in a 32 bits word\n", n); */ sethi %hi(.L20),%o0 ! if branch not taken, call printf(), or %o0,%lo(.L20),%o0 ! set up 1st, 2nd argument in %o0, %o1; ``` ``` call printf,2 ! the ",2" means there are 2 out mov %i0,%o1 ! registers used as arguments /* exit(1); */ call exit,1 mov 1.%00 .L77003: ! initialize variables before the loop /* for (i = 2; i < n; i++) { */ mov 1,%i4 ! curr_number = 1 mov 2,%i3 ! i = 2 cmp %i3,%i0 ! i <= n? bqe .L77006 ! if not, return sethi %hi(.L16+8),%o0 ! use %i5 to store fib_array[i] add %o0,%lo(.L16+8),%i5 .LY1: ! loop body /* fib_array[i] = prev_number + curr_number; */ add %i2,%i4,%i2 ! fib_array[i] = prev_number+curr_number st %i2,[%i5] /* prev_number = curr_number; */ mov %i4,%i2 ! prev_number = curr_number /* curr_number = fib_array[i]; */ ld [%i5],%i4 ! curr_number = fib_array[i] inc %i3 ! i++ cmp %i3,%i0 ! i <= n? bl .LY1 ! if yes, repeat loop inc 4.%i5 ! increment ptr to fib_array[] .L77006: /* return(fib_array); */ sethi %hi(.L16),%o0 ! return fib_array in %i0 add %00,%lo(.L16),%i0 ret. restore ! destroy stack frame and register ! window .type fibonacci, #function ! fibonacci() is of type function .size fibonacci,(.-fibonacci) ! size of function: ! current location counter minus ! beginning definition of function .proc 18 ! main program .global main .align 4 save %sp,-104,%sp ! create stack frame for main() /* printf("Fibonacci(n):, please input n:\n"); */ sethi %hi(.L31),%o0 ! call printf, with 1st arg in %o0 call printf,1 or %o0,%lo(.L31),%o0 /* scanf("%d", &n); */ sethi %hi(.L33),%o0 ! call scanf, with 1st arg, in %o0 or %00,%lo(.L33),%o0 ! move 2nd arg. to %o1, in delay slot call scanf,2 add %fp,-4,%o1 /* result = fibonacci(n); */ call fibonacci,1 ld [%fp-4],%o0 ! some initializations before the for- ! loop, put the variables in registers /* for (i = 1; i <= n; i++) */ ``` ``` mov 1,%i5 ! %i5 <-- i mov %o0,%i4 ! %i4 <-- result sethi %hi(.L38),%o0 ! %i2 <-- format string for printf add %o0,%lo(.L38),%i2 ld [%fp-4],%o0 ! test if (i <= n) ? cmp %i5,%o0! note, n is stored in [%fp-4] bg .LE27 nop .LY2: ! loop body /* printf("Fibonacci (%d) is %u\n", i, *result++); */ ld [%i4],%o2 ! call printf, with (*result) in %o2, ! i in %o1, format string in %o0 mov %i5,%o1 call printf,3 mov %i2,%o0 inc %i5 ! i++ ld [%fp-4],%o0 ! i <= n? cmp %i5,%o0 ble .LY2 inc 4,%i4 ! result++ .LE27: ret restore .type main, #function ! type and size of main .size main,(.-main) .section ".data" ! switch to data section ! (contains initialized data) .L16: /* static unsigned fib_array[MAX_FIB_REPRESENTABLE] = \{0,1\}; */ .align 4 ! initialization of first 2 elements .word 0 ! of fib_array[] .align 4 .word 1 .skip 188 .type .L16, #object ! storage allocation for the rest of ! fib_array[] .section ".datal" ! the ascii string data are entered ! into the .datal section; ! #alloc: memory would be allocated for this section during run time ! #write: the section contains data ! that is writeable during process execution .align 4 ! ascii strings used in the printf stmts .ascii "Fibonacci(%d) cannot be represented in a 32 bit w" .ascii "ord\n\0" .align 4 ! align the next ascii string to word ! boundary .ascii "Fibonacci(n):, please enter n:\n\0" .align 4 .L33: .ascii "%d\0" .align 4 .L38: ``` 82 ``` .ascii "Fibonacci (%d) is %u\n\0" .ident "acomp: (CDS) SPARCompilers 2.0 05 Jun 1991" ! an idenitfication string produced ! by the compiler to be entered into ! the .comment section ``` ## **SPARC-V9 Instruction Set** This appendix describes changes made to the SPARC instruction set due to the SPARC-V9 architecture. Application software for the 32-bit SPARC-V8 (Version8) architecture can execute, unchanged, on SPARC-V9 systems. This appendix is organized into the following sections: - Section E.1 "SPARC-V9 Changes" on page 85 - Section E.2 "SPARC-V9 Instruction Set Changes" on page 88 - Section E.3 "SPARC-V9 Instruction Set Mapping" on page 91 - Section E.4 "SPARC-V9 Floating-Point Instruction Set Mapping" on page 102 - Section E.5 "SPARC-V9 Synthetic Instruction-Set Mapping" on page 103 - Section E.6 "UlraSPARC and VIS Instruction Set Extensions" on page 106 ## E.1 SPARC-V9 Changes The SPARC-V9 architecture differs from SPARC-V8 architecture in the following areas, expanded below: registers, alternate space access, byte order, and instruction set. ### E.1.1 Registers These registers have been deleted: | PSR | Processor State Register | |-----|--------------------------| | TBR | Trap Base Register | | WIM | Window Invalid Mask | These registers have been widened from 32 to 64 bits: #### TABLE E-2 | Integer registers | | |---------------------|---------------------| | All state registers | FSR, PC, nPC, and Y | **Note -** FSR Floating-Point State Register: fcc1, fcc2, and fcc3 (added floating-point condition code) bits are added and the register widened to 64-bits. These SPARC-V9 registers are within a SPARC-V8 register field: #### TABLE E-3 | CCR | Condition Codes Register | |-----------|---------------------------| | CWP | Current Window Pointer | | PIL | Processor Interrupt Level | | TBA | Trap Base Address | | TT[MAXTL] | Trap Type | | VER | Version | These are registers that have been added. ASI Address Space Identifier **CANRESTORE** Restorable Windows **CANSAVE** Savable windows **CLEANWIN** Clean Windows **FPRS** Floating-point Register State **OTHERWIN** Other Windows **PSTATE Processor State** TICK Hardware clock tick-counter TL Trap Level TNPC[MAXTL] Trap Next Program Counter TPC[MAXTL] Trap Program Counter TSTATE[MAXTL] **Trap State** WSTATE Windows State Also, there are sixteen additional double-precision floating-point registers, f[32] .. f[62]. These registers overlap (and are aliased with) eight additional quad-precision floating-point registers, f[32] .. f[60] The SPARC-V9, CWP register is decremented during a RESTORE instruction, and incremented during a SAVE instruction. This is the opposite of PSR.CWP's behavior in SPARC-V8. This change has no effect on nonprivileged instructions. #### **Alternate Space Access** E.1.2 Load- and store-alternate instructions to one-half of the alternate spaces can now be included in user code. In SPARC-V9, loads and stores to ASIs $00_{16} \dots 7f_{16}$ are privileged; those to ASIs $80_{16}$ .. FF $_{16}$ are nonprivileged. In SPARC-V8, access to alternate address spaces is privileged. ## E.1.3 Byte Order SPARC-V9 supports both little- and big-endian byte orders for data accesses only; instruction accesses are always performed using big-endian byte order. In SPARC-V8, all data and instruction accesses are performed in big-endian byte order. ## E.2 SPARC-V9 Instruction Set Changes Application software written for the SPARC-V8 processor runs unchanged on a SPARC-V9 processor. ## E.2.1 Extended Instruction Definitions to Support the 64-bit Model #### TABLE E-5 | FCMP, FCMPE | Floating-Point Compare—can set any of the four floating-point condition codes. | |-----------------------|-----------------------------------------------------------------------------------------------------------------------| | LDFSR, STFSR | Load/Store FSR- only affect low-order 32 bits of FSR | | LDUW, LDUWA | Same as LD, LDA in SPARC-V8 | | RDASR/WRASR | Read/Write State Registers - access additional registers | | SAVE/RESTORE | | | SETHI | | | SRA, SRL, SLL, Shifts | Split into 32-bit and 64-bit versions | | Тсс | (was Ticc) Operates with either the 32-bit integer condition codes (icc), or the 64-bit integer condition codes (xcc) | All other arithmetic operations operate on 64-bit operands and produce 64-bit results. #### Added Instructions to Support 64 bits E.2.2 | TARIF F-6 | | | |-----------|--|--| | | | | | | | | | F[sdq]TOx | Convert floating point to 64-bit word | |-----------------------------|-----------------------------------------------------------| | FxTO[sdq] | Convert 64-bit word to floating point | | FMOV[dq] | Floating-Point Move, double and quad | | FNEG[dq] | Floating-point Negate, double and quad | | FABS[dq] | Floating-point Absolute Value, double and quad | | LDDFA, STDFA,<br>LDFA, STFA | Alternate address space forms of LDDF, STDF, LDF, and STF | | LDSW | Load a signed word | | LDSWA | Load a signed word from an alternate space | | LDX | Load an extended word | | LDXA | Load an extended word from an alternate space | | LDXFSR | Load all 64 bits of the FSR register | | STX | Store an extended word | | STXA | Store an extended word into an alternate space | | STXFSR | Store all 64 bits if the FSR register | ## Added Instructions to Support High-Performance System Implementation E.2.3 | BPcc | Branch on integer condition code with prediction | |------------------------|----------------------------------------------------------------------| | BPr | Branch on integer register contents with prediction | | CASA, CASXA | Compare and Swap from an alternate space | | FBPfcc | Branch on floating-point condition code with prediction | | FLUSHW | Flush windows | | FMOVcc | Move floating-point register if condition code is satisfied | | FMOVr | Move floating-point register if integer register satisfies condition | | LDQF(A),<br>STQF(A) | Load/Store Quad Floating-point (in an alternate space) | | MOVcc | Move integer register if condition code is satisfied | | MOVr | Move integer register if register contents satisfy condition | | MULX | Generic 64-bit multiply | | POPC | Population count | | PREFETCH,<br>PREFETCHA | Prefetch Data | | SDIVX, UDIVX | Signed and Unsigned 64-bit divide | ## E.2.4 Deleted Instructions | Coprocessor loads and stores | | |------------------------------|------------------------------------------------------------------------------------------------------------------------| | RDTBR and<br>WRTBR | TBR no longer exists. It is replaced by TBA, which can be read/written with RDPR/WRPR instructions | | RDWIM and<br>WRWIM | WIM no longer exists. WIM has been replaced by several register-window registers | | REPSR and<br>WRPSR | PSR no longer exists. It has been replaced by several separate registers that are read/written with other instructions | | RETT | Return from trap (replace by DONE/RETRY) | | STDFQ | Store Double from Floating-point Queue (replaced by the RDPR FQ instruction | #### E.2.5 Miscellaneous Instruction Changes | TΑ | BL | E. | E- | -9 | |----|----|----|----|----| |----|----|----|----|----| | IMPDEPn | (Changed) Implementation-dependent instructions (replace SPARC-V8 CPop instructions) | |---------|--------------------------------------------------------------------------------------| | MEMBAR | (Added) Memory barrier (memory synchronization support) | #### E.3 SPARC-V9 Instruction Set Mapping describe the SPARC-V9 instruction-set mapping. TABLE E-10 | Opcode | Mnemonic | Argument List | Operation | Comments | |--------|-----------|---------------------|----------------------------------|-------------------------| | ВРА | ba{,a} | %icc or %xcc, label | (Branch on cc with prediction) | 1 | | | {,pt ,pn} | | Branch always | | | BPN | bn{,a} | %icc or %xcc, label | Branch never | 0 | | | {,pt ,pn} | | | | | BPNE | bne{,a} | %icc or %xcc, label | Branch on not equal | not Z | | | {,pt ,pn} | | | | | BPE | be{ ,a} | %icc or %xcc, label | Branch on equal | Z | | | {,pt ,pn} | | | | | BPG | bg{,a} | %icc or %xcc, label | Branch on greater | not (Z or (N<br>xor V)) | | | {,pt ,pn} | | | | | BPLE | ble{,a} | %icc or %xcc, label | Branch on less or equal | Z or (N xor<br>V) | | | {,pt ,pn} | | | ŕ | | BPGE | bge{,a} | %icc or %xcc, label | Branch on greater or equal | not (N xor V) | | | {,pt ,pn} | | | | | BPL | bl{,a} | %icc or %xcc, label | Branch on less | N xor V | | | {,pt ,pn} | | | | | BPGU | bgu{,a} | %icc or %xcc, label | Branch on greater unsigned | not (C or Z) | | | {,pt ,pn} | | | | | BPLEU | bleu{,a} | %icc or %xcc, label | Branch on less or equal unsigned | C or Z | | | {,pt ,pn} | | | | TABLE E-10 (continued) | Opcode | Mnemonic | Argument List | Operation | Comments | |--------|-----------------------------------------|----------------------------|---------------------------------------------------------------|----------| | BPCC | bcc{,a} {,pt ,pn} | %icc or %xcc, label | Branch on carry clear<br>(greater than or equal,<br>unsigned) | not C | | BPCS | bcs{,a} {,pt ,pn} | %icc or %xcc, label | Branch on carry set<br>(less than, unsigned) | С | | BPPOS | bpos{,a} | %icc or %xcc, label | Branch on positive | not N | | BPNEG | <pre>{,pt ,pn} bneg{,a} {,pt ,pn}</pre> | %icc or %xcc, label | Branch on negative | N | | BPVC | bvc{,a} | %icc or %xcc, label | Branch on overflow clear | not V | | BPVS | {,pt ,pn} bvs{,a} | %icc or %xcc, label | Branch on overflow set | V | | | {,pt ,pn} | | | | | BRZ | brz{,a} {,pt ,pn} | reg <sub>rs1</sub> , label | Branch on register zero | Z | | BRLEZ | brlez{,a} {,pt ,pn} | reg <sub>rs1</sub> , label | Branch on register less<br>than or equal to zero | N or Z | | BRLZ | brlz{,a} | reg <sub>rs1</sub> , label | Branch on register less<br>than zero | N | | BRNZ | <pre>{,pt ,pn} brnz{,a} {,pt ,pn}</pre> | reg <sub>rs1</sub> , label | Branch on register not zero | not Z | TABLE E-10 (continued) | Opcode | Mnemonic | Argument List | Operation | Comments | |--------|---------------------|---------------------------------------------------------------------------------------------------------------|--------------------------------------------------------|--------------| | BRGZ | brgz{,a} {,pt ,pn} | reg <sub>rs1</sub> , label | Branch on register<br>greater than zero | not (N or Z) | | BRGEZ | brgez{,a} {,pt ,pn} | reg <sub>rs1</sub> , label | Branch on register<br>greater than or equal to<br>zero | not N | | CASA | casa<br>casa | $[reg_{\rm rs1}] imm\_asi, reg_{\rm rs2}, reg_{\rm rd}$ $[reg_{\rm rs1}] \% asi, reg_{\rm rs2}, reg_{\rm rd}$ | Compare and swap<br>word from alternate<br>space | | | CASXA | casxa<br>casxa | $[reg_{\rm rs1}] imm\_asi, reg_{\rm rs2}, reg_{\rm rd}$ $[reg_{\rm rs1}] \% asi, reg_{\rm rs2}, reg_{\rm rd}$ | Compare and swap extended from alternate space | | | FBPA | fba{,a} {,pt ,pn} | %fccn, label | (Branch on cc with<br>prediction)<br>Branch never | 1 | | FBPN | fbn{,a} {,pt ,pn} | %fccn, label | Branch always | 0 | | FBPU | fbu{,a} {,pt ,pn} | %fccn, label | Branch on unordered | U | | FBPG | fbg{,a} {,pt ,pn} | %fccn, label | Branch on greater | G | | FBPUG | fbug{,a} {,pt ,pn} | %fccn, label | Branch on unordered or greater | G or U | | FBPL | fbl{,a} {,pt ,pn} | %fccn, label | Branch on less | L | | FBPUL | fbul{,a} {,pt ,pn} | %fccn, label | Branch on unordered or less | | TABLE E-10 (continued) | Opcode | Mnemonic | Argument List | Operation | Comments | |--------|-----------|--------------------------------------------------------|-----------------------------------------|-------------| | FBPLG | fblg{,a} | %fccn, label | Branch on less or greater | L or G | | | {,pt ,pn} | | ground | | | FBPNE | fbne{,a} | %fccn, label | Branch on not equal | L or G or U | | | {,pt ,pn} | | | | | FBPE | fbe{,a} | %fccn, label | Branch on equal | Е | | | {,pt ,pn} | | | | | FBPUE | fbue{,a} | %fccn, label | Branch on unordered or equal | E or U | | | {,pt ,pn} | | equai | | | FBPGE | fbge{,a} | %fccn, label | Branch on greater or | E or G | | | {,pt ,pn} | | equal | | | FBPUGE | fbuge{,a} | %fccn, label | Branch on unordered or greater or equal | E or G or U | | | {,pt ,pn} | | greater or equal | | | FBPLE | fble{,a} | %fccn, label | Branch on less or equal | E or L | | | {,pt ,pn} | | | | | FBPULE | fbule{,a} | %fccn, label | Branch on unordered or less or equal | E or L or u | | | {,pt ,pn} | | less or equal | | | FBPO | fbo{,a} | %fccn, label | Branch on ordered | E or L or G | | | {,pt ,pn} | | | | | FLUSHW | flushw | | Flush register windows | | | FMOVA | fmov | %icc or %xcc, freg <sub>rs2</sub> , freg <sub>rd</sub> | (Move on integer cc) | 1 | | | {s,d,q}a | 135 14 | Move always | | TABLE E-10 (continued) | Opcode | Mnemonic | Argument List | Operation | Comments | |---------|------------|--------------------------------------------------------|-------------------------------------------|-------------------------| | FMOVN | fmov | %icc or %xcc, freg <sub>rs2</sub> , freg <sub>rd</sub> | Move never | 0 | | | {s,d,q}n | | | | | FMOVNE | fmov | %icc or %xcc, freg <sub>rs2</sub> , freg <sub>rd</sub> | Move if not equal | not Z | | | {s,d,q}ne | | | | | FMOVE | fmov | %icc or %xcc, freg <sub>rs2</sub> , freg <sub>rd</sub> | Move if equal | Z | | | {s,d,q}e | | | | | FMOVG | fmov | %icc or %xcc, freg <sub>rs2</sub> , freg <sub>rd</sub> | Move if greater | not (Z or (N<br>xor V)) | | | {s,d,q}g | | | XOI V)) | | FMOVLE | fmov | %icc or %xcc, freg <sub>rs2</sub> , freg <sub>rd</sub> | Move if less or equal | Z or (N xor<br>V) | | | {s,d,q}le | | | <b>V</b> ) | | FMOVGE | fmov | %icc or %xcc, freg <sub>rs2</sub> , freg <sub>rd</sub> | Move if greater or equal | not (N xor V) | | | {s,d,q}ge | | | | | FMOVL | fmov | %icc or %xcc, freg <sub>rs2</sub> , freg <sub>rd</sub> | Move if less | N xor V | | | {s,d,q}l | | | | | FMOVGU | fmov | %icc or %xcc, freg <sub>rs2</sub> , freg <sub>rd</sub> | Move if greater unsigned | not (C or Z) | | | {s,d,q}gu | | unsigned | | | FMOVLEU | fmov | %icc or %xcc, freg <sub>rs2</sub> , freg <sub>rd</sub> | Move if less or equal unsigned | C or Z | | | {s,d,q}leu | | unsigned | | | FMOVCC | fmov | %icc or %xcc, freg <sub>rs2</sub> , freg <sub>rd</sub> | Move if carry clear<br>(greater or equal, | not C | | | {s,d,q}cc | | unsigned) | | TABLE E-10 (continued) | Opcode | Mnemonic | Argument List | Operation | Comments | |----------|------------|---------------------------------------------------------------|---------------------------------------------|----------| | FMOVCS | fmov | %icc or %xcc, freg <sub>rs2</sub> , freg <sub>rd</sub> | Move if carry set (less than, unsigned) | С | | | {s,d,q}cs | | than, and given, | | | FMOVPOS | fmov | %icc or %xcc, freg <sub>rs2</sub> , freg <sub>rd</sub> | Move if positive | not N | | | {s,d,q}pos | | | | | FMOVNEG | fmov | %icc or %xcc, freg <sub>rs2</sub> , freg <sub>rd</sub> | Move if negative | N | | | {s,d,q}neg | | | | | FMOVVC | fmov | %icc or %xcc, freg <sub>rs2</sub> , freg <sub>rd</sub> | Move if overflow clear | not V | | | {s,d,q}vc | | | | | FMOVVS | fmov | %icc or %xcc, freg <sub>rs2</sub> , freg <sub>rd</sub> | Move if overflow set | V | | | {s,d,q}vs | | | | | FMOVRZ | fmovr | reg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | (Move f-p register on cc) | | | | {s,d,q}e | | Move if register zero | | | FMOVRLEZ | fmovr | reg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | Move if register less<br>than or equal zero | | | | {s,d,q}lz | | than or equal zero | | | FMOVRLZ | fmovr | $reg_{rs1}$ , $freg_{rs2}$ , $freg_{rd}$ | Move if register less<br>than zero | | | | {s,d,q}lz | | than zero | | | FMOVRNZ | fmovr | $reg_{rs1}$ , $freg_{rs2}$ , $freg_{rd}$ | Move if register not zero | | | FMOVRGZ | {s,d,q}ne | $reg_{rs1}$ , $freg_{rs2}$ , $freg_{rd}$ | Move if register greater | | | FMOVRGEZ | fmovr | $reg_{rs1}$ , $freg_{rs2}$ , $freg_{rd}$ | than zero Move if register greater | | | | {s,d,q}gz | | than or equal to zero | | | | fmovr | | | | | | {s,d,q}gez | | | | TABLE E-10 (continued) | Opcode | Mnemonic | Argument List | Operation | Comments | |----------|----------------|---------------------------------------------------------------|-----------------------------------------|-----------------------| | FMOVFA | fmov{s,d,q}a | %fccn, freg <sub>rs2</sub> ,freg <sub>rd</sub> | (Move on floating-point cc) | 1 | | FMOVFN | fmov{s,d,q}n | %fcc $n$ , $freg_{rs2}$ , $freg_{rd}$ | Move always | 0 | | FMOVFU | fmov{s,d,q}u | %fcc $n$ , freg $_{ m rs2}$ , freg $_{ m rd}$ | Move never | U | | | fmov{s,d,q}g | $$\fcon, freg_{rs2}, freg_{rd}$ | Move if unordered | G<br>G or U | | FMOVFG | | | Move if greater | L | | FMOVFUG | fmov{s,d,q}ug | %fcc $n$ , fre $g_{ m rs2}$ , fre $g_{ m rd}$ | Move if unordered or greater | L or U | | FMOVFL | $fmov{s,d,q}l$ | %fcc $n$ , fre $g_{ m rs2}$ , fre $g_{ m rd}$ | Move if less | L or G | | FMOVFUL | fmov{s,d,q}ul | %fcc $n$ , fre $g_{ m rs2}$ , fre $g_{ m rd}$ | Move if unordered or | L or G or U | | FMOVFLG | fmov{s,d,q}lg | ${\tt %fcc} n$ , ${\it freg}_{ m rs2}$ , ${\it freg}_{ m rd}$ | less | Е | | | fmov{s,d,q}ne | %fccn, freg <sub>rs2</sub> ,freg <sub>rd</sub> | Move if less or greater | E or U | | FMOVFNE | , =, | | Move if not equal | E or G | | FMOVFE | $fmov{s,d,q}e$ | %fcc $n$ , fre $g_{ m rs2}$ , fre $g_{ m rd}$ | Move if equal | E or G or U | | FMOVFUE | fmov{s,d,q}ue | %fcc $n$ , fre $g_{ m rs2}$ , fre $g_{ m rd}$ | Move if unordered or equal | E or L<br>E or L or u | | FMOVFGE | fmov{s,d,q}ge | %fcc $n$ , fre $g_{ m rs2}$ ,fre $g_{ m rd}$ | Move if greater or equal | E or L or G | | FMOVFUGE | fmov{s,d,q}uge | %fcc $n$ , freg $_{ m rs2}$ ,freg $_{ m rd}$ | Move if unordered or greater or equal | | | FMOVFLE | fmov{s,d,q}le | %fcc $n$ , fre $g_{rs2}$ , fre $g_{rd}$ | Move if less or equal | | | FMOVFULE | fmov{s,d,q}ule | %fccn, freg <sub>rs2</sub> ,freg <sub>rd</sub> | Move if unordered or less or equal | | | FMOVFO | fmov{s,d,q}o | %fcc $n$ , freg $_{ m rs2}$ ,freg $_{ m rd}$ | Move if ordered | | | LDSW | ldsw | [address], reg <sub>rd</sub> | Load a signed word | | | LDSWA | ldsw | [regaddr] imm_asi, reg <sub>rd</sub> | Load signed word from alternate space | | | LDX | ldx | [address], reg <sub>rd</sub> | Load extended word | | | LDXA | ldxa | [regaddr] imm_asi, reg <sub>rd</sub> | Load extended word from alternate space | | | | ldxa | [reg_plus_imm] %asi, reg <sub>rd</sub> | Load floating-point | | | LDXFSR | ldx | [address], %fsr | state register | | | MEMBAR | membar | membar_mask | Memory barrier | | TABLE E-10 (continued) | Opcode | Mnemonic | Argument List | Operation | Comments | |--------|----------|-----------------------------------------------|-----------------------------------|---------------| | MOVA | mova | %icc or %xcc, reg_or_imm11, reg <sub>rd</sub> | (Move integer register on cc) | 1 0 | | MOVN | movn | %icc or %xcc, reg_or_imm11, | Move always | not Z | | MOVNE | movne | $reg_{ m rd}$ | Move never | Z | | MOVE | move | %icc or %xcc, reg_or_imm11, | Move if not equal | not (Z or (N | | MOVE | move | $^{reg}_{rd}$ | Move if equal | xor V)) | | MOVG | movg | %icc or %xcc, reg_or_imm11, | Move if greater | Z or (N xor | | MOVLE | movle | reg <sub>rd</sub> | Move if less or equal | V) | | MOVGE | movge | %icc or %xcc, reg_or_imm11, reg <sub>rd</sub> | Move if greater or equal | not (N xor V) | | MOVGE | movge | %icc or %xcc, reg_or_imm11, | Move if less | N xor V | | MOVL | movl | reg <sub>rd</sub> | Move if greater | not (C or Z) | | MOVGU | movgu | %icc or %xcc, reg_or_imm11, | unsigned | C or Z | | MOVLEU | movleu | $reg_{ m rd}$ | Move if less or equal unsigned | not C | | | | %icc or %xcc, reg_or_imm11, | Move if carry clear | C<br>not N | | MOVCC | movcc | $^{reg}_{\mathrm{rd}}$ | (greater or equal, | not N<br>N | | MOVCS | movcs | %icc or %xcc, reg_or_imm11, | unsigned) Move if carry set (less | not V | | MOVPOS | movpos | reg <sub>rd</sub> | than, unsigned) | V | | | _ | %icc or %xcc, reg_or_imm11, reg <sub>rd</sub> | Move if positive | · | | MOVNEG | movneg | %icc or %xcc, reg_or_imm11, | Move if negative | | | MOVVC | movvc | reg <sub>rd</sub> | Move if overflow clear | | | MOVVS | movvs | %icc or %xcc, reg_or_imm11, reg <sub>rd</sub> | Move if overflow set | | | | | %icc or %xcc, reg_or_imm11, reg <sub>rd</sub> | | | | | | %icc or %xcc, reg_or_imm11, reg <sub>rd</sub> | | | | | | %icc or %xcc, reg_or_imm11, reg <sub>rd</sub> | | | | | | %icc or %xcc, reg_or_imm11, reg <sub>rd</sub> | | | TABLE E-10 (continued) | Opcode | Mnemonic | Argument List | Operation | Comments | |---------|----------|-----------------------------------------------------|------------------------------------------------|-----------------------| | MOVFA | mova | %fccn,reg_or_imm11,regrd | (Move on floating-point cc) | 1 | | MOVFN | movn | %fcc <i>n,reg_or_imm11</i> , reg <sub>rd</sub> | Move always | 0<br>U | | MOVFU | movu | %fccn,reg_or_imm11,reg_rd | Move never | G | | MOVFG | movg | %fccn,reg_or_imm11,reg_rd | Move if unordered Move if greater | G or U | | MOVFUG | movug | %fccn,reg_or_imm11,reg <sub>rd</sub> | Move if greater Move if unordered or | L | | MOVFL | movl | %fcc <i>n</i> , reg_or_imm11, reg <sub>rd</sub> | greater Move if less | L or U<br>L or G | | MOVFUL | movul | %fccn,reg_or_imm11,reg_rd | Move if unordered or | L or G or U | | MOVFLG | movlg | %fccn, reg_or_imm11, reg_rd | less | E E or U | | MOVFNE | | | Move if less or greater | E or G | | MOVFE | movne | %fcc <i>n</i> , reg_or_imm11, reg <sub>rd</sub> | Move if not equal | E or G or U | | MOVFE | move | %fcc <i>n</i> , reg_or_imm11, reg <sub>rd</sub> | Move if equal Move if unordered or | E or L<br>E or L or u | | MOVFUE | movue | %fccn,reg_or_imm11,reg_rd | equal | E or L or u | | MOVFGE | movge | %fccn,reg_or_imm11,reg_rd | Move if greater or equal | E OI E OI G | | MOVFUGE | movuge | %fccn,reg_or_imm11,reg_rd | Move if unordered or greater or equal | | | MOVFLE | movle | %fccn,reg_or_imm11,reg <sub>rd</sub> | Move if less or equal | | | MOVFULE | movule | %fccn, reg_or_imm11, reg_rd | Move if unordered or less or equal | | | MOVFO | movo | %fccn,reg_or_imm11,reg <sub>rd</sub> | Move if ordered | | | MOVRZ | movre | reg <sub>rs1</sub> , reg_or_imm10,reg <sub>rd</sub> | (Move register on register cc) | Z | | MOVRLEZ | movrlez | reg <sub>rs1</sub> , reg_or_imm10,reg <sub>rd</sub> | Move if register zero | N or Z | | MOVRLZ | movrlz | reg <sub>rs1</sub> , reg_or_imm10,reg <sub>rd</sub> | Move if register less<br>than or equal to zero | N<br>not Z | | MOVRNZ | movrnz | reg <sub>rs1</sub> , reg_or_imm10,reg <sub>rd</sub> | Move if register less | N nor Z | | MOVRGZ | movrgz | reg <sub>rs1</sub> , reg_or_imm10,reg <sub>rd</sub> | than zero | not N | | MOVRGEZ | movrgez | reg <sub>rs1</sub> , reg_or_imm10,reg <sub>rd</sub> | Move if register not zero | | | | | | Move if register greater than zero | | | | | | Move if register greater than or equal to zero | | TABLE E-10 (continued) | Opcode | Mnemonic | Argument List | Operation | Comments | |-----------------------|------------------------------------|-----------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------| | MULX | mulx | reg <sub>rs1</sub> , reg_or_imm,reg <sub>rd</sub> | (Generic 64-bit<br>Multiply) Multiply<br>(signed or unsigned) | See SDIVX<br>and UDIVX | | POPC | popc | reg_or_imm, reg <sub>rd</sub> | Population count | | | PREFETCH<br>PREFETCHA | prefetch<br>prefetcha<br>prefetcha | [address], prefetch_dcn [regaddr]<br>imm_asi, prefetch_fcn<br>[reg_plus_imm] %asi,<br>prefetch_fcn | Prefetch data Prefetch data from alternate space | See The<br>SPARC<br>architecture<br>manual,<br>version 9 | | SDIVX | sdivx | reg <sub>rs1</sub> , reg_or_imm,reg <sub>rd</sub> | (64-bit signed divide)<br>Signed Divide | See MULX<br>and UDIVX | | STX<br>STXA<br>STXFSR | stx<br>stxa<br>stxa<br>stx | reg <sub>rd</sub> , [address] reg <sub>rd</sub> , [address] imm_asi reg <sub>rd</sub> , [reg_plus_imm] %asi %fsr, [address] | Store extended word<br>Store extended word<br>into alternate space<br>Store floating-point<br>register (all 64-bits) | | | UDIVX | udivx | reg <sub>rs1</sub> , reg_or_imm, reg <sub>rd</sub> | (64-bit unsigned divide)<br>Unsigned divide | See MULX<br>and SDIVX | # E.4 SPARC-V9 Floating-Point Instruction Set Mapping SPARC-V9 floating-point instructions are shown in the following table. TABLE E-11 | SPARC | Mnemonic <sup>1</sup> | Argument List | Description | |-----------|-----------------------|------------------------------------------|------------------------------------------| | F[sdq]TOx | fstox | freg <sub>rs2</sub> , freg <sub>rd</sub> | Convert floating point to 64-bit integer | | | fdtox | freg <sub>rs2</sub> , freg <sub>rd</sub> | | | | fqtox | freg <sub>rs2</sub> , freg <sub>rd</sub> | | | | fstoi | freg <sub>rs2</sub> , freg <sub>rd</sub> | Convert floating-point to 32-bit integer | | | fdtoi | freg <sub>rs2</sub> , freg <sub>rd</sub> | | | | fqtoi | freg <sub>rs2</sub> , freg <sub>rd</sub> | | | FxT0[sdq] | fxtos | freg <sub>rs2</sub> , freg <sub>rd</sub> | Convert 64-bit integer to floating point | | | fxtod | freg <sub>rs2</sub> , freg <sub>rd</sub> | | | | fxtoq | freg <sub>rs2</sub> , freg <sub>rd</sub> | | | | fitos | freg <sub>rs2</sub> , freg <sub>rd</sub> | Convert 32-bit integer to floating point | | | fitod | freg <sub>rs2</sub> , freg <sub>rd</sub> | | | | fitoq | freg <sub>rs2</sub> , freg <sub>rd</sub> | | | FMOV[dq] | fmovd | freg <sub>rs2</sub> , freg <sub>rd</sub> | Move double | | | fmovq | freg <sub>rs2</sub> , freg <sub>rd</sub> | Move quad | | FNEG[dq] | fnegd | freg <sub>rs2</sub> , freg <sub>rd</sub> | Negate double | | | fnegq | freg <sub>rs2</sub> , freg <sub>rd</sub> | Negate quad | TABLE E-11 (continued) | SPARC | Mnemonic <sup>1</sup> | Argument List | Description | |----------|-----------------------|-------------------------------------------------------------------------------|---------------------------------------------------------------| | FABS[dq] | fabsd | freg <sub>rs2</sub> , freg <sub>rd</sub> | Absolute value double | | | fabsq | freg <sub>rs2</sub> , freg <sub>rd</sub> | Absolute value quad | | LDFA | lda | [regaddr] imm_asi, freg <sub>rd</sub> | Load floating-point register from alternate space | | LDDFA | lda | [reg_plus_imm] %asi, freg | Load double floating-point register | | LDQFA | ldda | [regaddr] imm_asi, freg <sub>rd</sub> [reg_plus_imm] %asi, freg <sub>rd</sub> | from alternate space. Load quad floating-point register from | | | ldda | [regaddr] imm_asi, freg <sub>rd</sub> | alternate space | | | ldqa | [reg_plus_imm] %asi, freg <sub>rd</sub> | | | | ldqa | | | | STFA | sta | freg <sub>rd</sub> , [regaddr] imm_asi | Store floating-point register to alternate space | | STDFA | sta | freg <sub>rd</sub> , [reg_plus_imm] %asi | Store double floating-point register to alternate space | | STOFA | stda | freg <sub>rd</sub> , [regaddr] imm_asi | Store quad floating-point register to | | ~ | stda | freg <sub>rd</sub> , [reg_plus_imm] %asi | alternate space | | | stqa | freg <sub>rd</sub> , [regaddr] imm_asi | | | | stqa | freg <sub>rd</sub> , [reg_plus_imm] %asi | | $<sup>1. \</sup>quad \text{Types of Operands are denoted by the following lower-case letters:} \\ \text{i } 32\text{-bit integers } \text{64-bit integers singled doubleq } \text{quad}$ ### **E.5** SPARC-V9 Synthetic Instruction-Set Mapping Here is a mapping of synthetic instructions to hardware equivalent instructions. TABLE E-12 | Synthetic Instru | ection | Hardware Equiv | valent(s) | Comment | |------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------| | cas<br>casl<br>casx<br>casxl | $ [reg_{rsl}], reg_{rs2}, reg_{rd} $ $ [reg_{rsl}], reg_{rs2}, reg_{rd} $ $ [reg_{rsl}], reg_{rs2}, reg_{rd} $ $ [reg_{rsl}], reg_{rs2}, reg_{rd} $ $ [reg_{rsl}], reg_{rs2}, reg_{rd} $ | casa<br>casa<br>casxa<br>casxa | $ \begin{aligned} &[reg_{rsl}] \text{ASI\_P, } reg_{rs2}, \\ ®_{rd} \\ &[reg_{rsl}] \text{ASI\_P\_L, } reg_{rs2}, \\ ®_{rd} \\ &[reg_{rsl}] \text{ASI\_P, } reg_{rs2}, \\ ®_{rd} \\ &[reg_{rsl}] \text{ASI\_P\_L, } reg_{rs2}, \\ ®_{rd} \end{aligned} $ | Compare & swap (cas) cas little-endian cas extended cas little-endian, extended | | clrx | [address] | stx | %g0, [address] | Clear extended word | | clruw<br>clruw | $reg_{rs1}$ , $reg_{rd}$ $reg_{rd}$ | srl<br>srl | $reg_{rs1}$ , %g0, $reg_{rd}$<br>$reg_{rd}$ , %g0, $reg_{rd}$ | Copy and clear upper<br>word<br>Clear upper word | | iprefetch | label | bn, pt | %xcc, label | Instruction prefetch, | | mov<br>mov | %y, reg <sub>rd</sub> %asrn, reg <sub>rd</sub> reg_or_imm, %asrn | rd<br>rd<br>wr | %y, reg <sub>rd</sub> %asrn, reg <sub>rd</sub> %g0, reg_or_imm, %asrn | | | ret<br>retl | | jmpl<br>jmpl | %i7+8, %g0<br>%o7+8, %g0 | Return from subroutine<br>Return from leaf<br>subroutine | | setn | value, r1, r2 | for -xarch=v9 same as setx value r1, r2 for -xarch=v8 same as set value r2 | | | | setnhi | value, r1, r2 | r1, r2 | same as setxhi <i>value</i> | | TABLE E-12 (continued) | Synthetic Instru | ction | Hardware Equiv | valent(s) | Comment | |------------------|-------------------------|----------------|------------------------------------------------------------------|----------------------------------------------------------------| | setuw | value,reg <sub>rd</sub> | sethi | %hi <i>(value), reg</i> rd | (value & $3FF_{16}$ )==0<br>when $0 \le \text{value} \le 4095$ | | | | or | %g0, <i>value</i> , <i>reg</i> rd | (otherwise) | | | | sethi | %hi <i>(value), reg</i> rd; | Do not use setuw in a DCTI delay slot. | | | | or | reg <sub>rd</sub> , %10(value), reg <sub>rd</sub> | DC11 delay slot. | | setsw | value,reg <sub>rd</sub> | sethi | %hi <i>(value), reg</i> rd | value>=0 and (value & 3FF <sub>16</sub> )==0 | | | | or | %g0, <i>value</i> , <i>reg</i> rd | -4096 ≤ value ≤ 4095 | | | | sethi | %hi <i>(value), reg</i> rd | if (value<0) and ((value<br>& 3FF)==0) | | | | sra | $\mathit{reg}_{\mathrm{rd}}$ , %g0, $\mathit{reg}_{\mathrm{rd}}$ | (otherwise, if value>=0) | | | | sethi | %hi <i>(value), reg</i> <sub>rd;</sub> | (otherwise, if value<0) | | | | or | reg <sub>rd</sub> , %10(value), reg <sub>rd</sub> | Do not use setsw in a CTI delay slot. | | | | sethi | %hi(value), reg <sub>rd</sub> ; | | | | | or | reg <sub>rd</sub> , %10(value), reg <sub>rd</sub> | | | | | sra | reg <sub>rd</sub> , %g0, reg <sub>rd</sub> | | | setx | value, r1, r2 | sethi | %hh(value), r1 | | | | | or | r1, %hm(value), r1 | | | | | sethi | %lm( <i>value</i> ), r2<br>r2, %lo( <i>value</i> ), r2 | | | | | or | r1, 32, r1 | | | | | sllx | r1, r2, r2 | | | | | or | | | TABLE E-12 (continued) | Synthetic Instruction | | Hardware Equivalent(s) | | Comment | |-----------------------|-------------------------------------------|------------------------|------------------------------------------------------------------------------------|----------------------------------------| | setxhi | value r1, r2 | sethi or sethi sllx or | %hh(value), r1<br>r1, %hm(value), r1<br>%lm(value), r2<br>r1, 32, r1<br>r1, r2, r2 | | | signx<br>signx | $reg_{ m rsl,} reg_{ m rd}$ $reg_{ m rd}$ | sra<br>sra | $reg_{rsl,}$ %g0, $reg_{rd}$ $reg_{rd,}$ %g0, $reg_{rd}$ | Sign-extend 32-bit<br>value to 64 bits | ## E.6 UlraSPARC and VIS Instruction Set Extensions This section describes extensions that require SPARC-V9. The extensions support enhanced graphics functionality and improved memory access efficiency. **Note -** SPARC-V9 instruction set extensions used in executables may not be portable to other SPARC-V9 systems. ## **E.6.1** Graphics Data Formats The overhead of converting to and from floating-point arithmetic is high, so the graphics instructions are optimized for short-integer arithmetic. Image components are 8 or 16 bits. Intermediate results are 16 or 32 bits. ## E.6.2 Eight-bit Format A 32-bit word contains pixels of four unsigned 8-bit integers. The integers represent image intensity values ( X, G, B, R). Support is provided for *band interleaved* images (store color components of a point), and band sequential images (store all values of one color component). #### E.6.3 **Fixed Data Formats** A 64-bit word contains four 16-bit signed fixed-point values. This is the fixed 16-bit data format. A 64-bit word contains two 8-bit signed fixed-point values. This is the fixed 32-bit data format. Enough precision and dynamic range (for filtering and simple image computations on pixel values) can be provided by an intermediate format of fixed data values. Pixel multiplication is used to convert from pixel data to fixed data. Pack instructions are used to convert from fixed data to pixel data (clip and truncate to an 8-bit unsigned value). The FPACKFIX instruction supports conversion from 32-bit fixed to 16-bit fixed. Rounding is done by adding one to the rounding bit position. You should use floating-point data to perform complex calculations needing more precision or dynamic range. #### **SHUTDOWN Instruction** E.6.4 All outstanding transactions are completed before the SHUTDOWN instruction completes. #### TABLE E-13 | SPARC | Mnemonic | Argument List | Description | |----------|----------|---------------|-----------------------------------| | SHUTDOWN | shutdown | | shutdown to enter power down mode | #### E.6.5**Graphics Status Register (GSR)** You use ASR 0x13 instructions RDASR and WRASR to access the Graphics Status Register. | SPARC | Mnemonic | Argument List | Description | |-------|----------|---------------------------------------|-------------| | RDASR | rdasr | %gsr, reg <sub>rd</sub> | read GSR | | WRASR | wrasr | reg <sub>rs1</sub> , reg_or_imm, %gsr | write GSR | ## **E.6.6** Graphics Instructions Unless otherwise specified, floating-point registers contain all instruction operands. There are 32 double-precision registers. Single-precision floating-point registers contain the pixel values, and double-precision floating-point registers contain the fixed values. The opcode space reserved for the Implementation-Dependent Instruction1 (IMPDEP1) instructions is where the graphics instruction set is mapped. Partitioned add/subtract instructions perform two 32-bit or four 16-bit partitioned adds or subtracts between the source operands corresponding fixed point values. TABLE E-15 | SPARC | Mnemonic | Argument List | Description | |----------|----------|----------------------------------------------------------------|--------------------------------------------| | FPADD16 | fpadd16 | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | four 16-bit add | | FPADD16S | fpadd16s | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | two 16-bit add<br>two 32-bit add | | FPADD32 | fpadd32 | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | one 32-bit add | | FPADD32S | fpadd32s | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | four 16-bit subtract | | FPSUB16 | fpsub16 | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | two 16-bit subtract<br>two 32-bit subtract | | FPSUB16S | fpsub16s | $freg_{rs1}$ , $freg_{rs2}$ , $freg_{rd}$ | one 32-bit subtract | | FPSUB32 | fpsub32 | $freg_{rs1}$ , $freg_{rs2}$ , $freg_{rd}$ | | | FPSUB32S | fpsub32s | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | | Pack instructions convert to a lower pixel or precision fixed format. TABLE E-16 | SPARC | Mnemonic | Argument List | Description | |----------|----------|----------------------------------------------------------------|---------------------------------------| | FPACK16 | fpack16 | freg <sub>rs2</sub> , freg <sub>rd</sub> | four 16-bit packs | | FPACK32 | fpack32 | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | two 32-bit packs<br>four 16-bit packs | | FPACKFIX | fpackfix | freg <sub>rs2</sub> , freg <sub>rd</sub> | four 16-bit expands | | FEXPAND | fexpand | freg <sub>rs2</sub> , freg <sub>rd</sub> | two 32-bit merges | | FPMERGE | fpmerge | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | | Partitioned multiply instructions have the following variations. TABLE E-17 | SPARC | Mnemonic | Argument List | Description | |-------------|-------------|----------------------------------------------------------------|----------------------------------------------------------| | FMUL8x16 | fmul8x16 | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | 8x16-bit partition | | FMUL8x16AU | fmul8x16au | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | 8x16-bit upper 💢 partition<br>8x16-bit lower 💢 partition | | FMUL8x16AL | fmul8x16al | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | upper 8x16-bit partition | | FMUL8SUx16 | fmul8sux16 | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | lower unsigned 8x16-bit | | FMUL8ULx16 | fmul8ulx16 | | upper 8x16-bit partition | | FMULD8SUx16 | fmuld8sux16 | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | lower unsigned 8x16-bit partition | | FMULD8ULx16 | fmuld8ulx16 | $freg_{rs1}$ , $freg_{rs2}$ , $freg_{rd}$ | | | | | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | | Alignment instructions have the following variations. | SPARC | Mnemonic | Argument List | Description | |---------------------|------------|----------------------------------------------------------------|---------------------------------------------------------------------| | ALIGNADDRESS | alignaddr | reg <sub>rs1</sub> , reg <sub>rs2</sub> , reg <sub>rd</sub> | find misaligned data access address | | ALIGNADDRESS_LITTLE | alignaddrl | reg <sub>rs1</sub> , reg <sub>rs2</sub> , reg <sub>rd</sub> | same as above, but little-endian do misaligned data, data alignment | | FALIGNDATA | faligndata | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | do misangned data, data angiment | Logical operate instructions perform one of sixteen 64-bit logical operations between rs1 and rs2 (in the standard 64-bit version). TABLE E-19 | SPARC | Mnemonic | Argument List | Description | |--------|----------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------| | FZERO | fzero | $\mathit{freg}_{\mathrm{rd}}$ | zero fill | | FZEROS | fzeros | freg <sub>rd</sub> | zero fill, single precision<br>one fill | | FONE | fone | $freg_{ m rd}$ | one fill, single precision | | FONES | fones | freg <sub>rd</sub> | copy src1 | | FSRC1 | fsrcl | freg <sub>rs1</sub> , freg <sub>rd</sub> | | | FSRC1S | fsrcls | freg <sub>rs1</sub> , freg <sub>rd</sub> | copy src1, single precision | | FSRC2 | fsrc2 | $\begin{aligned} & \textit{freg}_{\text{rs1}}, \ \textit{freg}_{\text{rd}} \\ & \textit{freg}_{\text{rs2}}, \ \textit{freg}_{\text{rd}} \\ & \textit{freg}_{\text{rs2}}, \ \textit{freg}_{\text{rd}} \end{aligned}$ | copy src2<br>copy src2, single precision | | FSRC2S | fsrc2s | freg <sub>rs2</sub> , freg <sub>rd</sub> | negate src1, 1's complement | | FNOT1 | fnot1 | freg <sub>rs1</sub> , freg <sub>rd</sub> | same as above, single precision | | FNOT1S | fnot1s | freg <sub>rs1</sub> , freg <sub>rd</sub> | | TABLE E-19 (continued) | SPARC | Mnemonic | Argument List | Description | |-----------|-----------|----------------------------------------------------------------|---------------------------------------------------------| | FNOT2 | fnot2 | freg <sub>rs2</sub> , freg <sub>rd</sub> | negate src2, 1's complement | | FNOT2S | fnot2s | freg <sub>rs2</sub> , freg <sub>rd</sub> | same as above, single precision logical OR | | FOR | for | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | logical OR, single precision | | FORS | fors | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | logical NOR | | FNOR | fnor | $freg_{rs1}$ , $freg_{rs2}$ , $freg_{rd}$ | | | FNORS | fnors | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | logical NOR, single precision | | FAND | fand | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | logical AND logical AND, single precision | | FANDS | fands | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | logical NAND | | FNAND | fnand | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | logical NAND, single precision | | FNANDS | fnands | $freg_{rs1}$ , $freg_{rs2}$ , $freg_{rd}$ | | | FXOR | fxor | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | logical XOR | | FXORS | fxors | $freg_{rs1}$ , $freg_{rs2}$ , $freg_{rd}$ | logical XOR, single precision logical XNOR | | FXNOR | fxnor | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | logical XNOR, single precision | | FXNORS | fxnors | $freg_{rs1}$ , $freg_{rs2}$ , $freg_{rd}$ | negated src1 OR src2 | | FORNOT1 | fornot1 | $freg_{rs1}$ , $freg_{rs2}$ , $freg_{rd}$ | | | FORNOT1S | fornot1s | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | same as above, single precision | | FORNOT2 | fornot2 | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | src1 OR negated src2<br>same as above, single precision | | FORNOT2S | fornot2s | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | negated src1 AND src2 | | FANDNOT1 | fandnot1 | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | | | FANDNOT1S | fandnot1s | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | same as above, single precision | | FANDNOT2 | fandnot2 | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | src1 AND negated src2 same as above, single precision | | FANDNOT2S | fandnot2s | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | 8 1 | Pixel compare instructions compare fixed-point values in rs1 and rs2 (two 32 bit or four 16 bit) TABLE E-20 | SPARC | Mnemonic | Argument List | Description | |----------|----------|---------------------------------------------------------------|--------------------------------------------------------------------------------| | FCMPGT16 | fcmpgt16 | freg <sub>rs1</sub> , freg <sub>rs2</sub> , reg <sub>rd</sub> | 4 16-bit compare, set rd if src1>src2 | | FCMPGT32 | fcmpgt32 | freg <sub>rs1</sub> , freg <sub>rs2</sub> , reg <sub>rd</sub> | 2 32-bit compare, set rd if src1>src2<br>4 16-bit compare, set rd if src1≤src2 | | FCMPLE16 | fcmple16 | freg <sub>rs1</sub> , freg <sub>rs2</sub> , reg <sub>rd</sub> | 2 32-bit compare, set rd if src1≤src2 | | FCMPLE32 | fcmple32 | $freg_{rs1}$ , $freg_{rs2}$ , $reg_{rd}$ | | | FCMPNE16 | fcmpne16 | freg <sub>rs1</sub> , freg <sub>rs2</sub> , reg <sub>rd</sub> | 4 16-bit compare, set rd if src1≠src2 | | FCMPNE32 | fcmpne32 | freg <sub>rs1</sub> , freg <sub>rs2</sub> , reg <sub>rd</sub> | 2 32-bit compare, set rd if src1≠src2<br>4 16-bit compare, set rd if src1=src2 | | FCMPEQ16 | fcmpeq16 | freg <sub>rs1</sub> , freg <sub>rs2</sub> , reg <sub>rd</sub> | 2 32-bit compare, set rd if src1=src2 | | FCMPEQ32 | fcmpeq32 | $freg_{rs1}$ , $freg_{rs2}$ , $reg_{rd}$ | | Edge handling instructions handle the boundary conditions for parallel pixel scan line loops. $\,$ TABLE E-21 | SPARC | Mnemonic | Argument List | Description | |---------|----------|-------------------------------------------------------------|----------------------------------------------------------------| | EDGE8 | edge8 | $reg_{rs1}$ , $reg_{rs2}$ , $reg_{rd}$ | 8 8-bit edge boundary processing | | EDGE8L | edge8l | $reg_{rs1}$ , $reg_{rs2}$ , $reg_{rd}$ | same as above, little-endian 4 16-bit edge boundary processing | | EDGE16 | edge16 | $reg_{rs1}$ , $reg_{rs2}$ , $reg_{rd}$ | | | EDGE16L | edge161 | reg <sub>rs1</sub> , reg <sub>rs2</sub> , reg <sub>rd</sub> | same as above, little-endian | | EDGE32 | edge32 | $reg_{rs1}$ , $reg_{rs2}$ , $reg_{rd}$ | 2 32-bit edge boundary processing same as above, little-endian | | EDGE32L | edge321 | $reg_{rs1}$ , $reg_{rs2}$ , $reg_{rd}$ | | Pixel component distance instructions are used for motion estimation in video compression algorithms. | SPARC | Mnemonic | Argument List | Description | |-------|----------|----------------------------------------------------------------|--------------------------------------| | PDIST | pdist | freg <sub>rs1</sub> , freg <sub>rs2</sub> , freg <sub>rd</sub> | 8 8-bit components, distance between | The three-dimensional array addressing instructions convert three- dimensional fixed-point addresses (in rs1) to a blocked-byte address. The result is stored in rd. #### TABLE E-23 | SPARC | Mnemonic | Argument List | Description | |---------|----------|-------------------------------------------------------------|------------------------------------------------------| | ARRAY8 | array8 | reg <sub>rs1</sub> , reg <sub>rs2</sub> , reg <sub>rd</sub> | convert 8-bit 3-D address to blocked<br>byte address | | ARRAY16 | array16 | reg <sub>rs1</sub> , reg <sub>rs2</sub> , reg <sub>rd</sub> | same as above, but 16-bit | | ARRAY32 | array32 | $reg_{rs1}$ , $reg_{rs2}$ , $reg_{rd}$ | same as above, but 32-bit | #### **Memory Access Instructions** E.6.7 These memory access instructions are part of the SPARC-V9 instruction set extensions. | SPARC | imm_asi | Argument List | Description | |-------|--------------|----------------------------------------------------------------------------------|-------------------------------------------------------------| | STDFA | ASI_PST8_P | stda freg <sub>rd</sub> , [freg <sub>rs1</sub> ]<br>reg <sub>mask,</sub> imm_asi | eight 8-bit conditional stores to:<br>primary address space | | STDFA | ASI_PST8_S | omask, – | secondary address space | | STDFA | ASI_PST8_PL | | primary address space, little endian | | STDFA | ASI_PST8_SL | | secondary address space, little endian | | STDFA | ASI_PST16_P | | four 16-bit conditional stores to: primary address space | | STDFA | ASI_PST16_S | | secondary address space | | STDFA | ASI_PST16_PL | | primary address space, little endian | | STDFA | ASI_PST16_SL | | secondary address space, little endian | | STDFA | ASI_PST32_P | | two 32-bit conditional stores to:<br>primary address space | | STDFA | ASI_PST32_S | | secondary address space | | STDFA | ASI_PST32_PL | | primary address space, little endian | | STDFA | ASI_PST32_SL | | secondary address space, little endian | $\mbox{\bf Note}$ - To select a partial store instruction, use one of the partial store ASIs with the STDA instruction. #### TABLE E-25 | SPARC | imm_asi | Argument List | Description | |----------------|-----------|------------------------------------------------------------------------------------------|----------------------------------------------------| | LDDFA<br>STDFA | ASI_FL8_P | ldda [reg_addr] imm_asi, freq <sub>rd</sub> stda freq <sub>rd</sub> , [reg_addr] imm_asi | 8-bit load/store from/to:<br>primary address space | | LDDFA<br>STDFA | ASI_FL8_S | ldda [reg_plus_imm] %asi, freq <sub>rd</sub><br>stda [reg_plus_imm] %asi | secondary address space | TABLE E-25 (continued) | SPARC | imm_asi | Argument List | Description | |-------|-------------|---------------|----------------------------------------| | LDDFA | ASI_FL8_PL | | primary address space, little endian | | STDFA | | | | | LDDFA | ASI_FL8_SL | | secondary address space, little endian | | STDFA | | | | | LDDFA | ASI_FL16_P | | 16-bit load/store from/to: | | STDFA | | | primary address space | | LDDFA | ASI_FL16_S | | secondary address space | | STDFA | | | | | LDDFA | ASI_FL16_PL | | primary address space, little endian | | STDFA | | | | | LDDFA | ASI_FL16_SL | | secondary address space, little | | STDFA | | | Circian | Note - To select a short floating-point load and store instruction, use one of the short ASIs with the LDDA and STDA instructions. TABLE E-26 | SPARC | imm_asi | Argument List | Description | |-------|------------------------|--------------------------------------------------------|------------------------------------------------------------------| | LDDA | ASI_NUCLEUS_QUAD_LDD | [reg_addr] imm_asi, reg <sub>rd</sub> | 128-bit atomic load | | LDDA | ASI_NUCLEUS_QUAD_LDD_L | [reg_plus_imm] %asi,<br>reg <sub>rd</sub> | 128-bit atomic load, little endian | | LDDFA | ASI_BLK_AIUP | | 64-byte block load/store from/to:<br>primary address space, user | | STDFA | | stda freq <sub>rd</sub> , [reg_addr]<br>imm_asi | privilege | | LDDFA | ASI_BLK_AIUS | ldda [reg_plus_imm]<br>%asi, <i>freq</i> <sub>rd</sub> | secondary address space, user privilege. | | STDFA | | stda freg <sub>rd</sub> ,<br>[reg_plus_imm] %asi | | | LDDFA | ASI_BLK_AIUPL | | primary address space, user<br>privilege, little endian | | STDFA | | | privilege, inde cridian | | LDDFA | ASI_BLK_AIUSL | | secondary address space, user<br>privilege little endian | | STDFA | | | | | LDDFA | ASI_BLK_P | | primary address space | | STDFA | | | | | LDDFA | ASI_BLK_S | | secondary address space | | STDFA | | | | | LDDFA | ASI_BLK_PL | | primary address space, little endian | | STDFA | | | | | LDDFA | ASI_BLK_SL | | secondary address space, little endian | | STDFA | | | | TABLE E-26 (continued) | SPARC | imm_asi | Argument List | Description | |-------|------------------|---------------|-------------------------------------------------------| | LDDFA | ASI_BLK_COMMIT_P | | 64-byte block commit store to primary address space | | STDFA | | | | | LDDFA | ASI_BLK_COMMIT_S | | 64-byte block commit store to secondary address space | | STDFA | | | , | Note - To select a block load and store instruction, use one of the block transfer ASIs with the LDDA and STDA instructions.