See Notes below for more on compression options for the SPSS_sav_family formats. The final "3" indicates that the data in the file is compressed using ZLIB. The first 4 bytes represent the string "$FL2" or "$F元" in the character encoding used for the file. A dictionary record consists of a numeric (32-bit integer) tag identifying the type of record, followed by a defined sequence of string or numeric values. The GNU PSPP documentation states, "The best way to determine the specific character encoding in use is to consult the character encoding record, if present, and failing that the character_code in the machine integer info record, (which, despite the name given to the record by the GNU PSPP team, has indicators for character and floating point encodings, not just for integer encoding).įile organization: The information in an SPSS_sav file is divided into logical sections: a header, a sequence of tagged "records" comprising a "dictionary" for the file, followed by the data itself. In some cases, more explicit indication of character encoding and numeric format can be confirmed through specific tagged "records." For record types and associated tags, see File Organization starting in the next paragraph. The endianness for a SPSS_sav file can be determined from one or more of the numeric integer values in the file header record. Floating-point data may nominally be in IEEE 754, IBM, or VAX encodings. Integer data may be big-endian or little-endian. Thus, hex "24 46 4c" indicates ASCII and hex "5b c6 d3" indicates EBCDIC. The first 3 bytes of an SPSS_sav file indicate the character encoding by using the encoding to represent "$FL". Unicode has been supported for character data in the SPSS application since version 16 (released in late 2007). It states, "System files may use most character encodings based on an 8-bit unit." This includes ASCII, EBCDIC, and for more recent files, UTF-8. sav format can use a variety of character encodings and a variety of representations for integers and floating-point numbers. Unofficial documentation is available from the GNU PSPP project as Appendix B: System File Format. There is no official public specification. When an SPSS Statistics data file is saved from SPSS, the file extension. SPSS has been owned by IBM since 2009 and is now known as IBM SPSS Statistics. SPSS, which originally stood for "Statistical Package for the Social Sciences," is a widely used statistical software system, first released in 1968. The SPSS Statistics File Format is a proprietary binary format, developed and maintained as the native format for the SPSS statistical software application. SPSS Statistics Data File Format Family (.sav), formerly known as SPSS System File Format.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |