System Columns
PostgreSQL 9.3.16 Documentation | ||||
---|---|---|---|---|
Prev | Up | Chapter 5. Data Definition | Next |
Every table has several system columns that are implicitly defined by the system. Therefore, these names cannot be used as names of user-defined columns. (Note that these restrictions are separate from whether the name is a key word or not; quoting a name will not allow you to escape these restrictions.) You do not really need to be concerned about these columns; just know they exist.
- oid
-
The object identifier (object ID) of a row. This column is only present if the table was created using WITH OIDS , or if the default_with_oids configuration variable was set at the time. This column is of type oid (same name as the column); see Section 8.18 for more information about the type.
- tableoid
-
The OID of the table containing this row. This column is particularly handy for queries that select from inheritance hierarchies (see Section 5.8 ), since without it, it's difficult to tell which individual table a row came from. The tableoid can be joined against the oid column of pg_class to obtain the table name.
- xmin
-
The identity (transaction ID) of the inserting transaction for this row version. (A row version is an individual state of a row; each update of a row creates a new row version for the same logical row.)
- cmin
-
The command identifier (starting at zero) within the inserting transaction.
- xmax
-
The identity (transaction ID) of the deleting transaction, or zero for an undeleted row version. It is possible for this column to be nonzero in a visible row version. That usually indicates that the deleting transaction hasn't committed yet, or that an attempted deletion was rolled back.
- cmax
-
The command identifier within the deleting transaction, or zero.
- ctid
-
The physical location of the row version within its table. Note that although the ctid can be used to locate the row version very quickly, a row's ctid will change if it is updated or moved by VACUUM FULL . Therefore ctid is useless as a long-term row identifier. The OID, or even better a user-defined serial number, should be used to identify logical rows.
OIDs are 32-bit quantities and are assigned from a single cluster-wide counter. In a large or long-lived database, it is possible for the counter to wrap around. Hence, it is bad practice to assume that OIDs are unique, unless you take steps to ensure that this is the case. If you need to identify the rows in a table, using a sequence generator is strongly recommended. However, OIDs can be used as well, provided that a few additional precautions are taken:
-
A unique constraint should be created on the OID column of each table for which the OID will be used to identify rows. When such a unique constraint (or unique index) exists, the system takes care not to generate an OID matching an already-existing row. (Of course, this is only possible if the table contains fewer than 2 32 (4 billion) rows, and in practice the table size had better be much less than that, or performance might suffer.)
-
OIDs should never be assumed to be unique across tables; use the combination of tableoid and row OID if you need a database-wide identifier.
-
Of course, the tables in question must be created WITH OIDS . As of PostgreSQL 8.1, WITHOUT OIDS is the default.
Transaction identifiers are also 32-bit quantities. In a long-lived database it is possible for transaction IDs to wrap around. This is not a fatal problem given appropriate maintenance procedures; see Chapter 23 for details. It is unwise, however, to depend on the uniqueness of transaction IDs over the long term (more than one billion transactions).
Command identifiers are also 32-bit quantities. This creates a hard limit of 2 32 (4 billion) SQL commands within a single transaction. In practice this limit is not a problem - note that the limit is on the number of SQL commands, not the number of rows processed. Also, as of PostgreSQL 8.3, only commands that actually modify the database contents will consume a command identifier.