The Serializable Isolation Level

[ See the index for the whole series ]

Much production T-SQL code is written with the implicit assumption that the underlying data will not change during execution. As we saw in the previous article in this series, this is an unsafe assumption because data and index entries can move around underneath us, even during the execution of a single statement.

Where the T-SQL programmer is aware of the kinds of correctness and data integrity issues that can arise due to concurrent data modifications by other processes, the solution most commonly offered is to wrap the vulnerable statements in a transaction. It is not clear how the same sort of reasoning would be applied to the single-statement case, which is already wrapped in an auto-commit transaction by default.

Leaving that aside for a second, the idea of protecting an important area of T-SQL code with a transaction seems to be based on a misunderstanding of the protections offered by the ACID transaction properties. The important element of that acronym for the present discussion is the Isolation property. The idea is that using a transaction automatically provides complete isolation from the effects of other concurrent activities.

The truth of the matter is that transactions below SERIALIZABLE only provide a degree of isolation, which depends on the currently effective transaction isolation level. To understand what all this means for our everyday TSQL coding practices, we will first take a detailed look at the serializable isolation level.

Serializable Isolation

Serializable is the most isolated of the standard transaction isolation levels. It is also the default isolation level specified by the SQL standard, though SQL Server (like most commercial database systems) differs from the standard in this respect. The default isolation level in SQL Server is read committed, a lower isolation level that we will explore later in the series.

The definition of the serializable isolation level in the SQL-92 standard contains the following text (emphasis mine):

A serializable execution is defined to be an execution of the operations of concurrently executing SQL-transactions that produces the same effect as some serial execution of those same SQL-transactions. A serial execution is one in which each SQL-transaction executes to completion before the next SQL-transaction begins.

There is an important distinction to be made here between truly serialized execution (where each transaction actually runs exclusively to completion before the next one starts) and serializable isolation, where transactions are only required to have the same effects as if they were executed serially (in some unspecified order).

To put it another way, a real database system is allowed to physically overlap the execution of serializable transactions in time (thereby increasing concurrency) so long as the effects of those transactions still correspond to some possible order of serial execution. In other words, serializable transactions are potentially serializable rather than being actually serialized.

Logically Serializable Transactions

Leave aside all physical considerations (like locking) for a moment, and think only about the logical processing of two concurrent serializable transactions.

Consider a table that contains a large number of rows, five of which happen to satisfy some interesting query predicate. A serializable transaction T₁ starts counting the number of rows in the table that match this predicate. Some time after T₁ begins, but before it commits, a second serializable transaction T₂ starts. Transaction T₂ adds four new rows that also satisfy the query predicate to the table, and commits. The diagram below shows the time sequence of events:

Time sequence

The question is, how many rows should the query in serializable transaction T₁ count? Remember we are thinking purely about the logical requirements here, so avoid thinking about which locks might be taken and so on.

The two transactions physically overlap in time, which is fine. Serializable isolation only requires that the results of these two transactions correspond to some possible serial execution. There are clearly two possibilities for a logical serial schedule of transactions T₁ and T₂:

Possible serial schedules

Using the first possible serial schedule (T₁ then T₂) the T₁ counting query would see five rows, because the second transaction does not start until the first one completes. Using the second possible logical schedule, the T₁ query would count nine rows, because the four-row insert logically completed before the counting transaction began.

Both answers are logically correct under serializable isolation. In addition, no other answer is possible (so transaction T₁ could not count seven rows, for example). Which of the two possible results is actually observed depends on precise timing and a number of implementation details specific to the database engine in use.

Note that we are not concluding that the transactions are actually somehow reordered in time. The physical execution is free to overlap as shown in the first diagram, so long as the database engine ensures the results reflect what would have happened had they executed in one of the two possible serial sequences.

Serializable and the Concurrency Phenomena

In addition to logical serialization, the SQL standard also mentions that a transaction operating at the serializable isolation level must not experience certain concurrency phenomena. It must not read uncommitted data (no dirty reads); and once data has been read, a repeat of the same operation must return exactly the same set of data (repeatable reads with no phantoms).

The standard makes a point of saying that those concurrency phenomena are excluded at the serializable isolation level as a direct consequence of requiring the transaction to be logically serializable. In other words, the serializability requirement is sufficient on its own to avoid the dirty read, non-repeatable read, and phantom concurrency phenomena. By contrast, avoiding the three concurrency phenomena alone is not sufficient to guarantee serializability, as we will see shortly.

Intuitively, serializable transactions avoid all concurrency-related phenomena because they are required to act as if they had executed in complete isolation. In that sense, the serializable transaction isolation level matches the common expectations of T-SQL programmers quite closely.

Serializable Implementations

SQL Server happens to use a locking implementation of the serializable isolation level, where physical locks are acquired and held to the end of the transaction (hence the deprecated table hint HOLDLOCK as a synonym for SERIALIZABLE).

This strategy is not quite enough to provide a technical guarantee of full serializability, because new or changed data could appear in a range of rows previously processed by the transaction. This concurrency phenomenon is known as a phantom, and can result in effects which could not have occurred in any serial schedule.

To ensure protection against the phantom concurrency phenomenon, locks taken by SQL Server at the serializable isolation level may also incorporate key-range locking to prevent new or changed rows from appearing between previously-examined index key values. Range locks are not always acquired under the serializable isolation level; all we can say in general is that SQL Server always acquires sufficient locks to meet the logical requirements of the serializable isolation level. In fact, locking implementations quite often acquire more, and stricter, locks than are really needed to guarantee serializability, but I digress.

Locking is just one of the possible physical implementations of the serializable isolation level. We should be careful to mentally separate the specific behaviours of the SQL Server locking implementation from the logical definition of serializable.

As an example of an alternative physical strategy, see the PostgreSQL implementation of serializable snapshot isolation, though this is just one alternative. Each different physical implementation has its own strengths and weaknesses of course. As an aside, note that Oracle still does not provide a fully compliant implementation of the serializable isolation level. It has an isolation level named serializable, but it does not truly guarantee that transactions will execute according to some possible serial schedule. Oracle instead provides snapshot isolation when serializable is requested, in much the same way PostgreSQL did before serializable snapshot isolation (SSI) was implemented.

Snapshot isolation does not prevent concurrency anomalies like write skew, which is not possible under truly serializable isolation. If you are interested, you can find examples of write skew and other concurrency effects allowed by snapshot isolation at the SSI link above. We will also discuss the SQL Server implementation of snapshot isolation level later in the series.

A point-in-time view?

One reason I have spent time talking about the differences between logical serializability and physically serialized execution is that it is otherwise easy to infer guarantees that might not actually exist. For example, if you think of serializable transactions as actually executing one after the other, you might infer that a serializable transaction will necessarily see the database as it existed at the start of the transaction, providing a point-in-time view.

In fact, this is an implementation-specific detail. Recall the previous example, where serializable transaction T₁ might legitimately count five or nine rows. If a count of nine is returned, the first transaction clearly sees rows that did not exist at the moment the transaction started. This result is possible in SQL Server but not in PostgreSQL SSI, though both implementations comply with the logical behaviours specified for the serializable isolation level.

In SQL Server, serializable transactions do not necessarily see the data as it existed at the start of the transaction. Rather, the details of the SQL Server implementation mean that a serializable transaction sees the latest committed data, as of the moment that the data was first locked for access. In addition, the set of latest-committed data ultimately read is guaranteed not to change its membership before the transaction ends.

Next Time

The next part in this series examines the repeatable read isolation level, which provides weaker transaction isolation guarantees than serializable.

[ See the index for the whole series ]

11 thoughts on “The Serializable Isolation Level”

tobi says:

April 10, 2014 at 8:42 AM

Very, very good post about what serializable means. Two points:

1. Another example for a physical implementation strategy would be Hekaton serializable transactions which do not lock at all. They rather validate at commit time whether serializability was upheld. If not, the transaction is aborted.
2. Making a transaction serializable always removes all concurrency concerns because it provides the illusion of a single-threaded access to the server.
Piers Williams says:

May 3, 2014 at 5:25 AM

I'm no Oracle expert, but their doco appears to contradict you here. They prevent snapshot write-skew by detecting and chucking what amounts to an optimistic concurrency violation:

Oracle generates an error when a serializable transaction tries to update or delete data modified by a transaction that commits after the serializable transaction began

http://docs.oracle.com/cd/B19306_01/server.102/b14220/consist.htm
Sure it prevents it at the expense of the transaction but that's the general case with optimistic approaches (including SQL 2014 in-memory). Point is the write-skew data never hits the database. Or am I missing something?
Paul White says:

May 4, 2014 at 2:48 AM

Hi tobi,

Yes, the Hekaton implementation is another good example of an alternative physical implementation of true serializability. Thanks!

Paul
Paul White says:

May 4, 2014 at 4:20 AM

Hi Piers,

That describes a write-write conflict under snapshot isolation, which is not the same thing as write skew. Write skew occurs when two concurrent transactions make disjoint updates (so there is no write-write conflict) based on reads that overlap what the other transaction is writing.

The SSI article I linked to has several examples of write skew, and you can find a SQL Server example using the common black and white marbles example here.

Perhaps the simplest possible illustration is:
```
create table a ( x int );
create table b ( x int );

-- Session 1 at serializable isolation
insert into a select count(*) from b

-- Session 2 at serializable isolation, concurrently
insert into b select count(*) from a
```
Under Oracle's serializable implementation, this results in both tables containing a zero; a result that is clearly impossible under any serial schedule. In a serial schedule, one of the inserts must complete first, and the other transaction would then count that row. Notice also there is no write-write conflict here, because the two statements update different tables.

More about the Oracle behaviour.

Paul
Nobody says:

September 12, 2014 at 4:14 PM

Extremely good post about what serializable means. I've seen a lot of articles about isolation that can't see the forest for the trees.
Mike West says:

December 18, 2014 at 9:18 PM

Paul is one of my favorite authors. This is another post that simply reinforces why he is one of my favorite authors.
wxfg says:

May 29, 2015 at 5:07 AM

To ensure protection against the phantom concurrency phenomenon, locks taken by SQL Server at the serializable isolation level may also incorporate key-range locking to prevent new or changed rows from appearing between previously-examined index key values.

according to this, SQL server will prevent T2 inserting these 4 rows.

so, why

If a count of nine is returned, the first transaction clearly sees rows that did not exist at the moment the transaction started. This result is possible in SQL Server
Paul White says:

May 29, 2015 at 11:20 AM

It depends on timing. T2 can insert rows after T1 has started, but before T1 has locked that range. Locks are acquired on demand, not all at once before reading begins.
valentin says:

December 29, 2016 at 1:35 PM

> you might infer that a serializable transaction will necessarily see the database as it existed at the start of the transaction, providing a point-in-time view. In fact, this is true only in Snapshot isolation. SQL Server violates this serializibility assumption.

I tried to understand how letting the updates of unread items can violate the point-in-time view (aka snapshot) and fail. Suppose you have started the transaction. Then, another Transaction updates X and Y while you read A. You then read X. Both look at a point-in-time snapshots. If postgress observes oldA+oldX, you say that it is a point-in-time snapshot. But, if you see oldA+newX does not mean that you have time-skewed snapshot. It is a valid snapshot also because newA=oldA and you see newA+newX — the DB more-up-to-date DB time 2.

In both cases the intermediate transaction that updates X and Y is serialized first then comes your transaction. Optimistic Postress just fails because it observes stale data whereas you can proceed because your point-in-time view is up-to-date. Since newA=oldA, point-in-time view is not violated.
Kenny says:

February 8, 2019 at 2:43 PM

Great Article. Do we know when will SQL server pick a range lock over a shared lock. I did quick test using a equality operator and a between for the same row. eg. Where id = 1 or id between 1 and 1. One will use shared lock and the later will use a range lock.
Paul White says:

February 9, 2019 at 5:27 PM

It sounds like your simple query was auto-parameterized so BETWEEN 1 AND 1 was replaced with e.g. id >= @1 AND id <= @2.

Otherwise it would have been simplified to id = 1. Try adding AND 1 = 1 to your query to disable autoparam. Generally, range locks are taken for inequality predicates.

Comments are closed.