New Bit Manipulation Functions in SQL Server 2022

The need to manipulate data at the bit level with bitwise operations isn’t common in T-SQL, but you might stumble into such a need in some specialized scenarios. Some implementations store a set of flags (yes/no, on/off, true/false) in a single integer or binary-typed column, where each bit represents a different flag. One example is using a bitwise representation of a set of user/role permissions. Another example is using a bitwise representation of a set of settings turned on or off in a given environment. Even SQL Server stores some flag-based data using bitwise representation. Examples include the following:

The set_options and required_cursor_options cached plan attributes, which you obtain using the sys.dm_exec_plan_attributes DMF
Querying the current session’s state of session options with the @@OPTIONS function
Setting the server configuration option “user options” using bitwise representation
The COLUMNS_UPDATED trigger-related function, which indicates which columns were inserted or updated
The GROUPING_ID function, which constructs a bitwise representation of the current grouping set

Bit manipulation may involve checking whether a certain flag is on or off, setting a certain flag on or off, checking which flags are on, counting how many flags are on, shifting bits left or right, and more.

Prior to SQL Server 2022, T-SQL had minimal support for bit manipulation. It had support for the following operators:

You could achieve some of the aforementioned bit manipulation needs using a combination of these operators and some mathematics. But the available manipulation was limited and often resulted in complicated logic.

SQL Server 2022 introduces a set of new bit manipulation functions and operators designed to enhance your bit manipulation abilities, and it also makes some of the tasks just easier and more intuitive to achieve. The new functions and operators are as follows:

I’ll provide a brief description of the functions. For the full technical details, I provided links to the official documentation. In this article, I want to focus more on how these functions improve and simplify tasks that were harder to achieve in the past.

GET_BIT and SET_BIT

The GET_BIT function extracts a bit from an input value in a specified bit offset. This function has the following syntax:

GET_BIT( inputval, bitoffset )

The input value can be of an integer or a binary non-large object type. The bit offset is 0-based, representing the offset of the bit from right to left. So you specify 0 for the first (rightmost) bit, 1 for the second, …, 31 for the thirty-second, and so on. The returned value is of the BIT type. It’s 1 if the requested bit is set and 0 if it’s not set.

For example, the ARITHABORT session option is represented by bit offset 6 in the bitmap returned by the @@options function. Use the following code to check the status of this option in your session:

SELECT GET_BIT( @@OPTIONS, 6 );

The ARITHABORT option was enabled in the session where I ran this code, so I got the following output:

-----
1

The SET_BIT function returns a manipulated input value, where the bit in the specified bit offset is set to 1 by default or to the specified bit state (1 or 0) if indicated. This function has the following syntax:

SET_BIT( inputval, bitoffset[, bitstate] )

The supported types for inputval and bitoffset are the same as for the GET_BIT function. The bitstate input can be of a bit or integer type, but it must be 1 or 0.

For example, the following code returns both the current state of @@OPTIONS as well as a manipulated state with ARITHABORT set to off:

SELECT @@OPTIONS AS cursession, SET_BIT( @@OPTIONS, 6, 0 ) AS cursession_ARITHABORT_off;

I get the following out:

cursession  cursession_ARITHABORT_off
----------- -------------------------
5496        5432

In case it wasn’t obvious, it’s important to note using the SET_BIT function doesn’t change the input value in any way; rather, it just returns a manipulated value based on the input.

BIT_COUNT

The BIT_COUNT function counts the number of bits set in the input value. This function has the following syntax:

BIT_COUNT( inputval )

The input value can be of an integer or a binary non-large object type. The output is of a BIGINT type.

For example, the following code counts the number of session options set in the current session:

SELECT BIT_COUNT( @@OPTIONS );

This code generates the following output in my session:

--------------------
7

LEFT_SHIFT and RIGHT_SHIFT

The LEFT_SHIFT function, also available as the << operator, shifts the bits of the input value the requested number of bits to the left, filling vacant bits with 0s. The function and operator have the following syntax:

LEFT_SHIFT( inputval, numbits )
inputval << numbits

Like with the other bit manipulation functions, the input value can be of an integer or a non-large-object binary type. The output type is the same as the input. The numbits input is of an integer type.

The RIGHT_SHIFT function, also available as the >> operator, shifts the bits of the input value the requested number of bits to the right, filling vacant bits with 0s. Note this is what’s known as a logical right shift as opposed to another kind called arithmetic right shift, where vacant bits replicate the original value of the most significant bit (MSB). T-SQL doesn’t support an arithmetic right shift. The function and operator have the following syntax:

RIGHT_SHIFT( inputval, numbits )
inputval >> numbits

With both functions, bits that overspill (original numbits leftmost bits in a left shift and original numbits rightmost bits in a right shift) are discarded. If you provide a negative integer as numbits, SQL Server will convert it to the counterpart function with the positive value. So x >> -1 is converted to x << 1.

The following example shifts the bits in the input value 10 one bit to the right and one bit to the left:

SELECT 10 >> 1 AS rightshift, 10 << 1 AS leftshift;

This code generates the following output:

rightshift  leftshift
----------- -----------
5           20

The bitwise representation of the INT-value 10 is as follows:

0000 0000 0000 0000 0000 0000 0000 1010

Shifting one bit to the right results in the following:

0000 0000 0000 0000 0000 0000 0000 0101

This is the integer value 5.

Shifting one bit to the left results in the following:

0000 0000 0000 0000 0000 0000 0001 0100

This is the integer value 20.

Logical bit shifting is used in some computing environments as an efficient low-level way to achieve division or multiplication of unsigned integers by powers of two. For instance, right shifting val by n bits achieves val/2^n, and left shifting val by n bits achieves val*2^n.

Obviously, you won’t use bit shifting in T-SQL for such purposes. More likely use cases would be shifting existing bitwise representations of sequences of flags due to the introduction of new flags and the need to use the rightmost bits for the new flags. For instance, suppose you store user and role permission information as a sequence of bits in a BINARY(25) column called perms in a table called dbo.UserRolePerms. The column has a capacity to represent up to 200 permissions, but currently, the 80 rightmost bits (offsets 0 through 79) are used to represent 80 permissions. At some point in the application’s life, 10 new permissions are introduced. For whatever reason, a decision is made to use the 10 rightmost bits (offset 0 through 9) for the new permissions. This means you need to shift the existing permission info 10 bits to the left. The shifting part can be achieved with the following update:

UPDATE dbo.UserRolePerms
SET perms = perms << 10;

Handling Bit Manipulation Tasks Is Now Easier and More Intuitive

To get a real appreciation for the benefits of the new bit manipulation functions, all you need to do is try and handle similar tasks without them.

Extracting the State of a Bit in a Given Offset or Setting It

Given an input value @val and bit offset @offset, write an expression designed to extract the state of the requested bit without the new functions. The technique you use to address this task depends on the data type of @val and on whether certain assurances are provided.

The TINYINT datatype in SQL Server is an unsigned integer, and SMALLINT, INT, and BIGINT are signed integers represented using the two’s complement notation. If @val is of the TINYINT type—or of the SMALLINT, INT, or BIGINT types with the assurance the leftmost bit isn’t used—you can extract the bit value (a value with only the specified bit offset set to the same state like in the input @val) with the expression @val & POWER(2, @offset) and the bit state (1 or 0) with the expression SIGN(@val & POWER(2, @offset)). Here’s an example:

DECLARE @val AS INT = 5496, @offset AS INT = 6;

SELECT
  @val & POWER(2, @offset) AS bitval,
  SIGN(@val & POWER(2, @offset)) AS bitstate;

This code returns the following output on my system:

bitval      bitstate
----------- -----------
64          1

But what if you’re dealing with an implementation using a signed integer type for @val and the leftmost bit? Both of the following computations are simply incorrect for the leftmost bit, not to speak of the fact the code fails with an overflow error:

DECLARE @val AS INT = 5496, @offset AS INT = 31;

SELECT
  @val & POWER(2, @offset) AS bitval,
  SIGN(@val & POWER(2, @offset)) AS bitstate;

Msg 232, Level 16, State 3, Line 3
Arithmetic overflow error for type int, value = 2147483648.000000.

Yes, you could add logic to handle the special case differently, but this adds complexity to your code.

And what about a binary input? What if @val is of a BINARY(25) type? Now, coming up with a solution is far more complicated (try it!).

Using the new features, it’s a piece of cake. To get the bit state, use the GET_BIT function with the inputs @val and @offset with no special manipulation. Here’s an example with an INT-typed value as input:

DECLARE @val AS INT = 5496, @offset AS INT = 31;

SELECT GET_BIT( @val, @offset );

This code generates the following output, indicating the bit in the specified offset is off:

-----
0

Here’s an example with a BINARY(25) input:

DECLARE @val AS BINARY(25) = 0x9B3B9D5AD3084AB1A522082491B8519C008AB4AACB7064B037, 
        @offset AS INT = 100;

SELECT GET_BIT( @val, @offset );

This code generates the following output, indicating the bit in the specified offset is on:

-----
1

To get the bit value, you can use the SET_BIT function, with the first input with all bits zeroed, the second input being @@offset, and the third input being the state returned by GET_BIT. Here’s an example with an integer input:

DECLARE @val AS INT = 5496, @offset AS INT = 6;

SELECT SET_BIT(0, @offset, GET_BIT( @val, @offset ));

This code returns the following output:

-----------
64

And here’s an example with a BINARY(25) input:

DECLARE @val AS BINARY(25) = 0x9B3B9D5AD3084AB1A522082491B8519C008AB4AACB7064B037, 
        @offset AS INT = 100;

SELECT SET_BIT(CAST(0x AS BINARY(25)), @offset, GET_BIT( @val, @offset ));

This code generates the following output:

----------------------------------------------------
0x00000000000000000000000010000000000000000000000000

You will naturally face similar complexities when trying to set a bit in a requested position. Try it with the special cases and binary inputs! Instead, starting with SQL Server 2022 you simply use the SET_BIT function.

Counting Bits

How about counting how many bits are set in an input value? Again, if it’s a signed integer and you have an assurance the leftmost bit isn’t used, you can use a table of numbers, a bitwise AND (&) operator, and a COUNT aggregate.

Here’s code to create the GetNums function, which returns a sequence of integers in the requested range:

CREATE FUNCTION dbo.GetNums(@low AS BIGINT = 1, @high AS BIGINT)
RETURNS TABLE
AS
RETURN
WITH
  L0 AS ( SELECT 1 AS c 
          FROM (VALUES(1),(1),(1),(1),(1),(1),(1),(1),
                      (1),(1),(1),(1),(1),(1),(1),(1)) AS D(c) ),
  L1 AS ( SELECT 1 AS c FROM L0 AS A CROSS JOIN L0 AS B ),
  L2 AS ( SELECT 1 AS c FROM L1 AS A CROSS JOIN L1 AS B ),
  L3 AS ( SELECT 1 AS c FROM L2 AS A CROSS JOIN L2 AS B ),
  Nums AS ( SELECT ROW_NUMBER() OVER(ORDER BY (SELECT NULL)) AS rownum
            FROM L3 )
SELECT TOP(@high - @low + 1)
   rownum AS rn,
   @high + 1 - rownum AS op,
   @low - 1 + rownum AS n
FROM Nums
ORDER BY rownum;
GO

And here’s code to count the bits set in an INT-typed input stored in @val, assuming the leftmost bit isn’t used:

DECLARE @val AS INT = 5496;

SELECT COUNT(*) AS bitcount
FROM dbo.GetNums(0, 30) AS N
WHERE @val & POWER(2, N.n) <> 0;

This code generates the following output:

bitcount
-----------
7

Clearly, this isn’t a cheap calculation. Moreover, if you do use the leftmost bit or the input is a binary-typed one, the solution becomes more complex (try it!). With the new features in SQL Server 2022, you simply use the BIT_COUNT function:

DECLARE @intval AS INT = 5496, 
        @binval AS BINARY(25) = 0x9B3B9D5AD3084AB1A522082491B8519C008AB4AACB7064B037;

SELECT BIT_COUNT(@intval) AS cntintval, BIT_COUNT(@binval) AS cntbinval;

This code generates the following output:

cntintval            cntbinval
-------------------- --------------------
7                    85

Bit Shifting

How about bit shifting? Suppose you’re given the inputs @val and @offset and you need to right shift or left shift the bits in @val based on the offset in @offset.

For right shifting, if @val is a nonnegative value of a signed int type such as INT, you can use the expression @val / POWER(2, @offset).

For left shifting, if @val is a nonnegative value of a signed int type such as INT and you have an assurance the @offset + 1 leftmost bits are currently unused (are 0), you can use the expression @val * POWER(2, @offset). In all other cases, including when the above assurances aren’t provided or when @val is of a binary type, you need to create more complex solutions (try it!).

With the new functions and operators, you simply provide the inputs with no manipulation:

DECLARE @val AS INT = -1, @offset AS INT = 1;

SELECT @val >> @offset AS rightshift, @val << @offset AS leftshift;

This code generates the following output:

rightshift  leftshift
----------- -----------
2147483647  -2

The INT-typed value -1 has the following bitwise representation:

1111 1111 1111 1111 1111 1111 1111 1111

Right shifting by one bit results in the following:

0111 1111 1111 1111 1111 1111 1111 1111

This is the integer value 2147483647.

Left shifting by one bit results in the following:

1111 1111 1111 1111 1111 1111 1111 1110

This is the integer value -2.

Conclusion

Bit manipulation isn’t a common need in T-SQL, but it’s relevant for some built-in features in SQL Server and could be relevant in some user-defined implementations. Prior to SQL Server 2022, T-SQL had limited bit manipulation capabilities, requiring you to write complex code with mathematical acrobatics in some cases. SQL Server 2022 introduces five new bit manipulation functions and two alternative operators, enabling simpler and more intuitive manipulation.

What’s still missing in T-SQL in this space are aggregate bitwise functions performing aggregate bitwise AND, OR, and XOR operations. Oracle, for instance, supports such functions. As an example use case for such functions, suppose a user is a member of several roles and the permissions of each role are stored using a bitwise representation similar to the example I used in the article. The effective permissions of the user are the aggregate bitwise OR of the permission bitmaps of all roles the user is a member of.

At any rate, it’s good to see how T-SQL keeps evolving and adding more functionality.

Happy bitwising!

6 thoughts on “New Bit Manipulation Functions in SQL Server 2022”

Shawn says:

September 22, 2022 at 11:44 PM

This generation of software devs is weak.

Ian says:

September 23, 2022 at 9:07 PM

When I saw Matt Parker's video "Can you find: five five-letter words with twenty-five unique letters?" (https://www.youtube.com/watch?v=_-AfhLQfb6w) I thought it would be fun to give it a go with bitmaps in T-SQL on SQL Server 2022. I mean, it couldn't be any slower than Matt's solution with its 30 day runtime!
I was pretty happy with the result (1-3 seconds depending on the initial word list), although I definitely would have appreciated bitwise aggregate functions. And I ended up using BIT_COUNT() to find the least significant set bit (see, for example, https://en.wikipedia.org/wiki/Find_first_set#Properties_and_relations).

DROP TABLE IF EXISTS #W
CREATE TABLE #W (w char(5) NOT NULL PRIMARY KEY)

DECLARE @list int
SET @list = 1
IF @list = 1
BEGIN
  -- Use Wordle words extracted in Feb 2022:
  BULK INSERT #W FROM 'c:\temp\wordle-guesses.txt'
  BULK INSERT #W FROM 'c:\temp\wordle-answers.txt' -- guess list does not include answers 
END 
ELSE BEGIN
  -- Use larger word list linked in description on Matt's video:
  DROP TABLE IF EXISTS #W0

  CREATE TABLE #W0 (w varchar(200) NOT NULL)
  BULK INSERT #W0 FROM 'c:\temp\words_alpha.txt'

  INSERT INTO #W
  SELECT *
  FROM #W0
  WHERE LEN(w) = 5
END

GO

DROP TABLE IF EXISTS #DistinctLetterWords

;WITH WLB AS (
    -- break words into letters, and represent each letter as a bit (index 0 to 25):
    SELECT W.w, SET_BIT(0, ASCII(SUBSTRING(W.w, N.value, 1)) - ASCII('a'), 1) AS b
    FROM #W AS W
    CROSS JOIN GENERATE_SERIES (1, 5) AS N
  ), WB AS (
    -- combine the bits (sadly, SQL Server does not have bitwise aggregate functions)
    SELECT w, SUM(b) AS b
    FROM WLB
    GROUP BY w
  ), WDB AS (
    -- distinct-letter words with alphabetic-order bit encoding: only keep words with 5 different letters
    SELECT w, ISNULL(b, 0) AS b
    FROM WB
    WHERE BIT_COUNT(b) = 5
  ), FBM AS (
    -- mapping of alphabetic to use-frequency order (counting distinct bit patterns):
    SELECT WLB.b AS from_bit, SET_BIT(0, ROW_NUMBER() OVER (ORDER BY COUNT(DISTINCT WDB.b)) - 1) AS to_bit
    FROM WLB
    INNER JOIN WDB
      ON WLB.w = WDB.w
    GROUP BY WLB.b
  ), WLFB AS (
    -- combine the bits, keeping only distinct-letter words as previously determined:
    SELECT WLB.w, FBM.to_bit AS b
    FROM WLB
    INNER JOIN FBM
      ON WLB.b = FBM.from_bit
    INNER JOIN WDB
      ON WLB.w = WDB.w
  ), WDFB AS (
    -- combine remapped bits (which are already guaranteed not to overlap)
    SELECT w, ISNULL(SUM(b), 0) AS b
    FROM WLFB
    GROUP BY w
  )
SELECT *
INTO #DistinctLetterWords
FROM WDFB

ALTER TABLE #DistinctLetterWords ADD PRIMARY KEY (b, w)

-- distinct bit patterns of distinct-letter words (using least-to-most frequent bit ordering)
-- annotate and index by least frequent used bit position 
DROP TABLE IF EXISTS #B

SELECT ISNULL(BIT_COUNT(a ^ (a - 1) - 1), 0) AS first, a INTO #B FROM ( SELECT DISTINCT b AS a FROM #DistinctLetterWords ) AS B

ALTER TABLE #B ADD PRIMARY KEY (first, a)

DROP TABLE IF EXISTS #A

-- accumulate distinct bits; start by choosing the unused bit as generation 0:
SELECT 0 AS gen, A.acc, A.acc AS a
INTO #A
FROM (
    SELECT ISNULL(SET_BIT(0, C.value, 1), 1) AS acc
    FROM GENERATE_SERIES (0, 25) AS C
  ) AS A

ALTER TABLE #A ADD PRIMARY KEY (gen, acc, a)

DECLARE @gen int
SET @gen = 1

WHILE @gen <= 5
BEGIN
  -- accumulate new compatible bit patterns that use the first (least frequent) unused bit:
  INSERT INTO #A
  SELECT @gen AS gen, A.acc | B.a AS acc, B.a
  FROM #B AS B
  INNER JOIN (
      SELECT acc, BIT_COUNT(~acc ^ (~acc - 1)) - 1 AS next
      FROM ( SELECT DISTINCT acc FROM #A WHERE gen = @gen - 1 ) AS B
    ) AS A
    ON A.acc & B.a = 0
   AND B.first = A.next

  SET @gen = @gen + 1
END

-- trace back to get words for each bit pattern added:
SELECT WA.w AS w1, WB.w AS w2, WC.w AS w3, WD.w AS w4, WE.w AS w5 
FROM #A AS A5 INNER JOIN #A AS A4 ON A5.acc - A5.a = A4.acc 
INNER JOIN #A AS A3 ON A4.acc - A4.a = A3.acc 
INNER JOIN #A AS A2 ON A3.acc - A3.a = A2.acc 
INNER JOIN #A AS A1 ON A2.acc - A2.a = A1.acc 
INNER JOIN #DistinctLetterWords AS WA ON A1.a = WA.b 
INNER JOIN #DistinctLetterWords AS WB ON A2.a = WB.b 
INNER JOIN #DistinctLetterWords AS WC ON A3.a = WC.b 
INNER JOIN #DistinctLetterWords AS WD ON A4.a = WD.b 
INNER JOIN #DistinctLetterWords AS WE ON A5.a = WE.b 
WHERE A5.gen = 5
  AND A4.gen = 4
  AND A3.gen = 3
  AND A2.gen = 2
  AND A1.gen = 1

Itzik Ben-Gan says:

October 25, 2022 at 3:14 PM

Ian, thanks for sharing the puzzle and your solution. I absolutely love it!
It is indeed an interesting example for a use case for bit manipulation functions in T-SQL. I also wish we had an aggregate bitwise or function to simplify accumulating set bits.
I do recommend other readers to carefully look at Ian's code. I loved the use of bitwise operators to identify the first set bit and the first unset bit, as well as the idea to rank letters by use frequency.
Alexander Perea Gomez says:

March 17, 2023 at 11:35 PM

Excellent!!
Itzik Ben-Gan says:

March 23, 2023 at 5:39 PM

Thanks Alexander!
Lukas says:

December 13, 2023 at 2:02 PM

Thanks for the helpful article. For those interested in the topic, I also recommend reading this article (https://blog.devart.com/bit-manipulation-functions-in-sql-server.html) about bit manipulation functions.

This site uses Akismet to reduce spam. Learn how your comment data is processed.

New Bit Manipulation Functions in SQL Server 2022

GET_BIT and SET_BIT

BIT_COUNT

LEFT_SHIFT and RIGHT_SHIFT

Handling Bit Manipulation Tasks Is Now Easier and More Intuitive

Extracting the State of a Bit in a Given Offset or Setting It

Counting Bits

Bit Shifting

Conclusion

6 thoughts on “New Bit Manipulation Functions in SQL Server 2022”

Leave a Reply