Islands T-SQL Challenge

April 13, 2022 by Itzik Ben-Gan in SQL Performance, T-SQL Queries | 16 Comments

Recently, I was introduced to a new islands challenge by my friend Erland Sommarskog. It’s based on a question from a public database forum. The challenge itself isn’t complicated to handle when using well-known techniques, which primarily employ window functions. However, these techniques require explicit sorting despite the presence of a supporting index. This affects the scalability and response time of the solutions. Fond of challenges, I set out to find a solution to minimize the number of explicit Sort operators in the plan, or better yet, eliminate the need for those altogether. And I found such a solution.

I’ll start by presenting a generalized form of the challenge. I’ll then show two solutions based on known techniques, followed by the new solution. Finally, I’ll compare the performance of the different solutions.

I recommend you try to find a solution before implementing mine.

The challenge

I’ll present a generalized form of Erland’s original islands challenge.

Use the following code to create a table called T1 and populate it with a small set of sample data:

SET NOCOUNT ON;

USE tempdb;

DROP TABLE IF EXISTS dbo.T1

CREATE TABLE dbo.T1
(
  grp VARCHAR(10) NOT NULL,
  ord INT NOT NULL,
  val VARCHAR(10) NOT NULL,
  CONSTRAINT PK_T1 PRIMARY KEY(grp, ord)
);
GO

INSERT INTO dbo.T1(grp, ord, val) VALUES
  ('Group A', 1002, 'Y'),
  ('Group A', 1003, 'Y'),
  ('Group A', 1005, 'Y'),
  ('Group A', 1007, 'N'),
  ('Group A', 1011, 'N'),
  ('Group A', 1013, 'N'),
  ('Group A', 1017, 'Y'),
  ('Group A', 1019, 'Y'),
  ('Group A', 1023, 'N'),
  ('Group A', 1029, 'N'),
  ('Group B', 1001, 'X'),
  ('Group B', 1002, 'X'),
  ('Group B', 1003, 'Z'),
  ('Group B', 1005, 'Z'),
  ('Group B', 1008, 'Z'),
  ('Group B', 1013, 'Z'),
  ('Group B', 1021, 'Y'),
  ('Group B', 1034, 'Y');

The challenge is as follows:

Assuming partitioning based on the column grp and ordering based on the column ord, compute sequential row numbers starting with 1 within each consecutive group of rows with the same value in the val column. Following is the desired result for the given small set of sample data:

grp      ord   val  seqno
-------- ----- ---- ------
Group A  1002  Y    1
Group A  1003  Y    2
Group A  1005  Y    3
Group A  1007  N    1
Group A  1011  N    2
Group A  1013  N    3
Group A  1017  Y    1
Group A  1019  Y    2
Group A  1023  N    1
Group A  1029  N    2
Group B  1001  X    1
Group B  1002  X    2
Group B  1003  Z    1
Group B  1005  Z    2
Group B  1008  Z    3
Group B  1013  Z    4
Group B  1021  Y    1
Group B  1034  Y    2

Note the definition of the primary key constraint based on the composite key (grp, ord), which results in a clustered index based on the same key. This index can potentially support window functions partitioned by grp and ordered by ord.

To test the performance of your solution, you’ll need to populate the table with larger volumes of sample data. Use the following code to create the helper function GetNums:

CREATE FUNCTION dbo.GetNums(@low AS BIGINT = 1, @high AS BIGINT)
  RETURNS TABLE
AS
RETURN
  WITH
    L0 AS ( SELECT 1 AS c 
            FROM (VALUES(1),(1),(1),(1),(1),(1),(1),(1),
                        (1),(1),(1),(1),(1),(1),(1),(1)) AS D(c) ),
    L1 AS ( SELECT 1 AS c FROM L0 AS A CROSS JOIN L0 AS B ),
    L2 AS ( SELECT 1 AS c FROM L1 AS A CROSS JOIN L1 AS B ),
    L3 AS ( SELECT 1 AS c FROM L2 AS A CROSS JOIN L2 AS B ),
    Nums AS ( SELECT ROW_NUMBER() OVER(ORDER BY (SELECT NULL)) AS rownum
              FROM L3 )
  SELECT TOP(@high - @low + 1)
     rownum AS rn,
     @high + 1 - rownum AS op,
     @low - 1 + rownum AS n
  FROM Nums
  ORDER BY rownum;
GO

Use the following code to populate T1, after changing the parameters representing the number of groups, number of rows per group and number of distinct values as you wish:

DECLARE 
  @numgroups AS INT = 1000,
  @rowspergroup AS INT = 10000, -- test with 1000 to 10000 here
  @uniquevalues AS INT = 5;

ALTER TABLE dbo.T1 DROP CONSTRAINT PK_T1;

TRUNCATE TABLE dbo.T1;

INSERT INTO dbo.T1 WITH(TABLOCK) (grp, ord, val)
  SELECT
    CAST(G.n AS VARCHAR(10)) AS grp,
    CAST(R.n AS INT) AS ord,
    CAST(ABS(CHECKSUM(NEWID())) % @uniquevalues + 1 AS VARCHAR(10)) AS val
  FROM dbo.GetNums(1, @numgroups) AS G
    CROSS JOIN dbo.GetNums(1, @rowspergroup) AS R;
    
ALTER TABLE dbo.T1 ADD CONSTRAINT PK_T1 PRIMARY KEY CLUSTERED(grp, ord);

In my performance tests, I populated the table with 1,000 groups, between 1,000 and 10,000 rows per group (so 1M to 10M rows), and 5 distinct values. I used a SELECT INTO statement to write the result into a temporary table.

My test machine has four logical CPUs, running SQL Server^® 2019 Enterprise.

I’ll assume you’re using an environment designed to support batch mode on row store either directly, e.g., using SQL Server 2019 Enterprise edition like mine, or indirectly, by creating a dummy columnstore index on the table.

Remember, extra points if you manage to come up with an efficient solution without explicit sorting in the plan. Good luck!

Is a Sort operator needed in the optimization of window functions?

Before I cover solutions, a bit of optimization background so what you’ll see in the query plans later will make more sense.

The most common techniques for solving islands tasks such as ours involve using some combination of aggregate and/or ranking window functions. SQL Server can process such window functions using either a series of older row mode operators (Segment, Sequence Project, Segment, Window Spool, Stream Aggregate) or the newer and usually more efficient batch mode Window Aggregate operator.

In both cases, the operators handling the window function’s calculation need to ingest the data ordered by the window partitioning and ordering elements. If you don’t have a supporting index, naturally SQL Server will need to introduce a Sort operator in the plan. For instance, if you have multiple window functions in your solution with more than one unique combination of partitioning and ordering, you’re bound to have explicit sorting in the plan. But what if you have only one unique combination of partitioning and ordering and a supporting index?

The older row mode method can rely on preordered data ingested from an index without the need for an explicit Sort operator in both serial and parallel modes. The newer batch mode operator eliminates much of the inefficiencies of the older row mode optimization and has the inherent benefits of batch mode processing. However, its current parallel implementation requires an intermediary batch mode parallel Sort operator even when a supporting index is present. Only its serial implementation can rely on index order without a Sort operator. This is all to say when the optimizer needs to choose a strategy to handle your window function, assuming you have a supporting index, it will generally be one of the following four options:

Row mode, serial, no sorting
Row mode, parallel, no sorting
Batch mode, serial, no sorting
Batch mode, parallel, sorting

Whichever one results in the lowest plan cost will be chosen, assuming of course prerequisites for parallelism and batch mode are met when considering those. Normally, for the optimizer to justify a parallel plan, the parallelism benefits need to outweigh the extra work like thread synchronization. With option 4 above, the parallelism benefits need to outweigh the usual extra work involved with parallelism, plus the extra sort.

While experimenting with different solutions to our challenge, I had cases where the optimizer chose option 2 above. It chose it despite the fact that the row mode method involves a few inefficiencies because the benefits in parallelism and no sorting resulted in a plan with a lower cost than the alternatives. In some of those cases, forcing a serial plan with a hint resulted in option 3 above, and in better performance.

With this background in mind, let’s look at solutions. I’ll start with two solutions relying on known techniques for islands tasks that cannot escape explicit sorting in the plan.

Solution based on known technique 1

Following is the first solution to our challenge, which is based on a technique that has been known for a while:

WITH C AS
(
  SELECT *,
    ROW_NUMBER() OVER(PARTITION BY grp ORDER BY ord) -
      ROW_NUMBER() OVER(PARTITION BY grp, val ORDER BY ord) AS island
  FROM dbo.T1
)
SELECT grp, ord, val, 
  ROW_NUMBER() OVER(PARTITION BY grp, val, island ORDER BY ord) AS seqno
FROM C;

I’ll refer to it as Known Solution 1.

The CTE called C is based on a query that computes two row numbers. The first (I’ll refer to it as rn1) is partitioned by grp and ordered by ord. The second (I’ll refer to it as rn2) is partitioned by grp and val and ordered by ord. Since rn1 has gaps between different groups of the same val and rn2 doesn’t, the difference between rn1 and rn2 (column called island) is a unique constant value for all rows with the same grp and val values. Following is the result of the inner query, including the results of the two-row number computations, which aren’t returned as separate columns:

grp      ord   val  rn1  rn2  island
-------- ----- ---- ---- ---- -------
Group A  1002  Y    1    1    0
Group A  1003  Y    2    2    0
Group A  1005  Y    3    3    0
Group A  1007  N    4    1    3
Group A  1011  N    5    2    3
Group A  1013  N    6    3    3
Group A  1017  Y    7    4    3
Group A  1019  Y    8    5    3
Group A  1023  N    9    4    5
Group A  1029  N    10   5    5
Group B  1001  X    1    1    0
Group B  1002  X    2    2    0
Group B  1003  Z    3    1    2
Group B  1005  Z    4    2    2
Group B  1008  Z    5    3    2
Group B  1013  Z    6    4    2
Group B  1021  Y    7    1    6
Group B  1034  Y    8    2    6

What’s left for the outer query to do is to compute the result column seqno using the ROW_NUMBER function, partitioned by grp, val, and island, and ordered by ord, generating the desired result.

Note you can get the same island value for different val values within the same partition, such as in the output above. This is why it’s important to use grp, val, and island as the window partitioning elements and not grp and island alone.

Similarly, if you’re dealing with an islands task requiring you to group the data by the island and compute group aggregates, you would group the rows by grp, val, and island. But this isn’t the case with our challenge. Here you were tasked with just computing row numbers independently for each island.

Figure 1 has the default plan that I got for this solution on my machine after populating the table with 10M rows.

Figure 1: Parallel plan for Known Solution 1

The computation of rn1 can rely on index order. So, the optimizer chose the no sort + parallel row mode strategy for this one (first Segment and Sequence Project operators in the plan). Since the computations of both rn2 and seqno use their own unique combinations of partitioning and ordering elements, explicit sorting is unavoidable for those irrespective of the strategy used. So, the optimizer chose the sort + parallel batch mode strategy for both. This plan involves two explicit Sort operators.

In my performance test, it took this solution 3.68 seconds to complete against 1M rows and 43.1 seconds against 10M rows.

As mentioned, I tested all solutions also by forcing a serial plan (with a MAXDOP 1 hint). The serial plan for this solution is shown in Figure 1.

Figure 2: Serial plan for Known Solution 1

As expected, this time also the computation of rn1 uses the batch mode strategy without a preceding Sort operator, but the plan still has two Sort operators for the subsequent row number computations. The serial plan performed worse than the parallel one on my machine with all input sizes I tested, taking 4.54 seconds to complete with 1M rows and 61.5 seconds with 10M rows.

Solution based on known technique 2

The second solution I’ll present is based on a brilliant technique for island detection I learned from Paul White a few years ago. Following is the complete solution code based on this technique:

WITH C1 AS
(
  SELECT *,
    CASE
      WHEN val = LAG(val) OVER(PARTITION BY grp ORDER BY ord) THEN 0
      ELSE 1
    END AS isstart
  FROM dbo.T1
),
C2 AS
(
  SELECT *,
    SUM(isstart) OVER(PARTITION BY grp ORDER BY ord ROWS UNBOUNDED PRECEDING) AS island
  FROM C1
)
SELECT grp, ord, val, 
  ROW_NUMBER() OVER(PARTITION BY grp, island ORDER BY ord) AS seqno
FROM C2;

I’ll refer to this solution as Known Solution 2.

The query defining the CTE C1 computes uses a CASE expression and the LAG window function (partitioned by grp and ordered by ord) to compute a result column called isstart. This column has the value 0 when the current val value is the same as the previous and 1 otherwise. In other words, it’s 1 when the row is the first in an island and 0 otherwise.

Following is the output of the query defining C1:

grp      ord   val  isstart
-------- ----- ---- --------
Group A  1002  Y    1
Group A  1003  Y    0
Group A  1005  Y    0
Group A  1007  N    1
Group A  1011  N    0
Group A  1013  N    0
Group A  1017  Y    1
Group A  1019  Y    0
Group A  1023  N    1
Group A  1029  N    0
Group B  1001  X    1
Group B  1002  X    0
Group B  1003  Z    1
Group B  1005  Z    0
Group B  1008  Z    0
Group B  1013  Z    0
Group B  1021  Y    1
Group B  1034  Y    0

The magic as far as island detection is concerned happens in the CTE C2. The query defining it computes an island identifier using the SUM window function (also partitioned by grp and ordered by ord) applied to the isstart column. The result column with the island identifier is called island. Within each partition, you get 1 for the first island, 2 for the second island, and so on. So, the combination of columns grp and island is an island identifier, which you can use in islands tasks that involve grouping by island when relevant.

Following is the output of the query defining C2:

grp      ord   val  isstart  island
-------- ----- ---- -------- -------
Group A  1002  Y    1        1
Group A  1003  Y    0        1
Group A  1005  Y    0        1
Group A  1007  N    1        2
Group A  1011  N    0        2
Group A  1013  N    0        2
Group A  1017  Y    1        3
Group A  1019  Y    0        3
Group A  1023  N    1        4
Group A  1029  N    0        4
Group B  1001  X    1        1
Group B  1002  X    0        1
Group B  1003  Z    1        2
Group B  1005  Z    0        2
Group B  1008  Z    0        2
Group B  1013  Z    0        2
Group B  1021  Y    1        3
Group B  1034  Y    0        3

Lastly, the outer query computes the desired result column seqno with a ROW_NUMBER function, partitioned by grp and island, and ordered by ord. Notice this combination of partitioning and ordering elements is different from the one used by the previous window functions. Whereas the computation of the first two window functions can potentially rely on index order, the last can’t.

Figure 3 has the default plan that I got for this solution.

Figure 3: Parallel plan for Known Solution 2

As you can see in the plan, the computation of the first two window functions uses the no sort + parallel row mode strategy, and the computation of the last uses the sort + parallel batch mode strategy.

The run times I got for this solution ranged from 2.57 seconds against 1M rows to 46.2 seconds against 10M rows.

When forcing serial processing, I got the plan shown in Figure 4.

Figure 4: Serial plan for Known Solution 2

As expected, this time all window function computations rely on the batch mode strategy. The first two without preceding sorting, and the last with. Both the parallel plan and the serial one involved one explicit Sort operator. The serial plan performed better than the parallel plan on my machine with the input sizes I tested. The run times I got for the forced serial plan ranged from 1.75 seconds against 1M rows to 21.7 seconds against 10M rows.

Solution based on new technique

When Erland introduced this challenge in a private forum, people were skeptical of the possibility of solving it with a query that had been optimized with a plan without explicit sorting. That’s all I needed to hear to motivate me. So, here’s what I came up with:

WITH C1 AS
(
  SELECT *,
    ROW_NUMBER() OVER(PARTITION BY grp ORDER BY ord) AS rn,
    LAG(val) OVER(PARTITION BY grp ORDER BY ord) AS prev 
  FROM dbo.T1
),
C2 AS
(
  SELECT *,
    MAX(CASE WHEN val = prev THEN NULL ELSE rn END)
      OVER(PARTITION BY grp ORDER BY ord ROWS UNBOUNDED PRECEDING) AS firstrn
  FROM C1
)
SELECT grp, ord, val, rn - firstrn + 1 AS seqno
FROM C2;

The solution uses three window functions: LAG, ROW_NUMBER and MAX. The important thing here is all three are based on grp partitioning and ord ordering, which is aligned with the supporting index key order.

The query defining the CTE C1 uses the ROW_NUMBER function to compute row numbers (rn column), and the LAG function to return the previous val value (prev column).

Here’s the output of the query defining C1:

grp      ord   val  rn  prev
-------- ----- ---- --- -----
Group A  1002  Y    1   NULL
Group A  1003  Y    2   Y
Group A  1005  Y    3   Y
Group A  1007  N    4   Y
Group A  1011  N    5   N
Group A  1013  N    6   N
Group A  1017  Y    7   N
Group A  1019  Y    8   Y
Group A  1023  N    9   Y
Group A  1029  N    10  N
Group B  1001  X    1   NULL
Group B  1002  X    2   X
Group B  1003  Z    3   X
Group B  1005  Z    4   Z
Group B  1008  Z    5   Z
Group B  1013  Z    6   Z
Group B  1021  Y    7   Z
Group B  1034  Y    8   Y

Notice when val and prev are the same, it’s not the first row in the island, otherwise it is.

Based on this logic, the query defining the CTE C2 uses a CASE expression that returns rn when the row is the first in an island and NULL otherwise. The code then applies the MAX window function to the result of the CASE expression, returning the first rn of the island (firstrn column).

Here’s the output of the query defining C2, including the output of the CASE expression:

grp      ord   val  rn  prev  CASE  firstrn
-------- ----- ---- --- ----- ----- --------
Group A  1002  Y    1   NULL  1     1
Group A  1003  Y    2   Y     NULL  1
Group A  1005  Y    3   Y     NULL  1
Group A  1007  N    4   Y     4     4
Group A  1011  N    5   N     NULL  4
Group A  1013  N    6   N     NULL  4
Group A  1017  Y    7   N     7     7
Group A  1019  Y    8   Y     NULL  7
Group A  1023  N    9   Y     9     9
Group A  1029  N    10  N     NULL  9
Group B  1001  X    1   NULL  1     1
Group B  1002  X    2   X     NULL  1
Group B  1003  Z    3   X     3     3
Group B  1005  Z    4   Z     NULL  3
Group B  1008  Z    5   Z     NULL  3
Group B  1013  Z    6   Z     NULL  3
Group B  1021  Y    7   Z     7     7
Group B  1034  Y    8   Y     NULL  7

What’s left for the outer query is to compute the desired result column seqno as rn minus firstrn plus 1.

Figure 5 has the default parallel plan I got for this solution when testing it using a SELECT INTO statement, writing the result into a temporary table.

Figure 5: Parallel plan for a new solution

There are no explicit sort operators in this plan. However, all three window functions are computed using the no sort + parallel row mode strategy, so we’re missing the benefits of batch processing. The run times I got for this solution with the parallel plan ranged from 2.47 seconds against 1M rows and 41.4 against 10M rows.

Here, there’s quite high likelihood for a serial plan with batch processing to do significantly better, especially when the machine doesn’t have many CPUs. Recall I’m testing my solutions against a machine with 4 logical CPUs. Figure 6 has the plan I got for this solution when forcing serial processing.

Figure 6: Serial plan for a new solution

All three window functions use the no sort + serial batch mode strategy. And the results are quite impressive. This solution run times ranged from 0.5 seconds against 1M rows and 5.49 seconds against 10M rows. What’s also curious about this solution is when testing it as a normal SELECT statement (with results discarded) as opposed to a SELECT INTO statement, SQL Server chose the serial plan by default. With the other two solutions, I got a parallel plan by default both with SELECT and with SELECT INTO.

See the next section for the complete performance test results.

Performance comparison

Here’s the code I used to test the three solutions, of course uncommenting the MAXDOP 1 hint to test the serial plans:

-- Test Known Solution 1

DROP TABLE IF EXISTS #Result;

WITH C AS
(
  SELECT *,
    ROW_NUMBER() OVER(PARTITION BY grp ORDER BY ord) -
      ROW_NUMBER() OVER(PARTITION BY grp, val ORDER BY ord) AS island
  FROM dbo.T1
)
SELECT grp, ord, val, 
  ROW_NUMBER() OVER(PARTITION BY grp, val, island ORDER BY ord) AS seqno
INTO #Result
FROM C
/* OPTION(MAXDOP 1) */; -- uncomment for serial test
GO

-- Test Known Solution 2

DROP TABLE IF EXISTS #Result;

WITH C1 AS
(
  SELECT *,
    CASE
      WHEN val = LAG(val) OVER(PARTITION BY grp ORDER BY ord) THEN 0
      ELSE 1
    END AS isstart
  FROM dbo.T1
),
C2 AS
(
  SELECT *,
    SUM(isstart) OVER(PARTITION BY grp ORDER BY ord ROWS UNBOUNDED PRECEDING) AS island
  FROM C1
)
SELECT grp, ord, val, 
  ROW_NUMBER() OVER(PARTITION BY grp, island ORDER BY ord) AS seqno
INTO #Result
FROM C2
/* OPTION(MAXDOP 1) */;
GO

-- Test New Solution

DROP TABLE IF EXISTS #Result;

WITH C1 AS
(
  SELECT *,
    ROW_NUMBER() OVER(PARTITION BY grp ORDER BY ord) AS rn,
    LAG(val) OVER(PARTITION BY grp ORDER BY ord) AS prev 
  FROM dbo.T1
),
C2 AS
(
  SELECT *,
    MAX(CASE WHEN val = prev THEN NULL ELSE rn END)
      OVER(PARTITION BY grp ORDER BY ord ROWS UNBOUNDED PRECEDING) AS firstrn
  FROM C1
)
SELECT grp, ord, val, rn - firstrn + 1 AS seqno
INTO #Result
FROM C2
/* OPTION(MAXDOP 1) */;

Figure 7 has the run times of both the parallel and serial plans for all solutions against different input sizes.

Figure 7: Performance comparison

The new solution, using serial mode, is the clear winner. It has great performance, linear scaling, and immediate response time.

Conclusion

Islands tasks are quite common in real life. Many of them involve both identifying islands and grouping the data by the island. Erland’s islands challenge, which was the focus of this article, is a bit more unique because it doesn’t involve grouping but instead sequencing each island’s rows with row numbers.

I presented two solutions based on known techniques for identifying islands. The problem with both is they result in plans involving explicit sorting, which negatively affects the performance, scalability, and response time of the solutions. I also presented a new technique that resulted in a plan with no sorting at all. The serial plan for this solution, which uses a no sort + serial batch mode strategy, has excellent performance, linear scaling, and immediate response time. It’s unfortunate, at least for now, we can’t have a no sort + parallel batch mode strategy for handling window functions.

16 thoughts on “Islands T-SQL Challenge”

Kamil Kosno says:

April 14, 2022 at 5:34 AM

Hi Itzik, before reading the rest of the article I came up with the following solution, there are no sort operators in the plan:

with 
regroup as
(
select grp, ord, val, 
case when lag(val) over(partition by grp order by ord) = val then 0 else 
count(*) over(partition by grp order by ord rows between unbounded preceding and current row) - 1  
end as grp_start,
row_number() over(partition by grp order by ord) as rn
from dbo.t1
)
select 
grp, ord, val, rn - max(grp_start) over(partition by grp order by ord rows between unbounded preceding and current row) as rn 
from regroup;

Now it's time to continue reading :-)

Sababa says:

April 14, 2022 at 6:51 AM

I know it's not close to solutions presented here…

drop table if exists #tmp
drop table if exists #r

/*
	need the RN so that can identity the prev/next row
*/
SELECT 
	g1.grp, 
	g1.val,
	g1.ord,
	row_number() over(partition by g1.grp order by g1.ord) as rn
into #tmp
FROM dbo.T1 AS g1


--create nonclustered index idx_tmp_1 on #tmp
--(grp,ord)
--create nonclustered index idx_tmp_2 on #tmp
--(grp,val,rn)

			SELECT
				grp,val, gap_Starts as gap_Starts_Ends
			--into #r
			FROM
			(
				SELECT 
					g1.grp, 
					g1.val,
					g1.ord as gap_Starts
				FROM 
					#tmp as g1 
					WHERE NOT EXISTS (SELECT 1
									  FROM #tmp as g2 
									  WHERE (
												g1.grp=g2.grp
												AND g2.val = g1.Val
												AND g2.rn =  g1.rn - 1
											)
									  )
							/* don't have a prec ;
								nu avem precedent  */
			)as s

			UNION ALL 
			
			 SELECT 
					g1.grp,
					g1.val,
					g1.ord as gap_Ends
			 FROM #tmp as g1
			 WHERE NOT EXISTS ( SELECT 1
									  FROM #tmp as g2
									  WHERE (
											g1.grp=g2.grp
											AND g2.val = g1.Val
											AND g2.rn = g1.rn + 1
											)
									  
							 )
						/* don't have a next
							nu avem urmatorul*/
			--order by 1,3

Nathan Peterson says:

April 14, 2022 at 2:01 PM

Hi Itzik, great work! It looks like an optimal solution.
Here is a refactor, using isstart from KS2, for the sake of (my and possibly others') understanding:

;WITH C1 AS 
( --Get Row # in Group, and whether row is First row in an island
	SELECT *,
		ROW_NUMBER() OVER(PARTITION BY grp ORDER BY ord) AS rn,
	CASE
		WHEN val = LAG(val) OVER(PARTITION BY grp ORDER BY ord) THEN 0
		ELSE 1
	END AS isstart												
	FROM dbo.T1
)
,C2 AS 
( --Get Row # in Group of Most recent starter 
	SELECT *,
		MAX(isstart * rn)										
		OVER(PARTITION BY grp ORDER BY ord ROWS UNBOUNDED PRECEDING) AS srn
	FROM C1
)
--Difference between starter and current row is position within island (1-based). 	
SELECT grp, ord, val, rn - srn + 1 AS seqno					
FROM C2;

Peter Larsson says:

April 14, 2022 at 7:50 PM

Strange… I tested all solutions and Itzik's NS Ser ran in 54 seconds on my laptop.
However, keeping it simple and using some technique from the Buyer/Seller challenge here is my suggestion (that runs in 6 seconds in the 10 million row sample table.

WITH ctePrevious(grp, ord, val, prev)
AS (
        SELECT  grp,
                ord,
                val,
                LAG(val) OVER (PARTITION BY grp ORDER BY ord) AS prev 
        FROM    dbo.T1
), cteSequence(grp, ord, val, start)
AS
(
        SELECT  grp,
                ord,
                val,
                MAX(CASE WHEN val = prev THEN NULL ELSE ord END) OVER (PARTITION BY grp ORDER BY ord ROWS UNBOUNDED PRECEDING) AS start
        FROM    ctePrevious
)
SELECT  grp,
        ord,
        val,
        ord - start + 1 AS seqno
INTO    #Result2
FROM    cteSequence
OPTION  (MAXDOP 1); -- Force a ordered scan

Itzik Ben-Gan says:

April 14, 2022 at 9:55 PM

Kamil, nice solution!

Indeed, there are no sorts in the plan–neither in the serial one nor in the parallel one. Compared to NS ser, yours (Kamil ser) has one more Window Aggregate operator in the plan. Compared to NS par, yours (Kamil par) has one more sequence of row mode operators ending with a Stream Aggregate operator computing a window aggregate.
Here are the numbers that I get for your solution against 10M rows:

Kamil ser: 5.63 sec
Kamil par: 53.89 sec

That's compared to:

NS ser: 5.49 sec
NS par: 41.4 sec

Itzik Ben-Gan says:

April 14, 2022 at 10:03 PM

Sababa,

Your solution doesn't produce the desired result. Here's what it returns for the small set of sample data:

grp        val        gap_Starts_Ends
---------- ---------- ---------------
Group A    Y          1002
Group A    N          1007
Group A    Y          1017
Group A    N          1023
Group B    X          1001
Group B    Z          1003
Group B    Y          1021
Group A    Y          1005
Group A    N          1013
Group A    Y          1019
Group A    N          1029
Group B    X          1002
Group B    Z          1013
Group B    Y          1034

As a reminder, here's the expected result:

grp      ord   val  seqno
-------- ----- ---- ------
Group A  1002  Y    1
Group A  1003  Y    2
Group A  1005  Y    3
Group A  1007  N    1
Group A  1011  N    2
Group A  1013  N    3
Group A  1017  Y    1
Group A  1019  Y    2
Group A  1023  N    1
Group A  1029  N    2
Group B  1001  X    1
Group B  1002  X    2
Group B  1003  Z    1
Group B  1005  Z    2
Group B  1008  Z    3
Group B  1013  Z    4
Group B  1021  Y    1
Group B  1034  Y    2

Perhaps you sent an incomplete state of your solution?

Itzik Ben-Gan says:

April 14, 2022 at 10:12 PM

Nathan, I love the refactor! I do think it helps in understanding the solution better. Thanks for this.
It produces identical plans to NS, and the same performance.

Itzik Ben-Gan says:

April 14, 2022 at 10:18 PM

Hi Peter,

Just retested the solutions with 10M rows in the input table on my machine and I'm consistently getting ~ 5.5 sec for NS ser, as well as for the refactor Nathan sent.
Are you getting the same plan that I'm getting? Would love to get input from others as well.

Regarding your solution, it has a bug. Here's the result it produces for the small set of sample data:

grp        ord         val        seqno
---------- ----------- ---------- -----------
Group A    1002        Y          1
Group A    1003        Y          2
Group A    1005        Y          4
Group A    1007        N          1
Group A    1011        N          5
Group A    1013        N          7
Group A    1017        Y          1
Group A    1019        Y          3
Group A    1023        N          1
Group A    1029        N          7
Group B    1001        X          1
Group B    1002        X          2
Group B    1003        Z          1
Group B    1005        Z          3
Group B    1008        Z          6
Group B    1013        Z          11
Group B    1021        Y          1
Group B    1034        Y          14

As a reminder, this is the desired result (sequential row numbers starting with 1 within each island):

grp      ord   val  seqno
-------- ----- ---- ------
Group A  1002  Y    1
Group A  1003  Y    2
Group A  1005  Y    3
Group A  1007  N    1
Group A  1011  N    2
Group A  1013  N    3
Group A  1017  Y    1
Group A  1019  Y    2
Group A  1023  N    1
Group A  1029  N    2
Group B  1001  X    1
Group B  1002  X    2
Group B  1003  Z    1
Group B  1005  Z    2
Group B  1008  Z    3
Group B  1013  Z    4
Group B  1021  Y    1
Group B  1034  Y    2

Kamil Kosno says:

April 15, 2022 at 4:11 AM

Thanks Itzik.
The train of thought at the time after trials with previous approaches left me with that combination and I stopped considering improvements, but the count should be simply row number.
If I rewrite it like below it is pretty much the same logic, but, after reading seems like my solution was nothing but what was already in the article although the steps applied (i.e. first calculating lag etc.) in the article lead to a better performance.
```
with 
regroup as
(
select grp, ord, val, 
case when lag(val) over(partition by grp order by ord) = val then 0 else 
row_number() over(partition by grp order by ord) - 1  
end as grp_start,
row_number() over(partition by grp order by ord) as rn
from dbo.t1
)
select 
grp, ord, val, rn - max(grp_start) over(partition by grp order by ord rows between unbounded preceding and current row) as rn 
from regroup;
```
Nonetheless, interesting how close it is, though should be no surprise since I've learnt everything about SQL querying from your books :-)
Kamil Kosno says:

April 15, 2022 at 4:31 AM

Hi Itzik, I am not sure how to reply to the comment, but I wanted to give you feedback you mentioned when replying to Peter earlier. When I run your solution with maxdop option it consistently runs in slightly over 3 seconds (~3.2) on my desktop. Without that option it takes 20.9 seconds.
For comparison, my solution with count takes ~3.3 secs with maxdop 1 and ~30secs without this option.
My setup is Intel(R) Core(TM) i9-9900K CPU @ 3.60GHz 3.60 GHz, 32.0 GB of RAM and SSD drive. SQL Server version: 2019 (RTM) – 15.0.2000.5 (X64) Sep 24 2019 13:48:23 , Developer Edition (64-bit) on Windows 10 Pro 10.0 (Build 22000: ) (Hypervisor)
Itzik Ben-Gan says:

April 15, 2022 at 5:20 AM

Hi Kamil,

Yes, they are pretty similar in logic, and like you say, once you convert the COUNT to a ROW_NUMBER, they're the same. :)

Regarding the performance numbers.
My machine is 5-6 years old. The setup is:
CPU: Intel Core i7-6600U 2.60GHz 2 Cores, 4 Logical Processors
Mem: 16GB RAM
Disk: Crucial MX200 SSD
OS: Windows 11 Pro
SQL: SQL Server 2019 Enterprise
NS ser runs on mine for ~5.5 sec, which seems reasonable for a 10M-row input given the fact that there are no sorts in the plan.

Thanks for sharing the run times on your machine. I expected the solution to run even faster on newer, stronger machines, like yours. So the ~3.2 sec run time seems logical. Thanks for confirming!
Douglas Osborne says:

April 17, 2022 at 6:24 PM

I love watching smart people being smart
Charlieface says:

May 22, 2022 at 5:35 PM

What annoys me the most about a lot of the gaps-and-islands problems, is that SQL Server actually has a "Merge Interval" operator which it uses internally in some situations. This would massively simplify gaps-and-islands, while allowing for no-sort solutions.

But there is no syntax to use this operator. So instead we need to muck around with row-numbering, lag/lead and whatnot.

Why can't we do

GROUP BY
grp,
MERGE BY (val) WITHIN GROUP (ORDER BY ord)

or what about

ROW_NUMBER() OVER (PARTITION BY grp, MERGE BY val ORDER BY ord)

or something like that
Itzik Ben-Gan says:

May 22, 2022 at 7:36 PM

Hi Charlie,

It would have been really cool if we could tap into the optimizer’s merge interval, as well as merge join concatenation capabilities, by request.
Mark Edgar says:

January 21, 2023 at 3:16 AM

To me, gaps and island problems look simple, almost trivial, at first glance and then becoming maddening once I try to get them to work. I'm amazed at how succinct and clever the best solutions are – great work! My challenge will be adapting these approaches to date-based islands where rows with start and end dates may overlap and might or might not have the same start and end dates within an island.
Itzik Ben-Gan says:

January 21, 2023 at 6:35 PM

Thanks Mark!

This site uses Akismet to reduce spam. Learn how your comment data is processed.