Thứ Hai, 27 tháng 1, 2014

SQL SERVER – Primary Key and NonClustered Index in Simple Words

There are two major place where I focus 
1) If there are change in features – I re-blog about it with additional details or 
2) If I have not provided complete information six years ago, I try to fill up the gap now. Well, just like everything my knowledge and writing skills have evolved. 
November 2006 was when I started to learn more about SQL and have been only 4 months in the product, I was still exploring various subjects. there are a few more details one should learn when it is about Primary Key and Clustered Index.
Here is the common misconception prevailing in the industry.
Primary Key has to be Clustered Index. 
In reality the statement should be corrected as follows:
Primary Key can be Clustered or Non-clustered but it is a common best practice to create a Primary Key as Clustered Index. 
Well, now we have corrected the statement let us understand a bit more in detail. Primary Key should be uniquely identifying column of the table and it should be NOT NULL. A good (most of the time) candidate of the clustered index key also uniquely identifies column and NOT NULL (most of the time). Well, that means it is a good idea to create a Primary Key Clustered so it solve both the problems together. Keeping these facts in mind SQL Server automatically creates Clustered Index on the Primary Key when the table is created.Developers often do not specify which column should have clustered index so by default Primary Key becomes Clustered Index. This practice is now extremely common and lots of people have forgotten that Primary Key and Clustered Index is two different things. They can be same column but they do not have to be.
Well here are four examples we will see where we will learn the behavior of SQL Server when it is about Primary Key and Clustered Index.
  • Scenario 1 : Primary Key will default to Clustered Index
  • Scenario 2: Primary Key is defined as a Non-clustered Index
  • Scenario 3: Primary Key defaults to Non-Clustered Index with another column defined as a Clustered Index
  • Scenario 4: Primary Key defaults to Clustered Index with other index defaults to Non-clustered index
Now let us see each of the scenarios in detail.

Scenario 1 : Primary Key will default to Clustered Index

In this case we will create only Primary Key and when we check the kind of index created on the table we will notice that it has created clustered index automatically over it.
-- Case 1 Primary Key Defaults to Clustered IndexUSE TempDB
GO
-- Create tableCREATE TABLE TestTable(ID INT NOT NULL PRIMARY KEY,Col1 INT NOT NULL)GO-- Check IndexesSELECT OBJECT_NAME(OBJECT_IDTableObject,[name] IndexName,[Type_Desc]FROM sys.indexesWHERE OBJECT_NAME(OBJECT_ID'TestTable'GO-- Clean upDROP TABLE TestTable
GO

Scenario 2: Primary Key is defined as a Non-clustered Index

In this case we will explicitly defined Primary Key as a non-clustered index and it will create it as a non-clustered index. It proves that Primary Key can be non-clustered index.
-- Case 2 Primary Key Non-clustered IndexUSE TempDB
GO
-- Create tableCREATE TABLE TestTable(ID INT NOT NULL PRIMARY KEY NONCLUSTERED,Col1 INT NOT NULL)GO-- Check IndexesSELECT OBJECT_NAME(OBJECT_IDTableObject,[name] IndexName,[Type_Desc]FROM sys.indexesWHERE OBJECT_NAME(OBJECT_ID'TestTable'GO-- Clean upDROP TABLE TestTable
GO

Scenario 3: Primary Key defaults to Non-Clustered Index with another column defined as a Clustered Index

In this case we will create clustered index on another column, SQL Server will automatically create a Primary Key as a non-clustered index as clustered index is specified on another column.
-- Case 3 Primary Key Defaults to Non-clustered IndexUSE TempDB
GO
-- Create tableCREATE TABLE TestTable(ID INT NOT NULL PRIMARY KEY,Col1 INT NOT NULL UNIQUE CLUSTERED)GO-- Check IndexesSELECT OBJECT_NAME(OBJECT_IDTableObject,[name] IndexName,[Type_Desc]FROM sys.indexesWHERE OBJECT_NAME(OBJECT_ID'TestTable'GO-- Clean upDROP TABLE TestTable
GO

Scenario 4: Primary Key defaults to Clustered Index with other index defaults to Non-clustered index

In this case we will create two indexes on the both the tables but we will not specify the type of the index on the columns. When we check the results we will notice that Primary Key is automatically defaulted to Clustered Index and another column as a Non-clustered index.
-- Case 4 Primary Key and DefaultsUSE TempDB
GO
-- Create tableCREATE TABLE TestTable(ID INT NOT NULL PRIMARY KEY,Col1 INT NOT NULL UNIQUE)GO-- Check IndexesSELECT OBJECT_NAME(OBJECT_IDTableObject,[name] IndexName,[Type_Desc]FROM sys.indexesWHERE OBJECT_NAME(OBJECT_ID'TestTable'GO-- Clean upDROP TABLE TestTable
GO
I think above examples clarifies if there are any confused related to Primary and Clustered Index.
Now here is the question I often get asked what can be the reason for creating Primary Key and Clustered Index Key on different columns. Well, there are many scenarios when this can be true. It is possible that you have column SSN which you want to create as a Primary Key but do not want make it as a clustered index key because you have uniquely increasing identity column which best suites your need for that table (again this is just an example – you can argue exactly the opposite way as well). You are welcome to continue the discussion on this subject in the comments field or a dedicated blog post I wrote about it years ago over here. There are few really good comments there – I think that blog post is now a gold mine to understand this concept.

Không có nhận xét nào: