Sachin Arora's Blog: The Clustering factor

Tuesday, November 28, 2006

The Clustering factor

The Clustering Factor

The clustering factor is a number which represent the degree to which data is randomly distributed in a table.

In simple terms it is the number of “block switches” while reading a table using an index.

Figure: Bad clustering factor

The above diagram explains that how scatter the rows of the table are. The first index entry (from left of index) points to the first data block and second index entry points to second data block. So while making index range scan or full index scan, optimizer have to switch between blocks and have to revisit the same block more than once because rows are scatter. So the number of times optimizer will make these switches is actually termed as “Clustering factor”.

Figure: Good clustering factor

The above image represents "Good CF”. In an event of index range scan, optimizer will not have to jump to next data block as most of the index entries points to same data block. This helps significantly in reducing the cost of your SELECT statements.

Clustering factor is stored in data dictionary and can be viewed from dba_indexes (or user_indexes)

SQL> create table sac as select * from all_objects;

Table created.

SQL> create index obj_id_indx on sac(object_id);

Index created.

SQL> select clustering_factor from user_indexes where index_name='OBJ_ID_INDX';

CLUSTERING_FACTOR
-----------------
545

SQL> select count(*) from sac;

COUNT(*)
----------
38956

SQL> select blocks from user_segments where segment_name='OBJ_ID_INDX';

BLOCKS
----------
96

The above example shows that index has to jump 545 times to give you the full data had you performed full table scan using the index.

Note:
- A good CF is equal (or near) to the values of number of blocks of table.

- A bad CF is equal (or near) to the number of rows of table.

Myth:
- Rebuilding of index can improve the CF.

Then how to improve the CF?

- To improve the CF, it’s the table that must be rebuilt (and reordered).
- If table has multiple indexes, careful consideration needs to be given by which index to order table.

Important point: The above is my interpretation of the subject after reading the book on Optimizer of Jonathan Lewis.

18 comments:

AnonymousMay 22, 2008 at 9:02:00 PM GMT+5:30
Very Nice way of Explaining Clustering factor.
ReplyDelete
Replies
AnonymousJuly 28, 2008 at 8:47:00 PM GMT+5:30
Explained in simple words...Thanks.
ReplyDelete
Replies
AnonymousMarch 13, 2009 at 12:52:00 PM GMT+5:30
Thanks for explaining in a crystal clear way
ReplyDelete
Replies
Kumar MadduriMay 4, 2009 at 11:19:00 AM GMT+5:30
Good and simple explanation.

Thank you
- Kumar
ReplyDelete
Replies
AnonymousMay 20, 2009 at 3:52:00 PM GMT+5:30
Hi,
What if the table has more than one index? Wich way should the table has to be re ordered? (wich column).
Thanks.
ReplyDelete
Replies
SachinMay 20, 2009 at 3:56:00 PM GMT+5:30
Thanks for your visit!

It is relative call. One needs to know the cols that are indexed with their respective usability. It may not happen that rebuilding/reordering table for one index may really hamper CF for another index. But if it happens, one needs to take a call on possible solutions depending upon how much each column is used in sql statements, their impact etc.
ReplyDelete
Replies
UnknownMay 22, 2009 at 3:13:00 AM GMT+5:30
very nice article to understand CF..

Thanks a lot.
--Ashwin
ReplyDelete
Replies
UnknownJune 11, 2009 at 12:40:00 PM GMT+5:30
This comment has been removed by the author.
ReplyDelete
Replies
UnknownJune 11, 2009 at 12:41:00 PM GMT+5:30
ery good article with simple word.Now I am able to gather clear idea about CF
ReplyDelete
Replies
Prasad KarlekarAugust 31, 2009 at 12:19:00 PM GMT+5:30
Really good explanation..Thnks a lot.. !! n keep blogging..
ReplyDelete
Replies
AnonymousJune 14, 2010 at 6:26:00 AM GMT+5:30
Good explanation. Keep it up. Thank you.
ReplyDelete
Replies
AnonymousJuly 25, 2010 at 2:31:00 AM GMT+5:30
Sachin....your explanation is clear and easy to understand. thanks for the info.

-Sai
ReplyDelete
Replies
UnknownAugust 1, 2010 at 2:46:00 AM GMT+5:30
Great Article.... keep it up,,,
ReplyDelete
Replies
AnonymousSeptember 21, 2010 at 4:14:00 PM GMT+5:30
Good Job Sachin!! Very Helpful
ReplyDelete
Replies
AnonymousMarch 26, 2011 at 1:38:00 PM GMT+5:30
Very nice and simple explaination...
Nice blogs from you Sachin...
ReplyDelete
Replies
AnonymousOctober 26, 2013 at 1:35:00 PM GMT+5:30
Good one
ReplyDelete
Replies
sridhartempalleApril 7, 2014 at 11:06:00 AM GMT+5:30
Superb explanation with simple words helped me understand it very well. Eventually this was the third time I tried to read and understood it at first attempt.
ReplyDelete
Replies
AnonymousMay 31, 2014 at 12:21:00 AM GMT+5:30
Very Good explanation

Thank you
Anoosha
ReplyDelete
Replies

Add comment