Microsoft Business Intelligence (Data Tools)|SQL - Queries Tuning and Optimization Techniques

Friday, May 6, 2016

SQL - Queries Tuning and Optimization Techniques

In SQL, it is very difficult to write complex SQL queries involving joins across many (at least 3-4) tables and involving several nested conditions because a SQL statement, once it reaches a certain level of complexity, is basically a little program in and of itself.

A database index is a data structure that improves the speed of operations on a database table. Indexes can be created using one or more columns of a database table, providing the basis for both rapid random look ups and efficient access of ordered records. Indexing is incredibly important when working with large tables, however, occasionally smaller tables should be indexed, if they are expected to grow.

Try to consistently indent and don't be afraid to use multiple lines. You don't have to write it all at once. Complex queries can sometimes just be a collection of simple queries. You need to follow some basic guidelines and Take the time to think these through such as-

List all of the columns that are to be returned

List all of the columns that are used in the WHERE clause

List all of the columns used in the JOINs (if applicable)

List all the tables used in JOINs (if applicable)

Get the correct records selected first

Save the complex calculations for last

If you do use a Common Table Expression (CTE), be aware that the query only persists until the next query is run, so in some cases where you are using the CTE in multiple queries, it might be better for performance to use a temp table.

Once you have the above information organized into this easy-to-comprehend form, it is much easier to identify those columns that could potentially make use of indexes when executed.

It makes a big difference to really understand how the data is combined, selected, filtered, and output. Here, query Optimization tricks comes into the picture to increase the performance of the program or software. There are a lot of guideline points to tune your query which do work as the boost of the query performance. These guideline points are mentioned below:

SET NOCOUNT ON at the beginning of each stored procedure you write. This statement should be included in every stored procedure, trigger, etc. that you write.

The SQL query becomes faster if you use the actual columns names in SELECT statement instead of than '*'.

HAVING clause is used to filter the rows after all the rows are selected if you are using aggregation functions. It is just like a filter. Do not use HAVING clause for any other purposes.

It is the best practice to avoid sub queries in your SQL statement and try to minimize the number of subquery block in your query if possible.

Use operator EXISTS, IN and table joins appropriately in your query. The reason is- Usually IN has the slowest performance

IN is efficient when most of the filter criteria are in the sub-query.

EXISTS is efficient when most of the filter criteria is in the main query.

Use EXISTS instead of DISTINCT when using joins which involves tables having one-to-many relationship.

Be careful while using conditions in WHERE clause.

To write queries which provide efficient performance follow the general SQL standard rules.

Use single case for all SQL verbs

Begin all SQL verbs on a new line

Separate all words with a single space

Right or left aligning verbs within the initial SQL verb

Indexes have the advantages as well as disadvantages as given below-

Do not automatically add indexes on a table because it seems like the right thing to do. Only add indexes if you know that they will be used by the queries run against the table.

Indexes should be measured on all columns that are frequently used in WHERE, ORDER BY, GROUP BY, TOP and DISTINCT clauses.

Do not add more indexes on your OLTP tables to minimize the overhead that occurs with indexes during data modifications.

Drop all those indexes that are not used by the Query Optimizer, generally.

If possible, try to create indexes on columns that have integer values instead of characters. Integer values use less overhead than character values.

To provide up-to-date statistics, the query optimizer needs to make smart query optimization decisions. You will generally want to leave the "Auto Update Statistics" database option on. This helps to ensure that the optimizer statistics are valid, ensuring that queries are properly optimized when they are run.

If you want to boost the performance of a query that includes an AND operator in the WHERE clause, consider the following:

Of the search criteria in the WHERE clause, at least one of them should be based on a highly selective column that has an index.

If at least one of the search criteria in the WHERE clause is not highly selective, consider adding indexes to all of the columns referenced in the WHERE clause.

If none of the columns in the WHERE clause are selective enough to use an index on their own, consider creating a covering index for this query.

Queries that include either the DISTINCT or the GROUP BY clauses can be optimized by including appropriate indexes. Any of the following indexing strategies can be used:

Include a covering, non-clustered index (covering the appropriate columns) of the DISTINCT or the GROUP BY clauses.

Include a clustered index on the columns in the GROUP BY clause.

Include a clustered index on the columns found in the SELECT clause.

Adding appropriate indexes to queries that include DISTINCT or GROUP BY is most important for those queries that run often.

When you need to execute a string of Transact-SQL, you should use the sp_executesql stored procedure instead of the EXECUTE statement.

When calling a stored procedure from your application, it is important that you call it using its qualified name.

Use stored procedures instead of views because they offer better performance and don't include code, variable or parameters that don't do anything.

If possible, avoid using SQL Server cursors. They generally use a lot of SQL Server resources and reduce the performance and scalability of your applications.

Instead of using temporary tables, consider using a derived table instead. A derived table is the result of using a SELECT statement in the FROM clause of an existing SELECT statement. By using derived tables instead of temporary tables, you can reduce I/O and often boost your application's performance.

Don't use the NVARCHAR or NCHAR data types unless you need to store 16-bit character (Unicode) data. They take up twice as much space as VARCHAR or CHAR data types, increasing server I/O and wasting unnecessary space in your buffer cache.

If you use the CONVERT function to convert a value to a variable length data type such as VARCHAR, always specify the length of the variable data type. If you do not, SQL Server assumes a default length of 30.

If you are creating a column that you know will be subject to many sorts, consider making the column integer-based and not character-based. This is because SQL Server can sort integer data much faster than character data.

Don't use ORDER BY in your SELECT statements unless you really need to, as it adds a lot of extra overhead. For example, perhaps it may be more efficient to sort the data at the client than at the server.

Don't return more data (both rows and columns) from SQL Server than you need to the client or middle-tier and then further reduce the data to the data you really need at the client or middle-tier. This wastes SQL Server resources and network bandwidth.

Conclusion

To tune our SQL queries, understanding our database does play the most important role. In SQL, typically each table column has an associated data type. Text, Integer, Varchar, Date, and more, are typically available types for developers to choose from. When writing SQL statements, make sure you choose the proper data type for the column. Sometimes it's easier to break up sub groups into their own select statement. To write a query, we need to know about the actual need of the query and scope of the query also.

Mukesh Singh

Workings as Technical Lead in Saviance Technologies on MSBI (SSRS, SSIS, SSAS and T-SQL with SQL Server 2005/2008 R2 / 2012 and SharePoint Server 2013, ERP Business application, Macola, ASP.net, C# and Web Services). Apart of this, SSRS integration with SharePoint Server 2013 and PowerShell.

32 comments:

Brajesh "BABU"July 19, 2016 at 5:44 AM
AWESOME
ReplyDelete
Replies
UnknownSeptember 2, 2016 at 1:18 AM
nice
ReplyDelete
Replies
UnknownSeptember 18, 2016 at 5:47 PM
Thank you for shared.
ReplyDelete
Replies
GrantedOctober 13, 2016 at 7:09 AM
Can you prove this:
The SQL query becomes faster if you use the actual columns names in SELECT statement instead of than '*'.
ReplyDelete
Replies
UnknownJanuary 24, 2017 at 10:59 AM
There are a lot of presumptions in these tips which are not always correct. For example an IN statement is as fast as a exists when the column is not nullable.
ReplyDelete
Replies
UnknownJanuary 29, 2017 at 6:09 AM
I have found that complex queries always involve multiple tables. To get the query correct, I always write a sub query against each table that gives me exactly the data I need, either to return or to select. Sometimes I even define these as views so they can be independently verified and used in multiple places. Then I combine the subqueries/views into a straightforward join query that gives me the data I need. I never worry about performance until a correct result is achieved. I review the query execution plan if needed to see if additional indices are needed.

I find the authors guidelines misleading because they do not include the use of views or the use of query execution plans and I find both of these to be essential to writing correct, performance queries.
ReplyDelete
Replies
Ram Babu SinghFebruary 18, 2017 at 2:27 PM
Very good article
ReplyDelete
Replies
Hamid SiamakiFebruary 23, 2017 at 6:19 PM
Many of the tips above is prescribing one solution for all, which is not correct.

Many of the tip can be proven to be wrong from query to query.

Optimization tips must be based on Execution Plan, not on Query. The tips above are based on Query.
ReplyDelete
Replies
Richard H. BlackApril 29, 2019 at 4:00 AM
Very useful post. This is my first time i visit here. I found so many interesting stuff in your blog especially its discussion. Really its great article. Keep it up. SEO optimalisatie
ReplyDelete
Replies
HalesMay 1, 2019 at 10:15 AM
You know your projects stand out of the herd. There is something special about them. It seems to me all of them are really brilliant! SEO
ReplyDelete
Replies
HalesMay 1, 2019 at 10:15 AM
You know your projects stand out of the herd. There is something special about them. It seems to me all of them are really brilliant! SEO
ReplyDelete
Replies
Riya RajMay 19, 2019 at 10:39 PM
Thanks for sharing such a most informative blog.... Waiting for the new updates...
Software Testing Course in Coimbatore
Software testing Training in coimbatore
best software testing training institute in coimbatore
software testing course in coimbatore with placement
Java Training in Bangalore
Java Training Institutes in Bangalore
AWS Training in Bangalore
data analytics courses in bangalore
Android Training in Coimbatore
ReplyDelete
Replies
marksonSeptember 16, 2019 at 2:38 AM
To inquire about the enormous measure of information we need an expert and all around experienced Data Scientist, to end up one of the specialists among the Data Scientist swarm, you should upgrade your range of abilities in huge information in a propelled manner.ExcelR Data Science Courses
ReplyDelete
Replies
olivermirSeptember 22, 2019 at 3:16 AM
Extraordinary review, I am a major adherent to remarking on online journals to advise the blog scholars realize that they've added something advantageous to the internet!. SEO in New York
ReplyDelete
Replies
datasciencecourseFebruary 6, 2020 at 8:38 PM
Great post i must say and thanks for the information. Education is definitely a sticky subject. However, is still among the leading topics of our time. I appreciate your post and look forward to more.

digital marketing course

For more info :
ExcelR - Data Science, Data Analytics, Business Analytics Course Training in Mumbai

304, 3rd Floor, Pratibha Building. Three Petrol pump, Opposite Manas Tower, LBS Rd, Pakhdi, Thane West, Thane, Maharashtra 400602
18002122120
ReplyDelete
Replies
HarryFebruary 19, 2020 at 10:07 PM
Amazing knowledge and I like to share this kind of information with my friends and hope they like it they why I do new york towing
ReplyDelete
Replies
HarryApril 23, 2020 at 4:40 AM
Gangaur Realtech is a professionally managed organisation specializing in real estate services where integrated services are provided by professionals to its clients seeking increased value by owning, occupying or investing in real estate. mobile cleaner light
ReplyDelete
Replies
HarryApril 27, 2020 at 2:14 AM
I don’t think many of websites provide this type of information. Bridal ring
ReplyDelete
Replies
FairyApril 28, 2020 at 12:06 AM
A debt of gratitude is in order for the blog entry amigo! Keep them coming... Magic mirror hire South west
ReplyDelete
Replies
FairyApril 28, 2020 at 12:06 AM
A debt of gratitude is in order for the blog entry amigo! Keep them coming... Magic mirror hire South west
ReplyDelete
Replies
Robert HudsonApril 30, 2020 at 1:10 PM
I read a article under the same title some time ago, but this articles quality is much, much better. How you do this.. day trading
ReplyDelete
Replies
FairyMay 5, 2020 at 2:21 AM
It is extremely nice to see the greatest details presented in an easy and understanding manner. Women's clothes
ReplyDelete
Replies
subhaJune 9, 2020 at 2:44 AM
Thanks for this wonderful blog. keep update more information about this.nice blog.
Ai & Artificial Intelligence Course in Chennai
PHP Training in Chennai
Ethical Hacking Course in Chennai Blue Prism Training in Chennai
UiPath Training in Chennai
ReplyDelete
Replies
hankstruckingJune 25, 2020 at 12:01 AM
Awesome article. Very interesting to read this newsletter.I would like to thanks for the efforts you had made for scripting this tremendous article. This text resolved my all queries...topsoil supplier |
ReplyDelete
Replies
hankstruckingJune 25, 2020 at 12:02 AM
great work thanks for sharing your experience..
topsoil services |
topsoil supplier |
ReplyDelete
Replies
Muhammad HassanAugust 14, 2020 at 3:31 AM
You make so many great points here that I read your article a couple of times. Your views are in accordance with my own for the most part. This is great content for your readers. https://cbtemailextractor.com/
ReplyDelete
Replies
rthwerAugust 29, 2020 at 5:00 AM
Thanks for taking the time to discuss this, I feel strongly about it and love learning more on this topic. If possible, as you gain expertise, would you mind updating your blog with extra information? It is extremely helpful for me. blog comment
ReplyDelete
Replies
Richard H. BlackAugust 31, 2020 at 11:12 AM
We have sell some products of different custom boxes.it is very useful and very low price please visits this site thanks and please share this post with your friends. Backlinks
ReplyDelete
Replies
Muhammad HassanSeptember 27, 2020 at 4:56 AM
When you use a genuine service, you will be able to provide instructions, share materials and choose the formatting style. Human Rights
ReplyDelete
Replies
HuongkvApril 12, 2021 at 1:13 AM
Mua vé tại đại lý vé máy bay Aivivu, tham khảo

mua ve may bay di my

vé máy bay hà nội hồ chí minh bamboo

vé máy bay đà nẵng đi hà nội

giá vé đi nha trang máy bay

vé máy bay đi quy nhơn bamboo

taxi sân bay nội bài

mua combo vinpearl phú quốc
ReplyDelete
Replies

Add comment

Friday, May 6, 2016

SQL - Queries Tuning and Optimization Techniques

32 comments:

Popular Posts