T-SQL PARTITION BY clause in SQL Server

The SQL Server PARTITION BY clause is a powerful feature that enables developers to split data into partitions and perform calculations or aggregations on each partition. This clause is commonly used with functions such as ROW_NUMBER(), RANK(), and DENSE_RANK().

Syntax

The syntax of the PARTITION BY clause is as follows:

SELECT column1, column2, ..., columnN, aggregate_function(columnX) OVER ( PARTITION BY partition_column1, partition_column2, ..., partition_columnN ORDER BY ordering_column1, ordering_column2, ..., ordering_columnN ) FROM table_name;

In this syntax, partition_column1, partition_column2, ..., partition_columnN specifies the columns by which the data will be partitioned. The aggregate_function(columnX) can be any SQL aggregate function such as COUNT(), SUM(), AVG(), etc., that is applied on the partitioned data.

The ORDER BY clause is optional and can be used to specify the order of the data within each partition.

Example

Let's consider an example to understand the usage of the PARTITION BY clause better. Suppose we have a table employees with the following data:

employee_id	department	salary
1	HR	50000
2	HR	60000
3	IT	70000
4	IT	80000
5	IT	90000
6	Sales	40000
7	Sales	55000
8	Sales	65000

Now, if we want to find the rank of each employee within their respective department based on their salary, we can use the following query:

SELECT employee_id, department, salary, RANK() OVER (PARTITION BY department ORDER BY salary DESC) as department_rank FROM employees;

This query will partition the data by the department column and then rank the employees within each department based on their salary in descending order.

The result of this query would be:

employee_id	department	salary	department_rank
2	HR	60000	1
1	HR	50000	2
5	IT	90000	1
4	IT	80000	2
3	IT	70000	3
8	Sales	65000	1
7	Sales	55000	2
6	Sales	40000	3

As we can see, the employees have been ranked based on their salary within their respective departments, which is exactly what we wanted to achieve.

In conclusion, the PARTITION BY clause is a powerful feature that can help developers partition data and perform complex calculations or aggregations on each partition. Its flexibility and ease of use make it an essential tool for any developer working with SQL Server.