What is the GROUP BY clause used for in SQL?

Updated on December 2, 2025

1 min read

The GROUP BY clause in SQL is used to group rows that have the same values in one or more columns and then perform aggregate operations (like SUM, COUNT, AVG, MAX, MIN) on each group. Essentially, it helps summarize data and generate meaningful insights from large datasets.

For example, consider a sales table:

sale_id	region	amount
1	East	100
2	West	150
3	East	200
4	West	50

If we want to calculate the total sales per region, we use:

🗃️

filename.sql

SELECT region, SUM(amount) AS total_sales
FROM sales
GROUP BY region;

Result:

region	total_sales
East	300
West	200

In a real-world scenario, I used GROUP BY to create monthly revenue reports. For example, grouping sales by month and product_category allowed management to quickly see which categories performed best each month.

A challenge I’ve faced is that columns in the SELECT clause must either be in the GROUP BY clause or be aggregated. Forgetting this often results in errors like “column must appear in the GROUP BY clause or be used in an aggregate function.” To resolve it, I ensured every non-aggregated column is included in GROUP BY.

Limitations:

GROUP BY can be performance-heavy on very large datasets, especially when multiple columns are grouped. Indexing helps but doesn’t eliminate the overhead completely.
It cannot be used for row-level filtering; for that, you need WHERE before grouping or HAVING after aggregation.

Alternatives or enhancements:

Use HAVING clause with GROUP BY to filter groups based on aggregate values:

🗃️

filename.sql

SELECT region, SUM(amount) AS total_sales
FROM sales
GROUP BY region
HAVING SUM(amount) > 250;

For complex analytics, use window functions (like SUM() OVER PARTITION BY) instead of GROUP BY when you need both grouped and detailed row-level data.

In summary, GROUP BY is essential for aggregation and summarization, making it a key tool in reporting, analytics, and data-driven decision-making.

Power BI Essentials

SQL

Data Analysis Expression (DAX)

Dart & Flutter

Python Programming

Business Analyst

App Developer

What is the GROUP BY clause used for in SQL?