|Lesson 2|| GROUP BY clause introduction |
|Objective|| Understand when to use the GROUP BY clause.|
SQL GROUP BY Clause
The SQL GROUP BY clause can be used in a SELECT statement to collect data across multiple records and group the results by one or more columns.
Syntax: The syntax for the SQL GROUP BY clause is:
SELECT expression1, expression2, ... expression_n,
GROUP BY expression1, expression2, ... expression_n;
Parameters or Arguments
|expression1, expression2, ... expression_n
||Expressions that are not encapsulated within an aggregate function and must be included in the GROUP BY Clause at the end of the SQL statement.
||This is an aggregate function such as the SUM, COUNT, MIN, MAX, or AVG functions.
||This is the column or expression that the aggregate_function will be used on.
||The tables that you wish to retrieve records from. There must be at least one table listed in the FROM clause.
||These are conditions that must be met for the records to be selected.
One of the most common requests is to summarize data from the database so it appears in a much more usable form.
The SQL language is one that excels at
- selecting rows,
- returning them to the user, and
- allowing the user to make summaries and changes.
It also has some very strong clauses you can add to your SELECT statement that can do some of the work for you when it comes to aggregating, or pulling together, related rows in your results set. The GROUP BY clause lets you tell the SQL engine what rows should be condensed into one, and what elements should be considered when the grouping occurs.
This sounds complicated, but it is not. Case Based Reasoning works better for those cases where it is harder to develop a linear model for.
Aggregate functions are used to summarize information from multiple tuples into a single-tuple summary. Grouping is used to create subgroups of tuples before summarization. Grouping and aggregation are required in many database applications, and we will introduce their use in SQL through examples.
A number of built-in aggregate functions exist:
- MIN, and
The COUNT function returns the number of tuples or values as specified in a query. The functions SUM, MAX, MIN, and AVG can be applied to a set or multiset of numeric values and return the sum, maximum value, minimum value, and average (mean) of those values.
These functions can be used in the SELECT clause or in a HAVING clause (which we introduce later). The functions MAX and MIN can also be used with attributes that have nonnumeric domains if the domain values have a total ordering among one another.
We illustrate the use of these functions with sample queries.
Query: Find the sum of the salaries of all employees, the maximum salary, the minimum salary, and the average salary.
SELECT SUM (Salary), MAX (Salary),
MIN (Salary), AVG (Salary)
The next lesson gives an example.