Set operations allow the results of multiple queries to be combined into a single result set. Set operators include UNION
, INTERSECT
, and EXCEPT
.
In SQL the UNION
clause combines the results of two SQL queries into a single table of all matching rows. The two queries must result in the same number of columns and compatible data types in order to unite. Any duplicate records are automatically removed unless UNION ALL
is used.
UNION
can be useful in data warehouse applications where tables aren't perfectly normalized. A simple example would be a database having tables sales2005
and sales2006
that have identical structures but are separated because of performance considerations. A UNION
query could combine results from both tables.
Note that UNION
does not guarantee the order of rows. Rows from the second operand may appear before, after, or mixed with rows from the first operand. In situations where a specific order is desired, ORDER BY
must be used.
Note that UNION ALL
may be much faster than plain UNION
.
Given these two tables:
Executing this statement:
yields this result set, though the order of the rows can vary because no ORDER BY
clause was supplied:
Note that there are two rows for Joe because those rows are distinct across their columns. There is only one row for Alex because those rows are not distinct for both columns.
UNION ALL
gives different results, because it will not eliminate duplicates. Executing this statement:
would give these results, again allowing variance for the lack of an ORDER BY
statement:
The discussion of full outer joins also has an example that uses UNION
.
The SQL INTERSECT
operator takes the results of two queries and returns only rows that appear in both result sets. For purposes of duplicate removal the INTERSECT
operator does not distinguish between NULLs
. The INTERSECT
operator removes duplicate rows from the final result set. The INTERSECT ALL
operator does not remove duplicate rows from the final result set.