Lesson 4	Deficiencies of rule-based SQL optimization
Objective	List the disadvantages of the rule-based optimizer.

Disadvantages of rule-based SQL optimization

As we discussed, there are cases where the rule-based optimizer fails. This is most common in cases where a table has many indexes and the rule-based optimizer fails to choose the best index to service a query. This generally happens because the rule-based optimizer is not aware of the number of distinct values in an index and the distribution of values within the index. Let us illustrate this point with an example shown in the following series of images

Circle	Label / Color	Quantity	Description
🔴 Red	`active`	50,000	All active employees
🟡 Yellow	`retired`	50,000	All retired employees
🔵 Blue	`other departments`	95,000	Employees in non-finance departments
🟢 Green	`finance`	5,000	Employees in the finance department
🟣 Overlap	`retired ∩ finance`	100	Retired employees who were in finance

SELECT
    Emp_name
FROM
    employee
WHERE
    department = 'FINANCE'
AND
    status = 3 /*+ Retired employee */;

✅ Extracted SQL Code from Image:
🔍 Analysis:
This query retrieves the names of employees who:

Belong to the FINANCE department
Have a status = 3, which (according to the comment) means retired

Note: The comment `/*+ Retired employee */` is *not* an optimizer hint; it is just a regular inline comment. If this were meant as a hint, it would require proper syntax like `/*+ INDEX(emp_status_idx) */`.
⚠️ Potential Optimization Concern:
Based on the previous diagram:

Only 100 of 100,000 employees meet both conditions.
If Oracle evaluates department = 'FINANCE' first (5,000 rows), and only then filters status = 3 (yields just 100), it's better than starting from all rows.
But ideally, the most selective predicate (status = 3) should be applied first, especially if there's an index on status.

SELECT STATEMENT
  SORT AGGREGATE
    SELECT BY ROWID EMPLOYEE
      NON-UNIQUE INDEX NON-SELECTIVE RANGE SCAN
        status_ix(status)

📘 Transcribed Commentary Text:

When you EXPLAIN this SQL with the rule-based optimizer, you'll see this plan.
The rule-based optimizer is choosing to scan through all 50,000 retired employees looking for the 100 that belong to the Finance department.
Obviously, the rule-based optimizer has made a less than optimal choice of indexes.

🔍 Analysis:

The plan shows that Oracle is using the `status_ix` index on the `status` column.
- This index access is labeled as:
- NON-UNIQUE INDEX
- NON-SELECTIVE RANGE SCAN
This means:
- The `status` column (with 50% of employees marked `retired`) is not selective.
- So the optimizer is scanning 50,000 rows for `status = 3` and then filtering by `department = 'FINANCE'`, resulting in just 100 matches.
This is inefficient compared to using an index on `department` (which would access only 5,000 rows).

💡 Key Optimization Lesson:

Rule-Based Optimizer (RBO) lacks the intelligence to choose the most selective index.
This example demonstrates why Oracle deprecated RBO—it does not consider data distribution (cardinality).
A Cost-Based Optimizer (CBO) would likely choose the department index first due to its higher selectivity (5% vs. 50%).

Optimizer	Index Used	Rows Scanned	Selectivity	Efficiency
Rule-Based	`status_ix(status)`	50,000 (retired)	50%	❌ Less efficient
Cost-Based	`dept_ix(department)`	5,000 (finance)	5%	✅ More efficient

What can we do about the problem outlined in the series of images above?

Possible Solutions

There are several remedies:

Add an index hint:
```
SELECT /*+ INDEX dept_ix */
```
Add a cost-based hint:
```
 
SELECT /*+ ALL_ROWS */
```
Invalidate the STATUS index.

Invalidating an index is a common trick that is used when the rule-based optimizer makes a bad choice of indexes.

Disabling unwanted indexes

Unwanted indexes can be disabled by mixing data type on the index. For example, we know that the status field is a numeric index, so we could concatenate a null character value to the predicate to invalidate it. Oracle will see the data type mismatch, and bypass that index. In the example below, we have concatenated a null character “” to the status predicate:

SELECT
Emp_name
FROM
employee
WHERE
department = ‘FINANCE’
AND
status = 3||0;

Now let us look at tricks for repositioning items in the FROM clause to improve the speed of rule-based SQL queries.