Oracle Optimization
Lecture 1

Introduction
Introduction into architecture of modern servers

Introduction

You should be very careful in optimization and the main criteria is "not do any harm" rather then "achieve spectacular improvements".

Remember:

The best performance first of all means avoiding blunders in installation and configuration. In a way it comes from the unnecessary work you don't do.
Much depends on the level of qualification of a particular DBA or system administrator.
The higher the level qualification is the more probably that actions taken will have positive, not negative effect.
Excessive zeal is another danger. Usually it backfires. The key here is measurement of performance. Good ideas without measurement of performance in optimization space are often very bad ideas.

The best performance first of all means avoiding blunders in installation and configuration. In a way it comes from the unnecessary work you don't do.

Much depends on the level of qualification of a particular DBA or system administrator.

The higher the level qualification is the more probably that actions taken will have positive, not negative effect.

Excessive zeal is another danger. Usually it backfires. The key here is measurement of performance. Good ideas without measurement of performance in optimization space are often turn to be useless or even bad ideas.

There is no free lunch and the more optimized system is, the more specialized for a particular application it became; as a result any changes in application can disproportionally affect performance. We can categorized the effects of performance tuning (aka optimization) in two categories:

Performance impact. Performance impact or benefit explains the level of potential performance gain by performing a specific tuning action:
- Low: Single digits improvement
- Medium: Between 10% to 33% improvement
- High: 33% to 100% improvement
- Dramatic: Over 100% improvement. several time or order of magnitude performance improvement
Risk. There is no free lunch and each tuning action entails some risk including the risk that some tuning actions may actually degrade performance. The tuning risks can be categorized as:
- Zero This is not expected to cause any problems.
- Low Safe change to make
- Medium You need to check that the game is worth the candles
- High This can cause problems or even reduce performance in some cases different from those you are optimizing. You are normally not expected to use these options since involve a large risk to stability and stability is more important factor then performance. However, they may be used if:
  - You have tried everything else and performance is still inadequate.
  - You fully understand what this tuning option do.
  - You run the benchmark and see such a dramatic improvement of performance that justifies the risk.

There are five major areas of optimization:

Hardware optimization (there is a difference between using 15K RPM disks and dual channel controller with battery backup and 7200RPM disks with primitive controller. The most basic optimization is selection of the system that has higher transaction benchmark (TCP, for example TCP-C). Solid state disk can make dramatic improvement if the database is mainly used for reading. SAN are necessary for large databases (not only from performance standpoint, but also backup and recovery standpoint).
Operating system based optimization (generally requires careful measurements, for example using D-trace on Solaris)
Database engine level optimization (for example Oracle is highly tunable and has multiple performance measurement tools)
Database schema and SQL-level optimization
Application level optimization . Here the most dramatic improvements are possible; that's why open source applications can generally beat closed-source application; but understanding the application on the level that makes possible its tuning is often limited by the brainpower available; this is especially true for complex, off-the-shelf applications)

As we go down the list we generally can get higher and higher returns on the efforts. But risks also generally increase. Also not all application are open source so application level optimization if often limited to the contains of the particular implementation.

The level of optimization available to the organization usually are dependent of the qualification of the staff. The higher the qualification of staff the more levels of optimization are available and the more is the potential return.

Here is a relevant quote from Oracle database Performance Tuning FAQ - Oracle FAQ

Consider the following areas for tuning. The order in which steps are listed needs to be maintained to prevent tuning side effects. For example, it is no good increasing the buffer cache if you can reduce I/O by rewriting a SQL statement.

Database Design (if it's not too late): Poor system performance usually results from a poor database design. One should generally normalize to the 3NF. Selective denormalization can provide valuable performance improvements. When designing, always keep the "data access path" in mind. Also look at proper data partitioning, data replication, aggregation tables for decision support systems, etc.

Application Tuning: Experience showed that approximately 80% of all Oracle system performance problems are resolved by coding optimal SQL. Also consider proper scheduling of batch tasks after peak working hours.

Memory Tuning: Properly size your database buffers (shared_pool, buffer cache, log buffer, etc) by looking at your wait events, buffer hit ratios, system swapping and paging, etc. You may also want to pin large objects into memory to prevent frequent reloads.

Disk I/O Tuning: Database files needs to be properly sized and placed to provide maximum disk subsystem throughput. Also look for frequent disk sorts, full table scans, missing indexes, row chaining, data fragmentation, etc.

Eliminate Database Contention: Study database locks, latches and wait events carefully and eliminate where possible.

Tune the Operating System: Monitor and tune operating system CPU, I/O and memory utilization. For more information, read the related Oracle FAQ dealing with your specific operating system.

Database sizes

Small databases: up to 16G
Medium databases: up to 64G
Large databases: up to several petabytes

[edit]

What tools/utilities does Oracle provide to assist with performance tuning?

Oracle provide the following tools/ utilities to assist with performance monitoring and tuning:

ADDM (Automated Database Diagnostics Monitor) introduced in Oracle 10g

TKProf

Statspack

Oracle Enterprise Manager - Tuning Pack (cost option)

Old UTLBSTAT.SQL and UTLESTAT.SQL - Begin and end stats monitoring

[edit]

When is cost based optimization triggered?

It's important to have statistics on all tables for the CBO (Cost Based Optimizer) to work correctly. If one table involved in a statement does not have statistics, and optimizer dynamic sampling isn't performed, Oracle has to revert to rule-based optimization for that statement. So you really want for all tables to have statistics right away; it won't help much to just have the larger tables analyzed.

Generally, the CBO can change the execution plan when you:

Change statistics of objects by doing an ANALYZE;

Change some initialization parameters (for example: hash_join_enabled, sort_area_size, db_file_multiblock_read_count).

[edit]

How can one optimize %XYZ% queries?

It is possible to improve %XYZ% (wildcard search) queries by forcing the optimizer to scan all the entries from the index instead of the table. This can be done by specifying hints.

If the index is physically smaller than the table (which is usually the case) it will take less time to scan the entire index than to scan the entire table.

[edit]

Where can one find I/O statistics per table?

The STATSPACK and UTLESTAT reports show I/O per tablespace. However, they do not show which tables in the tablespace has the most I/O operations.

The $ORACLE_HOME/rdbms/admin/catio.sql script creates a sample_io procedure and table to gather the required information. After executing the procedure, one can do a simple SELECT * FROM io_per_object; to extract the required information.

For more details, look at the header comments in the catio.sql script.

[edit]

My query was fine last week and now it is slow. Why?

The likely cause of this is because the execution plan has changed. Generate a current explain plan of the offending query and compare it to a previous one that was taken when the query was performing well. Usually the previous plan is not available.

Some factors that can cause a plan to change are:

Which tables are currently analyzed? Were they previously analyzed? (ie. Was the query using RBO and now CBO?)

Has OPTIMIZER_MODE been changed in INIT<SID>.ORA?

Has the DEGREE of parallelism been defined/changed on any table?

Have the tables been re-analyzed? Were the tables analyzed using estimate or compute? If estimate, what percentage was used?

Have the statistics changed?

Has the SPFILE/ INIT<SID>.ORA parameter DB_FILE_MULTIBLOCK_READ_COUNT been changed?

Has the INIT<SID>.ORA parameter SORT_AREA_SIZE been changed?

Have any other INIT<SID>.ORA parameters been changed?

What do you think the plan should be? Run the query with hints to see if this produces the required performance.

[edit]

Does Oracle use my index or not?

One can use the index monitoring feature to check if indexes are used by an application or not. When the MONITORING USAGE property is set for an index, one can query the v$object_usage to see if the index is being used or not. Here is an example:
SQL> CREATE TABLE t1 (c1 NUMBER);
Table created.

SQL> CREATE INDEX t1_idx ON t1(c1);
Index created.

SQL> ALTER INDEX t1_idx MONITORING USAGE;
Index altered.

SQL>
SQL> SELECT table_name, index_name, monitoring, used FROM v$object_usage;
TABLE_NAME                     INDEX_NAME                     MON USE
------------------------------ ------------------------------ --- ---
T1                             T1_IDX                         YES NO

SQL> SELECT * FROM t1 WHERE c1 = 1;
no rows selected

SQL> SELECT table_name, index_name, monitoring, used FROM v$object_usage;
TABLE_NAME                     INDEX_NAME                     MON USE
------------------------------ ------------------------------ --- ---
T1                             T1_IDX                         YES YES
To reset the values in the v$object_usage view, disable index monitoring and re-enable it:
ALTER INDEX indexname NOMONITORING USAGE;
ALTER INDEX indexname MONITORING   USAGE;
[edit]

Why is Oracle not using the damn index?

This problem normally only arises when the query plan is being generated by the Cost Based Optimizer (CBO). The usual cause is because the CBO calculates that executing a Full Table Scan would be faster than accessing the table via the index. Fundamental things that can be checked are:

USER_TAB_COLUMNS.NUM_DISTINCT - This column defines the number of distinct values the column holds.

USER_TABLES.NUM_ROWS - If NUM_DISTINCT = NUM_ROWS then using an index would be preferable to doing a FULL TABLE SCAN. As the NUM_DISTINCT decreases, the cost of using an index increase thereby making the index less desirable.

USER_INDEXES.CLUSTERING_FACTOR - This defines how ordered the rows are in the index. If CLUSTERING_FACTOR approaches the number of blocks in the table, the rows are ordered. If it approaches the number of rows in the table, the rows are randomly ordered. In such a case, it is unlikely that index entries in the same leaf block will point to rows in the same data blocks.

Decrease the INIT<SID>.ORA parameter DB_FILE_MULTIBLOCK_READ_COUNT - A higher value will make the cost of a FULL TABLE SCAN cheaper.

Remember that you MUST supply the leading column of an index, for the index to be used (unless you use a FAST FULL SCAN or SKIP SCANNING).

There are many other factors that affect the cost, but sometimes the above can help to show why an index is not being used by the CBO. If from checking the above you still feel that the query should be using an index, try specifying an index hint. Obtain an explain plan of the query either using TKPROF with TIMED_STATISTICS, so that one can see the CPU utilization, or with AUTOTRACE to see the statistics. Compare this to the explain plan when not using an index.

[edit]

When should one rebuild an index?

You can run the ANALYZE INDEX <index> VALIDATE STRUCTURE command on the affected indexes - each invocation of this command creates a single row in the INDEX_STATS view. This row is overwritten by the next ANALYZE INDEX command, so copy the contents of the view into a local table after each ANALYZE. The 'badness' of the index can then be judged by the ratio of 'DEL_LF_ROWS' to 'LF_ROWS'.

[edit]

How does one tune Oracle Wait event XYZ?

Here are some of the wait events from V$SESSION_WAIT and V$SYSTEM_EVENT views:

db file sequential read: Tune SQL to do less I/O. Make sure all objects are analyzed. Redistribute I/O across disks.

buffer busy waits: Increase DB_CACHE_SIZE (DB_BLOCK_BUFFERS prior to 9i)/ Analyze contention from SYS.V$BH

log buffer space: Increase LOG_BUFFER parameter or move log files to faster disks

[edit]

What is the difference between DBFile Sequential and Scattered Reads?

Both "db file sequential read" and "db file scattered read" events signify time waited for I/O read requests to complete. Time is reported in 100's of a second for Oracle 8i releases and below, and 1000's of a second for Oracle 9i and above. Most people confuse these events with each other as they think of how data is read from disk. Instead they should think of how data is read into the SGA buffer cache.

db file sequential read:

A sequential read operation reads data into contiguous memory (usually a single-block read with p3=1, but can be multiple blocks). Single block I/Os are usually the result of using indexes. This event is also used for rebuilding the controlfile and reading datafile headers (P2=1). In general, this event is indicative of disk contention on index reads.

db file scattered read:

Similar to db file sequential reads, except that the session is reading multiple data blocks and scatters them into different discontinuous buffers in the SGA. This statistic is NORMALLY indicating disk contention on full table scans. Rarely, data from full table scans could be fitted into a contiguous buffer area, these waits would then show up as sequential reads instead of scattered reads.

The following query shows average wait time for sequential versus scattered reads:
prompt "AVERAGE WAIT TIME FOR READ REQUESTS"
select a.average_wait "SEQ READ", b.average_wait "SCAT READ"
from   sys.v_$system_event a, sys.v_$system_event b
where  a.event = 'db file sequential read'
and    b.event = 'db file scattered read';

Database engines requirements for key subsystems of modern servers

The database server's primary function is to store, search, retrieve, and update data from disk. Examples of Database engines include IBM DB2, Microsoft SQL Server, and Oracle. Due to the high number of random I/O requests that database servers are required to do and the computation intensive activities that occur, the potential areas that have the most impact on performance are:

Memory subsystem
Disk
Processor
Network

A balanced system is especially important, for example, if adding additional CPUs, consider upgrading other subsystems such as increasing memory and ensuring that disk resources are adequate. The key subsystems that influence database performance in the servers are:

Processor and cache. In the majority of server installations, the CPU is, in fact, over-specified while the other subsystems are under-specified. It is only specific applications that are truly CPU intensive that take advantage of the full power of today's multi-core and 64-bit processors. Cache, while strictly part of the memory subsystem, is physically packaged with the processor these days. The CPU and cache are coupled together tightly and run at full or half the speed of the processor.
Processing power can be an important factor for database servers because some database queries and update operations require intensive CPU time. The database replication process also requires considerable amounts of CPU cycles.

Database servers are multi-threaded applications. So, SMP-capable systems provide improved performance scaling to 16-way and beyond. L2 cache size is also important due to the high hit ratio-the proportion of memory requests that fill from the much faster cache instead of from memory. For example, SQL Server's L2 cache hit ratio approaches 90%.
PCI bus. Servers use the PCI bus (PCI-X and PCI Express) for various cards. High-end servers now have multiple PCI buses and many more PCI slots than they used to. Advances in the PCI bus include the PCI Express (PCI-E) 1X to 16X technologies, which provide greater throughput and connectivity options. Connecting to the CPU and cache is the PCI chipset. This set of components governs the connections between the PCI bus and the processor and memory subsystems. The PCI chipset is carefully matched and tuned to the processors and memory to ensure the maximum performance of the system.
Memory subsystem. Critical to a server's performance is memory as memory is used a cashe for data during the oprations. If the server does not have sufficient memory, paging occurs, which results in excessive disk I/O, which in turn generates additional latencies. Sufficient memory is required for both the operating system and the database engine. You need to consider this when sizing database servers. Without enough memory installed, the system will perform poorly because the operating system will swap data to disk when it needs to make room for other data in memory. Even with sufficient memory, most database servers will perform large amounts of disk I/O to bring data records into memory and flush modified data to disk. The disk subsystem needs to be well designed to distribute this workload on multiple disks in order not to be a potential bottleneck.

Memory speed is very important (the faster the better). Current X86 servers can use 667MHz DIMMs. New memory technologies include the Fully Buffered DIMMs (FBD), providing higher capacity and bandwidth, improved flexibility and memory mirroring.
Disk subsystem. The disk subsystem is often critical to a server's performance. RAID is commonly employed in server configurations. With most database applications, more drives equals greater performance. The configuration of the RAID arrays also can make a significant difference in the performance characteristics. It is also important to keep your logs files on different disks to your database. It is important that disk used the top rotating speed available (15K RPM). If this is a budget server consider using a pair of mirrored drives of higher speed (15K RPM) for indexes. Even when using SAN devices for storage, you need to pay particular attention to Fibre channel network and SAN configuration to ensure that the storage environment does not place constraints on the server.
Network subsystem. 1 Gbps network adapters are now commonplace in servers. New 10 Gbps network are now available to provide the necessary bandwidth for high-throughput applications. Moreover, new technologies such as TCP Offload Engine help improve performance. The networking subsystem tends to be the least important component on an application or database server because the amount of data returned to the client is a small subset of the total database. The network can be important, however, if the application and the database are on separate servers.

There are also several subsystems that doesn't affect server performance. one is the video subsystem which in a server is relatively insignificant.

=== End of lecture ===

Oracle Optimization Lecture 1

Introduction

Database sizes

What tools/utilities does Oracle provide to assist with performance tuning?

When is cost based optimization triggered?

How can one optimize %XYZ% queries?

Where can one find I/O statistics per table?

My query was fine last week and now it is slow. Why?

Does Oracle use my index or not?

Why is Oracle not using the damn index?

When should one rebuild an index?

How does one tune Oracle Wait event XYZ?

What is the difference between DBFile Sequential and Scattered Reads?

Database engines requirements for key subsystems of modern servers

Oracle Optimization
Lecture 1