Incremental statistics, are you doing it right?

When Oracle 11gR1 was released, an improvement was introduced that allows to drastically reduce the time it takes to calculate the statistics of partitioned tables: incremental statistics.

With each new version of Oracle Server this functionality has been improved, but due to compatibility issues it does not apply by default to the tables we already had configured prior to each upgrade, so it is likely that you currently have tables that make use of its full potential and also some that do not.

Is this your case? Read on and find out!

The requirements

First, in order to make use of incremental statistics, some parameters must be set for the partitioned table:

PropertyValue
incrementaltrue
publishtrue
estimate_percentdbms_stats.auto_sample_size
granularityauto

For this we must make calls to DBMS_STATS.set_table_prefs, to modify each of the properties.

The Magic

Instead of having to process all partitions for the statistics calculation, statistics are now calculated for each partition, and these are aggregated in order to have the global statistics of the table.

To achieve this, a new element is used: SYNOPSIS, which is a summary of the statistical data for each partition, such as the number of distinct values in the columns (NDV).

This data is stored in 2 tables: wri$_optstat_synopsis_head$ y wri$_optstat_synopsis$.

The problem

While everything works fine as it is, the initial algorithm—called ADAPTIVE SAMPLING—has the side effect that millions of rows will be recorded in the table wri$_optstat_synopsis$, and may become the first consumer of space in the tablespace. SYSAUX.

Depending on the number of partitions, the number of rows, the number of columns, etc., this table can occupy tens or even hundreds of gigabytes!

SQL> @?/rdbms/admin/awrinfo.sql

********************************************************
(1b) SYSAUX occupants space usage (v$sysaux_occupants)
********************************************************
|
| Occupant Name        Schema Name               Space Usage
| -------------------- -------------------- ----------------
| SM/OPTSTAT           SYS                       25,317.4 MB   <<<-------
| SM/AWR               SYS                          420.1 MB
| SM/ADVISOR           SYS                          351.2 MB

The solution

It was with Oracle 12.2 that Oracle decided to introduce a new algorithm: HYPERLOGLOG, which offers a shorter time for the processing of the SYNOPSES, but what is even better: it is no longer necessary to use the table wri$_optstat_synopsis$, and with it all the millions of rows and tens of gigabytes to store them.

This new algorithm will only be used with new implementations of incremental statistics for partitioned tables. All tables that you already had configured before upgrading to Oracle 12.2 will continue to use the previous algorithm, with all its disadvantages.

Most likely you will want to start using the new algorithm right now, and it turns out that it is not complicated at all, but it will require a recalculation of the statistics, and that already implies time and resource usage, so a good planning is advisable.

Based on my experience, I recommend you follow these steps and you will achieve this with the least possible impact:

  1. If you have no problem rebuilding the statistics all at once for the entire table and its partitions:

Set properties

BEGIN
  DBMS_STATS.set_table_prefs( 
    ownname => 'SH', 
    tabname => 'SALES', 
    pname   => 'APPROXIMATE_NDV_ALGORITHM', 
    pvalue  => 'HYPERLOGLOG'
  );
  
  DBMS_STATS.set_table_prefs( 
    ownname => 'SH', 
    tabname => 'SALES', 
    pname   => 'INCREMENTAL_STALENESS', 
    pvalue  => 'NULL'
  );
END;
/   

Be careful to use ‘NULL’ and not NULL.

1

2

Delete synopses

BEGIN
  DBMS_STATS.delete_table_stats(
    ownname => 'SH', 
    tabname => 'SALES', 
    stat_category => 'SYNOPSES'
  );
END;
/

Regather statistics

BEGIN
  DBMS_STATS.gather_table_stats (
    ownname => 'SH',
    tabname => 'SALES'
  );
END;
/

3

  1. If your thing is to take it easy and rebuild stats gradually:

Set properties

BEGIN
  DBMS_STATS.set_table_prefs( 
    ownname => 'SH', 
    tabname => 'SALES', 
    pname   => 'APPROXIMATE_NDV_ALGORITHM', 
    pvalue  => 'HYPERLOGLOG'
  );
  
  DBMS_STATS.set_table_prefs( 
    ownname => 'SH', 
    tabname => 'SALES', 
    pname   => 'INCREMENTAL_STALENESS', 
    pvalue  => 'ALLOW_MIXED_FORMAT'
  );
END;
/   

ALLOW_MIDEX_FORMAT allows the simultaneous existence of partitions with SYNAPSES created with the algorithm ADAPTIVE SAMPLING and others created with the algorithm HYPERLOGLOG.

1

2

Regather stats partition by partition

BEGIN
  FOR part IN ( SELECT partition_name 
                  FROM dba_tab_partitions 
                 WHERE table_owner = 'SH' 
                   AND table_name = 'SALES' 
                 ORDER BY 
                       partition_name )
  LOOP
    DBMS_STATS.delete_table_stats(
      ownname => 'SH', 
      tabname => 'SALES', 
      partname => part.partition_name, 
      stat_category => 'SYNOPSES'
    );

    DBMS_STATS.gather_table_stats(
      ownname     => 'SH',
      tabname     => 'SALES',
      partname    => part.partition_name, 
      granularity => 'PARTITION'
    );
  END LOOP;
END
/

Here I run the recalculation partition by partition, but you could process a group of partitions today, another one tomorrow and so on, until there is no partition left without stats regathering.

Regather global stats

BEGIN
  DBMS_STATS.gather_table_stats (
    ownname => 'SH',
    tabname => 'SALES'
  );
END;
/

3

Conclusions

The introduction of incremental statistics for partitioned tables has allowed us to have up-to-date statistics with low resource consumption.

Over time, improvements have been made to the algorithms used for incremental statistics, which by default are applied to new objects.

Do not lose time and check the situation of your partitioned tables, and if so, plan the rebuild of their statistics with the new HYPERLOGLOG algorithm, and let us know how it went.

To complement the above, I recommend that you read the following notes:

1417133.1How To Collect Statistics On Partitioned Table
814710.1SYSAUX Tablespace Fills Rapidly due to Increased WRI$_OPTSTAT_SYNOPSIS$ Size
2916829.1How to rebuild Incremental Statistics with HYPERLOG

Did you find this article interesting, did you have any doubts, do you want to suggest a topic to cover, leave me your comments or contact me me right now!

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Related Posts

Hide column contents
Implement the masking of column contents, dynamically and without having to modify your applications.
Register ACFS file system fails
Learn how to register an ACFS filesystem after migrating the Disk Group containing it to a new Oracle Cluster.
if there is an unusable index when analyze is executed, expect ORA-1502
How an error caused by running analyze to a table with unusable indexes is solved with regular expressions and dbms_stats.

Need Help?

Fill in these details and I will be in touch as soon as possible.