Analyze Tables Component

Analyze Tables Component

This component allows you to analyze a list of tables in a single operation. Entire tables are analyzed, not individual columns. 'Analyze' in this context refers to the Amazon Redshift ANALYZE statement that rebuilds statistical metadata for a given table and allows for more optimized queries. You may wish to do this after modifying some tables earlier in the orchestration flow that had existing data in them, but the changes may have modified the table statistics considerably and you wish to analyze the data distribution in order to optimize queries on those tables.



Note: Analyze is often not required at all when a table is truncated and reloaded which is preferred to DML updates.

Note: External Tables cannot be analyzed.

Properties

Property Setting Description
Name Text The descriptive name for the component.
Schema Select Select the table schema. The special value, [Environment Default] will use the schema defined in the environment. For more information on using multiple schemas, see this article.
Tables to Analyze Select One or more tables to Analyze.

Strategy

Generates a single Analyze statement, listing the chosen tables.

Example

In this example, we have placed an Analyze Tables component at the end of a long Orchestration job that has resulted in many changes to airport and plane data tables.

The properties of the Analyze Tables component are shown below.

Here we wish to use the default environment and we can select the 'Tables to Analyze' via the dialog shown below. Tables moved to the right-hand column are the ones that will be analyzed.

When this job is run, it will analyze the three tables we have selected and build new metadata. Queries run against these tables can then be properly optimized.