Data Summary Functions

Frequency Distribution

Required Parameters:

  • Target Collection
  • Target Data Point
  • Number of Classes
  • Result Name

Description:

The Frequency Distribution action segregates values in the Target Data Point into a selected Number of Classes, calculates the following Data Points, and saves them as a new Working Data collection.

  • Class : String
  • Count : Int32
  • Percent Total : Double
  • Lower Limit : Double

How to Perform a Frequency Distribution
Step Description

1

Select the Working Data collection that contains the target Data Point.

Note - We will be using the Product Sales Sample Dataset as our Target Collection.

product sales sample dataset

2

Select the Target Data Point.

3

Enter a Number of Classes.

4

Enter a Result Name.

frequency-distribution-how-to-1

5

Click OK.

Running the action...

frequency-distribution-how-to-2

Describe Data Function

Required Parameters:

  • Target Collection
  • Result Collection

Description:

The Describe Data Function action creates a new Working Data collection that contains the following measurements for each numeric Data Point in the Target Collection:

  • Count
  • Sum
  • Min
  • Max
  • Range
  • Mean
  • Median
  • Mode
  • FirstQTL
  • ThirdQTL
  • IQR
  • VAR
  • STD
  • SumError
  • SumErrSQD
  • Skewness

How to Run the Describe Data Function
Step Description

1

Select the Working Data collection that contains the target Data Points.

Note - We will be using the Product Sales Sample Dataset as our Target Collection.

product sales sample dataset

2

Enter a name for the new Result Collection.

describe-data-how-to-1

3

Click OK.

Running the action...

describe-data-how-to-2

Data Quality Check (!!!)

Required Parameters:

  • Working Data
  • Data Point Name (at least 1)

Description:

The Remove Data Points action allows the user to remove one or more Data Points from a Working Data collection. The user must first have a Working Data set in their workflow.

How to Perform a Data Quality Check
Step Description

1

Select the Working Data collection that contains the Data Point(s) you wish to remove.

2

Check the Data Point(s) you wish to remove.

Checking Select All will check all Data Points contained within the Working Data collection.

3

Click Add arrow to push the Data Point(s) to the right side.

4

Click OK.

Profile Blank Data

Required Parameters:

  • Target Collection
  • Result Collection

Description:

The Profile Blank Data action creates a new Working Data collection that contains the following information for each Data Point in the Target Collection:

  • Missing or Null : Decimal
  • Count of Items : Decimal
  • Percent of Total : Decimal

How to Create a Blank Data Profile
Step Description

1

Select the Working Data collection that contains the target Data Points.

Note - We will be using the Product Sales Sample Dataset as our Target Collection.

product sales sample dataset

2

Enter a name for the new Result Collection.

profile-blank-data-how-to-1

3

Click OK.

Running the action...

profile-blank-data-how-to-2

Language Summary

Required Parameters:

  • Working Data
  • Operation
  • Set N
  • Data Point (at least 1)
  • Result

Description:

The Language Summary action computes the count of each N-Gram contained within one or more Data Points and stores this information in a new Working Data collection

How to Perform an N-Gram Count
Step Description

1

Select the Working Data collection that contains the target Data Points.

Note - We will be using the Sectors Industries Sample Dataset as our Target Collection.

sector industries sample dataset

2

Set the Operation to Compute N Gram Counts.

3

Set a value for N.

4

Select a Data Point and click the green (+) to add the values within that Data Point to the text corpus of the action.

language-summary-how-to-1

5

Enter a name for the Result collection.

6

Click OK.

Running the action...

language-summary-how-to-2

List Data Points

Required Parameters:

  • Target Collection
  • Result Collection

Description:

The List Data Points action creates a new Working Data collection that contains the following information for each Data Point in the Target Collection:

  • Data Point Name : String
  • Data Point Type : String
  • Collection Name : String

How to List the Data Points in a Data Collection
Step Description

1

Select the Working Data collection that contains the target Data Points.

Note - We will be using the Product Sales Sample Dataset as our Target Collection.

product sales sample dataset

2

Enter a name for the new Result Collection.

list-data-points-how-to-1

3

Click OK.

Running the action...

list-data-points-how-to-2

Contains Collection

Required Parameters:

  • Collection
  • Result Variable

Description:

The Contains Collection action determines if the specified Working Data collection is loaded in the Working Data Container and stores the result as a Boolean Variable.

How to Determine if a Data Collection Exists
Step Description

1

Select the Contains Collection Operation.

2

Enter the name of a Working Data collection to search for.

Note - We will be using the Product Sales Sample Dataset as our Target Collection.

product sales sample dataset

3

Enter a name for the Result Variable.

contains-collection-how-to-1

4

Click OK.

Running the action...

contains-collection-how-to-2

Contains HyperCube

Required Parameters:

  • Collection
  • Result Variable

Description:

The Contains HyperCube action determines if the specified HyperCube is loaded in the Working Data Container and stores the result as a Boolean Variable.

How to Determine if a HyperCube Exists
Step Description

1

Select the Contains HyperCube Operation.

2

Enter the name of a HyperCube to search for.

3

Enter a name for the Result Variable.

4

Click OK.

Contains Data Point

Required Parameters:

  • Collection
  • Data Point
  • Result Variable

Description:

The Contains Data Point action determines if the specified Data Point exists in a Working Data collection and stores the result as a Boolean Variable.

How to Determine if a Data Collection Contains a Data Point
Step Description

1

Select the Contains Data Point Operation.

2

Enter the name of a Working Data collection to search in.

3

Enter the name of a Data Point to search for.

4

Enter a name for the Result Variable.

5

Click OK.

Contains Variable

Required Parameters:

  • Data Point
  • Result Variable

Description:

The Contains Variable action determines if the specified Variable exists and stores the result as a Boolean Variable.

How to Determine if a Variable Exists
Step Description

1

Select the Contains Variable Operation.

2

Enter the name of a Variable to search for (in the Data Point field).

3

Enter a name for the Result Variable.

4

Click OK.

List Data Points For Type

Required Parameters:

  • Collection Name
  • Result Collection Name
  • Selected Type(s) (at least 1)

Description:

The List Data Points For Type action creates a new Working Data collection that lists the Data Points of Selected Type(s) contained in a specified Working Data collection.

How to List the Data Points of a Selected Type Contained in a Data Collection
Step Description

1

Select the List Data Points For Type Operation.

2

Select the Working Data collection to search in.

Note - We will be using the Product Sales Sample Dataset as our Target Collection.

product sales sample dataset

3

Enter a name for the Result Collection.

4

Select a Data Point Type and click the green (+) to add the Selected Type to the action.

list-data-points-for-type-how-to-1

5

Click OK.

Running the action...

list-data-points-for-type-how-to-2

List HyperCube Grouping Keys

Required Parameters:

  • Collection
  • Result Collection

Description:

The List HyperCube Grouping Keys action creates a new Working Data collection that lists the Grouping Keys of a HyperCube.

Note - The Grouping Keys of a HyperCube are the columns displayed when viewing the HyperCube.

How to List the Grouping Keys of a HyperCube
Step Description

1

Select the List HyperCube Grouping Keys Operation.

2

Select the HyperCube to search in.

Note - We will be using the Product Sales HyperCube which was created from the Product Sales Sample Dataset using the following dimensions:

  • Segment
  • Country
  • Product

product sales sample dataset
product-sales-hypercube

3

Enter a name for the Result Collection.

list-hypercube-grouping-keys-how-to-1

4

Click OK.

Running the action...

list-hypercube-grouping-keys-how-to-2

Get Data Point Info

Required Parameters:

  • Collection
  • Data Point
  • Result Collection

Description:

The Get Data Point Info action returns the Data Point Info of a specified Data Point contained in a Working Data collection.

How to Get the Info of a Data Point
Step Description

1

Select the Get Data Point Info Operation.

2

Select the Working Data collection that contains the targeted Data Point.

Note - We will be using the Product Sales Sample Dataset as our Target Collection.

product sales sample dataset

3

Select the targeted Data Point.

4

Enter a name for the Result Collection.

get-data-point-info-how-to-1

5

Click OK.

Running the action...

get-data-point-info-how-to-2

Bin Data Items

Required Parameters:

  • Target Collection
  • Bin Interval OR Var
  • Result Data Point

Description:

The Bin Data Items action creates a new Data Point that serves to bin, or group, a selected interval of rows within a Working Data collection.

How to Bin Items in a Data Collection
Step Description

1

Select a Working Data collection.

Note - We will be using the Product Sales Sample Dataset as our Target Collection.

product sales sample dataset

2

Enter a discrete Bin Interval.

OR

Check Var and select a dynamic Variable.

3

Enter a name for the new Bin Data Point.

bin-data-items-how-to-1

4

Click OK.

Running the action...

bin-data-items-how-to-2

Correlation Analysis

Required Parameters:

  • Collection Name
  • Response Data Point
  • Result Collection

Description:

The Correlation Analysis action calculates the Correlation and Slope between the Reponse Data Point and every other numeric Data Point in the target Working Data collection.

How to Perform Correlation Analysis on a Data Point
Step Description

1

Set Collection Name to the Working Data collection that contains the reponse Data Point.

Note - We will be using the Product Sales Sample Dataset as our Collection Name.

product sales sample dataset

2

Select the Reponse Data Point.

3

Enter a name for the Result Collection.

correlation-analysis-how-to-1

4

Click OK.

Running the action...

correlation-analysis-how-to-2