Professional Documents
Culture Documents
INDEX
1.3. Activation
In order to use InnerSoft STATS Full Version, user must purchase a license before activating the product.
To purchase InnerSoft STATS Full Version, you must first contact with us by email
innersoft@itspanish.org
innersoft@gmail.com
You can also call us (Spain number phone):
There are different payment methods that you can find at:
http://isstats.itspanish.org/
Once product has been paid, you can activate it. Just enter Help/Activate ISSTATS and follow instructions.
For a successful activation process, InnerSoft requires permissions to read/write files in its installation
folder, usually: 'C:\Program Files\ISSTATS\'.
So may you have to login to Windows as Administrator or have to modify User Account Control in order
to activate successfully InnerSoft STATS.
1.4. License
Registering data is chained to a PC Computer. When you pay a license, you can run InnerSoft STATS in
3 different computers. Thus you can ask for 3 Activation Codes.
You will not lose the license when formatting the hard drives or changing Windows S.O., BUT if you
make changes of hardware the activation code may become invalid.
Extra Activation Codes (more than 3) for same user have an additional cost.
1.4.1. Upgrades
You will receive the Activation Codes for the product upgrades at no cost during one year. If you do not
renew the license after one year, will not lose the license of the product version that you purchased, BUT
will not receive new Activation Codes for the new software versions.
2. Introduction to ISSTATS
When you start ISSTATS, a new, empty project is opened for you. You will see three windows:
Main window
o Project Manager window
o Worksheet window
o Output window
Controlling Windows
You can open and minimize ISSTATS windows just as you would other windows in your operating
system. You can also control ISSTATS windows with commands on the Window menu.
ISSTATS Environment
The project contains:
A Worksheet that contains your data. You can NOT have multiple worksheets in one project. Your data
will be displayed as columns. There is a Data window for your worksheet in the project. You can enter
and edit data directly in the Data window.
A Project Manager, which contains:
When you open a project file, the worksheet that were inside that project when you last saved is available
to you. When you save a project, the worksheet is saved within that project file.
The output results must be saved apart from the project file.
ISSTATS does not support multiple Worksheets or multiple Output documents. If you create a new table
or document, remember to save the previous data before it disappears.
You can save a project or worksheet in XML or Binary file format. The advantages and disadvantages of
using each format are:
5
XML
Binary
Thus XML compressed in zip is the better way of storing large amounts of data.
Actually, undo/redo history has no limits. A limit will be implemented in a future. If you feel that
ISSTATS is consuming too much memory in your PC, you should save your project, restart the
application and open your project again. This will clear the undo/redo history.
Please, remember this is an early version.
Exit
Exit you from ISSTATS. ISSTATS will prompt you to save the changes before closing the application.
General Settings
Most of the menus have a tab for general settings. The options are:
Chart Width. Determines the width of the entire chart image in pixels.
Chart Height. Determines the height of the entire chart image in pixels.
Opacity. To use an opaque color, set Opacity to 255. To create a semitransparent color, set
Opacity to any value from 1 through 254 (0=Total Transparency; 255=Total Opacity). You must
set some grade of transparency when using multi-series charts based on superposed areas: Radar,
Range, Area, SplineRange, SplineArea
Border Width for Lines. Determines the border width, in pixels, of the chart. Use if for line-based
charts as Candlestick, FastLine, Line, PointAndFigure, Spline, StepLine, Stock,
ThreeLineBreak
Use 3D Style. Check to display the chart area using 3D style.
Is Right Angle Axes. Gets or sets a flag that determines whether a chart area is displayed using an
isometric projection. Check to display the chart axes at right angles. Uncheck to display the chart
area using an isometric projection.
Y Axis is Started From Zero. Indicates whether the minimum value of the axis will be
automatically set to zero if all data point values are positive. If there are negative data point
values, the minimum value of the data points will be used. Check if the minimum value of the axis
is automatically set to zero when all data point values are positive.
Y Axis Scale Break Style. A scale break is a stripe drawn across the plotting area of a chart to
denote a break in continuity between the high and low values on a value axis. Use a scale break to
display two distinct ranges in the same chart area. You cannot specify where to place a scale break
on your chart. The chart uses its own calculations based on the values in your dataset to determine
9
whether there is sufficient separation between data ranges to draw a scale break on the value axis
(y-axis).
Label Y Values. Check to display the data point value as labels.
Show Legend. Check to display the chart legend.
There is also a common tab to set the Title and Footnote of the chart. Click Titles Tab to define lines of
text to be placed at the top or bottom of the plot. None of these fields is required. You may choose to
leave any or all of them blank.
Group of Cases
From the menus choose: Graphs > Group of Cases
Area
Bar
Bubble
Column
Doughnut
10
FastLine
FastPoint
Funnel
Line
Pie
Point
Pyramid
Radar
Spline
SplineArea
StackedBar
StackedBar100
StackedColumn
StackedColumn100
Number of cases
% of cases
Cumulative N
Cumulative %
You can arrange groups in ascending or descending order of the categories values. You can also arrange
groups in ascending or descending order of the group values.
Display missing as a group. This option is available only when you use a categorical variable to define
groups for a new chart. If selected, each system-missing value for the grouping variable will appear as a
separate group in the chart. If not, cases with system-missing or user-missing values for the grouping
variable are excluded from the chart.
Categorical-Quantitative
From the menus choose: Graphs > Categorical-Quantitative
11
Mean of values
Median of values
Mode of values
Number of values
Sum of values
Standard deviation
Variance
Minimum value
Maximum value
Cumulative sum
Area
Bar
Bubble
Column
Doughnut
FastLine
FastPoint
12
Funnel
Line
Pie
Point
Pyramid
Radar
Spline
SplineArea
StackedBar
StackedBar100. Not available for Cumulative %.
StackedColumn
StackedColumn100. Not available for Cumulative %.
StepLine
You can arrange groups in ascending or descending order of the factor variable values. You can also
arrange groups in ascending or descending order of the dependent variable values.
Display missing as a group. This option is available only when you use a categorical variable to define
groups for a new chart. If selected, each system-missing value for the grouping variable will appear as a
separate group in the chart. If not, cases with system-missing or user-missing values for the grouping
variable are excluded from the chart.
Two Categorical
From the menus choose: Graphs > Two Categorical
1. Select at least one variable for the category axis and move it into the Independent Variable (x)
field. This variable may be numeric or string.
2. Select at least one variable and move it into the Dependent Variable (Y) field. Each group will
consist of one bar/slice/stack/lines for each value of the Dependent variable at every level of the
Independent variable.
13
Number of cases.
% Grand Total. Percentages are based on the total number of cases or responses in the chart.
% by X. Percentages are based on the total number of cases in the x-axis category. Adding up the
graphic elements in the category equals 100%.
% by Y. Percentages are based on the total number of cases in the y-axis category. Adding up the
graphic elements in the category equals 100%.
Cumulative N. The cumulative number of cases with particular x-axis values. Each graphic
element shows the number of cases with particular x-axis values plus the total number of cases for
every graphic element drawn previously on the same axis.
Cumulative %. The cumulative percentage of cases with particular x-axis values. Each graphic
element shows the percentage of cases with particular x-axis values plus the total percentage of
cases for every graphic element drawn previously on the same axis.
14
FastLine
FastPoint
Line
Point
Radar. Set some grade of transparency to see all different series.
Spline
SplineArea. Set some grade of transparency to see all different series.
StackedArea.
StackedArea100. Not available for statistics % by Y and Cumulative %.
StackedBar.
StackedBar100. Not available for statistics % by Y and Cumulative %.
StackedColumn.
StackedColumn100. Not available for statistics % by Y and Cumulative %.
StepLine.
Display missing as a group. This option is available only when you use a categorical variable to define
groups for a new chart. If selected, each system-missing value for the grouping variable will appear as a
separate group in the chart. If not, cases with system-missing or user-missing values for the grouping
variable are excluded from the chart.
Graph Sequences
From the menus choose: Graphs > Sequences
1. Select at least one variable for the y- axis and move it into the Sequence Variable (Y) field. This
variable must be numeric.
2. Optionally, you can select a variable as Label X Variable. This variable may be numeric or string.
15
Checking One chart per variable, each variable produces a chart. Checking One Sequence per
variable, all variables are drawn in a unique chart; each variable produces a series or sequence.
Area
Bar
Column
FastLine
FastPoint
Kagi
Line
Point
Radar
Renko
Spline
SplineArea
StepLine
ThreeLineBreak
17
RangeBar. Requires High and Low variables. If Open and Close variables are also defined, it will
draw two different range series: High-Low series, and Open-Close series.
RangeColumn. Requires High and Low variables. If Open and Close variables are also defined, it
will draw two different range series: High-Low series, and Open-Close series.
SplineRange. Requires High and Low variables. If Open and Close variables are also defined, it
will draw two different range series: High-Low series, and Open-Close series.
Stock. Requires High, Low, Open and Close variables.
Scatter Plot
From the menus choose: Graphs > Scatter Plot
1. You must select at least one variable for the y axis and at least one variable for the x axis. These
variables must be numeric.
18
2. You can select a variable and move it into the Set Markers By field. Each value of this variable is
marked by a different symbol on the scatterplot. This variable may be numeric or string.
3. You can select a numeric or a string variable and move it into the Label Cases By field. You can
label points on the plot with this variable.
a. If selected, the value labels (or values if no labels are defined) of this variable are used as
point labels.
b. If you do not select a variable to label cases by, other options can be used to label points.
i. No Label
ii. Case Number
iii. Coordinates
iv. Y value
19
About
Shows ISSTATS version and contact information.
20
Default folder
Owner
Description
You can access some other menus using the tree at the left.
4.2. Settings
Set the Output Decimal Places in this menu. The default value is 3.
4.3. Variables
This menu shows the following properties of each variable:
Name
Column Id
Count
Missing
Type
o Numeric
o Text
21
Scale of measure
o Nominal
o Ordinal
o Scale
o Unknown
Description
Values
Clicking in the Values button, you can access the Value Labels menu.
You can assign descriptive value labels for each value of a variable. This process is particularly useful if
your data file uses numeric codes to represent non-numeric categories (for example, codes of 1 and 2 for
Man and Woman). Value labels are saved with the data file. You do not need to redefine value labels each
time you open a data file.
22
New Worksheet
Creating a new worksheet adds an empty worksheet to the current project. An empty Data window will
appear.
Open Worksheet
Copies data from a file into the current project. When you open a file, you copy the contents of the file
into the current project. Any changes you make to the file while in the project will not affect the original
file.
Save Worksheet
Saves the current worksheet in the same file and format as the file name displayed in the window title. If
the current worksheet is untitled, File > Save Worksheet brings up a dialog box where you can specify a
file name and location in which to save your data. See File > Save Worksheet As for a description of this
dialog box.
Choose File > Save Worksheet As to rename your worksheet or save it to a new location.
Save Worksheet As
Saves the worksheet data in a file. Use this command if you want to rename your worksheet or save it to a
new location.
Dialog box items
23
Then, displays a window from which the user can specify some importing options.
Choose the Excel Worksheet you want ISSTAS to open from WorkSheet to Import drop-down list.
Check the columns of the selected worksheet that you want to import from Columns to Import check box
list. The first cell value of each column is shown to help you identify them.
Click Read variable names from the first row of data if the first row of the spreadsheet contains column
headings.
24
Exporting to Excel has some limitations. As worksheet limit size in Excel 2010 is 65,536 rows, ISSTATS
will divide the data collection in different sheets of 65530 rows everyone. If you are going to export 15
million values, the application will create 23 different Excel worksheets with 65530 rows each one. As
columns limit in Excel 2010 is 256 columns, ISSTATS will not export the data of more than 256
variables.
Erases the contents of the selected cells, without moving other cells. This menu command is available
when at least one cell is selected in the Data window. To delete selected cells, see Delete Cells.
In a numeric column, ISSTATS inserts system missing values (*) in a cleared cell.
Delete Cells
If you are deleting cells, other cells automatically shift up. If you click a cell and then press DELETE,
ISSTATS deletes the cell contents.
Select All Cells
Click this button to select all cells on the worksheet.
Copy Columns
When you copy various columns, alternated selected columns will be joined together in a continuous
selection once you paste or insert them (as Excel does).
Paste Columns
On pasting, empty selected columns will not modify any existing variables. Not empty selected columns
will replace any existing variables.
Insert Columns
You can insert columns that previously copied in the worksheet.
To insert a single column, select the column in the column immediately to the right of where you want to
insert the new column. To insert multiple columns, select the columns immediately to the right of where
you want to insert columns.
If you insert columns, other columns automatically shift to the right.
On inserting, empty columns (no variable defined) will add new empty columns (no variable defined).
Clear Columns Content
Erases the contents of the selected column, without moving other columns. This menu command is
available when at least one column is selected in the Data window. To delete selected columns, see Delete
Columns.
The cleared cells remain as blank cells on the worksheet.
Delete Columns
If you are deleting columns, other columns automatically shift to the left.
Select All Columns
The Select All Columns command will create a new selection that includes every column with data.
As rows have not any structural meaning in the table, there is no copy/cut/paste/insert options for rows. If
you want to copy all cells of a row just select the rows and press COPY CELLS button.
26
If variables contain value labels, you can control the display of variable labels and data values clicking on
button View Labels.
27
28
Select All
The Select All command will create a new selection that includes everything on the document.
29
7. Frequency Tables
Overview
The frequency of a particular observation is the number of times the observation occurs in the data. The
distribution of a variable is the pattern of frequencies of the observation.
Frequency distribution tables can be used for both categorical and numeric variables. Use numeric codes
or strings to code categorical variables (nominal or ordinal level measurements).
30
8. Descriptive Statistics
Overview
Produces descriptive statistics for each column. The data columns must be numeric and contain at least
one numeric value. The dialog box allows to choose the statistics that you wish to display.
Available Variables list shows numeric data columns containing at least one no-missing value.
Dialog box items
Mean: Choose to display the arithmetic mean.
Sample Variance: Choose to display the unbiased variance of the data. Estimates population variance
based on a sample. If your data represents the entire population, then compute the variance by using Total
Variance.
Sample Standard Deviation: Choose to display the standard deviation of the data. Estimates population
standard deviation based on a sample. If your data represents the entire population, then compute the Std.
Deviation by using Total Std. Deviation.
Sample Coefficient of variation: Choose to display the coefficient of variation.
Sample Skewness: Choose to display the skewness value. Estimates population skewness based on a
sample. If your data represents the entire population, then compute the skewness by using Total
Skewness.
Sample Kurtosis: Choose to display the kurtosis value. Estimates population kurtosis based on a sample.
If your data represents the entire population, then compute the kurtosis by using Total Kurtosis.
31
Total Variance: Choose to display the variance of the data. Calculates variance based on the entire
population.
Total Standard deviation: Choose to display the standard deviation of the data. Calculates Std. Deviation
based on the entire population.
Total Coefficient of variation: Choose to display the coefficient of variation. Calculates coefficient of
variation based on the entire population.
Total Skewness: Choose to display the skewness value. Calculates skewness based on the entire
population.
Total Kurtosis: Choose to display the kurtosis value. Calculates kurtosis based on the entire population.
SEM: Choose to display the standard error of the mean.
Sum: Choose to display the data sum.
Minimum: Choose to display the data minimum.
Maximum: Choose to display the data maximum.
Range: Choose to display the data range. Data Range is the difference between the maximum and
minimum.
Quartiles: Choose to display the first quartile, the median and the third quartile.
Interquartile range: Choose to display the difference between the first and third quartiles.
Deciles: Choose to display the nine values that divide the sorted data into ten equal parts.
Percentiles: To request a percentile:
Enter the desired value in the box placed in Percentiles group box. For example, if you wanted the
7th percentile, you would enter a 7 in the box.
Click the add button to add the percentile to the list of requested percentiles.
Repeat Step 1 and 2 to add additional percentiles as desired.
If you need to delete a percentile, select it in the list and click the remove button.
Mode: Choose to display the mode and the number of times it occurs. If multiple modes exist, Minitab
displays the smallest modes, up to a total of four, along with their frequency.
Sum of squares: Choose to display the sum of the squared data values. This is the uncorrected sums of
squares, without first subtracting the mean.
MSSD: Choose to display half the Mean of Successive Squared Differences.
N nonmissing: Choose to display the number of nonmissing column entries.
N missing: Choose to display the number of missing column entries.
N total: Choose to display the total (nonmissing and missing) number of column entries.
32
CutPoints: Divide the data into a number of equal groups. For example, to create deciles, you would
enter 10 in the box .Enter 3 to divide the data into tertiles.
Check statistics
Check None: Choose to clear all check boxes and then individually check the statistics to display.
Check All: Choose to check all boxes. You can uncheck statistics as needed.
33
9. Crosstabs
34
For further details, see Methods and Formulas Help: Pearson Chi Square Test.
One sided and two sided Fishers Exact Test (only for 2-by-2 square tables)
For further details, see Methods and Formulas Help: Fishers Exact Test.
McNemar asymptotic
Edwards Continuity Correction
McNemar Exact Binomial
Mid-P McNemar Test
McNemar-Bowker Test
For further details, see Methods and Formulas Help: McNemars Test.
35
Odds Ratio
Relative Risk
Attributable risk
Relative Attributable Risk
Number Needed to Harm
Attributable Risk per Unit
Etiologic Fraction
Risk Estimate statistics are only computed for a 2-by-2 square table without empty cells
For further details, see Methods and Formulas Help: Risk Test.
For further details, see Methods and Formulas Help: Cohen's Kappa Test.
Phi Coefficient
Contingency Coefficient
Standardized Contingency Coefficient
Cramer's V
Tschuprow's T
Symmetric Lambda
Asymmetric Lambda (Row variable as dependent)
Asymmetric Lambda (Column variable as dependent)
36
Gamma
Sommers d
Kendalls tau-b
Kendalls tau-c
37
Overview
In the box Test Type, choose the test to perform:
Z Test
T Test
Variance Test
Hypothesis Test
To perform a hypothesis test, check the box Perform Hypothesis Test and choose the alternative
hypothesis of the test:
Less than: Perform a level test of H0: 0 against the one-sided alternative H1: < 0
Not Equal: Perform a level test of H0: = 0 against the two-sided alternative H1: 0
Greater than: Perform a level test of H0: 0 against the one-sided alternative H1: > 0
Significance level of the test () derive from the 1- value set in the Confidence Level text box.
If you choose a lower-tailed hypothesis test, an upper confidence bound will be constructed. If you
choose an upper-tailed hypothesis test, a lower confidence bound will be constructed.
Available Variables list shows numeric data columns containing at least one no-missing value.
10.1. Z Test
Use 1-Sample Z to compute a confidence interval or perform a hypothesis test of the mean when the
standard deviation of the population is known. The samples should come from a normal population if n
is low; if however n>30 the distribution of the population does not have to be normal.
38
Samples in columns: Choose if you have entered raw data in columns. Enter the columns
containing the sample data in the list Computing Variables. Move these variables from Available
Variables list to Computing Variables list using Add and Remove buttons. Enter the value for the
population standard deviation in the text box. Entering multiple columns, ISSTATS performs
separate one-sample analyses on each column.
Summarized data: Choose if you have summary values for the sample size and mean.
o Sample size: Enter the value for the sample size.
o Mean: Enter the value for the sample mean.
o Population Standard deviation: Enter the value for the population standard deviation.
Confidence level: Enter the level of confidence desired. Enter any number between 0 and 1. Entering 0,9
will result in a 90% confidence interval. The default is 0,95 = 95%.
Perform hypothesis test: Check to perform the hypothesis test.
10.2. T Test
Performs a one-sample t-test or t-confidence interval for the mean.
Use T Test for one sample to compute a confidence interval and perform a hypothesis test of the mean
when the population standard deviation, , is unknown. Use this test when samples come from a normal
population or n > 30.
Dialog box items
Data: Choose the origin of the data.
Samples in columns: Choose if you have entered raw data in columns. Enter the columns
containing the sample data in the list Computing Variables. Move these variables from Available
Variables list to Computing Variables list using Add and Remove buttons. Entering multiple
columns, ISSTATS performs separate one-sample analyses on each column
Summarized data: Choose if you have summary values for the sample size and mean.
o Sample size: Enter the value for the sample size.
o Mean: Enter the value for the sample mean.
o Sample Standard deviation: Enter the value for the sample standard deviation.
39
Confidence level: Enter the level of confidence desired. Enter any number between 0 and 1. Entering 0,9
will result in a 90% confidence interval. The default is 0,95 = 95%.
Perform hypothesis test: Check to perform the hypothesis test.
Samples in columns: Choose if you have entered raw data in columns. Enter the columns
containing the sample data in the list Computing Variables. Move these variables from Available
Variables list to Computing Variables list using Add and Remove buttons. Entering multiple
columns, ISSTATS performs separate one-sample analyses on each column.
Summarized data: Choose if you have summary values for the sample size and variance.
o Sample size: Enter the value for the sample size.
o Sample Variance: Enter the value for the sample variance.
Confidence level: Enter the level of confidence desired. Enter any number between 0 and 1. Entering 0,9
will result in a 90% confidence interval. The default is 0,95 = 95%.
Perform hypothesis test: Check to perform the hypothesis test.
40
2-Sample t-Test
Paired t-Test
2 Variances F-Test
Hypothesis Test
To perform a hypothesis test, check the box Perform Hypothesis Test and choose the Alternative
Hypothesis of the test:
Less than: Perform a level test of H0: d against the one-sided alternative H1: < d
Not Equal: Perform a level test of H0: = d against the two-sided alternative H1: d
Greater than: Perform a level test of H0: d against the one-sided alternative H1: > d
Significance level of the test () derive from the 1- value set in the Confidence Level text box.
If you choose a lower-tailed hypothesis test, an upper confidence bound will be constructed. If you
choose an upper-tailed hypothesis test, a lower confidence bound will be constructed.
Available Variables list shows numerical and text columns containing at least one no-missing value.
Text columns may be used as Subscripts.
Samples in one column: Choose if the sample data are in a single column, differentiated by
subscript values (group codes) in a second column.
o Computing Variables: Enter the columns containing the sample data in the list
Computing Variables. Move these variables from Available Variables list to Computing
Variables list using Add / Remove buttons.
o Grouping Variable: Enter the column containing the sample subscripts to define the
groups. It may be a numerical or text column.
o Group 1/Group 2: Enter the subscripts that define both groups.
o Automatic Grouping: Check to let the application define automatically both groups. It
will identify the first two different subscripts in the grouping variable.
Samples in different columns: Choose if the data of the two samples are in separate columns.
o First Sample: Enter the column containing one sample from Available Variables list.
o Second Sample: Enter the column containing the other sample from Available Variables
list.
Summarized data: Choose if you have summary values for the sample size, mean, and variance
for each sample.
o First Sample
Sample size 1: Enter the sample size for the first sample.
Mean 1: Enter the value for the mean of the first sample.
Sample Variance 1: Enter the value for the variance of the first sample.
o Second Sample
Sample size 2: Enter the sample size for the second sample.
Mean 2: Enter the value for the mean of the second sample.
Sample Variance 2: Enter the value for the variance of the second sample.
Assume Population equal variances: Check to assume that the populations have equal variances.
Confidence level: Enter the level of confidence desired. Enter any number between 0 and 1. Entering 0,9
will result in a 90% confidence interval. The default is 0,95 = 95%.
Perform hypothesis test: Check to perform the hypothesis test.
Hypothesized mean: Enter the hypothesized difference between the two population means 1-2.
Alternative hypothesis: Choose the alternative hypothesis of the test.
A difference of 0 suggests in the null hypothesis the equality between mean populations; H0: 1-2 = 0
against an alternative H1: 1-2 0
42
A difference equal or less than 0 suggest in the null hypothesis that the first mean is equal or less than the
second; H0: 1-2 0 against an alternative H1: 1-2 > 0
A difference equal or greater than 0 suggest in the null hypothesis that the first mean is equal or greater
than the second; H0: 1-2 0 against an alternative H1: 1-2 < 0
Optionally, test ratios other than 0 (equality) can be specified. A difference of 2 suggests in the null
hypothesis the first mean is the second mean plus 2.
Sample in columns: Choose if you have entered raw data in two columns.
o First sample: Enter the column containing the first sample from Available Variables list.
o Second sample: Enter the column containing the second sample from Available Variables
list.
Pairs must have two numerical values. Pairs that have a missing data in any of the
members are ignored.
Summarized data: Choose if you have summary values for the sample size, mean, and variance
of the difference.
o Sample size: Enter the value for the sample size.
o Mean of Differences: Enter the value for the mean of differences .
o Variance of Differences: Enter the value for the variance of differences s2d.
Confidence level: Enter the level of confidence desired. Enter any number between 0 and 1. Entering 0,9
will result in a 90% confidence interval. The default is 0,95 = 95%.
Perform hypothesis test: Check to perform the hypothesis test.
Hypothesized mean: Enter the hypothesized population mean of the paired differences d.
Alternative hypothesis: Choose the alternative hypothesis of the test.
43
Samples in one column: Choose if you have entered data into a single column with a second
column of subscripts that identify the samples.
o Computing Variables: Enter the columns containing the sample data in the list
Computing Variables. Move these variables from Available Variables list to Computing
Variables list using Add and Remove buttons. Entering multiple columns, ISSTATS
performs separate analyses on each column.
o Grouping Variable: Enter the column containing the sample subscripts to define the
groups. It may be a numerical or text column.
o Group 1/Group 2: Enter the subscripts that define both groups.
o Automatic Grouping: Check to let the application define automatically both groups. It
will identify the first two different subscripts in the grouping variable.
Samples in different columns: Choose if you have entered the data for the two samples into
separate columns.
o First Sample: Enter the column that contains the data for the first sample from Available
Variables list.
o Second Sample: Enter the column that contains the data for the second sample from
Available Variables list.
Summarized data: Choose if you have summary values for the sample sizes and variances.
o First Sample
Sample size 1: Enter the sample size for the first sample.
Sample Variance 1: Enter the variance for the first sample.
o Second Sample
Sample size 2: Enter the sample size for the second sample.
Sample Variance 2: Enter the variance for the second sample.
Confidence level: Enter the level of confidence desired. Enter any number between 0 and 1. Entering 0,9
will result in a 90% confidence interval. The default is 0,95 = 95%.
Perform hypothesis test: Check to perform the hypothesis test.
Hypothesized ratio: Enter the hypothesized ratio between two populations variances 21/22.
Alternative hypothesis: Choose the alternative hypothesis of the test.
A ratio of 1 suggests in the null hypothesis the equality between variance populations; H0: 21/22 = 1
against an alternative H1: 21/22 1.
44
A ratio equal or less than 1 suggest in the null hypothesis that the first variance is equal or less than the
second; H0: 21/22 1 against an alternative H1: 21/22 > 1.
A ratio equal or greater than 1 suggest in the null hypothesis that the first variance is equal or greater than
the second; H0: 21/22 1 against an alternative H1: 21/22 < 1.
Optionally, test ratios other than 1 (equality) can be specified. A ratio of 2 suggests the first variance is
double the second variance.
45
Performs a one-way analysis of variance. You can also perform multiple comparisons. The response
variable must be numeric. The factor level column (grouping variable) may be numeric or text.
Dialog box items
Data: Choose the origin of the data.
Groups in 1 Column
o Computing Variables: Enter the column or columns containing the response. ISSTATS
performs a ANOVA test for each of these columns.
o Grouping Variable: Enter the column containing the factor levels.
Scheffe
Tukey HSD: Tukey's Honestly Significant Difference test.
Sidak
46
For further details, see Methods and Formulas Help: ANOVA Multiple Comparisons.
Include Brown-Forsythe Test for equality of Means
Use to test the equality of means when distribution violates the assumption of equal variances.
The Brown-Forsythe test cannot be computed if all groups have zero variance. To take part into Brown
Forsythe test, a group must have at least two elements. In the situation that some groups have zero
standard deviations, the statistic can be computed but the approximation may not work.
For further details, see Methods and Formulas Help: BrownForsythe Test for equality of means.
Include Welchs Test for equality of Means
Use to test the equality of means when distribution violates the assumption of equal variances.
To take part into Welchs test, a group must have non zero variance. Moreover, sample sizes of a group
have to be greater than or equal to 2.
For further details, see Methods and Formulas Help: Welchs Test for equality of means.
Remark: ISSTATS does not cancel a test if any of the groups does not fulfill the conditions. It simply
rejects the group. To take part into an ANOVA test, a group must have at least one element. To take part
into a Welchs Test, a group must have at least two elements; moreover, the group must have a non-zero
variance. Thus, ISSTATS may use 4 of the groups to perform ANOVA Test but only 2 of the groups to
perform a Welchs Test. It depends on the number of elements of each group. You should read the
Descriptive Statistics information to check the Rejected Groups and also should read the Total Groups
info of each test to check the number of groups being used in that test.
Outputs
Outputs include:
Inter is also referred as Between Groups or Between Treatments. Intra is also referred as Within Groups
or Error Term.
47
R-squared represents the percentage of variation in a response variable that is explained by its relationship
with one predictor variable.
2 =
= 1
R-squared adjusted is a version of r-squared that has been adjusted for the number of predictors in the
model. R-squared tends to overestimate the strength of the association, especially when there are more
than one independent variables.
2 = 1
48
These tests are used to test if k samples are from populations with equal variances. The response variable
has to be numeric. The factor level column (grouping variable) may be numeric or text.
Dialog box items
In the box Test Type, choose the test to perform:
Levene's Test
BrownForsythe Test for equality of variances
Bartlett's Test
For further details, see Methods and Formulas Help: Homoscedasticity Tests.
Remark: ISSTATS does not cancel a test if any of the groups does not fulfill the conditions. It simply
rejects the group. To take part into a Levene or BrownForsythe test, a group must have at least one
element. To take part into a Bartlett's Test, a group must have at least two elements; moreover, the group
must have a non-zero variance. Thus, ISSTATS may use 4 of the groups to perform Levene's Test but
only 2 of the groups to perform a Bartlett's Test. It depends on the number of elements of each group.
You should read the Descriptive Statistics information to check the Rejected Groups and also should read
the Total Groups info of each test to check the number of groups being used in that test.
49
Outputs
Outputs include:
For further details, see Methods and Formulas Help: Bivariate Correlation Tests.
Remark: All tests use pairwise deletion mode, thus the statistical procedure uses cases that contain some
missing data. A case may contain 3 variables: VAR1, VAR2, and VAR3. A case may have a missing
value for VAR1, but this does not prevent some statistical procedures from using the same case to analyze
variables VAR2 and VAR3. Pairwise deletion allows you to use more of your data. However, each
computed statistic may be based on a different subset of cases.
50
Once you add the columns, input the position on each asset. Set the Holding Period (trading days) and the
Confidence Level (1-).
Outputs
51
For each asset, Position, One-day Expected Return, Standard Deviation or Volatility, One-day
Value at Risk, Total Value at Risk for n Trading Days, One-day Expected Shortfall.
Covariance matrix
One-day Portfolio Expected Return
Portfolio Variance
Portfolio Standard Deviation or Portfolio Volatility
One-day Value at Risk
Holding Period in Trading Days
Total Value at Risk for n Trading Days
One-day Portfolio Expected Shortfall
For each asset, Marginal Value at Risk, Component Value at Risk, Incremental Value at Risk.
Remark: There is no a common definition of VaR. Sometimes VaR is assumed to be the Portfolio
Volatility as expected return is supposed to be zero. ISSTATS follows the method described on J.P.
Morgan webpage, thus it does NOT consider VaR as Portfolio Volatility. Please, check the equations at
Methods and Formulas Help: Parametric Value at Risk.
52
ISSTAS follows the J.P. Morgan RiskMetrics approach to estimating and forecasting volatility by
exponentially weighted moving average model (EWMA).
Inputs
Choose the columns for daily return rates of each asset in a portfolio. Each of these columns must have a
list of Continuously Compounded Return for an asset on a daily basis. Do not enter the return rates in
percent values (%); use per unit values.
Order is the time order of the data series:
Ascending: the first data point the value in the first row- corresponds to the earliest date.
Descending: the first data point the value in the first row- corresponds to the latest or most
recent date.
Lambda is the smoothing parameter or decay factor used for the exponential-weighting scheme. A default
value of 0.94 is shown.
Set the forecast time/horizon (t). If you set 1 as forecast time, application computes the forecast for the
next day, given the information up to and including the last day of the series.
Outputs
53
For each asset, Position, One-day Expected Return, Volatility by EWMA method, One-day Value
at Risk, Total Value at Risk for n Trading Days.
One-day Covariance matrix by EWMA method.
Correlation matrix by EWMA method.
One-day Portfolio Expected Return
Portfolio Variance.
Portfolio Volatility.
One-day Value at Risk
Holding Period in Trading Days
Total Value at Risk for n Trading Days
For each asset, Marginal Value at Risk, Component Value at Risk, Incremental Value at Risk.
Remarks
The time series is supposed homogeneous or equally spaced. Missing values are removed from the
series.
The daily returns should be computed as Continuously Compounded Return: ln(Si/Si-1).
The time series should have more than 50 values.
When computing EWMA volatility, ISSTATS assumes that the time series has an average equal
to zero; but it does not assumes the average is zero when computing the VaR (VaR = + z).
ISSTATS does not use the recursive formula for EWMA; hence it does not compute any initial
variance.
Please, check the method and assumptions at Methods and Formulas Help: Exponentially Weighted
Moving Average (EWMA) Forecast.
54
2.
3.
Select the variables required for the formula and move them into the fields. Each formula have
its own requirements. The fields are: Daily High Price, Daily Low Price, Daily Close Price,
Daily Volume and Daily Price.
Select the formula you want to apply from the drop-down list.
Set the parameters for the formula
The accumulation distribution formula calculates a cumulative total of prices and volumes.
An increase in the accumulation distribution index indicates a probable price increase. A decrease in the
accumulation distribution index indicates a probable price decrease.
55
Input Values
This formula takes four input variables.
Output Value
This formula outputs one variable.
The average true range formula records the maximum values of the following three differences, and
calculates the moving average of the resulting data series:
The average true range indicator is a good measure of commitment. A high value often indicates market
bottom due to panic sell. A low value often indicates market top.
Parameters
This formula takes one optional parameter.
Period: Period for calculating the moving average of the true range values. The default value is 14.
Input Values
This formula takes one input variable.
56
Output Value
This formula outputs one variable.
The Bollinger Bands formula calculates the standard deviation above and below a simple moving average
of the data. Since standard deviation is a measure of volatility, a large standard deviation indicates a
volatile market, and a smaller standard deviation indicates a calmer market.
Parameters
This formula takes two required parameters.
Period: Period for calculating the moving average for the Bollinger Bands.
StdDev: The number of standard deviations for calculating the upper and lower bands.
Input Values
This formula takes one input variable.
Price: The price for which the Bollinger Bands are calculated.
Output Value
This formula outputs two variables.
The Chaikin Oscillator formula is useful for monitoring volume flow in a market. It applies the
Accumulation Distribution Formula on the input, calculates the exponential moving average of the result
for a short period and a long period, and then outputs the difference between the two.
Parameters
This formula takes two optional parameters.
Period Short: Period for calculating the short period exponential moving average. The default
value is 3.
Period Long: Period for calculating the long period exponential moving average. The default
value is 10.
Input Values
This formula takes four input variables.
Output Value
This formula outputs one variable.
The commodity channel index formula calculates the mean deviation of the daily average price from the
moving average. A value above 100 indicates that the commodity is overbought, and a value below -100
indicates that the commodity is oversold.
Parameters
This formula takes one optional parameter.
Period: Period for calculating the commodity channel index. The default value is 10.
Input Values
This formula takes three input variables.
Output Value
This formula outputs one variable.
The detrended price oscillator formula calculates the difference between the daily price and the moving
average. This is useful for identifying cycles and overbought and oversold price levels.
Parameters
This formula takes one required parameter.
Period: Period for calculating the moving average for the detrended price oscillator index.
Input Values
This formula takes one input variable.
Price: Price for which the detrended price oscillator index is calculated.
Output Value
This formula outputs one variable.
The ease of movement formula uses the close price and volume to measure the strength of the price trend.
A value close to zero indicates that prices are not moving easily, while a high positive value indicates that
prices are going up easily and a high negative value indicates that prices are going down easily.
59
Input Values
This formula takes three input variables.
Output Value
This formula outputs one variable.
Envelopes Formula
The envelopes formula calculates "envelopes" above and below a moving average using a specified
percentage as the shift. The envelopes indicator is used to create signals for buying and selling. You can
specify the percentage the formula uses to calculate the envelopes.
Parameters
This formula takes two required parameters.
Input Values
This formula takes one input variable.
Output Value
This formula outputs two variables.
Forecasting Formula
60
The forecasting formula attempts to fit the historical data to a regression function and forecast future
values of the data best on the best fit.
Parameters
This formula takes four optional parameters.
Regression Type: Indicate one of these regression types: Constant, Linear, Quadratic, Cubic,
Quartic, Exponential, Logarithmic, Power.
Period: Forecasting period. The formula predicts data for this period of days into the future. The
default value is half of the series' length.
Approx. Error: Whether to output the approximation error. If unchecked, output error series
contain no data for the corresponding historical data.
Forecast. Error: Whether to output the forecasting error. If unchecked, output error series contain
the approximation error for all predicted data points if ApproxError is checked.
Input Values
This formula takes one input variable.
Output Value
This formula outputs three variables.
The mass index formula predicts trend reversals by calculating the range between high and low prices for
each period. A bulge in the index line signals a possible trend reversal. You can use a 9-day Exponential
Moving Average Formula to determine whether the bulge is a buy or sell signal.
Parameters
This formula takes two parameters.
61
Input Values
This formula takes two input variables.
Output Value
This formula outputs one variable.
The money flow formula compares upward changes and downward changes of volume-weighted typical
prices. It can be used to identify market tops and bottoms.
Parameters
This formula takes one required parameter.
Input Values
This formula takes four input variables.
Output Value
This formula outputs one variable.
62
The moving average convergence/divergence (MACD) formula compares a short period moving average
and a long period moving average of prices. MACD is used with a 9-day exponential moving average as a
signal that identifies buying or selling moments.
Parameters
This formula takes two optional parameters.
Short Period: Period for calculating the short period moving average. The default value is 12.
Long Period: Period for calculating the long period moving average. The default value is 26.
Input Values
This formula takes one input variable.
Output Value
This formula outputs one variable.
MACDIndicator: Distance between the short and long exponential moving averages.
The exponential moving average formula is a moving average of data that gives more weight to the more
recent data in the period and less weight to the older data in the period. This formula produces a moving
average that follows the market trend much more quickly than the Weighted Moving Average Formula.
This formula smoothes a data series. This makes analyzing volatile data easier.
Parameters
This formula takes one required parameter.
63
Input Values
This formula takes one input variable.
Output Value
This formula outputs one variable.
The simple moving average formula takes the average of data over a period of time, and "moves" the
period across the data series one data point at a time. This formula smoothes a data series and makes
analyzing volatile data easier.
Parameters
This formula takes one required parameter.
Input Values
This formula takes one input variable.
Output Value
This formula outputs one variable.
64
The triangular moving average formula takes a simple moving average of data and applies a simple
moving average on the first moving average. It is a lagging indicator, and is always behind the price. The
triangular moving average gives the most weight to the middle portion of the data.
Parameters
This formula takes one required parameter.
Input Values
This formula takes one input variable.
Output Value
This formula outputs one variable.
The triple exponential moving average formula is useful for eliminating short and insignificant cycles in
the data. It smoothes the data three times using the Exponential Moving Average Formula, and then
calculates the rate of change in the moving average based on the result for the previous day.
Parameters
This formula takes one required parameter.
Period: Period for calculating the exponential moving average for the triple exponential moving
average indicator.
Input Values
This formula takes one input variable.
Price: Price for which the triple exponential moving average indicator is calculated.
65
Output Value
This formula outputs one variable.
The weighted moving average formula is a moving average of data that gives more weight to the more
recent data in the period and less weight to the older data in the period. This formula smoothes a data
series. This makes analyzing volatile data easier.
Parameters
This formula takes one required parameter.
Input Values
This formula takes one input variable.
Output Value
This formula outputs two variables.
The negative volume index formula helps identify a bull market. When the negative volume index is
above its moving average there is higher probability for a bull market. The probability for a bull market is
much lower when the negative volume index is below its moving average.
This formula should be used together with the Positive Volume Index Formula.
66
Parameters
This formula takes one required parameter.
Input Values
This formula takes two input variables.
Output Value
This formula outputs one variable.
The on balance volume formula measures positive and negative volume flows.
Input Values
This formula takes two input variables.
Output Value
This formula outputs one variable.
Performance Formula
67
The performance formula calculates the rate of price change compared with historical data. It differs from
the Rate of Change Formula in that it calculates rate of change against the first available data. The output
is a percentage.
This formula can also be used to calculate the rate of volume change.
Input Values
This formula takes one input variable.
Price: Price for which the performance indicator is calculated. It can be any other data for which
you wish to calculate the performance indicator, such as volume.
Output Value
This formula outputs one variable.
The positive volume index formula helps identify a bear market. When the positive volume index is
below its moving average there is higher probability for a bear market. The probability for a bear market
is much lower when the positive volume index is above its moving average.
This formula should be used together with the Negative Volume Index Formula.
Parameters
This formula takes one required parameter.
Input Values
This formula takes two input variables.
Output Value
This formula outputs one variable.
The median price formula calculates the average value of daily high and low prices.
Input Values
This formula takes two input variables.
Output Value
This formula outputs one variable.
The typical price formula calculates the average value of daily high, low, and close prices.
Input Values
This formula takes three input variables.
Output Value
This formula outputs one variable.
69
The weighted close formula calculates the average value of daily prices, but gives more weight to the
close price.
Input Values
This formula takes three input variables.
Output Value
This formula outputs one variable.
The price volume trend formula calculates a cumulative volume total using relative changes of the close
price. A bullish divergence between the price volume trend indicator and the price indicates that the
market is at the bottom. A bearish divergence between the price volume trend indicator and the price
indicates that the market is at the top.
Input Values
This formula takes two input variables.
Output Value
This formula outputs one variable.
70
The rate of change formula calculates the rate of price change compared with historical data. It differs
from the Performance Formula in that it calculates the rate of change against a period of days prior to the
current price. The output is a percentage.
This formula can also be used to calculate the rate of volume change.
Parameters
This formula takes one parameter.
Period: Number of days prior to the current day. The formula uses the data from that day as a
reference when calculating the current rate of change. The default value is 10.
Input Values
This formula takes one input variable.
Price: Price for which the rate of change is calculated. It can be any other data for which you wish
to calculate the rate of change, such as volume.
Output Value
This formula outputs one variable.
The relative strength index formula is a momentum oscillator formula that compares upward movements
of the close price with downward movements, and outputs values from 0 to 100. A value close to 100
indicates that the price is about to move downward, and a value close to 0 indicates that the price is about
to move upward.
71
Parameters
This formula takes one parameter.
Period: Period for calculating the relative strength index. The default value is 10.
Input Values
This formula takes one input variable.
Price: The closing price for which the relative strength index is calculated.
Output Value
This formula outputs one variable.
The standard deviation formula is used to indicate volatility. It calculates the difference between values
like the close price and their moving average. A higher standard deviation indicates higher volatility.
Parameters
This formula takes one required parameter.
Period: Period for calculating the moving average for the standard deviation.
Input Values
This formula takes one input variable.
Output Value
This formula outputs one variable.
72
The stochastic indicator formula calculates the simple stochastic indicator (%K) and the smoothed
stochastic indicator (%D). %D is a moving average of %K. The output is a percentage. A value more than
80% indicates that the current price is close to the price high, and a value less than 20% indicates that the
current price is close to the price low.
Parameters
This formula takes two parameters.
Input Values
This formula takes three input variables.
Output Value
This formula outputs two variables.
The Volatility Chaikins formula calculates the exponential moving average of the difference between
daily high and low prices, then calculates the rate of change of the exponential moving average.
Parameters
This formula takes two parameters.
73
Period EMA: Period for calculating the exponential moving average of the difference between the
high and low prices. The default value is 10.
Period ROC: Period for calculating the rate of change. The default value is 10.
Input Values
This formula takes two input variables.
Output Value
This formula outputs one variable.
The volume oscillator formula measures the difference between a short period moving average of volume
and a long period moving average of volume. A positive value indicates a strong trend, and a negative
value indicates a weak trend.
Parameters
This formula takes three parameters.
Period Short: Period for calculating the short period moving average. The default value is 5.
Period Long: Period for calculating the long period moving average. The default value is 10.
Use Percentage: Whether to output the difference in percentage. When set to false, the formula
outputs the difference as a point. The default value is true.
Input Values
This formula takes one input variable.
Output Value
This formula outputs one variable.
74
William's %R Formula
The William's %R formula is a momentum indicator, and is used to measure over-bought or oversold
levels. This indicator is very similar to the stochastic %K indicator, except that the Williams %R formula
calculates a negative value between 0 and -100 and does not smooth the data.
Parameters
This formula takes one parameter.
Period: Period for calculating the Williams %R indicator. The default value is 14.
Input Values
This formula takes three input variables.
Output Value
This formula outputs one variable.
75
76