Measure | Data Types | Description |
---|---|---|
Freq Table | Binary, Nominal, Ordinal | Provides a table of the frequencies of different categories present in the data. |
Mean | Discrete, Continuous | Reports the average or the mean value of the data within that group |
Median | Ordinal, Discrete, Continuous | Reports the median value of the data. The median value is the central value of the data when the data is put in either increasing or decreasing order. For ordinal data, we need to categories to be numeric in order to report the median value. |
Mode | All Types | Reports the most frequently occurring category for the binary, nominal and ordinal data and the most frequently occurring data value for discrete and continuous data types within a group. |
Trimmed Mean | Discrete, Continuous | Reports the mean value after subtracting the bottom 5% and the top 5% of the data points. Basically, it is like reporting the mean value after possibly removing the outliers. |
Min | Ordinal, Discrete, Continuous | Reports the minimum value observed in the data. Note that for Ordinal data type, this value is reported only if the categories are numeric. If text values are entered in the category column, then no minimum values are reported. |
Max | Ordinal, Discrete, Continuous | Reports the maximum value observed in the data. Note that for Ordinal data type, this value is reported only if the categories are numeric. If text values are entered in the category column, then no maximum values are reported. |
Range | Ordinal, Discrete, Continuous | Reports the range values (max - min) observed in the data. Note that for Ordinal data type, this value is reported only if the categories are numeric. If text values are entered in the category column, then no range values are reported. |
Stdev | Discrete, Continuous | Reports the sample standard deviation value for the given data set within a group. Note that this is the sample standard deviation and not the population standard deviation. |
Variance | Discrete, Continuous | Reports the variance value which is the square of the standard deviation for the given data within a group. Note that this is the variance of the sample and not the population. |
Q1 | Ordinal, Discrete, Continuous | Reports the first quartile of the data values for discrete and continous. This value is reported for ordinal data only if the category values are numeric. The first quartile means that 25% of the data points are less than Q1 and 75% of the data points are greater than Q1. |
Q3 | Ordinal, Discrete, Continuous | Reports the third quartile of the data values for discrete and continous. This value is reported for ordinal data only if the category values are numeric. The third quartile means that 75% of the data points are less than Q3 and 25% of the data points are greater than Q3. |
IQR | Ordinal, Discrete, Continuous | Reports the difference between Q3 and Q1 values. This value is reported for ordinal data only if the category values are numeric. |
Sum | Discrete, Continuous | Reports the sum of all the data values within a group. |
SSQ | Discrete, Continuous | Reports the sum of squares of all the data values within a group. Note that the mean value is not subtracted from the data to calculate the Sum of Squares values. |
CV | Discrete, Continuous | Reports the coefficient of variation for the given set of data within a group. The coefficient of variation is the ration of the standard deviation to the mean value. It reports the amount of variation in the data. If CV < 1, then we can consider the data as having low variation and if CV > 1, then the data has high variation. |
Skewness | Continuous | Reports a measure of the Skewness of the data. This measure can be used to check if the distribution is symmetric. If skewness is 0, then the distribution is symmetric. If the skewness is negative then left tail is longer and if the skewness is positive then the right tail is longer. |
Kurtosis | Continuous | Reports a measure of the shape of the distribution. If the Kurtosis is close to 3, the shape of the distribution is similar to normal distribution. If Kurtosis is less than 3, then there are fewer outliers and if the Kurtosis is greater than 3, there could be more number of outliers. |
Normality | Continuous | Reports the P value of the normality test to check if the given data is normally distributed. If P value is less than alpha, we conclude that the data is not normally distributed. |
CI | Continuous | Reports the confidence interval of the mean value. The confidence interval reports the likelihood of where the population mean is expected to lie and is based on the variation in the data and the confidence level required in the analysis. |
1 | Data Type:
Click on the data type to specify the type of input data. The type of statistics you can compute
and the graphs you can display depend on the type of data you have. Following are the available data types for this option:
| ||||||||||||
2 | Notes Type:
Specify the type of notes output you would like to generate from this analysis. The available options are:
| ||||||||||||
3 | Statistics: To select the statistics that you want to display for each analysis select the appropriate checkbox next to the statistic you are interested in. Depending on your data type only those metrics that are suitable for that data type can be selected. For example, for binary data you cannot calculate the mean value. For ordinal data, you can select a set of different metrics to calculate only if the categories are numeric. For example, if you are measuring the survey responses on a Likert scale with Disagree, Neutral, Agree. If you enter these as the categories then you can't calculate the median value but if you transpose the data to numeric values (like 1, 2, 3) then you can report the statistics such as the median. For discrete and continuous data, the data column must be numeric. | ||||||||||||
4 | Help Button: Click on this button to open the help file. | ||||||||||||
5 | Cancel Button: You can click on this button to cancel any changes you have made in the Analysis Setup and exit this form. No outputs are computed or updated. | ||||||||||||
6 | OK Button: You can click on this button to save any changes you have made in the Analysis Setup and recompute all the outputs (notes and graphs section). Note that the outputs are updated only if you have all the required data entered on the worksheet. If there are missing inputs, then you will need to enter the missing information and then click on Compute Outputs to generate the analysis results. |
1 | Search Data: The available data displays all the columns of data that are available for analysis. You can use the search bar to filter this list and to speed up finding the right data to use for analysis. Enter a few characters in the search field and the software will filter and display the filtered data in the Available Data box. | ||||||||
2 | Available Data: The available data box contains the list of data available for analysis. If your workbook does not have any data in tabular format, this box will display "No Data Found". The information displayed in this box includes the row number, whether the data is Numeric (N) or Text (T), and the name of the column variable. Note that the software displays data from all the tables in the current workbook. Even though data within the same table have unique column names, columns across different tables can have similar names. Hence, it is important that you not only specify the column name but also the table name. | ||||||||
3 | Add or View Data: Click on this button either to add more data into your workbook for analysis or to view more details about the data listed in the available data box. When you click on this button, it opens up the Data Editor dialog box where you can import more data into your workbook, or you can switch from the list view to a table view to see the individual data values for each column. | ||||||||
4 | Required Data: The code for the required data specifies what data can be specified for that box. An example code is N: 2-4. If the code starts with an N, then you will need to select only numeric columns. If the code starts with a T, then you can select both numeric and text columns. The numbers to the right of the colon specify the min-max values. For example, if the min-max values are 2-4, then you need to select a minimum of 2 columns of data and a maximum of 4 columns of data in this box. If the minimum value is 0, then no data is required to be specified for this box. | ||||||||
5 | Select Button: Click on this button to select the data for analysis. Any data you select for the analysis is moved to the right. To select a column, click on the columns in the Available Databox to highlight them and then click on the Select Button. A second method to select the data is to double click on the columns in the list of Available Data. Finally, you can also drag and drop the columns you are interested in by holding down the select columns using your left mouse key and dragging and dropping them in one of the boxes on the right. | ||||||||
6 | Selected Data:
If the right amount of data columns has been specified, the list box header will be displayed in the black color. If sufficient data has not been specified, then the list box header will be displayed in the red color. Note that you can double-click on any of the columns in this box to remove them from the box. The data you specify for this analysis depends on the options you have specified in the Setup tab.
| ||||||||
7 | View Selection: Click on this button to view the data you have specified for this analysis. The data can be viewed either in a tablular format or in a graphical summary. |
1 | Title: The system will automatically pick a title for your chart. However, if you would like to override that with your own title you can specify a title for your chart here. Note that this input is optional. |
2 | Sub Title: The system will automatically pick a subtitle for your chart. However, if you would like to override that with your own subtitle you can specify a subtitle for your chart here. Note that this input is optional. |
3 | X Label: The system will automatically pick a label for the x-axis. However, if you would like to override that with your own label for the x-axis you can specify a different label here. Note that this input is optional. |
4 | Y Label: The system will automatically pick a label for the y-axis. However, if you would like to override that with your own label for the y-axis you can specify a different label here. Note that this input is optional. |
5 | X Axis: The system will automatically pick a scale for the x-axis. However, if you would like to override that with your values for the x-axis, you can specify them here. The format for this input is to specify the minimum, increment, and maximum values separated by a semi-colon. For example, if you specify 10;20 then the minimum x-axis scale is set at 10 and the maximum x-axis scale is set at 20. If you specify 10;2;20, then, in addition to minimum and maximum values, the x-axis increment is set at 2. Note that this input is currently disabled and you will not be able to change this setting. |
6 | Y Axis: The system will automatically pick a scale for the y-axis. However, if you would like to override that with your values for the y-axis, you can specify them here. The format for this input is to specify the minimum, increment, and maximum values separated by a semi-colon. For example, if you specify 10;20 then the minimum y-axis scale is set as 10, and the maximum y-axis scale is set as 20. If you specify 10;2;20, then, in addition to minimum and maximum values, the y-axis increment is set at 2. Note that this input is optional. |
7 | Horizontal Lines: If you want to add a few extra horizontal reference lines on top of your chart you can specify the values here. The format for this input is numeric values separated by semi-colon. For example, if you specify 12;15 then two horizontal lines are plotted at Y = 12 and Y = 15 respectively. Note that this input is optional. |
8 | Vertical Lines: If you want to add a few extra vertical reference lines on top of your chart you can specify the values here. The format for this input is numeric values separated by semi-colon. For example, if you specify 2;5 then two vertical lines are plotted at X = 2 and X = 5 respectively. Note that this input is optional. |
1 | Notes: The notes section shows a summary of the inputs, specifically the number of rows of data, any skipped rows, and the number of distinct groups. In the analysis results area, the analysis lists the measures that you have selected to report such as Mean, Median, Min and Max values, Q1, Q3, IQR etc. |
2 | Graphs: For binary, nominal and ordinal data, a bar chart of the data is shown and for discrete and continuous data, a histogram of the data is shown in the graphical output box. A normal curve is superimposed on the histogram. |