Detailed List of
All Features

PRIMER 8 Lite

PERMANOVA+

Create a PERMANOVA design NEW UPDATED

Create a PERMANOVA design for an unlimited number of factors. Factors can be fixed, random, finite or correspond to a repeatedly measured subject or whole-plot from a split-plot design. Each factor can be specified as being nested within one or more other factors, or is treated as crossed. Build a priori contrasts among individual levels or groups of levels of any factor. Specify, remove, pool or change the order of fit for any/all individual terms from a list of all terms that are available (implied by your design), including all potential interactions among factors. Include one or more quantitative covariables in the model (via separate worksheet with matching sample labels). Treat sets of covariates in groups (specified by an indicator) to enable cyclical/seasonal (sin/cos) or other spatio-temporal (lat/long) models. Allow for heterogeneity of within-group dispersions, specifying the factor (or combination of factors) across which heterogeneity lies. Choose a partitioning based on Type I, Type II or Type III SS for unbalanced designs.

PERMANOVA analysis NEW

Analyse multivariate data in response to complex multi-factor experimental/sampling designs on the basis of a chosen resemblance matrix using permutational multivariate (or univariate) analysis of variance (PERMANOVA). Correct F-tests are constructed carefully by reference to expectations of mean squares (EMS) for the specified design. P-values are obtained using permutation algorithms for each term in the model using the correct permutable units and construction of appropriate residuals to account for other factors/terms in the model. Choose to do one of the following: the main (overall) tests (output in a PERMANOVA table) for all terms in the model; pair-wise tests to compare groups for specific factors, either a main effect or tests within levels of other factors to follow up significant interactions; or output a residual distance matrix after fitting the full model specified in the design file. Choose the number of permutations and to permute raw data, reduced-model residuals or full-model residuals. Optionally output Monte Carlo tests (useful for cases with few possible permutations) and/or histograms of values of the F statistic under permutation. PERMANOVA done on the basis of a Euclidean distance matrix calculated from one variable yields a permutational univariate ANOVA.

Allow for heterogeneous dispersions NEW
Factors with finite numbers of levels (in-between fixed and random) NEW
Repeated measures designs NEW
Split-plot and split-split plot (etc.) designs NEW
Group covariates (using an indicator) for spatio-temporal modeling NEW
Residual dissimilarity matrix / Residual plot NEW

Generate a residual dissimilarity matrix after fitting a model using either PERMANOVA or DISTLM. Remove the effects of unwanted, nuisance or known-to-be-important factors or predictor variables. Visualise potential pattern/relationships with other/minor/remaining factors or covariables by doing ordination(s)of residual dissimilarities.

PCO NEW

Principal Coordinate Analysis (PCO), an eigenvalue/eigenvector decomposition of the Gower-transformed dissimilarity matrix, showing % variation explained, with options to output: an ordination plot, PCO scores to a worksheet, scree plot, and further diagnostics such as Shepard diagrams and stress claculations for the 2D or 3D PCO ordination (for comparison with MDS outputs).

DISTLM NEW

Fit a distance/dissimilarity-based linear model on the basis of a resemblance measure of choice (DISTLM), a form of multivariate multiple regression. Predictor variables can be quantitative, ordinal or binary but must be numeric and are provided in a separate worksheet (with matching sample names). Specify the variables to be included in the model and the ordering of the fit, or perform model selection by choosing a selection procedure (forward, backward, step-wise, or ‘best’) and a selection criterion (R^2, Adjusted R^2, AIC, AICc, or BIC). Do marginal tests (each variable alone) or sequential conditional tests, with p-values obtained using permutation of residuals under a reduced model and choose the number of permutations. Optionally group sets of predictor variables together using an indicator (e.g., to handle factors as binary codes or to fit 2D+ spatial or cyclical models). Optionally output a dbRDA plot associated with the final model fit, predictor variable correlations and/or a residual distance/dissimilarity matrix.

CAP NEW

Canonical analysis of principal coordinates (CAP). Find linear combinations of unit-scaled PCO axes from a resemblance matrix that: (i) best discriminate groups (provided as a factor), (ii) best discriminate positions along a single gradient, or (iii) correlate best with linear combinations of some other set of variables (X). Do diagnostics to assess the appropriate number of PCO axes to include in the model (m) , with the option to output diagnostic plots. Specify m explicitly, or choose m to maximise leave-one-out % allocation success to groups (cross-validation) or to minimise the leave-one-out residual sum-of-squares for the CAP model. Optionally add new samples and predict their positions on CAP axes / group membership / gradient position. Optionally do a permutation test and/or output CAP scores to a worksheet, including predicted values/classifications for new samples. When based on a Euclidean distance measure, the CAP method corresponds to classical discriminant analysis (for groups) or canonical correlation analysis (for gradients), but with all tests by robust permutation.

Centroid Plot - Main effects plot NEW

From a resemblance matrix, create an ordination showing the centroids for main effects (i.e., the centroids for each level of every factor) in the space of the chosen resemblance measure. Can be done for a single factor or for all factors specified in a PERMANOVA Design worksheet. Specify the ordination method (mMDS, tmMDS or nMDS). Optionally include the overall centroid and/or replicates in the plot as well. Distances among all points in the plot are also output as a new resemblance matrix.

Centroid Plot - Interaction plot NEW

From a resemblance matrix, create an ordination of centroids corresponding to all combinations of factor levels (cells) in a two-way or three-way design, given a PERMANOVA design worksheet, in the space of a chosen resemblance measure. Specify the ordination method (mMDS, tmMDS or nMDS). For a three-way design, the colours of symbols identify the levels of the first factor; the shapes of symbols identify the levels of the second factor, and the labels identify the levels of the third factor. For a two-way design, combined colours+symbols identify the levels of the first factor, and labels identify the levels of the second factor. Distances among all centroids in the plot are also output as a new resemblance matrix.

Control Chart - classical NEW

Classical multivariate control chart, using Hotelling’s T^2 criterion. From a data sheet, discern if a new sample point (e.g., in a monitoring context) is ‘in-control’ or ‘out-of-control’, by comparison with a reference set of previous (‘in-control’) observations. Choose the number of initial control-chart samples and the type of chart (progressive, fixed baseline or moving window). Specify the alpha-level and type of upper control-chart limit (non-parametric or parametric). Robust non-parametric limits are obtained from quantiles of a permutation distribution. Choose to apply shrinkage for high-dimensional systems. Produce a control-chart plot and a factor that identifies in-control/out-of-control samples.

Control Chart - dissimilarity-based NEW

From a resemblance matrix, implement a dissimilarity-based multivariate control chart. This utilises Hotelling’s T^2 criterion, but on a set of ordination axes that represent the information in the dissimilarity matrix. In addition to all of the options and features of the classical multivariate control chart, choose the ordination method and number of axes that shall be used to represent the resemblance marix for analysis (PCO, mMDS, or nMDS) and limit the ordination to a dimensionality that is fixed by the user, chosen to capture a certain percentage of the variation (PCO), or to achieve a certain matrix correlation with the original resemblance matrix (e.g., rho > 0.99).

Fixed or random factors
Interaction terms
Nested factors/hierarchical designs
User-specified contrasts
Mixed models
Randomized blocks
Latin squares
Test one or more covariates (ANCOVA)
Test interactions of covariates with factors/other terms
Unbalanced designs (Type I, Type II or Type III SS)
Remove or Pool terms in the model
Change the order of the terms/model fit
PERMDISP

Test the null hypothesis of no differences in multivariate dispersion (beta diversity) among groups based on a chosen resemblance matrix. Provide a factor identifying the groups to be compared, choose to obtain p-values from tables or using permutation of residuals and specify the number of permutations. Optionally output pair-wise tests as well, or individual deviation values of sample units to their own group centroid (and/or their means) to a worksheet.

dbRDA

Dissimilarity-based redundancy analysis (dbRDA) produces ordination axes to show the fitted variation from a DISTLM. This can be implemented as an option from the DISTLM routine for a given model specified there, or can be achieved directly from a specified predictor variable worksheet (with matching sample names). Options include provision of dbRDA scores to a worksheet or output of a residual distance matrix. The default plot shows the predictor variables as vectors on the dbRDA ordination. dbRDA on the basis of a Euclidean distance matrix produces an RDA.

Distances among centroids

Calculate distances among centroids corresponding to levels of a user-supplied factor. The factor can optionally be constructed as combinations of factors for multi-way designs (using Edit > Combine). The result is a resemblance matrix among centroids that is ready for ordination/plotting.

Workspace

File

Open, Close, Save, Rename, Delete, Print, Exit any file within a PRIMER workspace, or the entire workspace file itself (*.pwk), with all of its component elements: data sheets, resemblance matrices, results files, graphics, and/or notes.

View

Toggle on or off from View: the Explorer tree, the Tool bar, the Status bar, or (for the Python add-on) the script Console window.

Explorer tree

Keep track of all analyses and workflows in a single workspace. Use the Explorer tree to navigate files, analyses, graphics and results.

Notes

Add a Note anywhere in the Explorer tree. Keep track of ideas, trials, insights and lines of thought right inside the PRIMER workspace itself. Notes can contain text or graphics – anything you can copy and paste. Perfect for sharing analyses with mentors/students/collaborators.

Window

Arrange the Windows in the PRIMER workspace: Cascade, Tile Horizontally, Tile Vertically, Arrange Icons, or Close All Windows.

MultiPlot

Arrange multiple plots within a single Graphic. Specify the graphics to include and arrange them horizontally or vertically, specifying the number of columns or rows. Optionally suppress showing individual titles, sub-titles and/or keys. Customise the font scale and/or the symbol sizes.

Undo workspace

Reverse deletions, name changes, etc. on the Explorer tree.

Data Import and Editing

Rename Levels NEW

Rename the Levels of Factors/Indicators

Create a New…

Create a new: Workspace, Sample data sheet, Variable information file, Note, or Multi plot.

Data Import; Open…

Open/Import 3-column or rectangular Data matrices, triangular Resemblance matrices or Image files. Can read in: any PRIMER file format (version 4, 5, 6, 7 or 8); Text files (*.txt, *.csv), Excel files (*.xls, *.xlsx, *.xlsb, *.xlsm); Rich text files (*.rtf); or Image files (*.jpg, *.jpeg, *.png, *.bmp, *.tif, *.gif, *.emf).

Edit Data sheets

Highlight, Cut, Copy, Paste or Delete individual values; Highlight, Add, Insert, Delete, Move or Sort rows or columns; Edit Labels on rows or columns (Variable/Sample names).

Undo (data)

Changes to data entries can be (multiply) reversed.

Edit Properties (Data sheet)

Edit Data Properties, including Data type (abundance, biomass, environmental or unknown/other), specify Samples as either Columns or Rows, and add/remove a Description (meta-data).

Edit Properties (Resemblance matrix)

Edit Resemblance matrix Properties, including Resemblance type (similarity, dissimilarity, distance, distance^2, correlation, ANOSIM R-statistic, or rank), and add/remove a Description (meta-data).

Edit Properties (Variable Information sheet)

Edit Variable Information sheet Properties, including Data type (taxa, traits, scores, or unknown/other), and add/remove a Description (meta-data).

Edit Factors/Indicators

Add, Duplicate, Combine, Rename Delete or Import Factors/Indicators associated with Data or Resemblance matrices. Optionally choose to Fill Values or Patterns to auto-complete manual entry of new Factor/Indicator levels. Edit the Levels of Factors/Indicators.

Tools (Data sheets)

Create Ordered Groups NEW

Create ordered groups from a continuous variable. These can be: (i) a given number of groups with equal sample sizes or at equally-space intervals; (ii) a given number of groups obtained by minimising the within-group sum-of-squares (SS) or mean-absolute-deviation (MAD), or the sum of average rank distances within groups; (iii) specified by user-entered breaks or quantiles; (iv) specified by a choosing a common within-group sample size.

Split Data NEW

Split a Data sheet into multiple sheets according to group identified by either a Factor or a Variable.

Summary Stats NEW

Summarise Variables or Samples (either overall or separately within groups specified by a Factor or Indicator) with any combination of the following statistics: average, median, sum, minimum, maximum, quantiles (uer-specified), range, inter-quartile range (IQR), standard deviation, variance, sample size, standard error, symmetry, skewness, kurtosis, number of zeros, number of singletons, number of doubletons, number of non-zeros (frequency), smallest number above a user-specified threshold value, largest number below a user-specified threshold.

Aggregate

Aggregate data according to a taxonomic or other hierarchy (e.g., to analyse at the genus, family, or other level of your choice), via an Aggregation file (Variable Information sheet).

Average

Average data across Samples according to the levels of a Factor, or across Variables according to the levels of an Indicator

Check (Data sheet)

Check Data matrices for: missing values, duplicate samples, all-zero samples, estimated values, negative values, duplicate variables, and/or all-zero variables.

Duplicate

Duplicate an item (data sheet, resemblance matrix, graphic, etc.) in the PRIMER workspace

Expand Samples

Expand a Data sheet so it can be matched with one that is sampled more finely.

Merge

Merge two data sheets together, with separate join and merge options for Samples and Variables.

Missing

Replace missing values with zeros or with an estimated value, using an expectation-maximisation (EM) algorithm, which assumes a multivariate normal distribution model for the data.

Rank Variables

Rank each variable individually (largest numerical value has rank = 1).

Sum

Sum data across Samples according to the levels of a Factor, or across Variables according to the levels of an Indicator

Transpose

Transpose (swaps columns and rows) of a data sheet to a create a new one. The properties of thematrix do not change (i.e, sample/variable orientation is also switched to correspond to the new orientation of the matrix).

Tools (Resemblance matrix)

Model Matrix

Create a Resemblance matrix (e.g., as a preamble to a RELATE test) corresponding to a model of: Seriation (with a numeric Factor specifying distances), Cyclicity (with a numeric Factor specifying fractional positions along a circle), Euclidean 2D spatial distances (two Factors are used to specify the position of the sample in a 2D plane), or Unordered groups (with a Factor specifying the groups).

Average

Average Resemblance values across all samples within groups identified by a Factor.

Check (Resemblance matrix)

Check Resemblance matrices for: undefined values, out of bounds values, or duplicate labels.

Dissim

Convert Similarities to Dissimilarities (and vice versa)

Duplicate

Duplicate a Resemblance matrix in the PRIMER workspace

Expand

Expand a Resemblance matrix so it can be matched with one that is sampled more finely.

Rank distance

Rank the values in a resemblance matrix (highest similarity / smallest dissimilarity has rank = 1).

Transform

Transform highlighted values according to a user-specified mathematical expression (using a format that is the same as the BASIC computer language).

Unravel

Sequentially ‘unwind’ a resemblance matrix into a single column of values.

Tools (Variable Information sheet)

Tree

Display hierarchical (e.g., taxonomic) data as a cascading tree of entries.

Check (Variable Information sheet)

Check Variable Information sheet for: missing values, duplicate species, or inconsistent taxa (incorrect nesting in the hierarchy)

Duplicate

Duplicate a Variable Information sheet in the PRIMER workspace

Pre-treatment (Data sheet)

Standardisations NEW

Standardise Samples (or Variables) by dividing each value in a given Sample (or Variable) by the total sum or the maximum value of that Sample (or Variable). Further options include: standardising separately wtihin levels of an Indicator/Factor; expressing the results as either direct or cumulative percentages or proportions.

Transformations(overall)

Transform all of the Variables in a Data sheet to one of: none, square root, fourth root, log(x+1) or presence/absence.

Transformations(individual)

Transform highlighted variables in a Data sheet according to a user-specified mathematical expression (using a format that is the same as the BASIC computer language)

Cumulate Samples

Cumulatively sum the values of each sample. The order of cumulation is either the same as the variable order in the Data sheet or is specified by a numeric Indicator.

Weight Variables

Each of the Variables in a Data sheet is weighted by values given by a numeric indicator.

Dispersion Weighting

Down-weight species (or taxa), based on their degree of clustering (estimated variance/mean ratio) as per a Generalised Poisson model of over-dispersed counts.

Variability Weighting

Variables are individually down-weighted depending on their variability according to: pooled standard deviation, averaged standard deviation, averaged range or averaged inter-quartile range.

Normalise Variable

Values for each Variable have their mean subtracted and are divided by their standard deviation.

Analyse (Data sheet)

PCA NEW

Principal Component Analysis (PCA), an eigenvalue/eigenvector decomposition of the covariance matrix (so normalised data should be input), showing % variation explained, with options to output: an ordination plot, PCA scores to a worksheet, scree plot, and further diagnostics such as Shepard diagrams and stress claculations for the 2D or 3D PCA ordination (for comparison with MDS outputs). The PCA ordination plot, by default, also shows base variables as labeled vectors (eigenvectors within a unit circle).

Wilcoxon signed-rank test NEW

Univariate non-parametric Wilcoxon signed rank test to compare two levels of a (first) factor in a paired design, based on the ranks of paired differences. A (second) factor identifying pairs must also be provided and the alternative hypothesis (two-tailed or one-tailed) must be specified. A p-value is achieved using permutations of values across the two levels of the first factor, separately within each pair. Options include: to perform the test separately within levels of another (third) factor; to output values of the test-statistic under permutation to a worksheet and/or histogram; and to output box plot(s) of paired differences.

Mann-Whitney U test NEW

Univariate non-parametric Mann-Whitney U test to compare two levels of a factor, based on the ranks of the combined set of sample values (both groups). The alternative hypothesis (two-tailed or one-tailed) must be specified. A p-value is achieved using permutations of values across the two levels. Options include: to perform the test separately within levels of another (second) factor; to output values of the test-statistic under permutation to a worksheet and/or histogram; and/or to output box plot(s) to visually compare the two groups.

Kruskal-Wallis test NEW

Univariate non-parametric Kruskal-Wallis test to compare two or more levels (groups) of a factor, based on the ranks of the combined set of sample values (all groups). A p-value is achieved using permutations of values across the groups. Options include: to perform pairwise comparisons of groups and (optionally) output results to a worksheet; to perform the test separately within levels of another (second) factor; to output values of the test-statistic under permutation to a worksheet and/or histogram; and/or to output box plot(s) to visually compare the groups.

Kolmogorov-Smirnov test NEW

Non-paramertric Kolmogorov-Smirnov test to compare two distributions of a continuous variable (two-sample test). Rejection of the null hypothesis indicates that the shape/position of the two distributions differ from one another in some way (location, dispersion, skewness, etc.). A p-value is achieved using permutations of values across the groups. Options include: to perform the test separately within levels of another (second) factor; to output values of the test-statistic under permutation to a worksheet and/or histogram; and/or to output cumulative empirical distribution plot(s) to visually compare the groups.

Test of association NEW

Non-parametric bivariate test of association between two variables obtained from the same set of sampling units. The alternative hypothesis must be specified: (two-tailed or one-tailed with expectation of either a positive or a negative association). A permutation test (randomizing the ordering of the samples for one variable while the ordering for the other variable stays constant), is used to calculate a p-value. The test can be based on one of the following measures: Pearson correlation, Spearman rank correlation, Weighted Spearman rank correlation, Kendall’s tau, or the Index of Association. Options include: to perform the test separately within levels of a specified factor; to output values of the test-statistic under permutation to a worksheet and/or histogram; and/or to output scatter plot(s) to visually examine patterns.

Calculate Resemblances

Obtain a Resemblance matrix by calculating measures of association (similarity, correlation, dissimilarity or distance) between all pairs of Samples or Variables in a Data sheet on the basis of 49+ resemblance measures: Bray-Curtis, Euclidean, Index of Association, Jaccard, Sørensen, Hellinger, Mod. Gower, Canberra, Minkowski, Binomial deviance, Chi-squared, Czekanowksi, Manhattan, Pearson, Spearman, Kendall, and more. Provide a taxonomic hierarchy or dissimilariy matrix among variables to calculate Taxonomic (Gamma+), Functional or Phylogenetic (Theta+) resemblances. Option to add a dummy variable (present in all samples) to calculate the adjusted Bray-Curtis (or other) measure. Resemblance calculations will all now operate in the presence of missing data entries.

Caswell's neutral model

Calculate Caswell’s V statistic under ‘neutral model’ assumptions.

Diversity measures (DIVERSE)

Calculate a suite of diversity indices for species-by-sample data. Options include: total species (S), total individuals (N), Margalef’s species richness, Pielou’s evenness (J’), Brillouin (H), Fisher’s alpha, Shannon (H’), Simpson’s evenness/dominance, Hill’s diversity numbers, rarefaction (estimated number of species in samples of user-specified sizes), taxonomic diversity (delta), taxonomic distinctness (delta*), average taxonomic distinctness (AvTD, Delta+), total taxonomic distinctness (TTD), variation in taxonomic distinctness (VarTD, Lambda+), average phylogenetic diversity (Phi+), Total phylogenetic diversity (PD).

Compare dominance curves (DOMDIS)

Calculate differences between k-dominance plot curves for all pairs of samples, with an option to log-transform the species rank axis.

SIMPER

Similarity Percentages (SIMPER) to assess the contributions of individual species (or taxa) to Bray-Curtis similarity, or variable contributions to Euclidean distance. Can be done for one-way or two-way crossed designs. Calculates the average contribution of a species to all pair-wise similarities within a group and (optionally) average contributions of a species to all pair-wise dissimilarities between samples occuring in different groups (separately for each pair of groups).

SIMPROF

Similarity Profile (SIMPROF) tests for cluster-type structures in samples or correlation/association-type structures in variables for a set of data. Sample (or Variable) similarities are ordered and plotted against their ranks to produce a profile. This profile for the data set is compared with the mean profile calculated from a distribution of profiles obtained under permutation, using the sum of absolute differences (the ‘pi’ statistic) to create a test of the null hypothesis of ‘No structure’.

Analyse (Resemblance matrix)

Cluster analysis - agglomerative method (CLUSTER)

Hierarchical agglomerative cluster analysis with a choice of cluster mode (single linkage, complete linkage, group average or flexible beta linkage). Options include the ability to: identify clusters with ‘significant structure’ remaining within them via SIMPROF tests performed at every node; create a factor based on the SIMPROF results; and/or to create a cophenetic distance matrix. Dendrogram plots in the output can be further customised. Specify factors or indicators for labels and/or symbols. Change the orientation of the plot (up, right, down or left). Rotate the leaves of the dendrogram (click horizontal lines). Simplify by collapsing leaves within any branch (click vertical lines). Use Zoom-in/Zoom-out tools to focus on particular parts of a dendrogram. Show a ‘slice’ through the dendrogram at a particular similarity/dissimilarity value and create a factor from this.

Cluster analysis - divisive method (UNCTREE)

Divide a set of multivariate observations (species or samples) into groups so as to maximise the ANOSIM R-statistic using binary top-down splits to yield a divisive hierarchical tree. Nominate the minimum group size and/or split size. Optionally identify clusters with ‘significant structure’ remaining within them via SIMPROF tests performed at every node; create a factor based on the SIMPROF results; and display vertical positions of nodes using either equally spaced (A%) or split quality (B%) criteria.

Constrained divisive clustering (LINKTREE)

Non-parametric classification based on divisive top-down binary splits chosen to maximise the ANOSIM R statistic (like UNCTREE) while also being constrained by one or more other variables in a secondary set of data (e.g., spatial/environmental). More specifically, the constraints require that a split can only be made that simultaneously corresponds to a split in the rank-ordered values of one more more variables in the other (secondary) data set. Optionally identify clusters with ‘significant structure’ remaining within them via SIMPROF tests performed at every node; create a factor based on the SIMPROF results; display vertical positions of nodes using either equally spaced (A%) or split quality (B%) criteria; and replace variable names in the information pane of the plot with the levels of an indicator or factor.

Non-parametric k-means (kRCLUSTER)

Divide a set of multivariate observations (species or samples) into a given number (k) of clusters/groups to maximise a given criterion – either the ANOSIM R statistic or the average rank within groups – using an iterative technique. Chose a single value for k, or perform the analysis for a range of k values. For the latter the lowest k is chosen so that a SIMPROF test shows no significant structure remains within the clusters. Output is a factor identifying the k groups.

Non-metric MDS (nMDS) ordination

Non-metric multi-dimensional scaling (create a configuration of inter-point distances to match rank-order dissimilarities) using an iterative algorithm from multiple initial random configurations to identify a global lowest-stress solution. Choose: the number of dimensions for the configuration(s) (2 to 3, or user-specified range of dimensions) and the number of restarts (random initial random configurations). There are options to choose the Kruskal fit scheme (1 or 2), animate the configuration plot, output Shepard diagrams and/or scree plots, send configuration coordinates to a worksheet, or ‘fix’ a potential ‘collapsed’ configuration by ‘mixing’ in a specified proportion of a metric MDS solution in the stress equation.

Metric MDS (mMDS) ordination

Metric multi-dimensional scaling (create a configuration of inter-point distances to match dissimilarities linearly) using an iterative algorithm from multiple initial random configurations to identify a global lowest-stress solution. Choose: the number of dimensions for the configuration(s) (2 to 3, or user-specified range of dimensions), and the number of restarts (random initial random configurations). There are options to animate the configuration plot, output Shepard diagrams and/or scree plots, or send configuration coordinates to a worksheet.

Threshold metric MDS (tmMDS) ordination

Threshold metric multi-dimensional scaling (create a configuration of inter-point distances to match dissimilarities linearly, but relax the intercept to a non-zero value) using an iterative algorithm from multiple initial random configurations to identify a global lowest-stress solution. Choose: the number of dimensions for the configuration(s) (2 to 3, or user-specified range of dimensions), and the number of restarts (random initial random configurations). There are options to animate the configuration plot, output Shepard diagrams and/or scree plots, or send configuration coordinates to a worksheet.

Combined MDS

Minimises an equal mixture of stress functions from two nMDS ordinations, which has potential application to produce a consensus view of among-sample relationships for two sets of variables which cannot be merged into a single matrix – perhaps needing different resemblance measures (e.g. biotic and abiotic; motile organism counts and colonial species areas, etc.).

Analysis of Similarities (ANOSIM)

Analysis of similarities (ANOSIM), a non-parametric test using the R statistic to rigorously test the null hypothesis of no differences among groups of multivariate samples (levels of a factor) in any specified design up to three factors having crossed and/or nested relationships among them. P-values are all achieved using rigorous permutation methods, including appropriate constraints or choice of permutable units for any given factor within the context of the full design. Choose a maximum number of permutations. For multiway tests with no replciation, choose the correlation method (Spearman, Weighted Spearman or Kendall). Optionally output pairwise tests values of R and/or their associated significance levels to worksheet(s), histogram plots or worksheets showing the distribution of test-statistic values under permutation.

BEST - BIOENV

Select a set of variables (from a ‘Fitted data’ worksheet) that will generate a distance (or dissimilarity) matrix (based on a chosen resemblance measure) that ‘best matches’ the patterns of relationships among samples in a given (target) resemblance matrix. This is an exhaustive search. Often used to find (say) environmental (or other) variables that, taken together, match patterns in biotic (species) dissimilarities. Choose the matrix correlation method (Spearman, weighted Spearman, Pearson, or Kendall). Optionally, keep fitted data variables together in groups specified by an indicator, do the analysis within levels of a factor, or do a permutation test for a significant ‘match’ that also carefully accommodates the ‘search’ beaing done through the fitted variables (computer intensive). Examine a chosen number of best overall results as well as the best results for a given number of fitted variables.

BEST - BVSTEP

Select a set of variables (from a ‘Fitted data’ worksheet) that will generate a distance (or dissimilarity) matrix (based on a chosen resemblance measure) that ‘best matches’ the patterns of relationships among samples in a given (target) resemblance matrix. This is not an exhaustive search, but rather begins with either a fixed set of variables or a specified number of random trial variables, followed by a stepwise (forward/backward) selection method to improve the fit. Useful for identifying indicator and/or redundant species sets. Choose a stopping criterion (e.g., rho > 0.95) and tolerance (e.g., delta rho < 0.001) for the fit; all other options are similar to BIOENV.

MVDISP

Calculate an index of multivariate dispersion, based on the rank dissimilarities among replicates within groups.

RELATE

Calculate a matrix correlation (using Spearman’s rho, weighted Spearman’s rho, Pearson’s correlation or Kendall’s tau) and test the null hypothesis of no significant association (no relationship in multivariate patterns) between two resemblance matrices. The second resemblance matrix can be a user-generated resemblance matrix or a model matrix (e.g., for trend or cyclicity). P-values are obtained via permutation of sampling units. Equivalent to a Mantel test when Pearson’s measure of matrix correlation is used. Choose the maximum number of permutations. Optionally perform the test within levels of a factor. Optionally output a histogram or a worksheet showing the distribution of values of the test-statistic under permutation.

2STAGE

Calculate the matrix correlation (using Spearman’s rho, weighted Spearman’s rho, Pearson’s correlation or Kendall’s tau) between all pairs of two or more resemblance matrices. The resulting matrix correlations are output into a new resemblance matrix, whose sample names correpond to the original resemblance matrices’ names. An MDS of the resulting matrix is a ‘second-stage’ MDS. Another option is to calculate resemblances between subsets of a chosen resemblance matrix, by nominating an ‘Outer’ and an ‘Inner’ factor. The levels of the ‘Outer’ factor split the whole resemblance matrix into the smaller ones which are to be compared. Its levels will be used to label the entries in the output resemblance matrix. The levels of the ‘Inner’ factor identifies corresponding samples in the sub-matrices.

TAXDTEST

Test the null hypothesis that a species list from one locality (or time) has the same taxonomic distinctness (AvTD, delta+) and/or variation in taxonomic distinctness (VarTD, lambda+) structure as a ‘master’ list (e.g., of all species in that biogeographic region) from which it is drawn. Achieved by random draws of species from the master list for a given number of species (S). Choose to plot a histogram (one value of S), funnel (one or more samples and a user-specified range of S values, but for just AvTD or VarTD only) or ellipse plot (AvTD and VarTD plotted together bivariately for a user-specified range of S values) to show the distribution of values obtained under random draws, and showing sample values within this context. Choose to weight the random selection of species by their abundances (frequencies) in a master frequency worksheet. Choose the significance level for contours in funnels and ellipses.

Bootstrap Averages

Ordination of bootstrap averages and confidence regions for groups of samples from a resemblance matrix. First, a metric MDS of sufficient dimension (m) to represent the resemblance matrix is produced first. Second, a set of n_boot random bootstrap re-samples within each of several (g) groups (specified by a factor) are calculated in that metric MDS space. Third, averages are calculated from the bootstrap samples in each group, yielding g x n_boot bootstrap averages. Ordination of all g x n_boot bootstrap averages can be done using metric, threshold-metric or non-metric MDS. Options include: choice to specify m directly or automate the choice using matrix correlation (i.e., choose m yielding rho greater than a user-specified value); output m-dimensional data to a worksheet (replicates, bootstrap averages and/or group averages); display bootstrap averages, group averages and/or bootstrap confidence regions at 50%, 80% or 95% significance levels.

Plots

Box Plot NEW

Create a Box-and-whisker plot of a univariate variable to compare the distirbutions of observations across different groups (identified by a Factor). Optionally change the opacity, colours, and degree of saturation of the colours.

Dot Plot NEW

Visualise the empirical distribution of a univariate set of data. Choose a factor to compare distributions of samples for different groups. Plot a dot for every data point at its appropriate location on the number line (y axis). Observations that have the same value are ‘stacked’ alongside one another at that same position and a horizontal line for the median is also shown. Optionally change the opacity of the dot symbol, the colour, and/or the saturation of the colour.

Empirical Distribution Plot NEW

Plot the cumulative empirical distribution (expressed as percentages or proportions) as a line for either Variables or Samples, with the option to split these into multiple distributions (lines on the plot) by specifying a Factor or Indicator, respecitvely.

Line Plot NEW

Plot the values of variables as lines (across all samples), or values for samples as lines (across all variables). Optionally draw separate plots for groups specified by an Indicator (for variable lines) or a Factor (for sample lines).

Violin Plot NEW

Create a Violin plot to show an empirical probability density function of a univariate variable for each of several groups (identified by a Factor), obtained by a non-parametric kernel density estimate (kde), which is mirrored horizontally. Also shows the median and inter-quartile range. Optionally change the bandwidth (Silverman’s rule of thumb or customised values, separately for each group), change the method of rescaling (none, area, width or count), set upper/lower cut-off bounds for truncated variables types (e.g. a lower cut-off of zero for biomass data), or customise the opacity, colour and/or saturation of the colours used in the plot.

Bar Plot

Individual or stacked bar plots showing the values of variables across samples in 2D or 3D.

Dominance Plot

Ranked Species abundance (or biomass) plot. For each sample, species are ranked in decreasing order of abundance. Their relative abundance (i.e. percentage of the total abundance in the sample) is plotted against the increasing rank (x axis), the latter on a log scale. Choose to plot relative abundance, cumulative relative abundance (a k-dominance plot) or partial dominance.

Abundance-Biomass Comparison (ABC) Curve

Put dominance curves for abundance and biomass, separately calculated, onto the same plot. Calculate the W statistic (difference in the two curves) as a measure of perturbation.

Draftsman Plot

Plot bivariate scatter plots for p variables in a p x (p-1)/2 array. Examine distributional patterns and patterns of correlation structure across the variables and calculate correlations for every pair of variables.

Geometric Class Plot

Plot the number of species (y axis) that fall into a set of geometric (x 2) abundance classes (x axis). Each line on the plot gives the number (or %) of species represented in the sample by a single individual (class 1), 2-3 individuals (class 2), 4-7 individuals (class 3), 8-15 individuals etc. Also called Species Abundance Distribution (SAD) curves.

Histogram Plot

Plot Histograms for each variable in a Data sheet. Optionally change the colours of the bins or boundaries, or change the bin size.

Means Plot

Plot means and 95% confidence intervals of a univariate variable for different levels of a Factor (groups). Optionally join those means with a line and/or use a common variance estimator to produce the confidence intervals.

Scatter Plot

Plot a 2D bivariate scatter plot of two chosen variables, or a 3D tr-variate scatter plot of three chosen variables, with the option to preserve the aspect ratio.

Shade Plot

Create a shade plot (heatmap) of the data matrix (Variables are rows and Samples are columns, or vice versa), with a grey-scale (from white-to-black) showing increasing values of abundance/biomass in a mosaic. Change from grey-scale to a single colour, two colours, or a spectrum. Specify the maximum, the number of levels to appear in the key and optionally reverse the palette. Show in 2D or as bars in 3D. Alter the relative size of dendrgrams (if included), draw sample and/or variable constraint group boundaries.

Shade Plot - Reorder

Change the ordering of samples and/or variables in a shade plot. You can keep the original/worksheet order, order by a numeric factor/indicator, order by some other chosen variable, order to maximise the degree of seriation (where you also choose the number of restarts for optimising the seriation), order according to nearest neighbour, or by reference to a resmblance matrix. Add additional constraints to the re-ordering, such as to maintain groups or the structure of a dendrogram from a cluster analysis of the samples/variables. Note that separate re-ordering options and/or constraints are chosen and applied for samples and variables.

Species-Accumulation Plot

Plots the cumulative species count against sample number and (optionally) outputs several different estimators of total richness based on the plot (S, Chao 1, Chao 2, Jacknife 1, Jacknife 2, Bootstrap, Michaelis-Menton, Ugland-Gray-Ellingsen). Choose to keep samples in their original order, or specify an order via selecting samples. A permutation option is also offered, where different (permuted) sample orders are taken and the resultant curves are averaged.

Surface Plot

A 3D plot of samples by variables with the height and colour gradient corresponding to data values. Optionally change the opacity of the surface.

Wizards

Basic multivariate analysis

Run a suite of basic multivariate analysis methods with user-specified settings on a set of data: (1) Pre-treatment (standardisation/transformation); (2) Resemblance; (3) one-way ANOSIM (user-specified factor); (3) CLUSTER analysis (optionally with SIMPROF); (4) Non-metric MDS; and (5) SIMPER.

Matrix display

Create a shade plot (heatmap) of the data, enabling a direct visualisation of the data values, where ordering of samples and variables are each separately re-arranged to maximise a pattern of seriation along each axis of the plot, but constrained by a cluster analysis of the samples (based on Bray-CUrtis) and a (separate) cluster analysis of the variables (based on the Index of Association). Samples may also be retained in groups based on the levels of a chosen factor. Optionally do an overall transformation and/or reduce the species set to retain only a chosen number of ‘important’ variables (based on % contribution to any one sample).

Coherence plots

Display plots of variables which are functionally similar via the following automated steps: (1) CLUSTER analysis on the variables; (2) SIMPROF tests (at a chosen alpha-level and number of permutations) to identify maximal groups of variables with no structure (i.e., ‘coherent variables’); (3) line plots of the coherent variable sets. For biotic data (type = abundance or biomass), there are options to standardise the variables, choose the resemblance measure (e.g., Index of Association) and reduce the species set to ‘important’ species (based on % contribution to any one sample). For environmental/other data, there are options to scale variables via normalisation or ranking, choose the resemblance measure (e.g., Pearson).

Graph Options

General

Choose to plot additional Information and History as an extra text pane. Choose the line width, line pattern, and overall font scale for History panes and text.