Quality Assurance and Quality Control Estimates
for the Production Ageing of Northwest Atlantic Species

Data Presentation

For each species, a table is given summarizing the results of all tests which have been conducted. Within this table, the source of samples for each test is listed first, and is linked to more detailed results for each test. If the test samples were from a specific part of the species' range, the stock area is shown; if no stock area is listed, the test was for all management areas combined. Detailed results for each test include an age-bias plot, an age-frequency table, and a summary of the test results for each production (or reference) age, in addition to the measures shown in the species table.

Visual inspection of the age-bias plots and age-frequency tables will reveal the presence or absence of bias, though these are not quantitative. The Bowker's test of symmetry can be useful in quantifying bias when the sample size is large and variability is high.

Variability is measured via both percent agreement and the mean coefficient of variation (CV). These measures are inflated when a bias is present, and will not accurately reflect variability when there is a bias. Variability levels are related to various factors inherent in the samples, including the fish species, the age reader’s experience, and the structure used for age determination, as some species/structures are easier to age than others.

Statistical Measures

The following measures are used to characterize the results of tests of ageing consistency at the Fishery Biology Program at the Northeast Fisheries Science Center:

Coefficient of Variation (CV)

The mean coefficient of variation (CV, Campana et al. 1995, Chang 1982) is a relatively robust approach to quantifying agreement in fish ages. It yields results which are easier to compare between species and structures. Also, the contribution each fish makes to the CV is relative to the average age assigned to that fish; i.e., a 2-year error in ageing a young fish would increase the measure more than would a 2-year error in an older fish, as the percentage change in age is greater for younger ages.

The CV is based on the differences between the mean age and each given age for each fish, and then these values are averaged over the entire sample set. When two ages are assigned to each fish, the CV is calculated as follows:

where Xij is the ith age for the jth fish, Xj is the mean age of the jth fish, and N is the sample size.

Campana (2001) indicates that many ageing laboratories around the world view CVs under 5% to be acceptable among species of moderate longevity and ageing complexity. His description applies to most of the species considered here.

Percent Agreement

The Fishery Biology Program has used this measure since the group’s inception, and considers levels of over 80% to be adequate. It is calculated based on the percentage of ages agreed upon relative to the total number aged:

For this measure, an error in ageing a young fish changes the measure by the same amount as would a similar error for an old fish. Therefore, this statistic is harder to compare between samples sets with different age distributions or across species.

Age-Bias Plot

The age-bias plot shows the average test age attained during the intra-reader precision test versus each production age (or, for accuracy tests, the reference age), with a 95% confidence interval. In inter-reader tests, one reader's age is chosen to be on each axis; unless one reader is a trainee, the selection of which set of ages in on the x-axis is not intended to indicate that one person's ages are expected to be more reliable.

These plots are easy to interpret, and can reveal bias patterns which other displays and statistics may not (Campana et al. 1995). The diagonal line in these plots indicates 1:1 agreement. The average test age should be near this line, and the 95% confidence interval should bracket it. If all the average ages are above or below this line, or the average ages diverge from this line at either low or high ages, this indicates a systematic bias.

Age-Frequency Table

The age-frequency matrix shows the numbers of samples at each age for both the test age, across the top, and the production (or reference) age, on the left. The grey boxes along the main diagonal of the matrix indicate the number of samples for which both ages are in agreement; fewer samples falling outside these boxes indicates better consistency. Numbers above this diagonal indicate fish which were given a higher age during the test, while numbers below this were given a lower test age; greater distance from the main diagonal indicates poorer agreement. Totals (at the right & bottom) indicate the age distribution within the test for both set of ages.

Bowker's Test of Symmetry

For both types of precision test, a Bowker’s test (Hoenig et al. 1995, Bowker 1948) was used to test for departures from symmetry within the age-frequency table. Such asymmetries indicate the presence of a bias, although the test has low sensitivity when few disagreements exist. Where ages differ from one another, the Bowker's test compares values on the age-frequency table which represent symmetric errors, such as the paired ages (3,4) and (4,3). If all such values are dissimilar, the test will return a significant P value.

This test statistic is calculated as a chi-square variable, as follows:

where m is the maximum age in the data set, and nij is the number of fish in the ith row and jth column (Hoenig et al. 1995, Bowker 1948). The value of the degrees of freedom is equal to the number of non-zero nij-nji comparisons in this calculation, to a maximum of m(m-1)/2.

Because this test compares only ages which differ from each other, it is less informative where few differences exist. Therefore, this test was not applied to cases where the percent agreement was 90% or above. Tests of symmetry also are not conducted for accuracy tests, as the error is assumed to be entirely within the test age, and therefore would be shown in the age-bias plot.

Results Summary

This table shows a breakdown of the test results for each production (or reference) age. It gives the total number at each age, the number agreed upon during the test, the percentage of agreements at that age, and the average test age. The average test age is the same as what is shown in the age-bias plot, and the number of samples agreed upon is shown in the main diagonal of the age-frequency matrix. Again, for inter-reader precision tests, one person's age is chosen to be the basis for the other's results; aside from training exercises, this is not intended to indicate that either set of ages is expected to be more reliable.


Return to the Species List

Return to the FBP Homepage

www.nefsc.noaa.gov
Search
Link Disclaimer
webMASTER
Privacy Policy
(Modified May. 07 2009)