|Advanced Mill Database|
Advanced Mill Database Creation
Edward Souza, SWQL, February 22, 2011
To summarize cultivar milling and baking data into a reference for use in adjusting other data sets of experimental lines.
Each group germplasm from breeders that is evaluated by the laboratory should have check cultivars included in the group. As part of the validation process of the data, those checks should be compared to prior performance of the check in quality evaluations. Environmentally adjusted scores are created for each line evaluated in the laboratory. This is calculated based on the difference between an historical average of the cultivar’s performance and the observed value within the trial. The Advanced Mill Database is the tabular form of historical averages for cultivars evaluated at the laboratory using the modified Quadrumat flour mill.
Compilation of Data
Databases of previous year’s performance are compiled and stored on the SWQL server in the folder marked ‘Quality Scores’. Begin with the most recent and add to it all new advanced groups evaluated in the laboratory since the last compilation. The dataset as of 2/22/2011 bridges evaluations with the current and earlier sugar-snap cookie method. Samples evaluated with the earlier method are noted in a method column with ‘Old’. All current data should be marked as ‘New’ for the method column as we are using the revised sugar-snap cookie method exclusively, since 2009. Edit previous entries of new cultivar releases in the past year for consistent naming. Review all naming for consistency as company names can and do change each year.
Analysis of Data
Due to the unbalanced nature of the data, we can only generate approximate means.
Milling and baking data analysis uses the models in the templates sheets marked ‘Adjustment Factor’ sheet to calculate the observed scores for the checks. The observed scores are compared to the historical averages for the checks in the Advanced Mill Database, and the difference is used to calculate the bias for adjusting the score of all the experimental lines to accommodate the environmental influences on the trial.
Annotation of the Set
The data sheets should note averages of all entries and standard errors for a specified number of observations. The raw data used to generate the means should be annotated for date of last entry and any missing or unusual information in the group. Backups should be made on the local computer and server. All technicians should be informed of the conversion to the new database. In the database and in files a date should be noted for conversion to the new database. Each group processed should be marked at the bottom with the version of the database used for analysis.