view article

Figure 2
(a) Dendrogram for cluster analysis of the 14 cryocooled thaumatin data sets introduced in §[link]3.1[link]. (b) Rmeas for random combinations of the 14 data sets introduced in §[link]3.1. Calculations for groups of two, three, four all the way up to 14 data sets are shown. The broken line runs through the medians for all groups, while the full lines include the inter-quartile range, i.e. all dots falling below the lower line and all dots falling above the upper line represent 50% of all values. Optimally selected groups of data sets could be considered as those having Rmeas below the lower full line; these are included among the 25% of best-performing groups. (c) The broken and full lines in this plot are a replica of those in (b). The empty circles correspond to values of Rmeas for all merged data sets found in the dendrogram in (a). Ten out of 13 of them fall under the lower inter-quartile range line. We know that only data sets performing among the top 25% fall in this region. Thus, the selective power provided by cluster analysis is quite evident.

Journal logoBIOLOGICAL
ISSN: 1399-0047
Follow Acta Cryst. D
Sign up for e-alerts
Follow Acta Cryst. on Twitter
Follow us on facebook
Sign up for RSS feeds