Page images
PDF
EPUB

tendency to omit more children than he adds because of the difficulty of getting complete family records, is of the utmost importance.

Increasing the number of measures has here no beneficial influence. In certain cases increasing the number of observers may, namely, when the constant error of one observer is offset by the constant error in the opposite direction of another observer. If, that is, there is an error of prejudice or tendency constant for any one observer, but varying in direction by chance among a group of observers, what is a constant error for one becomes a variable error for a group, and is no longer a source of misleading, but only of lessened reliability. For instance, if any one person, even an expert judge, should rank 100 men in order for morality or efficiency or intellect, the results would probably have a constant error due to the undue weight he would put upon certain evidence; but if we took the median of the rankings given by ten or twelve expert judges, the error would in the main be only a chance error, for the prejudice of one would offset the prejudice of another.

The sources of constant errors in mental measurements are so numerous and so specialized for different kinds of facts that it is impossible to forearm the student against them here. Skill in avoiding them is due to capacity and watchfulness far more than to knowledge of any formal rules. It is, however, practically wise to test any result which may be affected by some constant error by using different methods of measurement, and to examine the means of selecting cases for measurement with the utmost care. The tendency to bias or to blunder is much more likely to make one select unfair cases than to make one measure them unfairly.

There is also a source of error which is perhaps in strictness an error in inference, but which from another point of view may be regarded as an error in measurement and so as relevant to the topics of this book. In measuring, say the spelling ability of a number of individuals whom we wish to compare, we assume that the achievement of each is a measure of the spelling ability of each. But A and B may have been seated where they did not hear the words pronounced so well as did C and D. E and F may have had headaches, while G and H were cheerful and bright. There exist errors due in the first example to outer physical conditions and in the second to inner or psychological conditions. To compare A, B, C, etc., in

spelling ability, every extrinsic condition influencing that ability should be alike for all. Otherwise we are led into errors, which may be called errors of inferring an ability in abstracto from its manifestation under particular conditions, or of measuring a fact with a constant error of condition. It will be simpler to treat separately errors due to physical conditions and errors due to mental conditions.

Errors due to physical conditions can be prevented by making the conditions identical, or turned into relatively harmless variable errors by measuring each individual a number of times under conditions chosen at random. It would seem at first sight best to make conditions identical wherever practicable. This rule probably does hold for physical measurements, but there are certain disadvantages in this procedure in mental measurements. Too much artificiality and restraint in conditions often lead to an unusual and perturbed state of mind in the person measured, such that the thing one measures is likely to be a thing which would never occur in the ordinary course of the person's life. Measuring precisely a fact which you do not want is worse than measuring inexactly the fact you do want.

For instance, measurements of spelling under the unequal conditions of a schoolroom would, in spite of them, be better than measurements from 10-year-olds made to stand one at a time in the soundproof room of a laboratory with head exactly 50 centimeters from a phonograph which pronounced the words for them to spell. The last method would give identity of physical conditions, but would measure insensibility to strange surroundings and treatment and ability to attend to and interpret the phonograph's noises perhaps more than it would spelling ability.

Errors due to mental conditions can not be prevented with surety by making the conditions identical, for it is not in the power of the observer to control the mental conditions of the person measured. The best that can be done is to avoid any probable cause of difference in them and to take the subjects' reports as to what their mental conditions are. But mental conditions vary greatly even despite the apparent absence of causes for difference; and the reports of mental condition from untrained self-observers must be vague, subject to constant errors and always from a personal standard of comparison incommensurate with that of any other individual. Though A says, 'I am tired,' and B says, 'I am not,' their feelings of fatigue may

be equal. We do not take untrained individuals' opinions as facts elsewhere in science, and have no right to do so here. The more reliable procedure would be to eliminate the influence of the variability of inner conditions by random choice from among them rather than to pretend to eliminate the variation itself.

It is also a fair question whether the attempt to make all the mental conditions except the one to be measured alike in the persons to be compared, does not commonly result in so much unnaturalness of the sort against which protest was made a page back, as to do more harm than good. Attempted restriction of mental conditions surely disturbs any one even more than restriction of physical conditions.

Success in eliminating disturbing conditions is not attainable as a result of knowledge of any fixed rules, but only through a happy ingenuity in devising experiments, arranging observations and selecting data. We can, however, be careful, after securing the best measurements that we can, to distinguish sharply between the actual measurement of the fact under certain conditions, on the one hand, and on the other the inferences that we may be tempted to make about the fact in general or apart from those particular conditions. It is not undesirable to make inferences, but it is highly undesirable to confuse them with measurements or to leave them without critical scrutiny.

Much more might well be said with regard to the sources of error prevalent in studies of human nature, but the proper bounds of an introduction, not to the logic or general method of the mental sciences, but only to their statistical problems, have already been passed.

Weighting Results.

Different sources of information concerning any one quantity may give it differing amounts, and these sources may be of unequal reliability. It is, then, desirable to allow more weight to the more trustworthy sources in deciding what amount is the most probable for the quantity. For instance, if an expert in physical anthropology measured A's head and scored his cephalic index .81, while an ordinary person scored it .80, we should choose the .81 rather than the .80, and, if we allowed something for each judgment, would perhaps take 80.8 as the figure, counting the anthropologist's result four times. No care in weighting sources will do so much service as the

elimination of constant errors; and ideally no source with a constant error unallowed for should have any place in determining a result. Any source may deserve weight because of either numerical or qualitative strength. Its numerical strength is as the square root of the number of cases whose study it represents. Weighting for quality is bound in practice to be largely arbitrary, but this is not a great misfortune, for the result will rarely be altered appreciably by such differences in the system of weighting as reasonably competent students would make. For instance, A, B and C with the same general problem use different methods and get as a certain correlation coefficient .60, .50 and .48 respectively. Suppose that we weight these sources 1, 1, and 1; 4, 4 and 5; 3, 4 and 5; and finally 4, 3 and 5. We have then, as the probable true coefficient, .5267, .5231, .5167 or .5250. Bowley gives a rule that is satisfactory for most cases that occur in practice, namely, to give your attention to eliminating constant errors and not to manipulating weights.* If results are weighted it is always well to give them in their unweighted form as well and leave the opportunity open for any critic to weight them as he judges proper.

* 'In calculating averages give all your care to making the items free from bias and leave the weights to take care of themselves.' 'Elements of Statistics,' p. 118.

CONCLUSION.

CHAPTER XIII.

REFERENCES FOR FURTHER STUDY.

I TRUST that the reader has been impressed by now with the fact that the theory of mental measurements is no display of mathematical pedantry or subtle juggling with figures, but on the contrary is simple common sense. The chief lessons of this book are in fact simple applications of the most elementary logic. They may be summed up in the form of warnings against certain fallacies common in the quantitative treatment of mental facts, viz.:

1. Accepting guessed equality or mere verbal likeness in place of real equality.

2. Using quantities on a scale without consideration of the meaning of the scale's zero point.

3. Dealing carelessly with totals the constitution of which is unknown.

4. Using an average to represent a series of individual measures regardless of their distribution.

5. Estimating a total series from individual measures numerically insufficient or so selected as to actually misrepresent it.

6. Estimating differences by ambiguous measures.

7. Using a difference between or change in averages to represent a series of individual differences or changes. (7 is essentially the same fallacy as 4.)

If the reader has been rendered immune to these errors, has acquired facility and confidence in the manipulation of measurements, and has learned to discard guess work and crude arithmetic in favor of accurate and modern methods of measuring facts and relationships, the purpose of this book has been amply fulfilled.

It is desirable that the student who has been thus introduced to statistical methods should proceed to study samples of their concrete application to problems in the mental sciences and, in case he has the necessary mathematical interest and training, that he should study the abstract properties of different types of distribution, the derivation of statistical formulæ, the mathematical theory of correlation and other

« PreviousContinue »