*** Welcome to piglix ***

Test construction strategies


Test construction strategies are the various ways that items in a psychological measure are created and decided upon. They are most often associated with personality tests, but can also be applied to other psychological constructs such as mood or psychopathology. There are three commonly used general strategies: Inductive, Deductive, and Empirical. Scales created today will often incorporate elements of all three methods.

Also known as itemetric or internal consistency methods. The inductive method begins by constructing a wide variety of items with little or no relation to an established theory or previous measure. The group of items is then answered by a large number of participants and analyzed using various statistical methods, such as exploratory factor analysis or principal component analysis. These methods allow researchers to analyze natural relationships among the questions and then label components of the scale based on how the questions group together. The Five Factor Model of personality was developed using this method.

Advantages of this method include the opportunity to discover previously unidentified or unexpected relationships between items or constructs. It also may allow for the development of subtle items that prevent test takers from knowing what is being measured and may represent the actual structure of a construct better than a pre-developed theory. Criticisms include a vulnerability to finding item relationships that do not apply to a broader population, difficulty identifying what may be measured in each component because of confusing item relationships, or constructs that were not fully addressed by the originally created questions.

Also known as rational, intuitive, or deductive method. The deductive method begins by developing a theory for the construct of interest. This may include the use of a previously established theory. After this, items are created that are believed to measure each facet of the construct of interest. After item creation, initial items are selected or eliminated based upon which will result in the strongest internal validity for each scale.

Advantages of this method include clearly defined and face valid questions for each measure. Measures are also more likely to apply across populations. Additionally, it requires less statistical methodology for initial development, and will often outperform other methods while requiring fewer items. However, the construct of interest must be well understood to create a thorough measure, and it may be difficult to prevent or determine if individuals are faking on the measure.


...
Wikipedia

...