Assessment Instrument Review - 10
Assessment Instrument Review One
Name of Instrument
Woodcock Reading Mastery Tests- 3rd Edition (WRMT-3)
Publication Date
2011
Purpose of Instrument
Instrument is used to assess academic achievement in reading (Brunsman, 2014). The WRMT-3 uses scores in nine areas including letter identification, phonological awareness, rapid automatic naming, word identification, word attack, listening, word, and passage comprehension and oral reading fluency (Pierangelo & Giuliani, 2017). These scores produce another additional four cluster scores regarding the students readiness, basic skills, reading comprehension, and total reading skills (Brunsman, 2014). It is administered individually or in small groups in test format (Brunsman, 2014).
Age or Target Population
The WRMT-3 can be used with students from grades three through twelve (Brunsman, 2014).
Reliability
Split-half reliability scores fell between .64 and .97, with an average reliability score between .85 and .95 (Brunsman, 2014). Test-retest reliability was determined based on 155 students and ranged from .52 to .97 (Brunsman, 2014).
Validity
Evidence of construct validity was found for reading achievement (Brunsman, 2014). Patterns were found to increase as the age of test takers increased (Brunsman, 2014).
Norm Group Population
The WRMT-3 was normed during 2009-2010 using 5000 individuals ranging from 4 and ½ years to 79 years 11 months (Brunsman, 2014). The group was representative of the United States for variables such as geographical location, gender, ethnicity, socioeconomic status and the presence of disabilities (Brunsman, 2014).
Kinds of Scores
Scores for the WRMT-3 are available in raw and standard scores, percentiles, and grade and age equivalents so that the student’s reading ability can be compared to those of his or her peers (Brunsman, 2014).
Cost of Instrument
The total cost for both forms of the kit with the administration manual included is $683.70, with lower pricing for individual forms (Brunsman, 2014).
Cost of Protocols
The cost for the administration protocol is $165 (Brunsman, 2014).
List All Personnel Using Instrument
The WRMT-3 should only be administered by trained professionals such as educational diagnosticians (Pierangelo & Giuliani, 2017).
Assessment Information Use
This instrument is used to screen and assess reading achievement (Brunsman, 2014). The information gained from the WRMT-3 can provide guidance for IEP development as well as decisions regarding educational placement (Brunsman, 2014).
Justification and Recommendation for School Use
I would recommend the use of the WRMT-3 in my district for multiple reasons. The first reason is because the demographic makeup of my district tends to have many students who struggle in their reading abilities, so this test could help in assessing and identifying deficits. Additionally, this instrument was found to have good reliability and adequate validity so I believe it to be a viable option for assessing reading achievement (Brunsman, 2014).
References
Brunsman, B. (2014). Woodcock reading mastery tests-Third edition. Mental Measurements Yearbook, 19. Retrieved from https://eds-b-ebscohost-com.libproxy.lamar.edu/ehost/detail/detail?vid=3&sid=81ce81ba-a760-4248-9d37-163dc65f5c45%40sessionmgr104&bdata=JmxvZ2luLmFzcCZzaXRlPWVob3N0LWxpdmU%3d#AN=test.3351&db=mmt
Pierangelo, R., & Giuliani, G. A. (2017). Assessment in special education: A practical approach (5th ed.). Boston, MA: Pearson.
Assessment Instrument Review Two
Name of Instrument
Kaufman Tests of Educational Achievement-3rd Edition (KTEA-3)
Publication Date
The KTEA-3 was first published in 1985, with its most recent third edition being published in 2014 (Mackler, 2017).
Purpose of Instrument
This instrument is a comprehensive test of academic achievement (Pierangelo & Giuliani, 2017). The KTEA-3 is comprised of 14 subtests that generate scores for five composite areas including reading, math, written language, oral language, and comprehensive achievement (Pierangelo & Giuliani, 2017). It is a battery of assessments that is individually administered to students (Pierangelo & Giuliani, 2017).
Age or Target Population
The KTEA-3 is appropriate for individuals ages 4 to 25 (Pierangelo & Giuliani, 2017),
Reliability
Split-half reliability was calculated for all untimed subtests in the KTEA-3 (Mackler, 2017). Reliability coefficients were relatively elevated with averages between .80 and .90, although the oral fluency coefficient was measured to be in the .70 range (Mackler, 2017). The strongest reliability was found in five subtests, mid to high .90s, with slightly lower coefficients in another 5 areas falling between .80 and .90 (Mackler, 2017). Interrater reliability coefficients were high for oral and written expression which is indicative of consistent scoring criteria (Mackler, 2017).
Validity
The KTEA-3 demonstrates concurrent validity, which is important for school assessments (Mackler, 2017). Previous studies found evidence of validity for the subtests and composite scores (Mackler, 2017).
Norm Group Population
The KTEA-3 was normed using 2,600 English-speaking students with no physical or perceptual deficits whose ages ranged from prekindergarten through grade 12 (Mackler, 2017).
Kinds of Scores
Five composite scores are generated via 19 subtests, which can then be converted to standard scores (Mackler, 2017). The standard scores can also generate percentile scores to compare individual academic achievement levels (Mackler, 2017).
Cost of Instrument
The total cost of the KTEA-3 is $992.45 for both forms, the scoring manual, and multiple score reports (Mackler, 2017). Additional pricing is available for individual forms and other resources (Mackler, 2017).
Cost of Protocols
The protocol for the KTEA-3 is $32.95 for only the record form (Mackler, 2017).
List All Personnel Using Instrument
This instrument must be administered by someone with a master’s degree in a related field or someone with appropriate licensures, such as an educational diagnostician (Mackler, 2017).
Assessment Information Use
The KTEA-3 is used to determine overall levels of academic achievement for individuals (Mackler, 2017). This instrument also uses parallel forms which can help compare pre and posttests, as well provide a means for progress monitoring (Mackler, 2017).
Justification and Recommendation for School Use
I would recommend the continued use of the KTEA-3 in my current school district. The batteries cover a variety of areas that help with the understanding of an individual’s academic strengths and challenges (Mackler, 2017).
References
Macker, K. (2017). Kaufman tests of educational achievement-Third edition. Mental Measurements Yearbook, 20. Retrieved from https://eds-b-ebscohost-com.libproxy.lamar.edu/ehost/detail/detail?vid=8&sid=81ce81ba-a760-4248-9d37-163dc65f5c45%40sessionmgr104&bdata=JmxvZ2luLmFzcCZzaXRlPWVob3N0LWxpdmU%3d#AN=test.6516&db=mmt
Pierangelo, R., & Giuliani, G. A. (2017). Assessment in special education: A practical approach (5th ed.). Boston, MA: Pearson.
Assessment Instrument Review Three
Name of Instrument
Wechsler Intelligence Scale for Children-Fifth Edition, Integrated (WISC-V Integrated)
Publication Date
The WISC-V Integrated was first published in 2004, with its most recent fifth edition being published in 2015 (Bugaj, 2017).
Purpose of Instrument
This instrument is used to assess intelligence (Pierangelo & Giuliani, 2017). The WISC-V Integrated generates five primary scales of index for verbal comprehension, visual spatial, working memory, fluid reasoning, and processing speed (Pierangelo & Giuliani, 2017). Together, these index scales produce a full scale IQ score (Pierangelo & Giuliani, 2017). The test is administered individually (Pierangelo & Giuliani, 2017).
Age or Target Population
The target age group for the WISC-V Integrated is 6-16 years of age (Pierangelo & Giuliani, 2017).
Reliability
Internal reliability of the WISC-V Integrated was found by using the split-half method (Bugaj, 2017). The coefficients were found to be high and ranged in average from .77 to .93 (Bugaj, 2017). The internal reliability was also found to be good among groups of special populations such as ADHD, brain injuries, and autism spectrum disorders (Bugaj, 2017).
Validity
Validity correlations were found between the subtest versions, and explanation was provided for results that were not expected (Bugaj, 2017). Concurrent validity was determined by using two other achievement tests (Bugaj, 2017).
Norm Group Population
The WISC-V Integrated was normed using approximately 500 individuals separated into 11 different age groups (Bugaj, 2017). Gender equality was achieved in all but four groups and the samples were accurate representations of the population for variables such as race, education level, and geographical location (Bugaj, 2017).
Kinds of Scores
The five primary index scales of the WISC-V Integrated generate a Full Scale IQ Score that represents an individual’s overall IQ (Pierangelo & Giuliani, 2017). Scaled scores can also be derived from this instrument, which are converted from raw scores on each of the 21 subtests (Pierangelo & Giuliani, 2017). The scaled scores classify an individual as being developmentally delayed, borderline, low average, average, high average, superior, and very superior in terms of IQ (Pierangelo & Giuliani, 2017).
Cost of Instrument
The cost of the WISC-V Integrated kit is $305 and includes manuals for administration and scoring, technical manuals, stimulus books, and a variety of response booklets and scoring resources (Bugaj, 2017).
Cost of Protocols
WISC-V Integrated protocols cost approximately $125 for a bundle of 25 (Bugaj, 2017).
List All Personnel Using Instrument
In order to administer the WISC-V Integrated, an individual must have a doctorate degree in a related field or certification by a professional organization that is related to the field of assessment (Pierangelo & Giuliani, 2017).
Assessment Information Use
The WISC-V Integrated provides information regarding an individual’s IQ score, which can then be used to assist in IEP development and decision-making about educational placement (Pierangelo & Giuliani, 2017).
Justification and Recommendation for School Use
I would recommend the continued use of the WISC-V Integrated at my current school. It appears to have excellent reliability and good overall validity (Gubaj, 2017).
References
Gubaj, A.M. (2017). Wechsler intelligence scale for children: Fifth edition, integrated. Mental Measurements Yearbook, 20. Retrieved from https://eds-a-ebscohost-com.libproxy.lamar.edu/ehost/detail/detail?vid=6&sid=98523c9f-6e5c-431b-8bcd-965692542f1c%40sessionmgr4009&bdata=JmxvZ2luLmFzcCZzaXRlPWVob3N0LWxpdmU%3d#AN=test.8594&db=mmt
Pierangelo, R., & Giuliani, G. A. (2017). Assessment in special education: A practical approach (5th ed.). Boston, MA: Pearson.
Assessment Instrument Review Four
Name of Instrument
Kaufman Assessment Battery for Children, Second Edition (KABC-II)
Publication Date
The KABC-II was originally published in 1983 and its most recent second edition was published in 2004 (Braden & Ouzts, 2005).
Purpose of Instrument
This instrument is used to assess overall intelligence and was developed to minimize language influence for child IQ measurement (Pierangelo & Giuliani, 2017). It is comprised of 18 subtests, which are utilized dependent on the tester’s age, to create four of five scaled scores, again dependent on age (Braden & Ouzts, 2005). The KABC-II is an intelligence test that is individually administered (Braden & Ouzts, 2005).
Age or Target Population
The target population for this instrument is children aged 3 to 18 (Pierangelo & Giuliani, 2017).
Reliability
The majority of reliability coefficients of KABC-II subtests range from .80 to .90 and are classified as good to excellent (Braden & Ouzts, 2005). The consistency of single and global scales were found to range from .81 to .97 and are classified as being good to excellent as well (Braden & Ouzts, 2005).
Validity
The current edition of this instrument does not measure its validity using up to date standards, so finding evidence of validity is challenging (Braden & Ouzts, 2005). However, with the standards that is does use there is evidence of validity for three of the five measured areas (Braden & Ouzts, 2005). According to the technical manual, satisfactory evidence of content validity has been found (Braden & Ouzts, 2005).
Norm Group Population
The KABC-II was standardized with a group of approximately 3,000 children which was composed to accurately reflect the relevant census data for the United States (Braden & Ouzts, 2005).
Kinds of Scores
This instrument provides scaled scores for each subtest with a standardization of a mean of 10 with a standard deviation of 3 (Braden & Ouzts, 2005). Index scores are also normed to an average of 100, but with a standard deviation of 15 (Braden & Ouzts, 2005).
Cost of Instrument
The complete KABC-II kit currently costs $935 (Braden & Ouzts, 2005).
Cost of Protocols
Record forms for this instrument cost $70.75 for a set of 25 (Braden & Ouzts, 2005).
List All Personnel Using Instrument
The KABC-II can be administered by professionals with a doctorate in a related field or those with a licensed certification in the field of assessment such as a campus diagnostician or Licensed Specialist in School Psychology (LSSP) (Braden & Ouzts, 2005).
Assessment Information Use
This instrument is used to screen students and the scores generated help in IEP development and decision-making regarding educational placement (Pierangelo & Giuliani, 2017).
Justification and Recommendation for School Use
Even though the KABC-II demonstrates good reliability, due to the unclear nature of the instrument’s validity (Braden & Ouzts, 2005) I would not recommend the KABC-II for continued use in my school district.
References
Braden, J.P. & Ouzts, S.M. (2005). Kaufman assessment battery for children: Second edition. Mental Measurements Yearbook, 16. Retrieved from https://eds-a-ebscohost-com.libproxy.lamar.edu/ehost/detail/detail?vid=3&sid=03ead0ab-fa10-4dbd-afb3-7b827696f0de%40sessionmgr4007&bdata=JmxvZ2luLmFzcCZzaXRlPWVob3N0LWxpdmU%3d#AN=test.2808&db=mmt
Pierangelo, R., & Giuliani, G. A. (2017). Assessment in special education: A practical approach (5th ed.). Boston, MA: Pearson.
Assessment Instrument Review Five
Name of Instrument
Draw-A-Person: Screening Procedure for Emotional Disturbance (DAP:SPED)
Publication Date
The DAP:SPED was published in 1991 (Cosden, 1995).
Purpose of Instrument
This instrument is used to assess the social and emotional development of individuals and can identify children who are at a higher risk of having an emotional disturbance (Pierangelo & Giuliani, 2017). It is a projective drawing test, which works to make individuals feel emotions by presenting them with emotional stimuli (Pierangelo & Giuliani, 2017). There aren’t any subtests in the DAP:SPED, but the scoring systems has two different criteria (Pierangelo & Giuliani, 2017). The first criteria measures eight different dimensions of the drawing while the second criteria scores each drawing based on 47 items (Pierangelo & Giuliani, 2017). The DAP:SPED can be administered either individually or in a group setting (Pierangelo & Giuliani, 2017).
Age or Target Population
The DAP:SPED has a target population for individuals from ages 6 to 17 (Pierangelo & Giuliani, 2017).
Reliability
This instrument has evidence of high reliability (Cosden, 1995). Reliability studies identify the correlation coefficients to fall between .80 and .90 for interrater reliability and intrarater reliability (Cosden, 1995).
Validity
Studies referenced in the manual of the DAP:SPED claim that its results accurately discriminate between control groups and groups with emotional disturbances (Cosden, 1995).
Norm Group Population
The DAP:SPED was normed nationally for a variety of age and gender groups (Cosden, 1995). However, it is important to keep in mind that a “normal sample” for this type of test is solely based on individuals with emotional disturbances (Cosden, 1995).
Kinds of Scores
The T- scores obtained from the DAP:SPED, which are based on the figures drawn by the individual, are compared to the normed sample to determine if they are at a higher risk for having or developing an emotional disturbance (Cosden, 1995).
Cost of Instrument
The complete kit costs $168 (Cosden, 1995).
Cost of Protocols
Protocols cost $56 for a pack of 25 (Cosden, 1995).
List All Personnel Using Instrument
The DAP:SPED can be administered by licensed professionals, such as educational diagnosticians or LSSPs (Cosden, 1995).
Assessment Information Use
This instrument is used to screen individuals to determine if they are at risk for an emotional disturbance (Cosden, 1995).
Justification and Recommendation for School Use
I would not recommend the use of the DAP:SPED as a tool for emotional disturbance screenings. This is my opinion because of the lack of additional information it provides beyond determining the risk for emotional disturbances.
References
Cosden, M. (1995). Draw a person: Screening procedure for emotional disturbance. Mental Measurements Yearbook, 12. Retrieved from https://eds-b-ebscohost-com.libproxy.lamar.edu/ehost/detail/detail?vid=3&sid=478cfdb0-295f-4d59-96b6-bc570bbf5ebe%40sessionmgr102&bdata=JmxvZ2luLmFzcCZzaXRlPWVob3N0LWxpdmU%3d#AN=test.1248&db=mmt
Pierangelo, R., & Giuliani, G. A. (2017). Assessment in special education: A practical approach (5th ed.). Boston, MA: Pearson.
Assessment Instrument Review Six
Name of Instrument
Adaptive Behavior Assessment System-3rd Edition (ABAS-3)
Publication Date
The ABAS-3 was originally published in 2000, with the most recent third edition being released in 2015 (Henington, 2017).
Purpose of Instrument
This instrument is used to assess adaptive behavior (Pierangelo & Giuliani, 2017). The ABAS-3 puts focus on one’s ability to complete practical daily activities, care for his or herself, independently take part in interaction with others and the environment, and communicate effectively (Pierangelo & Giuliani, 2017). It is a rating scale that can be filled out by family, teachers, employers, and other people who interact with the individual regularly (Pierangelo & Giuliani, 2017). Due to the nature of the rating scale, it is completed on an individual basis (Henington, 2017).
Age or Target Population
The target population for this instrument is anyone between two years and 89 years of age (Pierangelo & Giuliani, 2017).
Reliability
The author cites internal reliability across six normed samples, and high reliability coefficients between .96 and .99 for the General Adaptive Composite (Henington, 2017). Coefficients for the adaptive components of the test had an average of .85, while those for the adaptive areas ranged between .72 and .99 (Henington, 2017).
Validity
The technical manual of the ABAS-3 cites high correlations of concurrent validity when compared to the Vineland Adaptive Behavior Scale and the Behavior assessment System for Children (Henington, 2017).
Norm Group Population
This instrument was normed so that it could be used with a wide variety of populations (Henington, 2017). The sample consisted of approximately 4,000 people across 24 different states (Henington, 2017).
Kinds of Scores
Raters provide an ability rating score from 1-4 based on the frequency with which the individual can perform the skill (Henington, 2017). The scores provided are added together and converted to scaled scores with a mean of 10 and standard deviation of 3 (Henington, 2017). Scores from defined areas are combined for a General Adaptive Composite and three different domain scores (Henington, 2017). These scores are then identified into ability categories ranging from extremely low to high (Henington, 2017).
Cost of Instrument
The whole ABAS-3 kit costs $310 (Henington, 2017).
Cost of Protocols
The protocols for this instrument cost $75 for a bundle of 25 forms (Henington, 2017).
List All Personnel Using Instrument
The rating scaled can be completed by any individual who regularly interact with the person being assessed, but only licensed professionals can score the forms (Henington, 2017).
Assessment Information Use
The ABAS-3 to assess an individual’s abiity to function in their daily lives (Henington, 2017).
Justification and Recommendation for School Use
I would recommend the continued use of the ABAS-3 due to the variety of people who rate the individual being assessed.
References
Henington, C. (2017). Adaptive behavior assessment system, third edition. Mental Measurements Yearbook, 20. Retrieved from https://eds-b-ebscohost-com.libproxy.lamar.edu/ehost/detail/detail?vid=4&sid=d48e6029-d8fe-4c82-8152-a816416525dc%40sessionmgr101&bdata=JmxvZ2luLmFzcCZzaXRlPWVob3N0LWxpdmU%3d#AN=test.7501&db=mmt
Pierangelo, R., & Giuliani, G. A. (2017). Assessment in special education: A practical approach (5th ed.). Boston, MA: Pearson.
Assessment Instrument Review Seven
Name of Instrument
Developmental Test of Visual Perception-3rd Edition (DTVP-3)
Publication Date
The DTVP-3 was originally published in 1961 with its most recent update in 2014 (Pierangelo & Giuliani, 2017).
Purpose of Instrument
This instrument is used to assess specific deficits in and individuals visual perception (Pierangelo & Giuliani, 2017). The DTVP-3 measures eye-hand coordination, copying, figure-fround, visual closure, and form constancy to determine if any deficits are present (Pierangelo & Giuliani, 2017). It is a test that is individually administered and requires students to draw lines, simple figures, complete missing portions of figures, find hidden shapes andrecognize shapes that are hidden in complex backgrounds (Pierangelo & Giuliani, 2017).
Age or Target Population
The DTVP-3 is geared for individuals between ages 4 and 13 (Pierangelo & Giuliani, 2017).
Reliability
Internal consistency reliability was demonstrated and the coefficients across all ages fell were .80 and .95 for the visual closure subtest and general visual perception composite score (Alfonso, Wissel, & Lorimer, 2017). Test-retest reliability coefficients were studied and found to fall between .70 and .85 for the different subtests and .87 and .90 for the composite scores (Alfonso et al., 2017).
Validity
Average criterion-related validity coefficients fell between .54 and .76, however there were no additional studies to confirm the validity (Alfonso et al., 2017). Construct-related validity was studied and the correlation coefficients were classified as medium to high as it was found that subtest measurements increased with age (Alfonso et al., 2017). The validity coefficient for sub-scores and composites was found to have a median of .43 (Alfonso et al., 2017).
Norm Group Population
The DTVP-3 was standardized using a group of approximately 1,000 kids from 27 different states (Alfonso et al., 2017). The test manual states that the makeup of the sample mirrors the makeup of children across the United States (Alfonso et al., 2017).
Kinds of Scores
This instrument records raw subtest scores that are changed to scaled scores so they can be represented as percentiles that compares the norms across different ages (Alfonso et al, 2017). Composite scores are also represented as percentile ranks with additional descriptions of performance levels (Alfonso et al., 2017). Full interpretation of the DTVP-3 provides age equivalents, percentile ranks, subtest raw and scaled scores, as well as composite index scores (Alfonso et al., 2017).
Cost of Instrument
The cost of the complete kit costs $257 (Alfonso et al., 2017).
Cost of Protocols
Record forms for the DTVP-3 costs $39 for a set of 25 (Alfonso et al., 2017).
List All Personnel Using Instrument
This assessment instrument should only be administered and interpreted by professionals who have been trained in formal assessment practices such as an educational diagnostician (Alfonso et al., 2017).
Assessment Information Use
The DTVP-3 can be used to measure an individual’s visual and perceptual abilities as well as screen for possible deficits that may arise (Pieranelo & Giuliani, 2017).
Justification and Recommendation for School Use
I would recommend the use of this instrument only for students who are in primary school as the information it provides would be most apparent in younger students.
References
Alfonso, V.C., Wissel, A., & Lorimer, L. (2017). Developmental test of visual perception-Third edition. Mental Measurements Yearbook, 20. Retrieved from https://eds-a-ebscohost-com.libproxy.lamar.edu/ehost/detail/detail?vid=3&sid=09e9385e-1b8e-4e7c-8099-a093dec508b7%40sessionmgr4008&bdata=JmxvZ2luLmFzcCZzaXRlPWVob3N0LWxpdmU%3d#AN=test.6460&db=mmt
Pierangelo, R., & Giuliani, G. A. (2017). Assessment in special education: A practical approach (5th ed.). Boston, MA: Pearson.
Assessment Instrument Review Eight
Name of Instrument
Bruininks-Oseretsky Test of Motor Proficiency-2nd Edition (BOT-2).
Publication Date
The BOT-2 was first published in 1978 and most recently updated in 2005 (Snyder, 2010).
Purpose of Instrument
This instrument is used to assess an individual’s fine and gross motor skills (Pierangelo & Giuliani, 2017). It is administered in an individual setting and is both a paper-based and performance based test (Pierangelo & Giuliani, 2017). There are eight subtests that evaluate fine motor precision, fine motor integration, manual dexterity, bilateral coordination, balance, running speed and agility, upper-limb coordination, and strength (Pierangelo & Giuliani, 2017).
Age or Target Population
This instrument can be used on individuals between 4 ½ and 14 ½ years of age (Pierangelo & Giuliani, 2017).
Reliability
Split-half reliability coefficients for the subtests ranged between .70 and .80 for all but two tests, while the total motor composite reliability coefficient was measured to be in the mid-.90s (Snyder, 2010). Test-retest reliability was found to be approximately .80 when the tests were taken within 1-6 weeks of eachother (Snyder, 2010). Additionally, interrater reliability was found to be quite high when two individuals scored a single subject (Snyder, 2010).
Validity
The authors of the manual worked to verify content and construct validity, however evidence of construct validity was not convincing (Snyder, 2010). However, evidence of construct validity was more convincing due to studies showing that median scores in the eight subtests were indicative of appropriate development (Snyder, 2010).