Training for intensive intervention
Developed in cooperation with the
University of Florida
sponsored by
A charitable, not-for-profit educational foundation
The Literacy Trust, Inc.
3324 W University Avenue #116
Gainesville, FL 32607
ph: 352-256-3516
fax: 347-602-8768
alt: 888-377-0401 (toll free)
hoover
http://www.bestevidence.org/reading/strug/top.htm
The national reading center at
Florida State University (FCRR)
awarded Reading Rescue high marks
in its review of the model and of the research supporting its effectiveness.
http://www.fcrr.org/FCRRReports/PDF/ReadingRescue.pdf.
Reading Rescue: An Effective Tutoring Intervention Model
for First-Grade Struggling Readers
Linnea C. Ehri, Lois G. Dreyer, Bert Flugman, and Alan Gross
Graduate Center, City University of New York
Author Note
Linnea C. Ehri, Department of Educational Psychology, Graduate Center of the City University of New York; Lois G. Dreyer, Center for Advanced Study in Education, Graduate Center of the City University of New York; Bert Flugman, Center for Advanced Study in Education, Graduate Center of the City University of New York; Alan Gross, Department of Educational Psychology, Graduate Center of the City University of New York, 365 Fifth Avenue, New York, NY 10016
Reading Rescue was developed in cooperation with the University of Florida and is sponsored by a charitable, not-for-profit organization, The Literacy Trust, Inc. This study was an independent evaluation conducted with grants from The Literacy Trust, Inc. and Pearson Learning. All aspects of the research were under the control of investigators who were neither employed by nor affiliated with the funding agencies. We express gratitude to the following colleagues for their assistance: Nora Hoover, Kathy Kaufman, Alba Langenthal, Tara Mastrorilli, Belinda Nix, Henry Park, and Benedict Silverman. We thank the district and school personnel including principals, teachers and paraprofessionals for their cooperation and assistance in the conduct of our study. For more detailed information about the Reading Rescue program, contact The Literacy Trust, www.literacytrust.org.
Correspondence concerning this article should be addressed to Linnea C. Ehri, Program in Educational Psychology, CUNY Graduate Center, 365 Fifth Avenue, New York, NY 10016. E-mail: LEhri@gc.cuny.edu. Phone: 212-817-8294. Fax: 212-817-1516.
Abstract
A tutoring intervention model (RES) was investigated with 64 low SES first grade students at risk for reading difficulties. School staff including reading specialists, credentialed teachers, and paraprofessionals, provided a mean of 50 tutoring sessions to teach phonological awareness, systematic phonics, vocabulary, fluency, and reading comprehension. Tutored students made significantly greater gains in word reading and text comprehension than controls who received a small group intervention or no intervention. In fact, the majority of tutored students reached normal reading levels. Paraprofessionals tutored as well as the other tutors except in skills benefiting nonword decoding. Also they were less efficient than reading specialists. Contrary to conventional wisdom advocating text reading practice at an instructional level, regression analyses indicated greater benefits at an independent level.
Reading Rescue: An Effective Tutoring Intervention Model
for First-Grade Struggling Readers
Currently there is much interest in developing effective ways to intervene early with young children who are at risk of difficulty in learning to read. Studies indicate that children who fail to acquire adequate reading skills in first grade often continue to have difficulties and may never catch up (Foorman, 1997; Juel, 1988; Stanovich, 1986). Children who are at risk can be identified early (Vellutino et al.,1996). Results of the National Reading Panel Report (2000) showed that early intervention was more effective than later remediation. There is much interest in evaluating intervention models to provide schools with evidence as the basis for choosing programs to teach reading. In fact, some government initiatives have made funding contingent upon schools’ choosing reading programs that are evidence based. One purpose of the present study was to obtain evidence regarding the effectiveness of Reading Rescue, a comprehensive tutoring intervention model with struggling readers in first grade.
The best known tutoring intervention model aimed at struggling readers in the early grades is Reading Recovery developed by Clay (1985, 1993). Various reviews of the effectiveness of Reading Recovery have been conducted (e.g., Shanahan & Barr, 1995; Elbaum, Vaughn, Hugh, & Moody, 2000). In the majority of studies, students who received this tutoring made greater gains in reading achievement during first grade than students who were not tutored. Studies have also shown that if Reading Recovery instruction is enriched with more phonemic awareness or systematic phonics instruction, its impact on reading is enhanced (Greaney, Tunmer & Chapman, 1997; Hatcher, Hulme, & Ellis, 1994; Iversen & Tunmer, 1993; Santa & Hoien, 1999). A question of interest in the present study was whether the Reading Rescue model would prove effective as well, and whether this model would benefit students in high poverty schools from families whose first language is not English.
In developing effective reading intervention models, several features require specification including the following: the reading instructors, their qualifications; and the specific training they need to provide instruction; the instructional materials and procedures including the texts to be read; the students targeted for the intervention, their entering reading capabilities, and the match between these capabilities and the instruction provided; the observational tools and tests to monitor students’ progress and make instructional decisions; the costs to schools.
The Reading Rescue model was implemented with the following features in the present study. The instructors were personnel already employed by schools but not necessarily credentialed teachers. Paraprofessionals as well as reading specialists and credentialed teachers received on-site training that included workshops, supervision and coaching. The students were low SES first graders reading near the bottom of their class as indicated by an inability to read primer level text and low scores on other literacy tests. However, students needed to have substantial letter name knowledge (i.e., at least 17 letters) considered a prerequisite for making progress in the program. This is because the program focused on teaching letter-sound correspondences and their use to read and write.
The instruction involved one-to-one tutoring for 30 minutes daily or as frequently as possible during a week. It was comprehensive in covering all five components identified by the National Reading Panel (2000) as evidence based and essential for learning to read (i.e., phonemic awareness, systematic phonics, fluency, vocabulary, and reading comprehension). The instructional materials and procedures engaged students in word study and sentence writing that taught phonemic awareness, letter-sound relations, decoding, reading high frequency words, and spelling. Students reread little books to acquire oral text reading strategies, sight words, vocabulary, fluency, and comprehension. The texts were leveled, they contained both high frequency and decodable words, and they were tied to a scope and sequence chart used for phonics instruction. Observational tools and informal tests were used. Tutors kept records of students’ performance during each session, and they tested students to determine when they were ready to progress to higher-level lessons and texts. The costs were relatively modest because existing school staff members rather than outside teachers provided the tutoring.
Several features of the Reading Rescue intervention model were expected to contribute to its effectiveness as shown by previous research. Some of the activities were drawn from the Reading Recovery model, such as sentence writing and the rereading of little books. The inclusion of systematic, sequential phonics instruction was expected to enhance its impact (Ehri, Nunes, Stahl, & Willows, 2001). The use of many decodable books at various reading levels was expected to help students apply the letter-sound correspondences and decoding skills they had been taught (Juel & Roper-Schneider, 1985; Torgesen, et al., 1999). The inclusion of vocabulary instruction was expected to make the program especially appropriate for low SES students from bilingual families (Biemiller, 1999). Questioning students in various ways about the books they read and distinguishing between text explicit and text implicit questions were expected to foster text comprehension (Block & Pressley, 2002).
There are many reasons why tutoring in first grade has been especially attractive to educators as a means of preventing early reading failure (Clay, 1985). Instruction can be tailored to the difficulties of individual students. Tutors can easily monitor students’ progress and the need for review. Compared to group instruction, tutoring allows more time for students to read, and their errors and misconceptions receive immediate feedback. Also, having the tutor’s undivided attention is motivating.
Although one-to-one tutoring has many advantages over small group instruction for teaching reading, not all studies have reported positive results. Matthes & Fuchs (1994) detected a wide range of effect sizes across studies. In a meta-analysis evaluating phonemic awareness instruction, Ehri, Nunes, Willows, Schuster, Yaghoub-Zadeh, & Shanahan (2001) found significantly smaller effect sizes for tutoring than for small groups. In a meta-analysis of systematic phonics instruction, Ehri, et al. (2001) found that effect sizes for small groups did not differ significantly from effect sizes for individual tutoring. In their meta-analysis of tutoring studies with struggling readers, Elbaum et al. (2000) also found no difference. One purpose of the present study was to compare the effects of tutoring to the effects of small group instruction.
Another issue addressed here was whether paraprofessionals are as effective at tutoring struggling readers as reading specialists and other credentialed teachers. The use of paraprofessionals as instructors is especially valuable in schools serving low SES populations. Such schools may have larger numbers of children reading below grade level, and their limited budgets may preclude hiring enough certified reading specialists to work with all the students who are needy. If trained paraprofessionals can deliver effective supplementary reading instruction, this provides a more affordable means of addressing the need.
Findings regarding the effectiveness of paras, however, are mixed. Wasik and Slavin (1993) reviewed studies of programs that provided tutoring for struggling readers in first grade, including Reading Recovery and Success for All. All programs yielded significant positive effects, but those utilizing certified teachers showed larger effects than those using paraprofessionals. However, paras and certified teachers were not compared teaching the same programs. Moreover, the programs used by paras were more focused and less comprehensive in the reading components taught than the programs used by teachers. Thus, these findings are not definitive.
In a recent study, Brown, Morris and Fields (2005) compared the effectiveness of certified teachers and paraprofessionals. Both used the same reading program, Next Steps (Morris, 1999), to tutor second and third grade struggling readers. The program included guided oral reading of instructional level text, word study, fluency training through rereadings of text, and listening comprehension. Results revealed that students tutored by paraprofessionals read as well as students tutored by certified teachers on most measures of reading.
Although Brown et al. (2005) documented the effectiveness of paras with older students, it is not clear whether the same results would hold for first graders. In the Wasik and Slavin (1993) review, paras were less effective at tutoring at risk first grade readers than credentialed teachers. Perhaps teachers possess a greater knowledge base that gives them an advantage over paras in teaching reading to beginners. One limitation of the Brown et al. (2005) study was that paras were randomly assigned to students whereas classroom teachers tutored students from their own classes. The present study used complete random assignment to compare trained paraprofessionals to credentialed teachers and reading specialists. Apart from Brown et al., few researchers have conducted such comparisons systematically.
The Reading Rescue (RES) intervention model has been found effective in previous studies reported at conferences or in technical reports (Britt, 2002; Hoover, 1994, 1995; Hoover & Lane, 2001; Hoover & Sullivan, 1996). These studies found that RES tutoring raised students’ reading achievement from the bottom quartile to grade-level performance. Results of a recent experiment (Muller & Davies, 2004) showed that high-poverty, urban, predominantly Hispanic students who received RES tutoring outperformed a control group of students who received an alternative intervention. Results of the present study were intended to replicate and extend findings of these studies.
Several questions were addressed:
1. Will struggling first-grade readers who receive Reading Rescue tutoring make greater gains in their reading achievement than struggling readers who do not receive tutoring but are enrolled in the same schools, and greater gains than struggling readers who are enrolled in comparable schools not implementing the Reading Rescue intervention? Both types of control groups were included to take account of contamination effects on reading instruction possibly resulting from the presence of the RES program in the schools.
2. Will struggling readers who receive RES tutoring make greater gains in reading than struggling readers who receive a small group intervention program adopted by the district?
3. Will RES tutoring increase students’ reading achievement to expected levels of performance based on nationally normed tests?
4. Will entry-level abilities of students and features of RES tutoring predict how much first graders improve in their reading as a result of tutoring?
5. Will all types of tutors, including reading specialists, credentialed teachers, and trained paraprofessionals, be equally effective at improving the reading achievement of their students?
Method
Participants
Participants were enrolled in five low-SES urban public schools that had implemented the RES tutoring intervention for 2-3 years and three comparable schools that had not implemented RES but hoped to do so in the future. All schools were located in the same district in a large metropolitan city. The majority of students, above 90%, were from homes where Spanish was the first language, and 95% qualified for free or reduced-price lunch. Students judged by teachers to have no knowledge of English were excluded from the study. Most students had sufficient English to qualify. Students in special education at the beginning of first grade (i.e., those with speech, language, or physical disabilities or behavioral problems) were excluded by schools because they were already receiving special services.
First graders in the five RES schools (N=497) were group-administered the Reading Rescue Classwide Screening Assessments to identify lower performing students (N=203) who were tested individually on the RES pretests. Students who could name at least 17 letters1 but who were unable to read a preprimer passage on an informal reading inventory formed the pool of students who qualified for the intervention (N = 190). Within each school, these candidates were rank ordered on the RES individual pretests, and adjacent scores were used to form 64 matched pairs. Between the time that pairs were formed and the intervention began, four of the five schools assigned some of these students to a small group intervention mandated by the district, Voyager/Passport (2004). In these cases, the other member of the pair was assigned to receive RES. For the remaining pairs, members were randomly assigned to the RES and control groups. During the year, a few students left school (N = 8 RES and 9 controls). Replacements (i.e., 8 RES and 7 controls) were students drawn from the larger pool with the most similar pretest rankings.
To address the possibility that the presence of RES in schools would affect the reading instruction in those schools, a second control group of first graders was drawn from three comparable schools not implementing the RES intervention. This group consisted of struggling readers who had similar scores to students in the RES group on the fall GMRT4 pretest, and who completed both fall and spring GMRT4 pre- and posttests.
There were 64 students who received the RES intervention, 62 control students (C1) enrolled in schools offering the RES program, and 60 control students (C2) enrolled in non-RES schools. Students in the two control groups were reconfigured to distinguish those who received the district-mandated small group intervention (N=52) from those who did not receive this intervention (N=70). As evident in Table 1, none of the groups differed statistically in age or gender (all ps > .05).
All students received regular classroom instruction in reading and writing. The same instructional programs were used in all the schools: Month-by-Month Phonics for First Grade (Cunningham & Hall, 1997) and readers/writers workshop.
Tutoring was provided by 59 adults who were members of the school staff and had received formal RES training. The tutors included certified reading specialists with graduate degrees (N=17), adults certified in other areas (guidance counseling, math and social work) (N=15), and paraprofessionals (N=27). Those certified in counseling and math had completed at least 6 credits of coursework in reading and language arts. All paras were high school graduates, many had bachelor’s degrees or were working toward BAs and teacher certification, and a few had MAs in other fields. Half of the tutors were bilingual. All tutors had additional responsibilities within the school. For many, this included teaching the small group intervention program. Tutors were randomly assigned to tutees within each school and remained with those students throughout the intervention. All but three tutors worked with only one RES student.
Materials and Procedures
The pretests, posttests, and tutoring intervention were administered by school personnel. Researchers had contact with Reading Rescue trainers and with school coordinators who were liaisons to the study but no direct contact with other tutors or their students.
Pretests. Group administered and individually administered pretests were given to assess entry-level reading ability and to identify potential candidates for tutoring. Group tests were given in all 8 schools whereas individual tests were given only in the 5 RES schools.
1. Gates-MacGinitie Reading Test. The GMRT4, Level BR (Beginning Reading) test (MacGinitie, MacGinitie, Maria & Dreyer, 2000) was administered to classrooms of first graders in all eight schools in November. This multiple-choice test measured students’ knowledge of letter-sound correspondences and their ability to read high-frequency words. Teachers read aloud each question and answer choice and gave students sufficient time to respond. Raw scores were transformed to normal curve equivalent (NCE) scores based on national norms. NCEs are percentiles that have been transformed into a scale of equal units, and are therefore appropriate for computing averages. The K-R 20 reliability is reported to be 0.95. Scores were used to match students from the non-RES control schools to students in the RES group. Testing occurred before the interventions began.
2. ITBS Vocabulary Test. The Iowa Tests of Basic Skills, Level 6, Vocabulary subtest (Hoover et al., 2001) was administered to classrooms of students in November. The teacher read the questions and answer choices aloud, and students recorded their answers. The K-R 20 reliability is reported to be 0.75.
3. Reading Rescue Classwide Screening Assessments. The RES Screen consisted of four group-administered tests adapted from Clay (1993a, 2002) that were given by classroom teachers in mid-September to students in RES schools in a print-free testing environment. Scores were used to identify low-performing candidates for the study. The tests assessed alphabetic knowledge (i.e., students wrote all the lower- and upper-case letters they knew), word writing (i.e., students had 10 minutes to write all the words they knew how to spell), and developmental spelling (i.e., 10 words were dictated and students wrote letters for the sounds they heard).
4. Reading Rescue Individual Pretests. Tutors administered 7 RES pretests individually to confirm students’ candidacy for tutoring in early November. Students (1) named lower case letters, (2) named upper case letters, (3) gave sounds of the letters or examples of words beginning with the letter-sounds, (4) blended spoken sounds to form words, (5) segmented words into phonemes, (6) read aloud nonwords, and (7) read sight words from graded lists of the informal reading inventory. Scores in these tasks were summed to yield an overall score. Students needed to know at least 17 letter names to qualify for RES tutoring.
5. Informal Reading Inventory. The Ekwall/Shanker Reading Inventory, 4th Edition (Shanker & Ekwall, 2000) was administered in early November. Scores on the sight word task above determined the proper starting point for reading the graded passages. Each passage was followed by comprehension questions. Two grade-level scores were calculated: word reading accuracy, and comprehension. Students began with the preprimer passage and continued until their word reading accuracy dropped below 91% or their comprehension below 60%. Any student who read the primer or Grade 1 passage at a minimum of 95% accuracy and at least 60% comprehension was disqualified.
The Tutoring Intervention. The program was not guided by a teacher’s manual with predetermined content specifying each day’s lesson. Rather tutors made instructional decisions that were guided by the RES scope and sequence of skills as well as by their analyses of students’ performance on assessments, the previous written record of text reading, and observations of students’ response to instructional activities. Tutors were trained to teach explicitly and systematically rather than incidentally, while being responsive to individual needs.
The intervention began in December, occurred during school hours, and consisted of “easing in” sessions followed by regular instructional sessions, and an “easing out” period. During easing in sessions, tutors administered the individual assessments and established rapport with students through such activities as drawing pictures and reading easy books together. Regular instruction consisted of a sequence of lessons organized into phases that gradually presented more challenging concepts and material as students made progress. Lessons taught fluency, phonological awareness and phonics, comprehension and vocabulary development, with the amount of time for each dependent on the phase of the program.
To apply the strategies and skills acquired, students read fiction and nonfiction small illustrated books from the Ready Readers series (Englebretson et al., 2000). These books were used but without the supplementary materials accompanying the books. The books were organized by difficulty, and they systematically introduced and provided practice with high frequency sight words as well as sound-symbol relationships in decodable words. The books provided practice in the phonics instruction that students received, and they were read and reread to build fluency. During the easing out sessions, a final review of the application of learned skills and strategies was conducted and students’ skills were assessed.
The following components were taught in this order during each session:
1. Fluency
Students started each lesson by rereading one or more familiar books. Beginning in Phase 1, oral reading accuracy, sight word knowledge, and confidence as a reader were stressed. In Phase 2, students practiced the application of blending, segmenting and syllabication skills. Timed readings of familiar text were used to increase reading rate and reduce errors. In Phase 3 and beyond, reading speed and expression were emphasized.
2. Word Analysis and Comprehension Strategies
Students reread orally a book that had been introduced to them during the previous lesson (see component 5 below). A written record was taken indicating the student’s word reading accuracy and use of word and text reading strategies. This served as a basis for deciding when to move to a higher level of difficulty. In selecting texts, tutors were advised to avoid books that were too easy or too hard and to maintain reading accuracy levels between 90% and 97%. In deciding which features of words to correct, tutors consulted a phonics elements chart. Errors on letter-sound relations that were less regular or were too advanced were not corrected.
3. Phonological Awareness and Word Study
Multisensory approaches were used. A phonics elements chart guided instruction and was supported by text reading practice in the Ready Readers. The phonics chart grouped instruction in letter-sound correspondences by phases. Phase 1 covered basic consonants, short vowels, and consonant-H digraphs. Phase 2 covered the short-long vowel distinction, additional consonants, and ER and ED endings. Phase 3 covered contractions, R-controlled vowels, additional vowel patterns, hard and soft G and C, and compound words. In addition, 18 to 23 phonograms were taught in each phase. Phonics was taught systematically and sequentially using both analytic and synthetic approaches. For example, students saw and heard a phonetically regular word, they repeated the word slowly saying its separate sounds while pointing to its letters, and then they spelled the word by moving plastic letters to segment and represent the sounds. Additional activities were employed for students having difficulty, such as tracing letters while saying their sounds, and using a mirror to monitor articulatory movements in words.
Phonemic awareness was taught systematically. During Phase 1, Elkonin boxes were used to teach phonemic segmentation. Students progressed from two-phoneme words to four-phoneme words, each illustrated by pictures so that pronunciations were recognized. Students kept a personal book, the Letters and Words I Know Book. The names and sounds associated with unfamiliar letters were taught and practiced. Segmenting and blending were taught using fingers and magnetic letters. In Phase 2 students were taught to decode by analogy using key words. They focused on high frequency words containing common phonograms. In late Phase 2 and beyond, tutors taught syllabication and other ways of dividing words to decode them, for example, common content words like carrot, morning, talking, bathtub.
4. Writing to Develop Phonological Awareness, Phonics, and Comprehension
In Phases 1 through 3, tutors assisted students in writing one or more sentences that the students had produced in conversation about the books they read. Attention was directed at spelling words correctly as well as comprehending the texts. When students were deemed ready in Phase 3, tutors taught students about text structure and how to organize information in texts they had read, for example, through the creation of semantic maps for expository text.
5. Comprehension and Vocabulary Development with the New Book
In the initial phase, tutors introduced a new book by talking through it with students, page by page. As the books became longer, introductions focused more on making predictions based on the title and cover page. One purpose was to develop oral language by encouraging students to talk about the books and by explaining the meanings of new vocabulary words. These words were written in students’ personal books, and the meanings were reviewed each time the book was read. Another goal was to spark the student’s interest in the book. The tutor coached the student through the reading of the book and prompted the application of the reading skills and strategies that had been taught. Students were encouraged to decode unknown words by relying on their letter-sound knowledge and then cross checking with meaning and pictures to confirm the identities of the words. Students were asked literal, inferential, and evaluative questions after the text was read. In Phase 3, students were taught the difference between text explicit and implicit questions.
Graduation from Reading Rescue. The RES program specifies multiple criteria for determining when students have met the goals of the program. Data on graduation criteria and decisions were not collected in the present study. Rather the reading achievement of all the students who received tutoring was assessed at the end of first grade.
Training of Tutors and Coordinators. The tutors were taught how to provide the intervention by Reading Rescue program staff. Training was guided by the Reading Rescue Tutor Handbook (Hoover, 2001). The initial phase consisted of five days of workshops delivered at the schools. The first two days at the beginning of the school year developed tutors’ understanding of the reading process, the components of the intervention model, the purpose, administration and scoring of the Classwide Screening Assessments, and how to begin working with students. The next three days of training took place after tutoring had begun and focused on the lessons.
A school coordinator who was a member of the school’s staff served as an on-site resource whose role was to provide modeling, feedback and support to tutors, to help tutors form a Peer Coaching Team, and to monitor implementation of the program. The coordinator had other literacy-related responsibilities within the school, but did not have classroom teaching duties. Coordinators were in their 2nd or 3rd year of implementation. They had attended advanced training institutes. They were given all the materials necessary, including video-taped lessons, to continue the training of tutors in their school. They were expected to schedule regular staff development meetings. RES program staff returned to each school several times to provide continued training and technical assistance for tutors and coordinators, and for quality assurance visits.
Assessment of Tutoring. Tutors completed lesson record sheets as they tutored during each session. The sheets reminded tutors of assessments, lesson components, activities, and reading strategies to be taught and mastered in each phase. Tutors listed dates, materials used to teach components, and how students responded.
To assess characteristics of tutoring and tutors’ adherence to procedures, two professionals thoroughly familiar with the program scored the records kept by tutors whose names had been removed. The scorers analyzed various characteristics including number of easing in, easing out, and regular tutoring sessions, the number of readings or rereadings of books, and reading accuracy levels which consisted of the proportions of readings that were completed at a 98-100% accuracy level, at a 90-97% accuracy level, and below 90%. These reading measures were taken from students’ second exposure to the books when they read them independently while tutors kept a running record. The first exposure to the books had occurred during the previous session when tutors coached students through the book (see Components 2 and 5 of the intervention above).
To provide an index of tutors’ adherence to the intervention model, judges rated the following six dimensions of the tutoring records on a four-point scale to indicate whether expectations were met: the number of books read; the difficulty of the books; selection of elements for word study; features of the sentences written; instructional decisions made during sentence writing; provision of tutoring on a regular basis on the days available. Scores were summed to provide an adherence measure with a maximum score of 24. To illustrate, the criteria used to rate tutors’ selection of sentences was as follows:
Rating of 4: Sentences chosen by the tutor for the student to write conformed to the
recommended criteria with regard to their length, complexity, and lexical features.
Rating of 3: With few exceptions, sentences chosen by the tutor conformed to the
recommended criteria.
Rating of 2: Many sentences chosen by the tutor conformed to the recommended criteria.
Rating of 1: For the majority of lessons, sentences chosen by the tutor did not reflect program
recommendations, or many sentences were not recorded.
The two judges rated all of the records independently and were in agreement on 90 percent of their ratings. Disagreements were discussed and resolved. Non-conformance to the program corresponded to a rating of 1 on five dimensions and 2 on one dimension (total score of 7). Conformance to the program corresponded to ratings of 2 or above on five dimensions and 3 or above on one dimension for a minimum adherence score of 13.
Small Group Intervention. Some of the struggling readers in the control groups (C1 and C2) across the 8 schools received instruction in a small group (SG) intervention program, Voyager Passport (2004). The SG program was adopted by the school district as a supplemental intervention for first-grade struggling readers with the goal of bringing their reading up to grade level. This was the first year of implementation. The program provided intensive, explicit, scripted instruction consisting of 130 lessons scheduled to be taught daily for 26 weeks. Tutor training in the program was provided by the district and consisted of one session plus on-site assistance by a literacy coach. Because instruction was strictly controlled by a script, more extensive training was not regarded as necessary.
Sessions were conducted in groups (3-6 students) and lasted for 30-40-minutes. The program taught the same components as the RES program. These were phonemic awareness, phonics, daily story reading, vocabulary instruction that included multiple exposures to new words, reading and listening comprehension with both narrative and expository passages, and fluency. Phonics instruction was emphasized and included teaching letters and their sounds, decoding regular words, reading high frequency irregular words, teaching various word types, word building exercises, and spelling. Fluency included practice in speeded letter naming and word reading as well as text reading in a limited number of decodable books. The program included a Vital Indicators of Progress Guide for assessment which was scheduled to occur every fifth lesson. Parents were expected to provide home support with take-home readers. Once students entered the program, they remained for the duration of the school year. The small group intervention was taught by many of the same staff members who also tutored in the RES program in the five RES schools. However, it was also taught to some of the C2 students in the non-RES schools by teachers and paraprofessionals who were not trained as RES tutors.
Posttests. Students in the RES and C1 groups completed several individually administered posttests in May: two WRMT-R (Woodcock, l987) subtests given by trained reading specialists, and the RES Posttests given by tutors. Classroom teachers administered the GMRT4 to first graders in all eight schools and the RES Classwide Screen to first graders in the RES schools.
1. WRMT-R Word Identification Test. Form G of the Woodcock Reading Mastery Tests-Revised (Woodcock, 1987) required students to read a list of real words that increased in difficulty. The median split-half reliability of this measure is reported to be 0.98.
2. WRMT-R Word Attack Test. Students read a list of decodable nonwords. The median split-half reliability of this measure is reported to be 0.94 (Woodcock, l987).
3. Reading Rescue Individual Posttests. The seven subtests of the RES Individual Pretests were repeated as posttests along with the Ekwall/Shanker Reading Inventory, 4th Edition (Shanker & Ekwall, 2000). (See descriptions above.) The maximum score on the RES posttests was higher, 223, because the sight word test included 110 words. The highest passage given to any student in the posttest phase was the third grade passage.
4. Gates-MacGinitie Reading Test. The GMRT4, Level 1 (MacGinitie et al., 2000) was group administered by classroom teachers. Questions were read and answered by students independently. The test consisted of two subtests whose scores were treated separately as well as combined into a total score. The K-R 20 reliability of the total score is reported as 0.96 (MacGinitie et al., 2002).
a. Word Decoding. There were 43 items. On each, students identified which of four words corresponded to a picture. The foils resembled the correct words in appearance or sound. Knowledge of letter-sound correspondences and letter sequences was required to make correct choices. K-R 20 reliability of this subtest is reported as 0.94 for the spring of Grade 1 (MacGinitie, et al., 2002).
b. Reading Comprehension. Students read passages covering both fiction and nonfiction and answered each comprehension question by choosing from a panel of pictures the one that illustrated or answered a question about a segment of the text. K-R 20 reliability of this subtest is reported as 0.93 for spring of Grade 1 (MacGinitie et al., 2002).
Design and Analyses
Performance of the group receiving RES tutoring was compared to performance of two different configurations of the same control students. The first configuration divided the students into a control group drawn from the same five schools as the RES students, referred to as Control 1 (C1), and a control group drawn from the three schools that did not offer RES, Control 2 (C2). The purpose of including C2 was to check on the possibility that the presence of RES and the professional development it offered to schools might contaminate instruction provided to control students in those schools. The second configuration divided the students into those receiving the small group intervention (SG) and those who did not receive this intervention (NI). Students in SG and NI groups came from all 8 schools.
Analyses of variance (ANOVA) were applied to fall pretest scores to determine whether the groups differed in entry-level reading skills. To assess effects of the RES and SG interventions, analyses of covariance (ANCOVA) were applied to spring posttest scores with GMRT4 pretest scores as the covariate. The dependent variables included word decoding and reading comprehension scores on the GMRT4, word reading and nonword reading scores on the WRMT-R, and performance on the RES program tests. In some analyses, data were missing for a few students. Statistical tests were conducted on actual scores.
To compare the three types of tutors--reading specialists, credentialed staff, and paraprofessionals--the performance and tutoring experiences of their students were subjected to ANOVAs. To determine whether features of the tutoring instruction predicted students’ reading achievement at the end of first grade, regression analyses were conducted.
Results
Equivalence of Reading Rescue Treatment and Control Groups on the Pretests. Table 1 shows the mean pretest performance of students who received RES tutoring and control students. From mean percentiles on the fall GMRT4 reading test and ITBS vocabulary test in Table 1, it is apparent that participants were struggling readers at risk for future reading difficulties and had English language vocabularies well below average compared to national norms for first graders.
The ANOVA of pretest scores with treatment group as the independent variable revealed that the three groups did not differ on the vocabulary test but they did differ on the GMRT4 test. A post hoc Bonferroni test indicated that the RES and C2 groups performed equivalently and significantly better than the C1 group. This was unexpected because RES and C1 students had been matched on their performance on the RES individual tests. One possible reason for poorer performance of the C1 group is that four out of five RES schools undermined random assignment by assigning students they deemed struggling readers to the small group intervention.
To check on whether C1 students were poorer readers than RES students on the other literacy pretests, mean scores were compared. ANOVAs revealed that the groups did not differ on any of the three group-administered RES Classwide Screen tests (all ps > .05) or on any of the 7 individually administered RES pretests (all ps > .05). Correlations between the RES pretests and the GMRT4 were as high as r = .69 (developmental spelling) and r = .67 (sight word reading) indicating that the RES tests were not lacking in sensitivity. These findings raise doubt that the RES and C1 groups actually differed in their reading level in the fall.
Nevertheless, to take account of the possibility of initial differences in ability favoring the RES and C2 groups, fall scores on the GMRT4 were used as a covariate in ANCOVAs of spring posttest scores. This analysis and choice of a covariate to adjust for initial differences in reading ability were considered legitimate for several reasons: the GMRT4 test has high reliability; it was administered to all participants at the same time of year before tutoring began; it was strongly correlated with the next level in the GMRT4 test series given as a posttest (r = .68, p < .001, N=844).
Comparison of Reading Rescue Treatment to C1 and C2 Control Groups on Posttests. Results of the ANCOVA applied to posttest scores revealed significant main effects of treatment group. Table 1 reports means and test statistics on the nationally normed tests. Bonferroni post hoc comparisons on GMRT4 performance revealed that RES tutored students decoded significantly more words and comprehended text significantly better than both of the control groups. In turn, the C1 group, located in RES schools, decoded significantly more words than the C2 group, located in non-RES schools, but comprehension of the two groups did not differ. This finding of a difference suggests that the presence of the RES program in schools may have improved the decoding instruction that struggling readers in these schools received.
On the WRMT-R tests that were individually administered, the RES tutored group read significantly more words and decoded significantly more nonwords than the C1 group. Mean grade-equivalent scores on the WRMT-R tests in Table 1 show that RES students were equally skilled at reading real and nonwords, an indication of good phonics skills. In contrast, C1 students were weaker in reading nonwords than real words.
Effect sizes were calculated by dividing the difference between the RES and C1 adjusted means by the pooled standard deviation on each measure. According to Cohen (1988), an effect size of 0.20 is considered small, an effect size of 0.50 is moderate, and 0.80 and above is large. Results on the GMRT4 were 0.53 (decoding), 0.43 (comprehension), and 0.50 (total score). Results on the WMRT-R were 0.72 (word reading) and 0.65 (word attack). Thus, effect sizes favoring the tutoring treatment over no tutoring were in the moderate range.
Performance of the RES tutored and C1 control groups on posttests from the RES batteries are shown in Table 2. ANCOVAs were conducted with fall GMRT4 scores as the covariate. The groups did not differ on four of the five tests of letter knowledge where means were close to ceiling indicating that students in both groups had learned most of the letter names and sounds. However, significant main effects were observed on other posttests. RES tutored students outperformed control students in writing lower case letters, segmenting and blending phonemes, reading sight words and pseudowords, generating plausible spellings of words, and reading and comprehending text. Effect sizes ranged from moderate to large on these measures. These findings provided additional evidence that struggling readers who received RES tutoring read significantly better at the end of first grade than struggling readers who did not receive this program.
Another question of interest was whether RES tutoring would raise students’ reading achievement from below average in the fall to an average level in the spring based on national test norms. We regarded the minimum average level as performance at the 40th percentile. From Table 1, it is apparent that on the GMRT4, RES students performed at the 50th percentile in decoding words, thus exceeding the minimum average level. RES students performed at the 38th percentile on the reading comprehension test. Although this is somewhat below the national average, reaching this level of achievement on a grade-appropriate test is impressive given that these were primarily low SES students with low vocabulary scores from homes where English was not the first language. On the word and nonword reading subtests of the WRMT-R, RES tutored students’ mean grade-equivalent scores placed them at the 1.9 grade level which is the level expected at the end of first grade. These findings show that RES tutoring did indeed raise the means of the group from below average to average levels based on national norms. In contrast, both control groups performed well below the normal range (see Table 1).
To determine what proportion of the struggling readers in each group reached average levels by the end of the year on the GMRT4 posttests, scores of individual students were examined. As shown in Table 3, before tutoring began, very few struggling readers scored at or above the 40th percentile across the groups, only 5% to 9%. However, at the end of the year, a much greater proportion of RES-tutored students than C1 or C2 students’ scores reached average levels. When RES students were compared to the entire sample of first graders (i.e., the WS group in Table 3), a greater proportion of RES tutored students reached average levels on the decoding test, and about the same proportions of RES and WS students reached average levels on the comprehension test. These findings show that the RES intervention brought many more of the struggling readers to average levels than would have occurred without this program. In fact, the record of success for RES students was comparable if not better than the record for the entire sample of first graders.
Supplementary Comparison of RES Treatment Group to a Larger Control Group. Because the RES and C1 groups differed on one of the pretests, this left open the possibility that the groups reflected different populations of struggling readers. To address this possibility, a larger control group was created from the five schools implementing the tutoring program. This group included not only the 62 C1 struggling readers but also 62 struggling readers who qualified to receive tutoring but were not selected for the RES or C1 group. Comparison of entry-level performance of the RES group to this larger control group revealed no statistically significant differences on any of the pretests, including the GMRT4 (all ps > .05). The only pretest that approached significance was the RES individually administered phoneme segmentation test (p < .07), with means favoring the control group. Thus, entry-level reading capabilities did not distinguished the RES group from this larger control group. ANCOVAs of posttest scores with the fall GMRT4 as the covariate revealed significant main effects of treatment, with the RES group outperforming the control group on all three GMRT4 measures (decoding, reading comprehension, total score, all ps < .001). These findings replicate those above and show that the positive effects of the RES tutoring intervention held when compared to all struggling readers who were eligible for the program but did not receive it.
Comparison of Students Receiving the Tutoring Intervention, Small Group Intervention, and Neither Intervention. Students in the C1 and C2 control groups were reconfigured to distinguish those who received a small group intervention (SG) from those who received neither intervention. Performance of these groups was compared to that of students receiving RES tutoring. ANOVAs of pretest means reported in Table 4 revealed a significant main effect of treatment group on the GMRT4 but no significant effect on the ITBS vocabulary test. Bonferroni post hoc comparisons on the GMRT4 showed that the RES group did not differ significantly from the SG or NI groups, but the NI group performed significantly better than the SG group. These findings indicate that even though some of the schools assigned struggling readers to the SG group and thus undermined random assignment of matched pairs, the reading level of students receiving the small group intervention was not significantly lower than that of students receiving the tutoring intervention.
To adjust for initial pretest differences statistically, ANCOVAs were conducted on outcomes with fall GMRT4 scores as the covariate. The groups were compared only on the GMRT4 posttest because of incomplete data for the SG and NI groups on the other measures. Results revealed main effects of treatment (see Table 4). Bonferroni post hoc pairwise comparisons showed that in all three analyses, RES tutored students significantly outperformed the SG and NI groups who did not differ. These findings indicate that struggling readers who received RES tutoring decoded words and comprehended text more effectively than struggling readers who received the small group intervention as well as readers who received neither intervention.
The small group intervention was not effective in raising performance above that of students who did not receive this intervention. Effect sizes comparing mean performance of the SG and NI groups were very small and nonsignificant, ranging from -.08 on the word decoding measure to +.11 on the reading comprehension measure. Moreover, the mean performance of both groups was well below average based on national norms (see Table 4). These findings were somewhat surprising. The small group intervention program was comprehensive in its coverage of beginning reading skills with a strong emphasis on systematic phonics instruction.
We also assessed the proportion of RES, SG, and NI students whose scores rose to average levels (i.e., at or above the 40th percentile) on the GMRT4 at the end of first grade. As shown in Table 3, although proportions rose dramatically from fall to spring, many fewer SG and NI children than RES tutored children scored within the normal range at the end of the school year.
In sum, these findings show that RES tutoring helped struggling first-grade readers progress in their reading more than the small group intervention program and more than receipt of neither intervention. In contrast, the small group intervention did not improve the reading achievement of students over that of students not receiving this intervention.
Tutors and Characteristics of Tutoring. During each tutoring session, the tutors kept records. These were used to assess the occurrence of several features of tutoring. Mean values are reported in Table 5. The mean number of total tutoring sessions was 49.6, with a standard deviation of 18, indicating that individual students differed substantially in the amount of tutoring they received. The amount of text reading and rereading completed by students was extensive, M = 164 texts, with a large standard deviation also indicating much variability. Not surprisingly, the correlation between the number of sessions and the number of books read was high, r = .79, p < .01.
The program recommends that books be selected to enable students to practice reading at their instructional level, indicated by accuracy scores ranging from 90% to 97%. To examine text accuracy levels, running records were evaluated. These were recorded during the first independent reading of newly introduced books after the books had been previewed and coached during the previous session. As indicated by means in Table 5, only 37% of the books were read at the recommended instructional level. More books, 50%, were read at an independent level (98-100% accuracy), and only 15% of the readings were at a frustration level (below 90% accuracy). This reveals that book reading accuracy was even higher than that recommended by the program.
The records that tutors kept were used to assess their adherence to the program. Raters evaluated adherence on six dimensions, such as the number of books read in each session, the difficulty level of the books, the appropriateness of the sentences written, and whether tutoring was provided on a regular basis. Ratings were summed to provide an overall score (maximum of 24). A total score of 13 was considered the minimum level indicating full adherence to recommended practice. As seen in Table 5, the mean value, 13.5, shows that tutors on average were performing slightly above this level. The small standard deviation (3.2) indicates that there was limited variation in scores. Inspection of scores showed that 63 percent of the tutors displayed full adherence to program procedures. Two dimensions displayed the strongest adherence, indicating that tutors were best at selecting the appropriate number of books to read, and having students write appropriate sentences that contained sight words and words with the letter-sound relations being learned. These findings indicate that the majority of the tutors did conform to expected instructional procedures.
To determine whether tutors’ adherence to the program influenced how well their students performed on outcome measures, correlations were examined between adherence scores and the various posttests. Results revealed little relationship, with values ranging from r = 0.06 on the Woodcock word identification test to r = 0.20 on the GMRT4 total score, all ps > .05. Because tutors did not differ much in their adherence to the program, little relationship to outcomes was observed.
Three types of tutors worked with students: reading specialists, other credentialed personnel, and paraprofessionals. To determine whether the three types differed in their tutoring effectiveness, ANOVAs were conducted with type of tutor as the independent variable and various measures taken on their students as the dependent variables. All but three of the 59 tutors taught only one of the 64 RES students, so most data points emerged from different tutors.
The three types of tutors did not differ in the reading level of their students when tutoring began, as indicated by a non-significant main effect of tutor type on the GMRT4 fall pretest (F < 1, p > .05). This is as expected because students were randomly assigned to tutors. ANCOVAs of reading outcomes with fall GMRT4 as the covariate revealed no main effects of tutor type on the GMRT4 posttests, the WMRT-R word identification posttest, or the RES program posttests (all Fs < 1). However, a difference was detected on the WRMT-R nonword test assessing students’ ability to decode unfamiliar words. As shown in Table 5, Bonferroni post hoc analyses revealed that reading specialists were significantly better at teaching students to decode than paraprofessionals (p < .003).
Measures of events that occurred during tutoring were subjected to ANOVAs with tutor type as the independent variable. From Table 5, it is apparent that tutors differed significantly in the number of regular sessions they conducted with students. Post hoc analyses revealed that the paraprofessionals conducted significantly more sessions than the other two types of tutors. However, the tutors did not differ in the number of easing in or easing out sessions. In addition, the tutors differed significantly in the number of books that they had students read and reread. Paraprofessionals exceeded reading specialists in this regard, not surprisingly because they provided more tutoring sessions. Other tutoring events were examined as well, but as shown in Table 5, no other effects of tutor type were detected (all ps > .05). The fact that reading specialists required fewer sessions to achieve effects than paras suggests that reading specialists were more efficient in their tutoring.
From these findings we conclude that paraprofessionals tutored as effectively as the credentialed teachers who were not reading specialists. Moreover, they tutored as well as the reading specialists in strengthening students’ word reading and text comprehension skills, but not as well in teaching the skills that benefit nonword decoding. In addition, paras took more sessions to work with students than reading specialists, indicating they were less efficient in delivering the tutoring. Perhaps decoding skill was more effectively taught by reading specialists, because it requires special knowledge about the alphabetic system (Moats, 2000) or because reading specialists regarded it as more important to teach than paras did.
Prediction of Reading Achievement for Students Receiving RES Tutoring. The next question of interest was whether entry-level abilities of tutored students and characteristics of tutoring might explain why some students improved more in reading than others. The outcome measure was the total GMRT4 spring score, which was highly correlated with the other reading outcome measures (i.e., r’s ranged from .62 to .93) indicating that it was indicative of general reading performance. Correlation coefficients between this outcome and each of the other variables involving students’ entry-level skills and tutoring characteristics revealed several statistically significant values reported in Column 1 of Table 6. The significant predictors included performance on the fall GMRT4 pretest, letter writing on the pretests, RES program individual pretests (total score), the proportions of books read at the three accuracy levels, and the number of easing in sessions. The two accuracy levels, 90-97% and < 90%, were each strongly and negatively correlated with the 98-100% accuracy measure (see Table 6), indicating that all three measures reflect one factor, text reading accuracy. This factor is best represented by the 98-100% accuracy measure. The number of easing in sessions correlated negatively with the GMRT4. One reason may be that students with more severe reading problems often took longer to assess because tutors spaced out the tests to avert a sense of failure. None of the other student or tutoring variables was correlated significantly with reading achievement (all ps > .05).
Regression analyses were conducted with the spring GMRT4 total score as the reading outcome. The predictors were the four pretest measures, the 98-100% text accuracy measure, and the easing in measure. When all of the predictors were entered as a single set, only the 98-100% text accuracy variable explained significant variance in the model, with intercept = 21.3, slope = 25.5, p < .001. When the accuracy variable was entered first in a stepwise regression, none of the other variables explained any significant additional variance (all p’s > .05). When the accuracy variable was entered last in a stepwise regression, it explained significant additional variance (p < .001), with r2 increasing from .36 to .51, and contributions of each of the other predictors becoming non-significant (all ps > .05). These findings indicate that high text reading accuracy during tutoring was the strongest predictor and the only unique predictor of students’ reading achievement at the end of first grade.
The strong relationship between high text reading accuracy during tutoring and reading is open to at least two interpretations. One is that the relationship may reflect the effects of tutoring. Children whose tutors enabled them to read a greater proportion of texts at high accuracy levels became better readers by the end of the year than children whose tutors did not do this. Higher levels might have been achieved either by tutors’ selecting easier texts or by tutors previewing and coaching students more effectively through the texts during the previous session when the books were introduced. The other interpretation is that students who entered the study as the better readers were able to read texts at higher levels of accuracy during tutoring than the weaker readers and also achieved higher reading scores at the end of first grade. Findings of the regression analyses appear to rule out the second interpretation. Pretest reading skills of students taken when they began tutoring did not explain variance on the reading posttest once the impact of the text accuracy tutoring variable was considered. Also these students had very limited reading skills when tutoring began, so their growth in reading resulted primarily from the instruction that they received, not from entry-level reading skill. These findings favor the explanation that students’ experiences learning to read texts at high accuracy levels during tutoring influenced how well they read as a result of the tutoring.
Discussion
Several important findings emerged in this study. First grade struggling readers who received tutoring according to the Reading Rescue intervention model made significantly greater improvement in reading than struggling readers who did not receive this tutoring but were enrolled in the same schools, and also greater improvement than struggling readers who were enrolled in comparable schools not using the program. Also students receiving RES tutoring outperformed students receiving a commercially produced, small group intervention program mandated by the school district for struggling readers. An advantage occurred despite the fact that both intervention programs were comprehensive and provided instruction in the same components. Greater gains in reading by tutored students were observed on standardized measures of word and pseudoword reading and reading comprehension. In fact, RES tutoring raised the majority of students’ reading from below average to average levels. This contrasted with many fewer students reaching average levels in the comparison groups.
The reading achievement of students who received RES tutoring appeared to be explained primarily by one aspect of their tutoring experience, reading texts at a high level of accuracy, between 98 and 100%. This finding needs more careful consideration because it appears to contradict a commonly accepted view about the quality of text reading that is most effective for building fluency. Rasinski (2003) stated this shared wisdom in his book, The Fluent Reader (p. 63), “As with any direct-teaching activity, the greatest gain will occur when the difficulty of the material is at the student’s instructional level (i.e., 90-95 percent accuracy in word recognition)—neither too hard nor too easy, just right!” In fact, the developer of the RES tutoring model adopted this view and recommended it to tutors. However, present findings suggest otherwise.
Following program guidelines, tutors in the present study coached students through a book during one session and then in the next session had students reread the book, this time independently, and recorded their accuracy. Results revealed that the proportion of texts read at an instructional level (90-97% accuracy) was strongly and negatively correlated with positive growth in reading, whereas the proportion of texts read at a higher, independent level (98-100% accuracy) was strongly and positively correlated with reading growth. In fact, the latter, high accuracy measure figured more prominently than any other variable in explaining variance on the reading posttest. Even when students’ entry-level reading skill was controlled statistically, this tutoring variable still explained significant variance. Before concluding that reading text at high accuracy levels is important for promoting growth in struggling readers, we need to consider alternative interpretations.
In the present study, the high accuracy levels that proved important occurred on text that was read independently after prior coaching. It may be that conventional wisdom regarding recommended accuracy levels (i.e., 90-97%) applies to text read without prior coaching. Reading text cold is certainly susceptible to more errors. What difference this makes deserves further research.
Monitoring high text accuracy levels may have proven important for reading achievement because it reflected the impact of the preview session for improving students’ word reading. According to theories of word learning (Ehri, 1992; Perfetti, 1992; Rack, Hulme, Snowling, & Wightman, 1994; Share, 1999, 2004a), readers remember how to read words when they possess knowledge of grapheme-phoneme relations that provide the connections securing the written words to their pronunciations and meanings in memory. Perhaps more thorough coaching by tutors to help students use decoding skill to secure words in memory as they previewed the decodable texts throughout tutoring was an important factor strengthening students’ word and text reading accuracy and their general reading ability. This possibility needs further study.
Another possible factor contributing to the effectiveness of the RES model merits consideration, specifically, the criteria used to qualify struggling readers for the RES program. Rather than selecting all first graders at the bottom of the class, the program selected those who knew at least 17 letters yet who received low scores on other literacy tasks such as phonemic awareness and who lacked any text reading ability. Many studies have shown that letter name knowledge is the best single kindergarten predictor of success in learning to read (Scarborough, 2001). If students have not learned the shapes and names or sounds of letters, it is impossible for them to acquire word decoding skill. Share (2004b) showed that if students already knew the names of letters it was much easier for them to learn their sounds because most names contain their sounds. Letter names or sounds are typically taught in kindergarten, so first graders are expected to know them. Selecting students who already possess substantial letter knowledge reduces the need for RES tutors to spend time teaching letters and allows instruction to focus primarily on the application of letter knowledge to acquire reading and spelling skills. According to the model, low performing readers who lack sufficient letter knowledge at the beginning of first grade are admitted for RES tutoring later in the year once they qualify. This more refined tailoring of program characteristics to suit student capabilities very likely enhanced its effectiveness.
Although having letter knowledge when they began RES tutoring may have enabled students to make better progress, it is important to note that this factor does not explain why RES students outperformed control students at the end of the year. In the present study, control students met the letter knowledge criteria as well when they were selected for the study. This suggests that the nature of the instruction as it capitalized on entry-level letter knowledge was of primary importance in explaining the greater success of RES students in learning to read.
Another purpose of the study was to assess the effectiveness of various types of tutors. Present findings indicated that paraprofessionals delivered RES tutoring as effectively as reading specialists and credentialed teachers except in two respects. Paras were less successful in impacting students’ ability to decode pseudowords than reading specialists were. Paras provided more tutoring sessions than the other tutors and, as a result, had students read and reread more books. The fact that paras’ students were just as good at reading real words and comprehending text as students taught by the other tutors suggests that paras were less efficient in their tutoring.
Interestingly, one of these findings replicates and extends that reported by Brown et al. (2005) who found that students tutored by paras did not differ from students tutored by teachers on tests of word and text reading but only on the WMRT-R pseudoword reading test, the same test used in the present study. Although both studies show agreement, confidence in the present finding is greater because tutors here were randomly assigned to students whereas tutors in the Brown et al. study were not.
What might explain the shortcoming in paras’ tutoring that was observed in both studies? Perhaps paras spent less time teaching decoding than the reading specialists did. In the present study, paras had their students read more books than other tutors. Perhaps paras lacked the background knowledge to teach phonics effectively, or perhaps they did not consider decoding as important to teach. According to Moats (2000), a high level of knowledge about the alphabetic system is needed to teach systematic phonics programs effectively. These findings and their explanation deserve further study to determine whether more extensive staff development in how to teach decoding is needed to prepare paraprofessional tutors adequately.
Comparison of the RES tutoring intervention to the small group intervention suggested that tutoring was more effective. Differences in reading outcomes were observed despite the fact that the reading instruction components taught were similar in the two programs. Tutoring may have proven especially effective because of the greater ease of adapting instruction to the needs of individuals, and the greater amount of reading practice with feedback that was possible. Alternatively, extenuating circumstances may have created an advantage for tutoring in the present study. The RES program had been in use for two to three years in the five schools, whereas the SG program was in its first year of implementation. RES tutors and staff were more experienced delivering RES than SG, so this may explain the difference. On the other hand, new programs can benefit from a novelty effect in the first year, and this would be expected to benefit the SG program.
The author of the Reading Rescue Program suggests that implementing the program in schools may raise the reading achievement of even those first graders who are not tutored (Hoover, 1996). This general benefit may result from the professional development and tutoring experience that teachers and paras on the staff receive. Some evidence for this was detected in the present study. Control students enrolled in RES schools outperformed control students in non-RES schools on the GMRT4 word decoding posttest. The possibility that participation in the RES program improved teachers’ ability to teach decoding outside of the RES program in their classrooms merits systematic study.
Possible limitations of the present study need to be considered. Because we were unable to assign all the students randomly to the tutored and control groups, our design was quasi-experimental. However, we took steps to match students in assigning them to the groups, and we adjusted for any entry-level differences in our statistical analyses of outcomes. In addition we compared the tutored group to several different comparison groups and found an advantage for tutoring in all cases.
A second possible limitation is that our paraprofessionals were relatively well educated, with many holding college degrees. Our findings may not generalize to paraprofessionals with less than a high school diploma. This possibility awaits study.
A third limitation is that only short-term effects of the RES tutoring model were tested, at the end of the year when instruction ended. Of equal interest is whether the effects of tutoring in first grade are lasting and maintain students’ reading at average levels in subsequent grades. This matter awaits study.
A fourth limitation bears on the comparison of tutoring to small group intervention. These two forms of instruction were confounded with other variables. The two systematic phonics programs were similar but not identical. RES tutors received more training and were more experienced using their program than SG tutors were. The time that students spent receiving SG instruction was not monitored so comparability to RES time is unknown. As a result, it is not clear that the instructional delivery unit was the critical factor explaining the difference in students’ performance following the interventions, so any conclusion about the greater effectiveness of tutoring over small group instruction remains tentative.
Present findings do make it clear, however, that the small group intervention was not effective, whatever the cause. Struggling readers who received this intervention performed no better than struggling readers who did not receive it. This came as a surprise. The program held a strong reputation and was mandated by the district. The program was scripted and explicit regarding the steps for delivering instruction. Reading specialists as well as paraprofessionals served as the teachers. Sessions were conducted on a regular basis at all the schools. Because this program is used widely, uncertainty about its effectiveness merits further study. One possibility to explore is that admission of students without letter knowledge to the small groups necessitated attention to letter instruction and as a result limited the progress of group members who already knew letters (i.e., students in the SG group of the present study).
Despite some limitations, the present study has many strengths. This was a well designed, field-based research study, not all that common in the literature, and never perfectly executed due to the capriciousness of school environments. The study was conducted in several typical urban, low SES elementary schools. Procedures of the intervention programs were implemented by school personnel with few if any modifications introduced by researchers. As a result, the study has high external validity. Random assignment was employed with tutors so the internal validity of conclusions regarding their effectiveness is high. Although random placement of students in treatment and control groups was not fully implemented, statistical adjustments applied to multiple outcome measures maintained the internal validity of the study.
Findings advance our understanding of the nature and impact of early intervention, and they carry important implications for practice. Results offer evidence for the effectiveness of a specific tutoring intervention model providing comprehensive instruction in multiple components of reading needed by struggling beginning readers. The model proved effective with the neediest students, those in urban, high-poverty schools with large numbers of minority students and students from bilingual homes. The core of the intervention model is not a commercial program but rather involves a service offered by a nonprofit foundation whose staff helps schools institute a tutoring program by using existing teachers and paraprofessionals already employed at the school. As a result, implementation costs substantially less than programs requiring the hiring of credentialed, specially trained teachers. The present study contributes by documenting the effectiveness of this approach to reading intervention. In addition, findings show how other interventions programs for struggling readers in first grade might be structured to maximize their effectiveness.
References
Biemiller, A. (1999). Language and reading success: From reading research to practice: A series for teachers (Volume 5). Cambridge, MA: Brookline Books.
Block, C. & Pressley, M. (Eds.) (2002). Comprehension instruction: Research-based best practices. New York: Guilford Press.
Britt, L.A. (2002) Reading Rescueâ in District Six Community Schools of the New York City Public School System. (Technical Report). Gainesville, FL: The Literacy Trust, Inc.
Brown, K., Morris, D., & Fields, M. (2005). Intervention after Grade 1: Serving increased numbers of struggling readers effectively. Journal of Literacy Research, 37, 61-94.
Clay, M. (1985). The early detection of reading difficulties. (3rd ed.) Portsmouth, NH: Heinemann.
Clay, M. M. (1993a). An observation survey of early literacy achievement. Portsmouth, NH: Heinemann.
Clay, M. (1993b). Reading Recovery: A guidebook for teachers in training. Portsmouth, NH: Heinemann.
Clay, M. M. (2002). An observation survey of early literacy achievement (2nd ed). Portsmouth, NH: Heinemann.
Cohen, J. (1988). Statistical power analysis for the behavior sciences (2nd ed.). Hillsdale,
NJ: Erlbaum.
Cunningham, P. & Hall, D. (1997). Month-by-Month Phonics for First Grade: Systematic, Multilevel Instruction. Greensboro, NC: Carson-Dellosa.
Ehri, L. C. (1992). Reconceptualizing the development of sight word reading and its relationship to recoding. In P. Gough, L. Ehri, & R. Treiman (Eds.), Reading Acquisition, (pp. 107-143). Hillsdale, NJ: Erlbaum.
Ehri, L., Nunes, S., Willows, D., Schuster, B., Yaghoub-Zadeh, Z., & Shanahan, T. (2001). Phonemic awareness instruction helps children learn to read: Evidence from the National Reading Panel's meta-analysis. Reading Research Quarterly, 36, 250-287.
Ehri, L., Nunes, S., Stahl, S., & Willows, D. (2001). Systematic phonics instruction helps students learn to read: Evidence from the National Reading Panel's meta-analysis. Review of Educational Research, 71, 393-447.
Elbaum, B., Vaughn, S., Hughs, M., & Moody, S. (2000). How effective are one-to-one tutoring programs in reading for elementary students at risk for reading failure? A meta-analysis of the intervention research. Journal of Educational Psychology, 92, 605-619.
Englebretson, R., Hiebert, E., & Juel, C. (2000). Ready Readers. Parsippany, NJ: Modern Curriculum Press.
Foorman, B., Francis, D., Shaywitz, S., Shaywitz, B., & Fletcher, J. (1997). The case for early reading intervention. In B. Blachman (Ed.), Foundations of reading acquisition and dyslexia: Implications for early intervention (pp. 243-264). Mahwah, NJ: Erlbaum.
Greany, K., Tunmer, W., & Chapman, J. (1997). Effects of rime-based orthographic analogy training on the word recognition skills of children with reading disability. Journal of Educational Psychology, 89, 645-651.
Hatcher, P., Hulme, C., & Ellis, A. (1994). Ameliorating early reading failure by integrating the teaching of reading and phonological skills: The phonological linkage hypothesis. Child Development, 65, 41‑57.
Hoover, H., Dunbar, S., Frisbie, D., Oberley, K., Bray, G., Naylor, R., et al. (2003). Iowa Test of Basic Skills Complete/ Core Battery Norms and Score Conversions. Itasca, IL: Riverside.
Hoover, H., Dunbar, S., Frisbie, D., Oberley, K., Ordman, V., Naylor, R., et al. (2001). The Iowa Tests of Basic Skills. Itasca, IL: Riverside.
Hoover, N. (1994). Reading Rescue for Elementary Readers: An Early Literacy Intervention Program 1993-1994 End-of- Pilot Year Report. (Tech. Report No. 1). Gainesville, FL: University of Florida.
Hoover, N. (1995). Reading Rescue for Elementary Readers: An Early Literacy Intervention Program 1994-1995 End-of-Second Year Report on Student Progress at Interlachen Elementary School, Putnam County, Florida. Gainesville, Florida. (Tech. Report No. 2). Gainesville, FL: University of Florida.
Hoover, N., & Sullivan, L. (1996, April). Teaching at-risk children to read: One school’s successful implementation of cost-effective tutoring. Paper presented at the annual meeting of the American Educational Research Association, New York, N.Y.
Hoover, N. (1996, April). Primary teachers learn to teach at-risk students to read: Empowerment and improvement through a school-based inservice model. Paper presented at the annual meeting of the American Educational Research Association, New York, NY.
Hoover, N. (2001). Reading Rescueâ: A Handbook for Tutors. Gainesville, FL: The Literacy Trust. Inc.
Hoover, N., & Lane, H. (2001, May). Preventing Literacy Failure through Cost-effective Tutoring: A Report on Outcomes in High-poverty and Rural Schools. Paper presented at the annual meeting of the International Reading Association, San Francisco, CA.
Iversen, S., & Tunmer, W. (1993) Iversen, S., & Tunmer, W. (1993). Phonological processing skills and the Reading Recovery Program. Journal of Educational Psychology , 85, 112‑126.
Juel, C. (1988). Learning to read and write: A longitudinal study of 54 children from first through fourth grades. Journal of Educational Psychology, 80, 437-447.
Juel, C. & Roper-Schneider, D. (1985). The influence of basal readers on first grade reading. Reading Research Quarterly, 20, 134-152.
MacGinitie, W., MacGinitie, R., Maria, K., & Dreyer, L. (2000). Gates-MacGinitie Reading Tests, 4th Edition. Itasca, IL: Riverside.
MacGinitie, W., MacGinitie, R., Maria, K., & Dreyer, L. (2002). Gates-MacGinitie Reading Tests, 4th Edition Technical Report. Itasca, IL: Riverside.
Matthes, P. & Fuchs, L. (1994). The efficacy of peer tutoring in reading for students with mild disabilities: A best-evidence synthesis. School Psychology Review, 23, 59-80.
Moats, L. (2000). Speech to print: Language essentials for teachers. Baltimore, MD: Brookes.
Morris, D. (1999). The Howard Street tutoring manual: Teaching at-risk readers in the primary grades. New York: Guilford.
Muller, P. & Davies, R. (2004). Reading Rescue® in Inner City Schools: An Experimental Study Examining Reading Outcomes of a One-on-One Tutoring Intervention. Bloomington, IN: Indiana University, The Indiana Evaluation Center.
National Reading Panel, (2000). Report of the National Reading Panel: Teaching children to read: An evidence-based assessment of the scientific research literature on reading and its implications for reading instruction: Reports of the subgroups. Rockville, MD: NICHD Clearinghouse.
Perfetti, C. (1992). The representation problem in reading acquisition. In P. Gough, L. Ehri, & R. Treiman (Eds.), Reading acquisition. (pp. 107-143). Hillsdale, NJ: Erlbaum.
Rack, J., Hulme, C., Snowling, M., & Wightman, J. (1994). The role of phonology in young children learning to read words: The direct-mapping hypothesis. Journal of Experimental Child Psychology, 57, 42-71.
Rasinski, T. (2003). The fluent reader: Oral reading strategies for building word recognition, fluency, and comprehension. New York: Scholastic.
Santa, C., & Hoien, T. (1999). An assessment of Early Steps: A program for early intervention of reading problems. Reading Research Quarterly, 34, 54-79.
Scarborough, H. (2001). Connecting early language and literacy to later reading (dis)abilities: Evidence, theory, and practice. In S. Neuman & D. Dickinson (Eds.), Handbook of early literacy research (pp. 97-110). New York: Guilford.
Shanahan, T. (1998). On the effectiveness and limitations of tutoring in reading. Review of Research in Education, 23, 217-234.
Shanahan, T. & Barr, R. (1995). Reading Recovery: An independent evaluation of the effects of an early instructional intervention for at-risk learners. Reading Research Quarterly, 30, 958-996.
Shanker, J., & Ekwall, E.(2003). Ekwall/Shanker Reading Inventory (4th ed). Needham Heights, MA: Allyn and Bacon.
Share, D. (1999). Phonological recoding and orthographic learning: A direct test of the self-teaching hypothesis. Journal of Experimental Child Psychology, 72, 95-129.
Share, D. (2004a). Orthographic learning at a glance: On the time course and developmental onset of self-teaching. Journal of Experimental Child Psychology, 87, 267-298.
Share, D. (2004b). Knowing letter names and learning letter sounds: A causal connection. Journal of Experimental Child Psychology, 88, 213-233.
Stanovich, K. (1986). Matthew effects in reading: Some consequences of individual differences in the acquisition of literacy. Reading Research Quarterly, 21, 360-406.
Torgesen, J., Wagner, R., Rashotte, C., Rose, E., Lindamood, P., Conway, T., et al. (1999). Preventing reading failure in young children with phonological processing disabilities: Group and individual responses to instruction. Journal of Educational Psychology, 91, 579-593.
Vellutino, F., Scanlon, D., Sipay, E., Small, S., Pratt, A., Chen, R. et al. (1996). Cognitive profiles of difficult-to-remediate and readily remediated poor readers: Early intervention as a vehicle for distinguishing between cognitive and experiential deficits as basic causes of specific reading disability. Journal of Educational Psychology, 88, 601-638.
Voyager Expanded Learning (2004). Voyager Passport. Dallas, TX: Voyager Expanded Learning, Inc.
Wasik, B., & Slavin, R. (1993). Preventing early reading failure with one-to-one tutoring: A review of five programs. Reading Research Quarterly, 28, 178-200.
Woodcock, R. (1987). Woodcock Reading Mastery Tests-Revised. Circle Pines, MN: American Guidance Service.
Woodcock, R. (l998). Woodcock Reading Mastery Tests-Revised Examiner’s Manual Normative Update. Circle Pines, MN: American Guidance Service.
Footnotes
1Substantial letter knowledge is required to benefit from the RES program. Low-performing students who do not know at least 17 letters in the fall are typically enrolled in the program later in the year after they have learned letters in their classrooms. Because tutoring began late in this study, it was not possible to tutor a second wave of students.
Table 1
Characteristics, Mean Performance and Standard Deviations on Fall Pretests and Spring Posttests
for Reading Rescue (RES) Students, Control students in RES Schools (Control 1), and Control Students in Non-RES Schools (Control 2)
__________________________________________________________________________________________
Characteristics and Reading Control 1 Control 2 Post Hoc F-stata Number of
Tests Rescue Comparisons Students b (N=64) (N=62) (N=60) (Bonferroni) (RES/C1/C2)
__________________________________________________________________________________________
Age (Months) 75.3 (3.9) 74.3 (5.4) 74.2 (3.6) - - 1.26 n.s. (64/62/59)
Gender (F/M) 30/34 30/32 30/28 - - < 1 n.s. (64/62/58)
Fall Pretestsc
GMRT4 NCE 29.2 (9.2) 25.3 (9.1) 32.5 (8.5) RES=C2>C1 9.91** (63/61/60)
Percentile 16 12 20
ITBS Vocabulary (SS) 125.3 (8.5) 125.6 (8.5) 128.3 (9.0) - - 2.18 n.s. (64/62/59)
Percentile 21 22 27
Spring Posttestsc
GMRT4 Word Decoding
NCE 49.9 (15.9) 42.1(13.5) 35.2 (11.8) RES> C1>C2 19.41** (62/60/60)
Percentile 50 36 25
GMRT4 Read Comprehension
NCE 43.5 (12.9) 37.8 (13.8) 35.3 (11.2) RES> C1=C2 7.50** (62/60/60)
Percentile 38 29 25
Table 1 (Continued)
Characteristics, Mean Performance and Standard Deviations on Fall Pretests and Spring Posttests
for Reading Rescue (RES) Students, Control students in RES Schools (Control 1), and Control Students in Non-RES Schools (Control 2)
__________________________________________________________________________________________
Characteristics and Reading Control 1 Control 2 Post Hoc F-stata Number of
Tests Rescue Comparisons Students b (N=64) (N=62) (N=60) (Bonferroni) (RES/C1/C2)
__________________________________________________________________________________________
GMRT4 Total
NCE 47.5 (12.9) 41.2 (12.1) 36.4 (10.4) RES> C1=C2 15.47** (62/60/60)
Percentile 45 33 28
WRMT-R Word Reading 35.1 (9.3) 27.5 (11.8) - - - RES>C1 19.14** (62/61)
Grade Equivalent 1.9 1.7
WRMT-R Word Attack 14.7 (9.0) 9.4 (7.4) - - - RES>C1 12.93** (62/61)
Grade Equivalent 1.9 1.5
_________________________________________________________________________________________
Note. * p < .05; ** p < .01; n.s. not statistically significant.
aANOVAs were conducted on fall pretest scores. ANCOVs were conducted on posttests scores with fall
GMRT4 scores as the covariate. Posttest means are adjusted values, and posttest standard deviations are unadjusted. NCE refers to normal curve equivalent scores.
bMissing data account for the varying number of students contributing to each analysis.
cIn the fall, Level BR (Grade 1 Beginning Reading) of the GMRT4 was administered. In the spring, Level 1 (Grade 1) of the GMRT4 was administered. All tests were group administered except the WRMT-R tests which were individually administered.
Table 2
Posttest Means, Standard Deviations, and Test Statistics on the Group-administered RES Classwide Screen and the Individual RES Tests for the Reading Rescue Group and the C1 Control Group
__________________________________________________________________________________________
Tests Reading Control 1 F-stata Number of Effect
Rescue Students b Sizec (N=64) (N=62) (RES/C1)
__________________________________________________________________________________________
Classwide Screen Tests
Write Upper Case Letters (26 max) 22.8 (4.0) 21.8 (5.8) 1.08 n.s. (62/60) 0.20
Write Lower Case Letters (26 max) 23.3 (3.6) 20.9 (7.3) 5.48 * (62/60) 0.44
Write Words 26.2 (14.9) 23.3 (13.3) 1.34 n.s. (62/60) 0.21
Developmental Spelling (40 max) 27.6 (5.8) 24.6 (7.9) 5.66 * (62/60) 0.44
RES Individual Tests
Name Lower Case Letters (26 max) 25.7 (.69) 25.4 (1.0) 2.42 n.s. (61/59) 0.36
Name Upper Case Letters (26 max) 25.8 (.50) 25.9 (.72) < 1 n.s. (61/59) -0.16
Give Sounds of Letters (26 max) 25.5 (.92) 25.1 (1.6) 2.72 n.s. (61/59) 0.32
Phoneme Blending (10 max) 9.72 (.54) 9.34 (1.2) 5.22 * (61/59) 0.44
Phoneme Segmentation (10 max) 8.78 (1.6) 7.03 (2.7) 18.23 ** (61/59) 0.81
Read Pseudowords (15 max) 11.37 (3.1) 7.62 (3.8) 34.00 ** (61/59) 1.09
Read Sight Words (110 max) 37.13 (13.9) 28.77 (13.7) 11.30 ** (61/59) 0.61
Sum of Scores (223 max) 144.0 (17.5) 129.1 (18.6) 20.97 ** (61/59) 0.83
Informal Reading Inventory
Oral Text Accuracy (3.5 max) 2.1 (1.3) 0.9 (1.1) 33.54** (61/59) 1.00
Comprehension (3.5 max) 2.1 (1.2) 1.1 (1.0 ) 33.16** (62/59) 0.96
Table 2 (Continued)
Posttest Means, Standard Deviations, and Test Statistics on the Group-administered RES Classwide Screen and the Individual RES Tests for the Reading Rescue Group and the C1 Control Group
__________________________________________________________________________________________
Note. * p < .05; ** p < .01; n.s. not statistically significant.
aANCOVs were conducted on spring posttests scores with fall GMRT4 scores as the covariate. Posttest means are adjusted values, and posttest standard deviations are unadjusted.
bMissing data account for the varying number of students contributing to each analysis.
cEffect size was calculated by subtracting adjusted mean of RES group from adjusted mean of C1 group and dividing by the unadjusted pooled standard deviations.
Table 3
Proportion of Students in the Reading Rescue Intervention (RES), Small Group Intervention (SG), Comparison Control Groups (C1, C2 and NI) and Whole First Grade Sample (WS) Who Performed At or Above the 40th Percentile on the fall and spring Gates-MacGinitie Reading Tests, 4th Edition (GMRT4)
__________________________________________________________________________________________
GMRT4 RES C1 C2 SG NI WS
Tests (N=64) (N=62) (N=60) (N=52) (N=70) (N=844)
__________________________________________________________________________________________
Fall Pretest 5% 5% 7% 9% 3% 35%
Spring Posttests
Decoding 68% 35% 23% 27% 31% 58%
Read Compreh. 52% 29% 23% 24% 28% 51%
Total Score 60% 34% 23% 27% 30% 53%
__________________________________________________________________________________________
Table 4
Characteristics and Statistics on Fall Pretests and Spring Posttests for Reading Rescue (RES) Students, Small Group Intervention (SG) Students, and Neither Intervention (NI) Control Students
__________________________________________________________________________________________
Characteristics Reading Small Group Neither Post Hoc F-stata Number of
and Tests Rescue Intervention Intervention Comparisons Studentsb
(N=64) (N=52) (N=70) (Bonferroni) (RES/SG/NI)
__________________________________________________________________________________________Age (Mo.) 75.3 (3.9) 74.4 (4.0) 74.1 (5.1) - - - 1.30 n.s. (64/52/69)
Gender (F/M) 30/34 25/26 35/34 < 1 (64/51/69)
Fall Pretestsc
GMRT4 (NCE) 29.2 (9.2) 26.4 (9.2) 30.8 (9.4) NI>SG; 3.40* (63/52/69)
Percentile 16 13 18 RES=SG, NI
ITBS Vocabulary (SS) 125.3 (8.5) 125.7 (7.9) 127.7 (9.4) - - - 1.51 n.s. (64/52/69)
Percentile 21 22 26
Spring Posttestsc
GMRT4 Word Decoding
NCE 49.9 (15.9) 38.1 (12.9) 39.1 (12.4) RES>SG=NI 14.90** (62/52/68)
Percentile 50 28 30
GMRT4 Read Comprehension
NCE 43.5 (12.9) 37.4 (12.5) 36.0 (12.7) RES>SG=NI 7.05** (62/52/68)
Percentile 38 28 26
GMRT4 Total
NCE 47.5 (12.9) 38.9 (11.7) 38.7 (10.9) RES>SG=NI 12.51** (62/52/68)
Percentile 45 30 31
Table 4 (Continued)
Characteristics and Statistics on Fall Pretests and Spring Posttests for Reading Rescue (RES) Students, Small Group Intervention (SG) Students, and Neither Intervention (NI) Control Students
_________________________________________________________________________________________
Note. * p < .05; ** p < .01; n.s. not statistically significant.
aANOVAs were conducted on pretest scores. ANCOVAs were conducted on spring posttests scores with fall GMRT4 scores as the covariate. Posttest means are adjusted values, and posttest standard deviations are unadjusted. NCE refers to normal curve equivalent scores.
bMissing data account for the varying number of students contributing to each analysis.
cIn the fall, Level BR (Grade 1 Beginning Reading) of the GMRT4 was administered. In the spring, Level 1 (Grade 1) was administered.
Table 5
Mean Performance of Reading Rescue students on Tutoring Measures and Reading Outcomes as a Function of Whether the Tutor Was a Reading Specialist, Other Certified Teacher or Paraprofessional _________________________________________________________________________________________
Tutoring Reading Others Para- Mean F-Stat Bonferroni
Events & Specialists Certified professionals Post Hoc
Outcomes (N=17) (N=15) (N=26)
_________________________________________________________________________________________
Outcomes
GMRT4 Total 47.0 (14.9) 50.3 (13.8) 46.5 (11.0) 47.7 (12.9) < 1 ns
WMRT-R Word 35.2 (11.3) 37.4 (10.8) 36.3 (6.8) 36.3 (9.3) < 1 ns
WMRT-R Nonword 19.6 (10.5) 16.6 (9.4) 10.8 (5.4) 15.0 (9.0) 6.39 ** R>P; O=R,P
Events
Easing-in Sessions 1.31 (1.8) 3.31 (2.7) 2.4 (2.2) 2.3 (2.3) 2.91 ns
Regular Sessions 33.9 (16.2) 41.5 (14.8) 55.7 (13.6) 46.0 (17.3) 11.76 ** P>R=O
Easing-out Sessions 1.0 (1.7) 1.9 (2.4) 1.2 (1.4) 1.3 (1.8) < 1 ns
Books read/reread 128.1 (54.8) 154.2 (52.6) 191.7 (59.7) 164.3 (62.2) 6.50 ** P>R; O=P,R
Text acc.98-100% 54% (.26) 48% (.27) 48% (.23) 50% (.25) < 1 ns
Text acc. 90-97% 37% (.17) 33% (.15) 40% (.13) 37% (.14) 1.14 ns
Text acc. < 90% 13% (.11) 19% (.19) 14% (.15) 15% (.15) < 1 ns
Adherence to program 14.4 (2.7) 12.6 (3.3) 13.4 (3.4) 13.5 (3.2) 1.15 ns
_________________________________________________________________________________________
Note. * p < .05; ** p < .01; n.s. not statistically significant. GMRT4 means are NCE scores; WRMT-R means are raw scores. Number of regular sessions does not include easing in and easing out sessions. Adherence to program was based on judgments of tutors’ records with 13 the minimum acceptable and 24 the maximum.
Table 6
Entry-Level Characteristics of Tutored Students and Features of Tutoring That Were Significantly Correlated With GMRT4 Posttest Scores
_________________________________________________________________________________________
Measures 1 2 3 4 5 6 7 8 Mean SD
_________________________________________________________________________________________
1 GMRT4 Posttest -- 47.4 12.8
Pretests
2 GMRT4 .46** -- 29.2 9.2
3 RES Individual .43** .41** -- 89.6 15.0
4 Write Upper Case Letters .26* .17 .15 -- 16.0 6.2
5 Write Lower Case Letters .26* .28* .12 .57** -- 11.3 6.8
Features of Tutoring
6 Text Acc. 98-100% .66** .49** .41** .22 .22 -- 49% .24
7 Text Acc. 90-97% -.56** -.24 -.40** -.19 -.05 -.76** -- 38% .15
8 Text Acc. < 90% -.45** -.53** -.36** -.10 -.26 -.77** .14 -- 15% .15
9 Easing in sessions -.32* -.02 -.05 -.10 -.38** -.19 .11 .07 2.3 2.2
_________________________________________________________________________________________
Note. There were between 56 and 64 observations contributing to each correlation as a result of a few missing values. * p < .05; ** p < .01.
Reading Rescue® in Inner City Schools:
An Experimental Study Examining
Reading Outcomes of a One-on-One Tutoring Intervention
Patricia Muller, Ph.D., Associate Director
Randy Davies, Ph.D., Research Associate
Center for Evaluation and Education Policy
Indiana University
509 E Third Street
Bloomington, Indiana 47401-3654
Manuscript prepared for submission to
Reading Research and Instruction
May 10, 2004
Reading Rescue® in Inner City Schools: An Experimental Study Examining Reading Outcomes of a One-on-One Tutoring Intervention
Abstract
The purpose of this study was to examine the effectiveness of the one-on-one tutoring component of Reading Rescue® , specifically addressing the following research question: Do students who receive one-on-one Reading Rescue® tutoring achieve better reading results compared to similar students who do not receive one-on-one Reading Rescue® tutoring? A randomized pretest-posttest control group experimental design was used to study the effects of the Reading Rescue® tutoring program on reading achievement in six elementary schools located in one inner-city school district in New York City with predominantly Hispanic students. Findings of this study support those from research conducted on other structured one-on-one tutoring programs, showing that the reading achievement of elementary students at risk of reading failure can be improved through the use of supplemental, adult-instructed one-to-one reading intervention. Results of the ANCOVA analyses comparing treatment and control groups on posttest reading achievement, after controlling for pretest differences, provides evidence that Reading Rescue® increases students’ reading achievement. Students who received one-on-one Reading Rescue® tutoring scored significantly higher (p < .05) on the posttest measure of reading achievement than did similar students who were randomly assigned to the control group. The findings from this study are particularly significant given that the majority of research on literacy interventions is based on program implementation in one particular setting with primarily monolingual students, whereas this study is based on data drawn from six separate elementary schools with large percentages of English Language Learners.
Reading Rescue® in Inner City Schools: An Experimental Study Examining Reading Outcomes of a One-on-One Tutoring Intervention
Educators and researchers agree that good reading skills are important to successful learning and that such skills are best learned at an early age (Kennedy & Others, 1986). Early reading success is also important for developing children’s self-confidence and motivation to learn (Ambruster, Lehr, & Osborn, 2003). However, according to the U.S. Department of Education’s National Center for Education Statistics (2003), the overall reading achievement of the nation’s fourth grade students has remained constant since 1992. In 2003, 37% of fourth grade students performed below basic levels of proficiency on the National Assessment of Educational Progress (NAEP) reading test. Responding to this trend, the Elementary and Secondary Education Act (ESEA) of January 2002, or No Child Left Behind Act, has renewed expectations for the improvement of reading achievement and evidence that reading programs are effective. Research has identified early identification of, and early intervention for, at-risk readers as one of the best ways to increase reading achievement (Hiebert, 1994; Torgesen, 2002; Wasik & Slavin, 1993). As a consequence, interest in remedial reading and preventative assistance programs for students with reading difficulties continues to be a focus for educational reform in schools across the country.
In particular, one method that has proven effective in providing remediation for struggling readers in the primary grades is one-to-one pull out intervention in the form of tutoring (Allington, 2004; Elbaum,Vaughn, Hughes & Moody, 2000; Institute of Education Sciences, 2003; Pinnel, Lyons, Deford, Bryk, & Seltzer, 1994; Shanahan, 1998; Wasik & Slavin, 1993). Given the increasing national focus on scientifically-based research and evaluation, and the U.S. Department of Education’s identification of experimental design as one of the most rigorous and valid methods of evaluating program impact, the purpose of this study was to examine the effects of a structured one-on-one tutoring program on students’ reading achievement. Specifically, this study used a randomized experiment to examine the impact of Reading Rescue® tutoring on the reading achievement of inner city students in six New York City elementary schools serving low socio-economic and primarily Hispanic students.
Background
Reading Rescue®, a comprehensive early intervention model developed in cooperation with the University of Florida and now sponsored by a charitable, non-profit organization (Literacy Trust, Inc.), is designed to prevent reading failure through one-on-one tutoring for the lowest performing students, while also improving the quality of literacy instruction provided for all students in the classrooms of participating teachers. Reading Rescue® consists of the following components: (1) a three-year professional development sequence delivered on site, within a school or district, that trains a school’s staff (typically classroom teachers, but also reading and language teachers, paraprofessionals, media specialists, administrators, etc.) to provide research-based one-on-one tutoring for at-risk students in the early grades, (2) technical support for the school restructuring required to implement a multi-tiered assessment and instruction delivery model comprised of large and small group instruction and one-on-one tutoring during the regular school day, (3) large group screening and individual assessments of emergent literacy and reading, (4) Summer Institutes that prepare each school’s Reading Rescue® Coordinator as a Literacy Coach and in-house trainer, (5) formation of a school-based Peer Coaching team under the direction of each school’s Reading Rescue® Coordinator, and (6) continuous data collection monitoring program effectiveness. Each component supports the goal of fluent reading for the lowest performing students, as well as enhanced instruction for all students, in the early grades. The model is based on several diverse bodies of research in the fields of teacher professional development and adult learning; school reform and restructuring; and early literacy acquisition, reading disabilities, and reading instruction.
Over the three-years of adoption, the tutors participate in professional development sessions provided by The Literacy Trust, Inc. Tutors’ theoretical understanding and pedagogical skills are developed through reading and discussion, analysis of tutoring videotapes, and hands-on practice with feedback. In addition, the professional development is on-going through meetings of each school’s peer coaching group. During the second year of adoption, the Coordinator observes tutors to ensure compliance with the instructional model, and leads the tutors in the analysis of the Coordinator’s own tutoring recorded on videotapes. Over the three year implementation process, which includes the training of an in-school Literacy Coach during the Summer Institute, capacity is built for self-maintenance of the program within the school.
Students’ performance on large group, pencil-and-paper assessments (i.e. Reading Rescue® Classwide Literacy Screening Assessments) are used to identify students who can be provided remediation effectively in large and small groups, and those students who can be taught most effectively through one-on-one tutoring during the first semester.
Students whose scores on these screening assessments place them at the top of the bottom quartile are considered the best candidates for one-on-one tutoring, with levels of emergent literacy skills sufficient to enable them to accelerate with skilled tutoring. Meanwhile, Reading Rescue® recommends the very lowest performing students (bottom of the bottom quartile) receive small group instruction designed to develop the basic knowledge and skills needed to benefit from the one-on-one tutoring in the second semester. One-on-one tutoring is provided through daily thirty minute tutoring sessions, conducted by trained and monitored classroom teachers and paraprofessionals during times when other students are engaged in non-instructional activities (e.g. recess, lunch, etc.). Each Reading Rescue® lesson includes work on phonological awareness in keeping with students’ needs, sequential phonics instruction informed by assessment data, fluency building, and the development of vocabulary and comprehension.
Originally developed in 1993, Reading Rescue® has trained teachers and other committed staff in over one hundred schools in seven states to provide research-based tutoring. Preliminary evidence suggests that Reading Rescue® has been effective in improving students’ reading achievement at these schools. In addition to strong anecdotal and selfreport evidence from long-standing programs, evaluations and statistical analyses have been conducted by individual schools and school districts (Capella, 2003; Parman, 2001; Price, 2002; Pugh, 2001). All schools who adopt Reading Rescue® are also contractually required to regularly submit test results for tutored students and their peers to the Literacy Trust Inc., who in turn uses these data to continuously monitor program effectiveness.
Although the primary focus of Reading Rescue® has been on program improvement, the Literacy Trust Inc. has also periodically used these data to conduct statistical analyses that document program impact (Britt, 2002; Gibson, 2002; Hoover, 1996; Hoover, 1999; Hoover & Sullivan, 1996).
However, as with most education programs, the type of scientifically-based evaluation discussed in the No Child Left Behind legislation has not been conducted to validate the program’s effectiveness. This study addresses the need for such evaluation through the use of a rigorous, valid methodology (i.e. random, experimental-control group design) to examine the impact of the one-on-one tutoring component of Reading Rescue® on students’ reading achievement.
Participants/Method
The purpose of this study is to examine the effectiveness of the one-on-one tutoring component of Reading Rescue® , specifically addressing the following research question: Do students who receive one-on-one Reading Rescue® tutoring achieve better reading results compared to similar students who do not receive one-on-one Reading Rescue® tutoring?
A randomized pretest-posttest control group experimental design (Gay & Airasian, 2000) was used to study the effects of the Reading Rescue® tutoring program on reading achievement in six elementary schools located in one inner-city school district in New York City with predominantly Hispanic students. Three of the schools were in the first year of the three-year implementation process for Reading Rescue®, and three schools were in the second year of implementation of the program.
In terms of demographics, five of the six schools serve students in Kindergarten through fifth grade, with enrollment sizes ranging from approximately 1100 to 1800 students; and one elementary school serves students from Kindergarten through second grade, with an enrollment size of approximately 400 students. All schools are Title I elementary schools, with approximately 82-99% of enrolled students eligible for free lunch (Mean = 92.0%, Median = 93.5%). The majority (approximately 87-98%) of the student body at each school is Hispanic; and approximately 27-41% of students at these schools are English Language Learners with limited English ability, receiving Bilingual or English as a Second Language instruction (Mean = 35.0%, Median = 37.0%). Across these schools approximately 26-46% of students met or exceeded state standards in English Language Arts during the 2001-2002 academic year (Mean = 35.3%, Median 34.0%).
Each of the participating schools (N=6) administered the Reading Rescue® large group, pencil-and-paper screening assessment (i.e. Reading Rescue® Classwide Literacy Screening Assessments) to its first grade students, and subsequently identified those students in the bottom quartile. From the students identified as potential participants for one-on-one Reading Rescue® tutoring, students within each school were randomly selected for participation in the reading intervention. Subsequently, students in the treatment group received one-on-one Reading Rescue® tutoring in addition to their regular classroom instruction, and students in the control group received regular classroom instruction without one-on-one Reading Rescue® tutoring. Some control group students also received small group instruction in literacy and/or ESL, although this was not a systematically applied intervention for these students. Reading Rescue coordinators from these schools reported that approximately 90 to 100% of the first graders who received one-on-one tutoring were English Language Learners (i.e. students who come from homes in which a language other than English is spoken, in this case primarily Spanish); and given the random assignment of treatment and control group students, similar percentages of control group students who did not receive Reading Rescue® tutoring are also likely to have been English Language Learners.
The study used the Gates-MacGinitie Reading Test (GMRT), Fourth Edition, as the dependent variable to provide a standardized measure of reading achievement. The GMRT is a timed multiple-choice test administered in groups, providing valid and reliable norm-referenced scores that include extended scale scores and Normal Curve Equivalents (NCEs). As opposed to studies that use outcome measures that specifically target the strategies taught by the program, this study used the GMRT as the outcome measure in order to provide a more valid and standardized measure of actual impact on students’ reading achievement, regardless of the specific reading strategies or technique used. In other words, if an evaluation uses a measure that is closely aligned with the strategies taught by a particular program, results are biased in favor of the program (e.g. children taught to use context to predict words rather than sounding them out will score better on reading measures based on predictable text rather than measures based on authentic text). Therefore, the use of a norm-referenced, standardized measure of achievement in this particular study provides a more fair measure of achievement for both treatment and control group students.
As part of this study, the GMRT Fourth Edition, BR Level (Beginning Reading Level) was administered to all first grade students, including regular classroom students (i.e. those students who did not score in the bottom quartile on the large group assessment, and therefore were not identified as potential candidates for the one-on-one tutoring assistance). This test was administered during Fall 2002, prior to any of the students receiving the one-on-one tutoring, and constitutes the pretest data for the study. In Spring 2003, when most students had completed the one-on-one Reading Rescue® tutoring, posttest data were collected using the GMRT, Level 1, Form S. In addition, the GMRT was administered to the treatment and control groups at the end of the 2002-2003 school year to provide follow-up data for the examination of any residual effects of the tutoring. Given the costs of purchasing and scoring the GMRT, and the ability to examine the residual effect without the test score data from regular classroom students, a decision was made to not administer the GMRT to this particular group of students at the end of the school year.
To determine the impact of one-on-one Reading Rescue® tutoring on reading achievement, this study compared students in the treatment group to similar students assigned to the control group. For a treatment-control design with both premeasures and postmeasures, analysis of covariance (ANCOVA) is the most appropriate analysis (Tabachnick & Fidell, 1996). Therefore, ANCOVA was used to compare the posttest reading achievement of students who received one-on-one Reading Rescue® tutoring to similar students in the control group who did not receive the tutoring intervention, with the pretest treated as the covariate to control for random individual differences1. Unlike research on Reading Recovery that has been criticized for only including those students who are determined to have completed the program successfully, resulting in inflated reports of impact on achievement (Elbaum,Vaughn, Hughes & Moody, 2000; Shanahan & Barr, 1995), this study includes all students who were initially identified as part of the treatment group and received any Reading Rescue® one-on-one tutoring. This paper also provides descriptive data that compares both treatment and control group students to regular classroom students, in order to provide a context for better understanding students’ reading achievement2.
In addition to the pretest-posttest analysis, repeated measures analysis of variance (ANOVA) was used to examine differences between treatment and control groups using data from all three test occasions: the pretest, posttest and follow-up scale score achievement data. Given that two of the six participating schools did not administer the 1 Potential candidates for the tutoring intervention (i.e. those students in the top of the bottom quartile on the large group reading assessment) were randomly assigned to groups. However, this random assignment does not guarantee equality among groups, but rather only assures there are no systematic differences between groups to begin with, within probability limits. The experimental design does not diminish the effects of random individual differences that can both spread out scores among subjects within a group, and create differences among groups that are not associated with the treatment. ANCOVA diminishes the effects of these individual differences by adjusting for them statistically, thereby providing a more powerful test of differences (Tabachnick & Fidell, 1996). 2 Given the focus of the study, and to protect against inflated Type I error, the data for regular classroom students is provided as contextual data rather than incorporated into the primary analysis. Although the study could have used ANCOVA to make comparisons among the three groups (treatment, control, and regular classroom students), the post hoc comparisons necessary to determine where the actual differences occur would inflate the Type I error rate. In addition, differences in reading achievement are to be expected between those students identified as candidates for reading intervention (i.e. scoring in the bottom quartile of the class on a large group reading assessment), and regular classroom students (i.e. primarily those students in the top three quartiles on the large group reading assessment). Therefore, running this larger number of comparisons is unwarranted.
third testing due to logistical and scheduling issues, sample sizes for both the treatment and control groups are much more limited for this particular analysis, and therefore are presented separately.
For all analyses, data was aggregated across schools given that the purpose of the study was to examine program effects across schools, rather than individual differences that might exist between schools in the implementation or effectiveness of Reading Rescue® . Preliminary data screening and analyses supported this decision given that no outliers based on school groupings were detected that might affect the results.
Results
Preliminary analyses were conducted to ensure that there were, in fact, no statistically significant differences between the randomly assigned treatment and control groups. Results indicated that there were no statistically significant differences in reading achievement between students randomly assigned to the treatment and control groups, as measured by the GMRT , with F(1, 128) = .12, p >.05. The extended scale score pretest group means for the treatment and control groups were 306.8 (SD = 22.9, N = 58) and 308.2 (SD = 25.1, N=71), respectively.
Preliminary analyses also indicated that the large group screening assessment used by Reading Rescue® (i.e. Reading Rescue® Classwide Literacy Screening Assessments) was effective at differentiating students’ reading achievement levels, and identifying low performing students. Students identified through the Reading Rescue® screening assessment as low performing also scored significantly lower on the GMRT than other students in their respective classrooms, with statistically significant differences between the treatment/control group students and regular classroom students [F (1, 753) = 40.49, p < .001]. Whereas the extended scale score pretest group mean for the combined treatment and control groups was 307.4 (SD = 23.8, N =129), the extended scale score pretest group mean for the regular classroom students was 329.8 (SD = 38.5, N =625).
Comparison of Pretest and Posttest Achievement Data.
Analysis of covariance (ANCOVA), with pretest reading achievement (extended scale scores) as the covariate, was performed on students’ posttest GMRT reading achievement (extended scale scores). As summarized in Table 1, results indicate pretest reading achievement scores were statistically significant as a source of variance in posttest reading achievement, with F (1, 126) = 63.49, p < .001. After adjustment by covariate, posttest achievement scores varied significantly with treatment/control group status, as also summarized in Table 1, with F (1, 126) = 4.75, p < .05. The standardized effect size, or standardized mean difference3, of the effect of treatment/control group status was .325.
Table 1. Analysis of Covariance of Posttest Reading Achievement
Source of Variance Adjusted SS Df MS F
Corrected Model 30597.4 2 15298.7 33.62*
Pretest Reading
Achievement
(Covariate)
28895.99 1 28896.99 63.49*
Treatment/Control
Group Status
2163.76 1 2163.76 4.75**
Error 57342.63 126 455.10
* p < .001
** p < .05
3 Computed as the difference between the mean post-test score of the treatment group minus the mean posttest score for the control group, divided by the standard deviation of the control group.
Table 2 provides descriptive statistics for the posttest reading achievement measures. In addition to the unadjusted means4 and standard deviations for the extended scale scores, Normal Curve Equivalent (NCE) mean scores5 are provided to enhance the interpretation of the scores. In addition, unadjusted posttest data are included for regular classroom students in order to provide a context for understanding how the treatment and control groups’ performance compares to other students in their respective classrooms who were not identified as potential candidates for Reading Rescue® .
Table 2. Descriptive Posttest Reading Achievement Data
Treatment Group
(Reading Rescue® )
N = 58
Control Group
(Non-Reading Rescue® )
N =71
Regular Classroom Students
N = 625
Mean SD Mean SD Mean SD
Mean Scaled
Score
363.2
28.7 355.9 22.4 378.7 38.3
Normal
Curve
Equivalent
(NCE)
35.8 11.2 32.3 10.4 41.6 14.9
Repeated Measures Analyses Using Follow-Up Data. Repeated measures ANOVA was conducted using GMRT reading achievement extended scale scores obtained for the same students at three separate points in time: prior to any students receiving Reading Rescue® tutoring (Fall 2002, pretest data), after the majority of students had completed Reading
4 Unadjusted means are presented for the descriptive statistics to allow comparison to the descriptive statistics for regular classroom students not included in the ANCOVA analysis (i.e. there are no comparable adjusted means and standard errors for these students given that they are not included in the analyses).
However, the adjusted means for treatment and control groups are very similar to the unadjusted means: 363.6 adjusted mean (2.5 SE) for the treatment group, as compared to 363.2 for the unadjusted mean; and 355.4 adjusted mean (2.8 SE) for the control group, as compared to 355.9 for the unadjusted mean.
5 Normal Curve Equivalent Scores (NCE) are standard scores with a known mean and standard deviation, such that NCE scores of 1, 50 and 99 correspond to percentile ranks of 1, 50 and 99. The score is a statistical (normalized) transformation of percentile ranks in which the range of reading achievement is divided into 99 equal units with a mean of 50 and a standard deviation of 21.06.
Rescue® tutoring (Spring 2002, posttest data), and at the end of the 2002-2003 academic year (follow-up data). As noted in Table 3, the analyses indicate that there were statistically significant differences both between test occasions, with F (1,76) = 567.4, p <.001, and between treatment and control groups, with F (1,76) = 7.91, p < .05. The
between groups analysis indicates that the difference in improved reading achievement for students receiving Reading Rescue® assistance (N = 32) was significantly greater than that of students in the control group (N=46).
Table 3. Summary Table for Repeated Measures ANOVA
Source of Variance SS Df MS F Partial Eta
Squared
Between Tests 212654.0 1 212654.0 567.4* .892
Between Groups 8659.3 1 8659.3 7.91** .094
* p < .001
** p < .05
The reading achievement, over time, for both the treatment and control groups is also graphically displayed in Figure 1 using the GMRT extended scale scores. As visually depicted in this figure, the achievement gap between the treatment and control group students appears to increase over time, with students who receive the one-on-one Reading Rescue® intervention improving in reading achievement at a greater rate than the control group, even after the intervention has concluded. For example, the Reading Rescue® students scored, on average, 11.1 scaled score points higher than the control group students at the time of the posttest. However, at the end of the 2002-2003 academic year, those students who had received Reading Rescue® tutoring scored, on average, 22.3 scaled score points higher than the control group students who began the year similar in terms of reading achievement.
Figure 1: Longitudinal GMRT Extended Scale Score Comparison
385.2
362.9 361.6
300.8
350.5
297.2
290
310
330
350
370
390
Pretest Posttest Follow-up
Treatment
Group
(Reading
Rescue)
Control
Group
Discussion and Conclusions
As noted previously, the purpose of this study was to examine the effectiveness of the one-on-one tutoring component of Reading Rescue® , specifically addressing the following research question: Do students who receive one-on-one Reading Rescue® tutoring achieve better reading results compared to similar students who do not receive one-on-one Reading Rescue® tutoring? The randomized pretest-posttest control group experimental design of this study not only provides the type of evidence encouraged by the No Child Left Behind Act, but also allows causal links to be established between the Reading Rescue® program and its effects on reading achievement. Without such a design, it would be difficult to discern whether any differences found between those students who receive Reading Rescue® tutoring, and similar students who do not receive the tutoring intervention, are directly attributable to the one-on-one tutoring program.
Results of the ANCOVA analyses comparing treatment and control groups on posttest reading achievement, after controlling for pretest differences, provides evidence that Reading Rescue® increases students’ reading achievement. Students who received one-on-one Reading Rescue® tutoring scored significantly higher (p < .05) on the posttest measure of reading achievement than did similar students who were randomly assigned to the control group. After controlling for pre-program differences, students who received Reading Rescue® tutoring (N = 71) scored an average of 363.2 (extended scale score), as compared to 355.9 (extended scale score) for the control group students (N = 58). Despite a smaller sample size, the repeated measures ANOVA using three points of data also found statistically significant differences between the treatment and control groups, with students who received Reading Rescue® tutoring having higher levels of reading achievement over time.
The findings from this study are particularly significant given that the majority of research on literacy interventions is based on program implementation in one particular setting with primarily monolingual students (Elbaum, Vaughn, Hughes, & Moody, 2000), whereas this study is based on data drawn from six separate elementary schools with large percentages of English Language Learners. Research based on implementation in one particular setting can be problematic because the results, and its generalizability, are highly dependent on the actual implementation of the program in one particular context. While it is easier to detect differences in these controlled settings, the practical relevance of the data is more limited. In this particular study, the effects of Reading Rescue® are examined across six distinct settings, each with its own specific contextual factors impacting implementation of the program. Although these variations in implementation and context can make it more difficult to detect differences across settings, findings from such studies are more generalizable, reliable and valid. The results indicate that across the six elementary schools, without any measures of program fidelity included in the study, there was still evidence that Reading Rescue® positively impacts the reading achievement of at–risk students.
These findings are also of particular significance given that English Language Learners (i.e. students who come from homes in which a language other than English is spoken) represent one of the fastest-growing student populations in U.S. schools (Slavin & Cheung, 2004); and the findings from this study suggest that Reading Rescue® is effective in improving the English reading performance of English Language Learners. Both the treatment and control groups across the six schools in this study were predominantly English Language Learners, and the study found that students who received the Reading Rescue® one-on-one tutoring outperformed the control group students who did not receive Reading Rescue® tutoring.
In addition to examining statistical significance, however, it is also important to address the practical significance and meaningfulness of findings. For example, should a difference of 10 points between the means be considered large, medium, or small? For this study, the standardized effect size for the ANCOVA analysis was .325, representing a medium or moderate effect size for this type of intervention. For example, Elbaum, Vaughn, Hughes and Moody’s (2000) meta-analysis of 42 samples of students investigated
in 29 studies of supplemental, one-to-one reading interventions for elementary students found a mean weighted effect size of .41 when compared to controls. Although the calculated effect size for this study is slightly lower than the mean effect size of literacy interventions included in the meta-analysis, the findings in this study include data across a variety of contexts without checks for treatment fidelity, which likely decreases overall effect size in the aggregated data. In addition, the intensive professional development provided by Reading Rescue® to the classroom teachers and paraprofessionals likely impacts not only tutoring, but also the quality of literacy instruction provided for all students in the classrooms of participating teachers. Therefore, control group students are likely to also be benefiting from the Reading Rescue® program indirectly (as are regular classroom students), mitigating the differences between the treatment and control group students. Also, standardized measures such as the GMRT generally yield smaller effects than the nonstandardized measures used in many of these studies (Elbaum, Vaughn, Hughes, & Moody, 2000); and Reading Rescue® had not yet finished its three year implementation process in these schools (i.e. half the schools were in their first year of implementation, and half the schools were in the second year of implementation).
Although Reading Rescue® students did not reach the same level of reading achievement
on the GMRT posttest as did their classroom peers and the national norming sample, the gains made in decreasing the gap might be great enough to allow these students to keep up with classroom instruction and to avoid academic failure. The results of the repeated measures ANOVA regarding residual program effects support this hypothesis. Findings suggest that those students who received Reading Rescue® tutoring continued to increase their reading achievement at a greater rate than did those students who did not receive Reading Rescue® tutoring, even after the tutoring has ended. In other words, even though Reading Rescue® students did not reach the reading achievement level of their classmates by the time of the posttest administration of the GMRT, the increase in reading achievement due to the tutoring program might have provided these students with enough of an academic gain to take better advantage of the regular classroom instruction. Thus, there appear to be residual gains even after the one-on-one tutoring ends, with Reading Rescue® students appearing to have an academic advantage over similar students who do not receive tutoring. These findings are particularly meaningful given that their relevance to impoverished urban school districts, with large percentages of English Language Learners with limited English ability. The positive program effects in this study resulted from a program that was implemented across six low socio-economic urban elementary schools enrolling large percentages (27-41%) of students who receive Bilingual or English as a Second language instruction.
In conclusion, the findings of this study support those from research conducted on other structured one-on-one tutoring programs, showing that the reading achievement of elementary students at risk of reading failure can be improved through the use of supplemental, adult-instructed one-to-one reading interventions (Elbaum,Vaughn, Hughes & Moody, 2000; Pinnel, Lyons, Deford, Bryk, & Seltzer, 1994). Previously conducted self-evaluations by individual schools and districts (Capella, 2002; Price, 2002; Parman, 2001; Pugh, 2001), and other statistical analyses and research on the impact of Reading Rescue® have consistently noted positive impacts of the program (Britt, 2002; Gibson, 2002; Hoover, 1996; Hoover, 1999; Hoover & Sullivan, 1996). This study confirms the previous findings regarding the impact of Reading Rescue® tutoring, providing evidence based on a randomized experimental design. Although very difficult to implement in realworld educational settings, this design allows the strongest possible causal attributions to be made, and as noted by the No Child Left Behind legislation, provides the most scientifically based findings. Further research is needed to replicate these findings, and should include measures of program fidelity at both the school and teacher/tutor levels.
Including a fidelity of treatment check will likely result in findings of greater effect sizes for those students receiving the one-on-one tutoring as intended by the Reading Rescue® program. In addition, research and evaluation is needed that examines the impact of other aspects of Reading Rescue® (e.g. effects of professional development on the quality of instruction in the regular classroom), as well as research that accounts for differences in program effect between students.
References
Allington, R. L. (2004). Setting the record straight. Educational Leadership, 61(6), 22-25.
Ambruster, B.B., Lehr, F., & Osborn, J. (2003). Put Reading First: The Research Building Blocks for Teaching Children to Read. Retrieved August 13, 2003
http://www.ciera.org/library/products/others/putreadingfirst/
Britt, L. (2002). Reading Rescue® in District Six Community Schools of the New York City Public School System: An Evaluation of Student Achievement in Year One of Program Adoption, 2001-2002. Gainesville, FL: Report submitted to the Literacy Trust, Inc.
Campbell, D.T. & Stanley, J.C. (1971). Experimental and quasi-experimental design for research. Chicago: Rand McNally.
Capella, S. (2002). Invited testimony before the Council of the City of New York City, Committee on Education Oversight Hearing on Reading Instruction. New York City, New York. September 30, 2002.
Cohen, J. (1988). Statistical power analysis for the behavioral sciences. Hillsdale, NJ: Lawrence Erlbaum.
Clay, M.M (1985). The early detection of reading difficulties. Exeter, NH: Heinemann.
Elbaum, B., Vaughn, S., Hughes, M.T., & Moody, S.W. (2000). How effective are one-to-one tutoring programs in reading for elementary students at risk for reading failure? A meta-analysis of the intervention research. Journal of Educational Psychology, 92 (4), 605-619.
Gay, L.R. & Airasian, P. (2000). Educational Research: Competencies for analysis and application. Upper Saddle River, NJ: Prentice-Hall, Inc.
Gibson, N. (2002). An evaluation of Reading Rescue programs in Georgia’s Dodge, Fayette, Putnam and McDuffie County Schools, 2000-2001. The Literacy Trust, Inc. Gainesville, Florida.
Hiebert, E. H. (1994). Reading Recovery in the United States: What differences does it make to an age cohort? Educational Researcher, 23(9), 15-25.
Hoover, N. L. (1996). Primary teachers learn to teach at-risk students to read: Empowerment and improvement through a school-based inservice model. Presented at the annual meeting of the American Evaluation Research Association. New York City, N.Y.
Hoover, N.L. (1996). Reading Rescue® at Interlachen Elementary School. Title I Newsletter. University of Florida, Gainesville, Florida.
Hoover, N. L. (1997). Literacy – Realized or Rescued: The Primary School Mission. Education Times. University of Florida, Gainesville, Florida.
Hoover, N. (1999). Preventing literacy failure through cost-effective tutoring: A statistical report on high-poverty & rural schools. Gainesville, Florida: The Literacy Trust, Inc.
Hoover, N. L., and Sullivan, L. (1996). Teaching at-risk children to read. Presented at the annual meeting of the American Evaluation Research Association. New York City, N.Y.
Institute of Education Sciences. (2003). Identifying and implementing educational practices supported by rigorous evidence. Washington, DC: U.S. Department of Education. Available: www.ed.gov/rschstat/research/pubs/rigorousevid/index.html
Kennedy, M. M. and Others. (1986). The effectiveness of Chapter 1 Services. Second interim report from the national assessment of Chapter 1. Office of Educational Research and Improvement. Washington DC.
Parman, L. (2001). An analysis of Gates MacGinitie Reading Test Scores documenting achievement among Strom Lake, Iowa’s Reading Rescue® students. Presented at the Iowa Reading Conference. Des Moines, Iowa.
Pedhazur, E. J. and Schmelkin, L. P. (1991). Measurement, Design, and Analysis: An Integrated Approach. Hillsdale, NJ: Lawrence Erlbaum Associates Inc.
Pinnel, G.S., Lyons, C.A., Deford, D.E., Bryk, A.S., & Seltzer, M. (1994). Comparing instructional models for the literacy education of high-risk first graders. Reading Research Quarterly, 29 (1), 9-39.
Price, B. (2002). Effectiveness of Reading Rescue® Programs. Paper presented at the 47th Annual Convention of the International Reading Association. San Francisco, CA.
Pugh, M. (2001). Reading Rescue® in Walton County’s Title I Schools: A report on success in helping the lowest performing students learn to read. Presented at the annual meeting of the Florida Reading Association. Orlando, FL.
Shadish, W.R., Cook, T.D., & D.T. Campbell (2002). Experimental and quasiexperimental designs for generalized causal inference. Boston, MA: Houghton Mifflin Company.
Shanahan, T. (1998). On the effectiveness and limitations of tutoring. In P.D. Pearson & A. Iran-Nejad (Eds.), Review of research in education, 23 (pp. 217-234). Washington, DC: American Educational Research Association.
Shanahan, T., & Barr, R. (1995). Reading Recovery: An independent evaluation of the effects of an early instructional intervention for at-risk learners. Reading Research Quarterly, 30 (4), 958-996.
Slavin, R. E. & Cheung, A. (2004). How do English language learners learn to read? Educational Leadership, 61(6), 52-57.
Tabachnick, B. G., & Fidell, L.S. (1996). Using Multivariate Statistics. Third Edition. New York, NY: HarperCollins.
Torgesen, J. K. (2002). The prevention of reading difficulities. Journal of School Psychology, 40(1), 7-26.
U.S. Department of Education, Office of Elementary and Secondary Education, No Child Left Behind: A Desktop Reference, Washington, D.C., 2002.
U.S. Department of Education, National Center for Education Statistics, Reading 2003 State snapshot report, NCES 2004–456, Washington, DC: U.S. Government Printing Office, 2003. Last viewed on the internet 11/15/03 http://nces.ed.gov/nationsreportcard/reading/results2003/.
Wasik, B.A., & Slavin, R.E. (1993). Preventing early reading failure with one-to-one tutoring: A review of five programs. Reading Research Quarterly, 28, 178-200.
The Literacy Trust, Inc.
3324 W University Avenue #116
Gainesville, FL 32607
ph: 352-256-3516
fax: 347-602-8768
alt: 888-377-0401 (toll free)
hoover