November 7, 2003

Senate Democrats are Committed to Improving Head Start Assessments

On October 29, the Senate Health, Education, Labor and Pensions (HELP) Committee approved legislation by a vote of 21 to 0 to reauthorize Head Start. The Senate version enables more poor children to get a head start; strengthens the Head Start workforce; enhances Head Start coordination and collaboration; helps improve outcomes for children; strengthens the academic portion of Head Start; and enhances the accountability of Head Start programs. One of the most controversial proposals - assessment of young children - deserves further attention and analysis because accountability for program performance is crucial, but the role of assessments in that context is a difficult one.

Background

Measuring what and how well students learn is an important building block in the process of adequately preparing young children for school. Just over a year ago, the Administration began developing a national child assessment for Head Start, called the National Reporting System (NRS) - a test every four year-old participating in Head Start must take twice a year. The NRS assessment is not even mentioned in the Senate bill, yet it has potentially major repercussions for Head Start - at best, strengthening programs by improving the way teachers teach; at worst, leading to de-funding of high-quality programs, and excluding some of our neediest children. More expert guidance on and review of the development, implementation, and uses of this major new initiative are warranted.

Currently, children in Head Start are assessed three times a year on all of the domains of early learning and development, including literacy and math. The NRS is an assessment developed by the Department of Health and Human Services (HHS), which creates an additional test for all four year-olds in Head Start (roughly 500,000 children) on literacy and math skills only. Rather than the traditional "observational assessment" used by current Head Start programs, the NRS is a one-on-one question and answer test for each child. Children are assessed twice a year and, according to Administration documents, changes in children's scores would be used to judge the success of individual Head Start programs. The new testing program is expected to cost about $20 million each year. Some pilot testing was begun in April and May of 2003 and HHS has begun to fully implement this assessment for every four year-old this fall.

Top Ten Concerns

1. The NRS has been developed and implemented too quickly, compromising quality. Assessing young children is extremely difficult and requires carefully crafted assessments. Young children are not used to taking tests and often do not have the emotional maturity to sit still and focus on the task at hand. Their test scores tend to fluctuate across time and can reflect many factors unrelated to their skills (their mood, whether they sleep enough the night before a test, their susceptibility to distraction depending on the test location, their ability to respond to questions posed by a stranger, etc.).

The NRS assessment was both developed and implemented within a year; by contrast, the time frame for the development of tests under the No Child Left Behind Act (NCLB) is three to four years. As a result of the Department's haste, the quality is suspect. Hundreds of experts have concerns about the foundation of the assessment, including its validity, reliability, and fairness. Earlier this year, over 200 researchers in early childhood development and assessment signed a letter to Congress expressing significant reservations about the NRS, and the rush to put an assessment system in place in six to eight months.

2. More time is critical to develop and pilot the assessment in an appropriate way. Academic experts and Head Start staff alike are requesting a more scientific, thoughtful approach to the development of this assessment - one that includes some Congressional oversight. The National Academy of Sciences (NAS) report, "Eager to Learn: Educating Our Preschoolers," made it clear that more research on assessing young children is needed before such assessments should be used for accountability purposes. Because of this, it is crucial that the assessment instruments to be used in the NRS are properly validated and deemed to be appropriate for four year-old children. At this point, we simply do not know that this is the case.

3. The assessment has been developed and implemented in secrecy. This process has been surrounded by an air of secrecy. An assessment of this magnitude and importance must be developed and implemented with great care and openness. An independent body, such as the NAS, should examine how to appropriately assess young children. We know that good and useful assessments can be developed, even for young children, but they must be developed carefully and appropriately if they are to resolve, and not create, educational problems.

The process of developing assessment instruments should involve consultation with the public and experts in early childhood development and research methodology. Although HHS claims that they have had many meetings with "experts," there is little or no information publicly available that clarifies what went on at these meetings, what decisions were reached, and whether the advice of the experts was or was not heeded in developing the NRS. This process must be more open and public.

4. How the data will be analyzed and used is unclear. The Administration has been inconsistent in its statements about the use of the data, at times saying it's for program improvement and accountability and at other times saying it's for measuring individual child progress. Because results from this assessment could lead to de-funding programs, the assessment has the features of a high-stakes test - putting test pressures on young children and negatively impacting the way they view themselves and their ability to learn and achieve. In addition, linking test results to grantees' award amounts would create a disincentive to serving those children who most need Head Start. The Administration needs to be explicit about its intentions on uses of the data.

5. Expert reviews and pilot testing have been inadequate. The Administration claims there has been appropriate expert review and pilot testing. The process has not been responsive to expert and public concerns, leading to concerns about the adequacy of the review. Since the assessment has been made public, many experts in early childhood education and assessment have raised concerns about the test. Supposed feedback periods overlapped with the data collection and analysis phase of the pilot study, making feedback uninformed.

The Erikson Institute, a prestigious Chicago graduate school specializing in child development, surveyed national leaders in the early childhood field about the best way to measure achievement in preschoolers. In this survey, standardized testing (the method utilized by the NRS) was ranked tenth. Clearly, these experts do not view the methodology in use by the NRS as optimal.

6. The assessment is not balanced. The current NRS assessment is narrowly focused on literacy, language, and mathematics skills. Head Start is a comprehensive program that addresses child development through a whole-child approach, including social, emotional, physical, and cognitive development. The NRS at this time does not reflect development of the whole child. The assessment targets only a few of the skills that Head Start seeks to instill in children. For example, social skills are not being assessed; without them, children are simply not ready to learn.

7. The assessment may not be valid and reliable. Many questions surround the scientific soundness of the test. For example, the early math scale was adapted from a test for kindergarteners rather than from a test known to be valid for preschoolers. As another example, sections of the Spanish-version assessment have extremely low predictive validity. Many individuals have expressed concerns about the scientific soundness of the training for test givers. The NRS uses a "train-the-trainer" approach, which means that a relatively small number of Head Start staff nationwide receive training and then they are expected to return to their states and train the vast majority of test-givers. This is especially questionable since Head Start staff varies greatly in terms of education and ethnicity in ways that may lead to testing inconsistencies.

Sufficient time must be taken to ensure that English language learners are not placed at a disadvantage by having to take a test that is inappropriate for them. The test is currently available in English and Spanish, yet many Head Start children speak Asian or other languages. In addition, some Native American Head Start Directors are concerned that the NRS, in its current form, is not appropriate for their students, who often do not speak English in the home. Many Head Start directors question the appropriateness of these tests for children with developmental delays, speech and language delays, or other disabilities. We should take time to ensure that the assessment tool ultimately used is valid and reliable; assesses the gamut of skills that children acquire in Head Start; and is appropriate for children from a wide variety of cultural backgrounds.

8. The assessment could be culturally-biased and unfair. Many reviewers feel the assessment is not fair to children from diverse cultural and geographic backgrounds. For example, one question asks children to identify a swamp from a set of four drawings. Yet, how many children living in an inner city or on an arid Indian reservation have ever seen or experienced a swamp? Another question asks a four year-old in Head Start to distinguish between a vase and a decanter.

9. This year's assessment will not measure progress across the full year. The Administration has had to extend the deadline for receipt of fall data because it was unable to get testing materials to programs in a timely way. As a result, some children will not be tested until November, so their progress from August and September through November will be missed. Teachers and directors are very upset about this situation, wanting to ensure fair measuring of children's progress.

10. The assessment fails to include observations and teacher and parent input. The way data are measured is too narrow since it only includes a direct (face-to-face) child assessment. Direct assessments are useful, but they do not capture the full progress of the child at the preschool age. Multiple methods of data collection should be used for high-quality and accurate assessments, in which the NRS does not yet engage.

While the Senate bill is silent on the NRS - it does not mandate its use in determining program quality or in de-funding decisions - given the Administration's investment in this assessment and plans for the future, Congress must ensure that this assessment is not fully implemented until the test can be shown to be a reliable and appropriate measure, and be administered in a fair and even-handed manner.


Prepared by the Senate Democratic Policy Committee
Byron L. Dorgan, Chairman
419 Hart Senate Office Building
Washington, D.C. 20510