Become a Catalyst member

Join the conversation

We encourage our readers to leave comments and engage in dialogue about our stories. But before you do, please check out our "rules of the road."

Current Issue

School closings

As CPS prepares to close a record number of schools, the fate of students and communities is in question.

Testing and the Common Core

Better thinking about assessment can help us change our culture of teaching and learning from “know and be able to do” to “understand and be able to explain”

Everybody agrees: We need Common Core standards because “many 17-year-olds do not possess the higher order intellectual skills we should expect of them.” 

The thing is, that quote doesn’t come from recent work on the Common Core.  It comes from A Nation at Risk, the document that launched the standards and accountability movement close to 30 years ago.  Thirty years earlier, the same critique launched the curricular reforms of the 1960’s and before that, spawned the Progressive Era of John Dewey.   

We’ve been agreeing about this problem for quite a while.  But we’re still not very good at teaching all or most of our students “the higher-order intellectual skills we should expect of them.”  

What are we missing?

The Common Core is all about deep understanding.  One way to think about deep understanding is that it’s mostly about adding more to what you already know. But the evidence from modern learning science points in a different direction: It says deeper understanding typically starts by letting go of something you already “know”  so you can reincorporate that knowledge into a deeper, more comprehensive system of explanation.

Consider this example.  Despite what all of us were taught in grade school, most of the American population still believes that temperatures get warmer in summer because the Earth moves closer to the sun.  That’s a good guess, but bad science.  One explanation for why we keep getting this one wrong is that most of us aren’t very smart.  A better explanation is that common sense and intuition always trump formal knowledge until there’s a compelling reason to let intuition go.

How does all this relate to the Common Core?  For generations, our intuitions have told us that “teaching the curriculum” is mostly about adding skills and information, and filling in gaps left by holes in prior experience. Over the last two decades, the standards and accountability movement has doubled down on those intuitions, first by formalizing them into massive constellations of standards and performance indicators, and second by pairing those constellations up with equally massive systems of commercially-developed assessments.  

So we trusted our intuition, we doubled down our bet, and we lost. Now we have a choice.  Do we double down again, or do we let go of some comfortable intuitions and start putting our money on a different horse? 

Rethinking our intuitions

Here are four familiar intuitions that we need to confront which are deeply reinforced by existing approaches to assessment.  If we don’t find a way to get past them, they’ll kill the Common Core.

Intuition #1:  Mastery of skills and procedures is the main show

James Stigler, James Hiebert and their international research team have spent over 15 years reviewing thousands of hours of videotaped instruction from around the world as part of the Third International Math and Science Study (TIMSS).  One of their most unexpected findings was that teaching is more of a “cultural activity” than an individual one. They found that underlying patterns in the way people teach are very different from one country to the next, but are remarkably similar within countries. 

The pattern that Stigler and Hiebert found in every single American classroom they studied was that teachers spent large amounts of time reviewing material and practicing mathematical procedures without expecting students to grasp the underlying concepts on which skills and procedures were based. 

By contrast, teaching strategies in every one of the world’s higher-achieving countries regularly engaged students in active struggle with core mathematics concepts and procedures.  Teachers in some high-achieving countries might lecture while teachers in others might focus more on group problem-solving.  The technique wasn’t the key.  The key was regularly engaging students in active struggle with core mathematics concepts and procedures.

Intuition #2:  Commercial test design is objective, precise and scientific

The American bias toward teaching skills and procedures in ways that are divorced from conceptual underpinnings is a familiar target of progressive reform.  A major irony of No Child Left Behind is that, far from confronting this bias, NCLB led us to write standards and report test results in ways that reinforced the bias more systematically than ever.

The process began when states and districts formalized standards by reducing them to lengthy lists of discrete skills and procedures. Then commercial publishers were contracted to produce testing systems that matched.  On its face, this approach seemed pretty straightforward. But test publishers knew they had a problem.  They knew that standardized tests are poorly designed to measure discrete skills and procedures. 

Publishers finessed this problem by sorting test questions into a small number of “content strands” that purport to measure mastery of specific standards.  They did that knowing full well that standardized test items almost always measure more than one standard at a time, and are less about specific skills than about students’ ability to handle different kinds of academic complexity. In the end, states and test publishers fulfilled their NCLB obligations by putting their stamp of approval on a deeply compromised reporting procedure that is at best ambiguous, and at worst downright misleading. So much for scientific precision.  

How this could have happened on a nationwide scale without someone blowing the whistle is not really clear.  What is clear is that both content strands on standardized tests, and the “know and be able to do” mantra from which they derive, reflect a skill-based mindset that is out of sync with modern learning science and runs contrary to the goals of the Common Core. 

An old adage in systems theory is that, “Your system, any system, is perfectly designed to produce the results you’re getting.”  In recent years, we’ve done a more perfect job of designing our system so that it reduces what we teach to discrete skills and procedures.  Without confronting that bias, we will continue to assess and report learning in ways that will doom the Common Core. 

Intuition #3:  The best way to improve assessment at scale is to do that job for teachers so that teachers have more time to “just teach”

Another powerful irony of No Child Left Behind is that the rise of outsourced assessment coincided with strong evidence from the research community that frequent, high-quality classroom assessment produces achievement gains that far exceed those of any other single intervention strategy.   “Inside the Black Box,” the now-classic study of classroom assessment practices, reported strong academic gains when frequent, high-quality classroom assessment was practiced.  At the top end of the range, these gains were roughly equivalent to the difference between overall averages on state and national tests, and the averages posted by our lowest-achieving schools.

Given what we know about the culture of American teaching and the power of high-quality classroom assessment, the troubling thing about current work on Common Core assessment is that we seem to be doubling down again on outsourcing, this time with tests that are being developed for teachers by the PARCC and SMARTER multi-state consortia.  We’re not hearing much yet about how these systems will help teachers learn more about classroom assessment practices like the ones described in “Inside the Black Box”. 

What would it take to improve local assessment at scale?   Finland confronted the problem in the 1970s and 80s by investing deeply in teacher learning. A generation later, Finland vaulted from the middle of the pack to one of the highest-achieving nations in the world.  

During the same period, American schools invested heavily in externally-developed systems and tightened administrative oversight at the school and district level. In 2006, Dennis Shirley and Andy Hargreaves assessed the impact of this approach in an article for Education Week called, “Data Driven to Distraction.” Their conclusion was that teachers were typically left with “little chance to consider how best to respond to the figures in front of them . . .  There are few considered, professional judgments . . . just simplistic solutions driven by the scores and the political pressures behind them.”

Intuition #4:  Standardized testing is inherently sterile and inauthentic

So why not just get rid of standardized assessment altogether?  Grant Wiggins, a longtime advocate of progressive curriculum and assessment reform and a vocal critic of rote teaching and learning, offered up some solid reasons in a recent article for Education Leadership called “Why We Should Stop Bashing State Tests.” 

In it, he writes that test-prep ‘teaching’ and test bashing both have it wrong.  Why?  His analysis of released test items from Massachusetts, Florida and Ohio shows that “. . . the test items that our students do most poorly on demand interpretation and transfer, not rote learning and recall.  Better teaching and (especially) better local testing would raise state test scores.”

The surprising implication of Wiggins’ analysis is that standardized testing doesn’t have to be the Darth Vader of school reform.  Released test items and full reports of student responses can actually deepen the way we think about teaching and learning in ways that other forms of assessment cannot.  They can also give us better insights about how to improve local assessment practices in ways that directly support the goals of the Common Core. 

The Urban Education Leadership Program at the University of Illinois at Chicago has been working at this kind of reporting for a number of years now.  The result has been a growing range of protocols that are designed to help grade-, department-, and school-level learning teams get a clearer picture of where students are getting stuck and why.  Rather than producing laundry lists of discrete skills that need to be remediated, these protocols help teachers identify patterns of thinking and forms of academic complexity that stump students. 

Rather than producing pre-packaged sets of answers about who-should-be-taught-what-tomorrow-morning, the aim of these protocols is to support collective analysis and adult learning.  The purpose of that learning is to produce more thoughtful and challenging assignments  that can only be created by classroom teachers and collaborative teacher teams.

Better thinking about assessment

In School Reform from the Inside-Out, Richard Elmore writes that, “Improvement at scale is largely a property of organizations, not the pre-existing traits of the individuals who work in them.”  One key property that supports improvement at scale is information systems that foster adult collaboration and accelerate adult learning.  In most American schools, systems like that are still in short supply.   

Teachers and students both learn best when they can depend on frequent, high-quality feedback about the work they do. For the most part, the feedback we’re getting from outsourced assessment systems is poorly designed to improve either student or adult learning and does not support the goals of the Common Core.  This problem has less to do with assessments themselves than with how assessment results are reported. Reporting that emphasizes mastery of discrete skills works against Common Core goals by steering attention away from more complex aspects of teaching and learning.  Reporting that pre-packages results for teachers denies teachers access to more nuanced aspects of student thinking that hold the key to deeper learning.

Common Core standards pose the most fundamental challenge to the culture of American teaching since the Progressive Era of John Dewey.  To succeed where Dewey and others have failed, we need to build coordinated systems of local and external assessment that work together to support ambitious learning by students and adults. 

Insisting on more thoughtful reporting of state and district assessments will be an important first step toward scaling up improvement of local assessment, where the pay-offs can be huge and where the potential for improvement is enormous. 

Paul Zavitkovsky is a former CPS principal and currently works as a leadership coach and assessment specialist at UIC’s Urban Education Leadership Program.

12 comments

cs_parent wrote 1 year 6 weeks ago

Nicely written

Thanks for the links

me wrote 1 year 6 weeks ago

common core

I hate to be the bearer of bad news....but the common core standards are not that great. the language is pompous and vague. it claims to be "deep" ....however just read the standards hey have writen. it makes assumptions...and gives no insight on how to address children who fall behind? i dont know...i may be the minority, but i dont really see them as much more than the "new state standards". i remember when the states standards were the holy grail....i bet this will be the case with the common core standards.....everyone wil claim their books cover the common core standards. a new set of "standardized tests" will come out...teachers will teach to the test to save their skins..and the children will be in the same place. if you think i am cynical talk to CPS and the common core standard test they rolled out. it was so badly written the skipped the second one!! it had misspellings and bizzare rubrics......they are no more than words...not the magic words that critics claim they are!

Margo Criscuola wrote 1 year 5 weeks ago

Path to improvement

I applaud the way this article lays out a path to improvement, starting with the adjustments in thinking all of us in education must make. A strength of the standards especially in reading are they way they lay out the stages of mastery in a logical but also seamless way, instead of chopping them into many small skills and strategies.

Teaching reading by the skill has led to mini-teaching--working strategy by strategy and text level by text level, without considering how students bring skills together in a seamless effort. At the same time, advice to reading teachers has focused on making sure students don't tackle texts that are "too hard for them" or for which they "don't have prior experience," cutting students off from the experience of new discovery through reading.

I hope we all thoroughly question and reshape the intuitions about reading that have brought us to our present crisis.

Noel McNally wrote 1 year 5 weeks ago

Common Core

Let's look at the research. LOWER THE CLASS SIZES. Then, we can talk about research that reveals: students should be able to discuss how text structure contributes to the meaning and author's purpose of informational text, elaborate feedback is neccessary for higher order critical thinking, and using misconceptions in text analysis uncovers deeper comprehension.

Tom McDougal wrote 1 year 4 weeks ago

excellent article

Great article. Key points:
- frequent, high-quality *classroom* assessment is powerful
- we need to make deep changes in how we teach
- we should invest in teacher learning.

But how do we make those deep changes, and what does it mean to invest in teacher learning?

Professional development in the U.S. typically uses the "transmission" approach in which some presumed expert tells teachers what they should do. We recognize that this approach doesn't work for students; it doesn't work for teachers either.

Teachers need to be engaged in their own learning. Stigler and Heibert, and others, praise "lesson study" (from Japan) as a way to engage teachers to overcome cultural forces. Teachers in Chicago and elsewhere are using lesson study to work with colleagues and make fundamental changes in their teaching. It is an unusual point of agreement between CTU and CPS: they are both promoting lesson study.

Paul Bischoff wrote 1 year 3 weeks ago

A standing ovation for this

A standing ovation for this article....
Finally, someone wrote about education who knows something about process, concept and learning...go figure!
The reduction of the intellectual process called thinking, conceptualizing as related to learning to merely demonstrating detached skills...one after the other...like the CRS Strands-Skills reduction of thinking, integrating and conceptualizing to be ultimately measured in math with 60 problems in 60 minutes by the "god" of all measure the ACT is ludicrous.

Finally, we'll have something that dignifies what teachers in classrooms do that hasn't been produced by some bean counter/corporation trying to usurp the learning function of professionally trained educators!

Mark Flynn wrote 51 weeks 2 days ago

So - standardized testing is inherently sterile and inauthentic?

So, we no longer:
1. Announce test dates to parents and tell them to schedule everything around it?
2. Purchase test prep materials and shutdown learning to prepare?
3. Use MAP and other intermittent tests to "test to the test"?
4. Teach to the test?
5. Send letters home to parents telling them to make sure their kids get a good night's sleep and a good breakfast JUST during the testing window?
6. Take away kids' learning tools (e.g. calculators, technology)?
7. Putting signs up that say "Sh! Testing in Progress!"?
8. Setting aside instructional time instead of integrating naturally the testing with the learning?
9. Not having local teachers involved in test development?
10. Remove the "stakes" from the testing for kids, teachers, and schools?
11. Treating kids LITERALLY as a number, instead of as a human?
12. (i could go on, you know!)

Mark Flynn wrote 51 weeks 2 days ago

Std. Testing (cont.)

12. End testing as trivia contests?
13. Provide kids with unlimited options for expressing their thinking?
14. (to be continued)

David wrote 50 weeks 3 days ago

Proof your work

Your misspellings, poor grammar and overall lack of attention to detail does not really help you make your case.

Anonymous wrote 50 weeks 2 days ago

David and Mark

Mark I like your points...did you mention that NWEA scores vary greatly from one day to the next. I have seen a retake with a 20 point difference! Up and down!!

This is a blog? Let's not start the grammar game! Actually if Proof Your Work is your title, shouldn't it be all capitalized?

People write blogs to convey messages. This is not a newspaper.

Mark Flynn wrote 50 weeks 1 day ago

Testing

David - Your point is taken, on the grammar part. I had some problems as I posted my response from my mobile device where display is limited. Sorry.
Anonymous - Thank you for providing #14. NWEA, MAPS, NCLB tests, etc. indeed have limited reliability and lack validity.

Paul Zavitkovsky wrote 50 weeks 1 day ago

Can anything useful be salvaged from standardized testing?

Mark, David and Anonymous,

Amen to all 14 of the items you listed, and to 50 more that we could easily come up with at the drop of a hat if somebody asked us to. The point I'd ask you all to consider is that each of the things you've mentioned come from public policies that: a) insist on using standardized test information to drive high stakes accountability (for which it is poorly designed); and, b) ignore or reject more formative ways of reporting standardized test information that teachers and grade/departmental teams can actually use to improve teaching and learning. As I see it, the problem we've got is that standardized testing has been misused for so long, and in so many ways, that we just want to get rid of the filty bathwater as quickly as possible . . . without seeing that a heathy baby is still drowning under the surface.

That's the point that I think Grant Wiggins was trying to make in "Why We Should Stop Bashing State Tests" and is pretty much the same thing Marla Ucelli-Kashyap of the AFT was saying in a recent (May 22) EdWeek post when she wrote, "The problem lies neither with the existence of testing nor the idea of accountability. Rather, it's the kind of tests our system prioritizes, the scale at which they are used, and the narrow definition of accountability dictated by current public policy."

Clearly, standardized test information from a battery that's administered one a year can never replace the kind of local assessment information that Black and Wiliam wrote about in "Inside the Black Box." If fact, when annual standardized test information is reported out in bogus "content strands" it actually impedes the work of local assessment. Interim assesment clones of annual state tests can't replace local assessment either. So the sooner we all recognize that we can't teacher-proof the assessment process, the better we'll all be. That said, it's also helpful to recognize that thoughtfully-reported standardized test information can supplement locally-developed assesment info in ways that can't be replicated using local info alone.

The problem with standardized testing isn't that it's inherently inauthentic. The problem with standardized testing is that it has become so closely fused with high-stakes accountability and has been assigned such an out-sized role in school reform under NCLB that its hard to see the formative contributions it's still uniquely positioned make.

Thanks for taking the time to read and respond to the article. FYI, the web address for Marla Ucelli-Kashyap's piece in EdWeek article is: http://blogs.edweek.org/edweek/transforming_learning/2012/05/fix_the_fix... T

Add your comment

The content of this field is kept private and will not be shown publicly.
go here for more