skip to main content
10.1145/3173574.3173868acmconferencesArticle/Chapter ViewAbstractPublication PageschiConference Proceedingsconference-collections
research-article
Open access

Juxtapeer: Comparative Peer Review Yields Higher Quality Feedback and Promotes Deeper Reflection

Published: 21 April 2018 Publication History

Abstract

Peer review asks novices to take on an evaluator's role, yet novices often lack the perspective to accurately assess the quality of others' work. To help learners give feedback on their peers' work through an expert lens, we present the Juxtapeer peer review system for structured comparisons. We build on theories of learning through contrasting cases, and contribute the first systematic evaluation of comparative peer review. In a controlled experiment, 476 consenting learners across four courses submitted 1,297 submissions, 4,102 reviews, and 846 self assessments. Learners assigned to compare submissions wrote reviews and self-reflections that were longer and received higher ratings from experts than those who evaluated submissions one at a time. A second study found that a ranking of submissions derived from learners' comparisons correlates well with staff ranking. These results demonstrate that comparing algorithmically-curated pairs of submissions helps learners write better feedback.

Supplementary Material

suppl.mov (pn2815-file5.mp4)
Supplemental video

References

[1]
Louis Alfieri, Timothy J. Nokes-Malach, and Christian D. Schunn. 2013. Learning Through Case Comparisons: A Meta-Analytic Review. Educational Psychologist 48, 2: 87--113.
[2]
J.R Anderson and G. H Bower. 1972. Recognition and Retrieval Process in Free recall. Psychological review 79, 2: 97--123.
[3]
Anish Athalye. 2016. Gavel. Retrieved from https://github.com/anishathalye/gavel
[4]
Elizabeth L Bjork and Robert A Bjork. 2011. Making Things Hard on Yourself, But in a Good Way: Creating Desirable Difficulties to Enhance Learning. Psychology and the Real World: Essays Illustrating Fundamental Contributions to Society: 55--64.
[5]
David Boud. 1995. Enhancing learning through self-assessment. Routledge.
[6]
David Boud, Ruth Cohen, and Jane Sampson. 2014. Peer learning in higher education: Learning from and with each other. Routledge.
[7]
P A Carlson and F C Berry. 2003. Calibrated Peer Review and assessing learning outcomes. Frontiers in Education Conference 2: 1--6.
[8]
Xi Chen, Paul N. Bennett, Kevyn CollinsThompson, and Eric Horvitz. 2013. Pairwise ranking aggregation in a crowdsourced setting. Proceedings of the Sixth ACM International Conference on Web Search and Data Mining: 193-- 202.
[9]
Michelene T. H. Chi, Paul J. Feltovich, and Robert Glaser. 1981. Categorization and Representation of Physics Problems by Experts and Novices*. Cognitive Science 5, 2: 121--152.
[10]
Kwangsu Cho and Christian D. Schunn. 2007. Scaffolded writing and rewriting in the discipline: A web-based reciprocal peer review system. Computers and Education 48, 3: 409--426.
[11]
Robert B Cialdini. 2001. Influence: Science and Practice. Book 3rd: 262.
[12]
Steven P. Dow, Alana Glassco, Jonathan Kass, Melissa Schwarz, Daniel L. Schwartz, and Scott R. Klemmer. 2010. Parallel prototyping leads to better design results, more divergence, and increased selfefficacy. ACM Transactions on Computer-Human Interaction 17, 4: 1--24.
[13]
Dedre Gentner, Jeffrey Loewenstein, and Leigh Thompson. 2003. Learning and transfer: A general role for analogical encoding. Journal of Educational Psychology 95, 2: 393--408.
[14]
Dedre Gentner and Arthur B Markman. 1994. Structural Alignment in Comparison: No Difference Without Similarity. Psychological Science 5, 3: 152--159.
[15]
Eleanor Jack Gibson. 1969. Principles of perceptual learning and development. AppletonCentury-Crofts.
[16]
Michael D Greenberg, Matthew W Easterday, and Elizabeth M Gerber. 2015. Critiki: A scaffolded approach to gathering design feedback from paid crowdworkers. C&C 2015: 1--10.
[17]
Brian Greer. 1997. Modelling reality in mathematics classrooms: The case of word problems. Learning and Instruction 7, 4: 293--307.
[18]
Adriaan D De Groot, Fernand Gobet, and Riekent W Jongman. 1996. Perception and memory in chess: Studies in the heuristics of the professional eye. Van Gorcum & Co.
[19]
Catherine M Hicks, Vineet Pandey, C Ailie Fraser, and Scott Klemmer. 2016. Framing Feedback. Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems - CHI '16: 458--469.
[20]
David A. Joyner. 2017. Scaling Expert Feedback. Proceedings of the Fourth (2017) ACM Conference on Learning @ Scale - L@S '17: 71--80.
[21]
Philip J. Kellman, Christine M. Massey, and Ji Y. Son. 2010. Perceptual learning modules in mathematics: Enhancing students' pattern recognition, structure extraction, and fluency. Topics in Cognitive Science 2, 2: 285--305.
[22]
Avraham N. Kluger and Angelo DeNisi. 1996. The effects of feedback interventions on performance: A historical review, a meta-analysis, and a preliminary feedback intervention theory. Psychological Bulletin 119, 2: 254--284.
[23]
Pushkar Kolhe, Michael L. Littman, and Charles L. Isbell. 2016. Peer Reviewing Short Answers using Comparative Judgement. Proceedings of the Third ACM Conference on Learning @ Scale: 241--244.
[24]
Markus Krause, Tom Garnack, and Steven P. Dow. 2017. Critique Style Guide: Improving Crowdsourced Design Critiques using a Natural Language Model. CHI '17 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, May.
[25]
Chinmay E. Kulkarni, MS Michael S Bernstein, and Scott R. Klemmer. 2015. PeerStudio: Rapid Peer Feedback Emphasizes Revision and Improves Performance. In Proceedings of the Second (2015) ACM Conference on Learning @ Scale - L@S '15, 75--84.
[26]
Chinmay E. Kulkarni, Richard Socher, Michael S. Bernstein, and Scott R. Klemmer. 2014. Scaling short-answer grading by combining peer assessment with algorithmic scoring. Proceedings of the first ACM conference on Learning @ scale conference L@S '14: 99--108.
[27]
Chinmay Kulkarni, Kathryn Papadopoulos, Justin Cheng, Daphne Koller, and Scott R Klemmer. 2013. Peer and Self Assessment in Massive Online Classes. 20, 6.
[28]
Kenneth J. Kurtz, Chun-Hui Miao, and Dedre Gentner. 2001. Learning by Analogical Bootstrapping. Journal of the Learning Sciences 10, 4: 417--446.
[29]
Marianne LaFrance. 1989. The quality of expertise: implications of expert-novice differences for knowledge acquisition. ACM SIGART Bulletin, 108: 6--14.
[30]
Penelope Lockwood and Ziva Kunda. 1997. Superstars and me: Predicting the impact of role models on the self. Journal of Personality and Social Psychology 73, 1: 91--103.
[31]
Jeffrey Loewenstein, Leigh Thompson, and Dedre Gentner. 1999. Analogical encoding facilitates knowledge transfer in negotiation. Psychonomic Bulletin & Review 6, 4: 586--597.
[32]
Kristi Lundstrom and Wendy Baker. 2009. To give is better than to receive: The benefits of peer review to the reviewer's own writing. Journal of Second Language Writing 18, 1: 30--43.
[33]
Kurt Luther, Jari-Lee Tolentino, Wei Wu, Amy Pavel, Brian P. Bailey, Maneesh Agrawala, Björn Hartmann, and Steven P. Dow. 2015. Structuring, Aggregating, and Evaluating Crowdsourced Design Critique. Proceedings of the 18th ACM Conference on Computer Supported Cooperative Work & Social Computing - CSCW '15: 473--485.
[34]
Thomas Mussweiler. 2003. Comparison processes in social judgment: Mechanisms and consequences. Psychological Review 110, 3: 472--489.
[35]
Duyen T Nguyen, Thomas R Garncarz, Felicia Ng, Laura Dabbish, and Steven Dow. 2016. Fruitful Feedback: Positive affective language and source anonymity improve critique reception and work outcomes. Manuscript submitted for publication: 1024--1034.
[36]
Laura R Novick. 1988. Analogical transfer, problem similarity, and expertise. Journal of Experimental Psychology: Learning, Memory, and Cognition 14, 3: 510.
[37]
Tiffany Potter, Letitia Englund, James Charbonneau, Mark Thomson MacLean, Jonathan Newell, and Ido Roll. 2017. ComPAIR: A New Online Tool Using Adaptive Comparative Judgement to Support Learning with Peer Feedback. Teaching & Learning Inquiry 5, 2: 89-- 113.
[38]
K Raman and T Joachims. 2014. Methods for ordinal peer grading. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining: 1037--1046.
[39]
Bethany Rittle-Johnson and Jon R. Star. 2007. Does comparing solution methods facilitate conceptual and procedural knowledge? An experimental study on learning to solve equations. Journal of Educational Psychology 99, 3: 561--574.
[40]
T. Rogers and A. Feller. 2016. Discouraged by Peer Excellence: Exposure to Exemplary Peer Performance Causes Quitting. Psychological Science 27, 3: 365--374.
[41]
D.Royce Sadler. 1989. Formative assessment and the design of instructional systems. Instructional Studies 18, 2: 119--144.
[42]
Donald A Schön. 1984. The reflective practitioner: How professionals think in action. Basic books.
[43]
Daniel L. Schwartz, Catherine C. Chase, Marily A. Oppezzo, and Doris B. Chin. 2011. Practicing versus inventing with contrasting cases: The effects of telling first on learning and transfer. Journal of Educational Psychology 103, 4: 759--775.
[44]
Daniel L Schwartz, Jessica M Tsang, and Kristen P Blair. 2016. The ABCs of How We Learn: 26 Scientifically Proven Approaches, How They Work, and When to Use Them. WW Norton & Company.
[45]
Nihar B Shah, Joseph K Bradley, Abhay Parekh, Martin Wainwright, and Kannan Ramchandran. 2013. A Case for Ordinal Peer-evaluation in MOOCs. NIPS Workshop on Data Driven Education: 1--8. Retrieved from http://lytics.stanford.edu/datadriveneducation/paper s/shahetal.pdf
[46]
Nancy Sommers. 1982. Responding to student writing. College composition and communication 33, 2: 148--156.
[47]
Leigh Thompson, Dedre Gentner, and Jeffrey Loewenstein. 2000. Avoiding Missed Opportunities in Managerial Life: Analogical Training More Powerful Than Individual Case Training. Organizational Behavior and Human Decision Processes 82, 1: 60--75.
[48]
David Tinapple, Loren Olson, and John Sadauskas. 2013. CritViz: Web-Based Software Supporting Peer Critique in Large Creative Classrooms. Bulletin of the IEEE Technical Committee on Learning Technology 15, 1: 29--35. Retrieved from http://lttf.ieee.org/issues/january2013/Tinapple.pdf
[49]
Maryam Tohidi, William Buxton, Ronald Baecker, and Abigail Sellen. 2006. Getting the right design and the design right. Proceedings of ACM CHI 2006 Conference on Human Factors in Computing Systems 1: 1243--1252.
[50]
Amos Tversky and Itamar Simonson. 1993. Context-dependent preferences. Management science 39, 10: 1179--1189.
[51]
A.E.a Waters, D.b Tinapple, and R.G.a Baraniuk. 2015. BayesRank: A bayesian approach to ranked peer grading. L@S 2015 - 2nd ACM Conference on Learning at Scale: 177--183.
[52]
David Scott Yeager, Valerie Purdie-Vaughns, Julio Garcia, Nancy Apfel, Patti Brzustoski, Allison Master, William T. Hessert, Matthew E. Williams, and Geoffrey L. Cohen. 2014. Breaking the cycle of mistrust: Wise interventions to provide critical feedback across the racial divide. Journal of Experimental Psychology: General 143, 2: 804-- 824.
[53]
A Yuan, K Luther, and M Krause. 2016. Almost an Expert: The Effects of Rubrics and Expertise on Perceived Value of Crowdsourced Design Critiques. (CSCW'16) Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing: 1005-- 1017.

Cited By

View all

Index Terms

  1. Juxtapeer: Comparative Peer Review Yields Higher Quality Feedback and Promotes Deeper Reflection

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CHI '18: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems
    April 2018
    8489 pages
    ISBN:9781450356206
    DOI:10.1145/3173574
    Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 21 April 2018

    Check for updates

    Author Tags

    1. comparative peer review
    2. contrasting cases
    3. feedback

    Qualifiers

    • Research-article

    Conference

    CHI '18
    Sponsor:

    Acceptance Rates

    CHI '18 Paper Acceptance Rate 666 of 2,590 submissions, 26%;
    Overall Acceptance Rate 6,199 of 26,314 submissions, 24%

    Upcoming Conference

    CHI 2025
    ACM CHI Conference on Human Factors in Computing Systems
    April 26 - May 1, 2025
    Yokohama , Japan

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)257
    • Downloads (Last 6 weeks)17
    Reflects downloads up to 21 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media