![]() |
|||||||||||||||||||||
|
The NTLF resource library is a cooperative project undertaken in cooperation with the POD Network. The POD Network is the oldest professional organization in the United States devoted to professional, faculty, and organizational development within higher education. To learn more about POD and its dedicated efforts to enhance the quality of college teaching visit http://www.podnetwork.org Faculty
evaluation is a term used to describe a range of processes and practices
for determining work performance of academics, usually driven by the
need for decision-making one of two areas - formative (improvement) or
summative (judgmental). The literature on faculty evaluation is quite
extensive, encompassing over 4,000 works on topics such as student
ratings of instruction (the most commonly collected data used in faculty
evaluation), peer/colleague evaluation, the construction and use of
performance portfolios, psychological measurement, and personnel process
and policy. A significant amount of research on faculty evaluation is
closely associated with faculty development studies and initiatives. |
|
||||||||||||||||||||
|
|
|||||||||||||||||||||
|
Faculty have traditionally been held responsible for (and thus rewarded based upon) their performance in three areas; teaching, scholarship, and service. A particular emphasis on any of these three areas is usually tied to an institution's mission. For example, faculty at small liberal arts colleges are often valued more for their teaching effectiveness than their research productivity, while the reverse may hold true at a research university. Historically, most types of institutions - from colonial colleges to land grant universities - rewarded their faculty for effective teaching and service, but early in the twentieth century the universities began to move toward a European model of higher education that considered research to be the primary responsibility of the institution's faculty. Dramatic increases in federal funding for research during the Cold War - providing a new stream of income for both faculty and their universities - fueled a shift at these institutions toward an emphasis on evaluating scholarly productivity. This emphasis on research and publication assumed that good scholars were good teachers, but over time students, parents, and the general public began to demand evidence of teaching effectiveness as well, and by 1980, student ratings of instruction were common throughout higher education. Major contributions to the literature were made by Centra (1979), Doyle (1975), Scriven (1967) and others. However, it was not until the last fifteen years of the century that attempts were made to develop a broader framework for faculty evaluation that included all the roles of the faculty. Shulman (1986) described three kinds of teacher knowledge: content knowledge [knowledge of one's subject]; pedagogical content knowledge [knowledge for teaching]; and curricular knowledge [a deep blending of the other two characterized by a broad repertoire of techniques, alternatives, materials, and the ability to relate the subject to a wide range of other fields].
In 1990, Ernest Boyer suggested four categories of scholarship: discovery, integration, application, and teaching. "Discovery" refers to the traditional scholarship of investigation and dissemination of research results to peers. Faculty who succeed in this realm are seen as creating new knowledge, and often receive grants to support their research and publication or presentation of their findings. "Integration" describes the kind of scholarship that synthesizes and gives meaning to isolated facts by putting them into perspective. Faculty demonstrating this form of scholarship go beyond strict disciplinary boundaries to bring together diverse viewpoints and to look at new questions and pressing problems. "Application" refers to a form of scholarship that uses traditional scholarship to address significant problems in order to help individuals, institutions, and society. Boyer's thesis incorporated the traditional notion of service into this aspect of scholarship. Finally, the "scholarship of teaching" describes the work of a professor becoming meaningful only when it is understood by others, and is something that both educates and entices future scholars. Boyer's reconceptualization was hailed as one that would allow the recognition of teaching, and the more complete and appropriate evaluation of faculty performance.
Following this reconsideration of scholarship itself, new guidelines for assessing scholarship were proposed by researchers and policymakers in the last several years. Boyer's colleagues (Glassick, Huber, & Maeroff, 1996) proposed schema for assessing scholarship and outlined the qualities of a scholar. Considerable dialogue on the "roles and rewards" of academic life, and the criteria used to evaluate the full range of faculty performance, has been led most vocally by Eugene Rice and an initiative sponsored by the American Association for Higher Education. (See: http://www.aahe.org/initiatives/facultyroles.htm for more information.)
Recent trends in the research literature have attempted to meld evaluation into an ongoing faculty development process, rather than viewing faculty evaluation as a stand-alone activity whose only result is a judgment on (and not a learning opportunity for) the individual faculty member. Arreola, Theall, & Aleamoni (2003) presented an expanded view of faculty work, distinguishing between the "base profession" (one's disciplinary area of expertise) and the larger "meta-profession" (the work of the professoriate as defined to include base profession expertise, teaching, and many other related areas of responsibility). The model is based on the premise that appropriate evaluation of faculty performance should begin with a full definition of the responsibilities of the individual, followed by the development of mechanisms to assess performance in the various areas, and coupled with sufficient resources for training and development of the necessary skills.
Evaluation by one's peers is common at virtually all institutions of higher education. This form of faculty evaluation is influenced considerably by academic disciplines and departments (as opposed to institutionally standardized forms of course evaluations), and thus encompasses several dimensions of departmental and disciplinary politics, collegiality, and the perspectives of one's departmental peers regarding the rigor of one's academic research. Often, a professor's research interests are so specific that nobody else in his or her department is abreast of the field, which makes a proper evaluation of research quality impossible. In these cases, a department chair or others within their department will seek other members of the academic discipline with similar interests and familiarity of the subject under review. However, the core activities of faculty evaluation are by necessity local constructs. Clearly, it makes sense for a review of research quality to be provided by someone with appropriate expertise, usually from the same or a closely allied discipline. Above all, senior administrators or institutional governing boards participation in these faculty evaluation activities are typically met with fervent resistance from the faculty. Although recent decades have witnessed a dramatic increase of attention paid to questions of institutional accountability, new approaches to faculty evaluation are still rare at most colleges and universities.
Conflicting views about student ratings of teaching
However, the actual processes of collecting, analyzing, interpreting, and using evaluation data on most campuses remain largely unaffected by this dialogue. Student ratings are the primary and in some cases, the sole source of data for summative decision making. In part, this limited practice had led to ongoing, and sometimes contentious debate about the technical and applied merits of ratings. A long history of ratings research consistently has shown these data to be reliable, valid, and useful. Cohen's (1981) often-cited multisection meta-analysis established a significant relationship between ratings and learning. Marsh's (1987) review considered a wide array of variables and concluded that ratings were reliable and valid. The string of substantial studies by Feldman (e.g., 1998) explored a number of variables in depth arriving at the basic conclusion that ratings were not significantly biased by these variables. However, a small number of studies have reported results interpreted by critics to prove ratings to be unreliable and invalid (e.g., Naftulin, Ware, & Donnelly, 1973), and these studies, along with a few newer ones, continue to be trumpeted as evidence that ratings should be dismissed or discontinued despite the fact that the positive evidence is overwhelming. The real problem is poor practice and misuse or misinterpretation of ratings data: a situation that has been decried for over a decade (e.g., Theall & Franklin, 1990)
Recommendations
The following are recommendations for good practice in faculty evaluation distilled from research and development efforts over the past four decades
New approaches require the investment of time, energy, and resources, as well as the commitment of all concerned parties. There are no quick or simple solutions, but many approaches hold the promise of allowing the development of fair, reliable, valid, accurate, credible, and accepted processes that will lead to improved evaluation. Lastly, the most promising institutional approaches to faculty evaluation - and which have the most potential to improving faculty performance - are those most closely tied to well-supported, long-term faculty development efforts.
References
Arreola, R. A., Theall, M. & Aleamoni, L. M. (2003) Beyond scholarship. Recognizing the multiple roles of the professoriate. Paper presented at the 84th annual meeting of the American Educational research Association. Chicago: April 22. Available on-line at: http://cedanet.com/meta. Boyer, E. L. (1990) Scholarship reconsidered. San Francisco: Jossey Bass. Centra, J. A. (1979) Determining faculty effectiveness: assessing teaching, research, and service for personnel decisions and improvement. San Francisco: Jossey Bass. Cohen, P. A. (1981). Student ratings of instruction and student achievement: A meta-analysis of multisection validity studies. Review of Educational Research, 51, 281-309 Doyle, K. O. (1975). Student Evaluation of Instruction. Lexington, MA: D. C. Heath. Feldman, K. A. (1998) "Reflections on the effective study of college teaching and student ratings: one continuing quest and two unresolved issues." In J. C. Smart (ed.) Higher education: handbook of theory and research. New York: Agathon Press. Glassick, C. E., Huber, M. T. & Maeroff, G. I. (1997) Scholarship assessed. San Francisco: Jossey Bass. Marsh, H. W. (1987). Students evaluations of university teaching: Research findings, methodological issues, and directions for future research. International Journal of Educational Research, 11, 253-388. Naftulin, D. H., Ware, J. E., & Donnelly, F. A. (1973). The Doctor Fox lecture: A paradigm of educational seduction. Journal of Medical Education, 48, 630-635. Shulman, L. S. (1986) "Those who understand: knowledge growth in teaching." Educational Researcher, 15, 4-14. Theall, M., & Franklin, J. L. (1990) (Eds.) Student ratings of instruction: Issues for improving practice. New Directions for Teaching and Learning #43. San Francisco: Jossey Bass.
Annotated readings:
Angelo, T. A., & Cross, K. P. (1993) Classroom assessment techniques: a handbook for college teachers. (2nd ed.) San Francisco: Jossey Bass. This widely read text is derived from work at the National Center for Research into Postsecondary Teaching and Learning at the University of Michigan. The book offers extensive description and application of various strategies for gathering and using information to assess student learning. Since evidence of student learning is an important part of the larger evaluation of faculty performance, knowledge and use of effective assessment techniques is critical to good evaluation practice.
Arreola, R. A. (2000) (2nd ed.) Developing a comprehensive faculty evaluation system. Bolton, MA: Anker Publishing Company. This is the best practical guide to developing and implementing a comprehensive evaluation system. It contain, a thorough overview, grounded discussion of operational issues, samples of instruments and process, and proven techniques for generating dialogue and consensus. If one could have only one book to guide faculty evaluation, this should be it. Arreola, R. A., Theall, M. & Aleamoni, L. M. (2003) Beyond scholarship. Recognizing the multiple roles of the professoriate. Paper presented at the 84th annual meeting of the American Educational research Association. Chicago: April 22. Available on-line at: http://cedanet.com/meta. This paper and associated website allow readers to access the full presentation as wellas several matrices which describe in some detail, the ‘meta-profession' concept and its application to faculty work. A separate section is included demonstrating how ‘the scholarship of teaching and learning" represents a special case of the meta-professional construct.
Birnbaum, R. (1988) How colleges work: the cybernetics of academic organization and leadership. San Francisco: Jossey Bass ___________ (1992) How academic leadership works. Understanding success and failure in the college presidency. San Francisco: Jossey Bass. ___________ (2001) Management fads in higher education. Where they come from, what they do, and why they fail. San Francisco: Jossey Bass. Birnbaum's work is clear, insightful, realistic, and very readable. His books provide many descriptive examples and the work is grounded in a substantial and broad literature base. Few authors have described higher education as clearly and realistically as Birnbaum. While this work centers on the ways higher education works, it is important to evaluation because it provides a better understanding of the context in which evaluation takes place. Evaluation systems must be responsive to the environments in which they exist, thus this literature is relevant and useful.
Bloom, S. (1998) Structure and ideology in medical education: an analysis of resistance to change. Journal of Health and Social Behavior, 29, 294-306. This is an interesting view of the change process and a sobering description of "reform without change", that is, the futility of responding to the need to alter or improve something without implementing any long-lasting or meaningful changes. In conjunction with Birnbaum's text on "fads", it alerts readers to the need for making change relevant and connecting to stakeholder frames of reference. By its very nature, evaluation is often threatening, and it is potentially an explosive issue. Faculty evaluation is often (unfortunately) done in a ‘pro forma' manner in response to pressure for reform or change and Bloom's article points out the dangers and frustrations associated with this error. Boyer, E. L. (1990) Scholarship reconsidered: the priorities of the professoriate. San Francisco: Jossey Bass. This short work has had tremendous impact in the decade since its publication, generating dialogue and action on campuses of all types. It's reconceptualization of scholarship, particularly the introduction of "the scholarship of teaching" as an equal partner with discovery, integration, and application has led to numerous projects and attempts to broaden the roles of faculty. Faculty evaluation must include clear and public definition of its expectations, process, and intended uses. By further defining the "priorities of the professoriate, Boyer opened a dialogue that has brought more attention to teaching and the scholarship that underlies it. Braskamp, L. A. & Ory, J. C. (1994). Assessing faculty work. San Francisco: Jossey-Bass. and Centra, J. A. (1993). Reflective faculty evaluation: enhancing teaching and determining faculty effectiveness. San Francisco: Jossey-Bass. These are two of the most substantial, recent books on the general topic of faculty evaluation, both by widely respected authors.. They are comprehensive and discuss a range of issues, more theoretical than the Arreola text, but excellent additions to a basic library on the topic. Chickering, A. W. & Gamson, Z.F. (1987) Seven principles for good practice in undergraduate education." Wingspread Journal, 9 (2), special insert. The seven principles have been used as guidelines for assessment in traditional and even on-line instruction and there has been widespread use of the faculty, student, and institutional inventories devised to measure implementation of the principles on campuses. There is debate about the extent to which these general guidelines should be used in evaluating faculty performance, but their impact can not be overlooked. Cross, K. P. & Steadman, M. H. (1996) Classroom research: implementing the scholarship of teaching. San Francsico: Jossey Basss. This work expands on Cross' work in classroom assessment to incorporate Boyer's definitions of scholarship and to provide teachers with strategies for conducting classroom research that supports better understanding of classroom process and simultaneously has the rigor to be incorporated into more traditional outlets for scholarship. Classroom research can be legitimate scholarship and its results can lead to improved teaching and learning. As such, the efforts of faculty who undertake classroom research should be considered valid evidence of professional activity and/or scholarship and should be among the considerations in evaluating performance. Diamond, R. M. (2002) A field guide to academic leadership. San Francisco: Jossey Bass This new book considers a broad range of topics related to effective leadership in higher education including a section that deals with assessment, faculty evaluation, program review, and the faculty reward system.
Farmer, D. W. (1999) Institutional improvement and motivated faculty: a case study. In M. Theall (Ed.) "Motivation from within: approaches for encouraging faculty and students to excel" New Directions for Teaching and Learning # 78. San Francisco: Jossey Bass. ___________ (1988) Enhancing student learning: emphasizing essential competencies in academic programs. Wilkes Barre, PA: Kings College Press. Farmer's work as an academic administrator and leader in change processes is summarized in the New Directions chapter and his specific efforts at Kings College are discussed in detail in the book. Both are useful for those in a position to effect change through leadership. While the thrust of Farmer's work was in assessment, the principles he discusses also apply to institutional programs for evaluation.
Feldman, K. A. (1976 through 1998; see extended bibliography) A series of research syntheses in Research in higher Education. In a series of over a dozen articles, Feldman has provided the most in-depth reviews of specific issues relating to faculty evaluation and student ratings of instruction. These definitive works have explored all of the major issues raised as potential biases to faculty evaluation. Feldman's findings concur with other major work on evaluation, noting that in general, student ratings provide valid, reliable, and useful information. Feldman, K. A. & Paulsen, M. B. (1998) (2nd ed.) Teaching and learning in the college classroom. Needham Heights, MA: Simon & Schuster Custom Publishing. This is the most comprehensive collection of readings on this topic. It is broad and substantial, and includes landmark items like Boyer's definitions of the four kinds of scholarship, Chickering & Gamson's "Seven Principles", and also incorporates some of Feldman's studies of college teaching and Paulsen & Feldman's work on creating a campus teaching culture. This is probably the best single compendium of the research on college teaching. Its collection of articles make clear, the complexity and multi-dimensionality of college teaching and the associated notion that the evaluation of that teaching is equally complex.
Franklin, J. & Theall, M. (1989) "Who reads ratings: knowledge, attitudes, and practices of users of student ratings of instruction." Paper presented at the 70th annual meeting of the American Educational Research Association. San Francisco: March 31. ERIC # ED 306-241. This was the first large-scale study of faculty and administrator knowledge of and attitudes about ratings. It included a survey determining user knowledge of evaluation practice, basic statistics, and the literature of the field. Faculty and administrators in general scored significantly lower than staff in centers for teaching, learning, assessment, and evaluation, who, in turn, scored lower than a group of 40 experts in the field. Less knowledge of the literature and practice in the field was significantly correlated with negative views about evaluation and about students as providers of data.
Glassick, C. E., Huber, M. T., & Maeroff, G. I. (1997) Scholarship assessed: evaluation of the professoriate. San Francisco: Jossey Bass. This work continues where Boyer's text left off, providing discussion and guidelines for evaluating and documenting scholarly work, particularly in the realm of the scholarship of teaching. Completed after Boyer's death, it nonetheless includes a prologue by Boyer. The outline of criteria for assessing scholarship has been widely accepted but its limitation with respect to faculty evaluation is that it considers only scholarship while thorough faculty evaluation must consider all aspects of performance.
Marsh, H. W. (1987) Students evaluations of university teaching: Research findings, methodological issues, and directions for future research. International Journal of Educational Research, 11, 253-388. This review of the faculty evaluation literature remains the most cited work of its kind. It depth and breadth are unusual and its conclusions, supported by virtually all major researchers in evaluation, are that student ratings are, "1) multidimensional; 2) reliable and stable; 3) primarily a function of the instructor who teaches a courserather than the course that is taught; 4) relatively valid against a variety of indicators of effective teaching; 5) relatively unaffected by a variety of variables hypothesized as potential biases; and 6) seen to be useful by faculty…by students…and by administrators." .
Miller, R. I. (1987) Evaluating faculty for promotion and tenure. San Francisco: Jossey Bass. This text lists ten useful characteristics of effective systems that are the basis for many of the recommendations in this chapter, and includes legal issues, administrative roles, and discussion of promotion and tenure procedures.
New Directions for Teaching and Learning / Institutional Research These series from Jossey Bass include several relevant titles. Those dealing with evaluation and closely related issues are identified below.
This is the first in a series of volumes exploring evaluation issues and it includes contributions by many of the leading writers in the field.
On-line teaching and learning present new questions and issues in terms of the effect of context on instructional effectiveness. There are important questions about the evaluation of on-line teaching that have not yet been addressed and this volume's emphasis on assessment can contribute to more accurate and effective evaluation of on-line teaching.
In its discussions of the use of assessment to improve teaching and learning; to help new instructors; and to support program accountability, this volume is a model partner to effective evaluation. Since student learning is one measure of effective teaching, assessment can and should be part of good evaluation practice,
This volume considers evaluation from several perspectives, connecting it to the scholarship of teaching, teaching awards, the use of technology, portfolio development, accreditation, and other issues.
This volume deals with useful techniques for avoiding common errors in reporting and interpreting ratings results. Since such errors are one of the most common problems with ratings, the guidelines in this volume can be particularly useful.
This volume considers past practice and new approaches , recommending practical strategies for improving current methods, and for developing new ones.
This is one of the first discussions of ratings and evaluation to focus on context and the systematic nature of evaluation and the requirements it imposes. Several aspects of evaluation from ethical issues to disciplinary differences are presented.
This volume deals with teaching improvement but connects such efforts to gathering, analyzing, and interpreting evaluation and assessment data for purposes of improvement.
This volume looks ahead to possible changes in practice affected by issues such as the increasing demand for distance and on-line learning, and the effects of new definitions of validity.
Pescolozodi, B. & Aminzade, R. (1999) The social worlds of higher education: handbook for teaching in a new century. Thousand Oaks, CA: Pine Forge Press. This is a very interesting collection of works by sociologists and considers a range of higher education issues. Its treatment of higher education as a sociological microcosm is unique and worthwhile, and like the work of Birnbaum, it offers insights into the context for evaluation.
Scriven, M. (1967) Methodology of evaluation. In R. Tyler, R. Gagne, & M. Scriven (Eds.) Perspectives of curriculum evaluation. Chicago: Rand McNally & Co. If there is a seminal article on modern evaluation, this is it. Scriven's terminology and approach are the basis for contemporary evaluation and assessment.
Extended bibliography of related publications:
The
following bibliography is not complete. There are far too many items on
evaluation to include here. Those works listed below are almost all in
accessible publications or books. Presentations at conferences not
available via ERIC or on-line, are not included. Some items connect
evaluation with faculty or instructional development, institutional
policy, leadership, and related topics, and are thus included. |
|
Abrami, P. C. (1993). Using student rating norm groups for summative evaluation. Faculty Evaluation and Development, 13, 5-9. Abrami, P. C. (1999). Improving judgments about teaching using teacher rating forms. In M. Theall, P. Abrami, & L. A. Mets (Issue Eds.), "The student ratings debate. Are they valid? How can we best use them?" New Directions for Institutional Research # 109. San Francisco: Jossey-Bass. Abrami, P. C. & d'Apollonia, S. (1990a) The dimensionality of ratings and their use in personnel decisions. In M. Theall & J. Franklin (eds) "Student ratings of instruction: issues for improving practice". New Directions for Teaching and Learning #43. San Francisco: Jossey Bass. Abrami, P. C., d'Apollonia, S., & Cohen, P. A. (1990b). The validity of student ratings of instruction: What we know and what we don't. Journal of Educational Psychology, 82, 219-231. Abrami, P. C., d'Apollonia, S., & Rosenfield, S. (1996). The dimensionality of student ratings of instruction: What we know and what we do not. Higher Education: Handbook of Theory and Research, XI, 213-264. Also published in: Perry, R. P. & Smart, J.C. (Eds.). Effective Teaching in Higher Education: Research and Practice. Bronx, N.Y.: Agathon Press, pp. 321- 367. Abrami, P. C., Dickens, W. J., Perry, R. P., & Leventhal, L. (1980). Do teacher standards for assigning grades affect student evaluations of instruction? Journal of Educational Psychology, 72, 107-118. Abrami, P. C., Leventhal, L., & Perry, R. P. (1982). Educational seduction. Review of Educational Research, 52, 446-464. Aleamoni, L. M. (1987). Typical faculty concerns about student evaluation of teaching. In L. M. Aleamoni (Ed.) "Techniques for evaluating and improving instruction." New Directions for Teaching and Learning #31. San Francisco: Jossey Bass. Ambady, N., & Rosenthal, R. (1992). Half a minute: Predicting teacher evaluations from thin slices of nonverbal behavior and physical attractiveness. Journal of Personality and Social Psychology, 64, 431-441. Angelo, T. A., & Cross, K. P. (1993) Classroom assessment techniques: a handbook for college teachers. (2nd ed.) San Francisco: Jossey Bass. Arreola, R. A. (2nd ed.) (2000) Developing a Comprehensive Faculty Evaluation System. Bolton, MA: Anker Publishing Company. Bejar, I. I., & Doyle, K. O. (1981). Factorial invariance in student ratings of instruction. Applied Psychological Measurement, 5, 307-312. Bess, J. L. & Associates (2000) Teaching alone, teaching together. San Francisco: Jossey Bass. Birnbaum, R. (1992) How academic leadership works. Understanding success and failure in the college presidency. San Francisco: Jossey Bass. Birnbaum, R. (1988) How colleges work: the cybernetics of academic organization and leadership. San Francisco: Jossey Bass. Birnbaum, R. (2001) Management fads in higher education: where they come from, what they do, why they fail. San Francisco: Jossey Bass. Blackburn, R. T., & Clark, M. J. (1975). An assessment of faculty performance: Some correlates between administrators, colleagues, students, and self-Ratings. Sociology of Education, 48, 242-256. Boice, R. (1991) Quick starters: new faculty who succeed. in M. Theall & J. Franklin (eds.) "Effective practices for improving teaching." New Directions for Teaching and Learning # 48. San Francisco: Jossey Bass. Boice, R. (1992) The new faculty member. San Francisco: Jossey Bass. Brandenburg, D. C., Slinde, J. A. & Batista, E. E. (1977). Student ratings of instruction: Validity and normative interpretations. Journal of Research in Higher Education, 7, 67-78. Braskamp, L. A. & Ory, J. C. (1994). Assessing Faculty Work. San Francisco: Jossey-Bass. Braskamp, L. A., Ory, J. C., & Pieper, D. M. (1981). Student written comments: Dimensions of instructional quality. Journal of Educational Psychology, 73, 65-70. Brinko, K. T. (1991) The interactions of teaching improvement, in M. Theall & J. Franklin (Eds.) "Effective Practices for Improving Teaching." New Directions for Teaching and Learning. # 48. San Francisco: Jossey Bass. Brinko, K. T. & Menges, R. J. (1997) Practically speaking: a source book for instructional consultants in higher education. Stillwater, OK: New Forums Press. Cashin, W. E. (1990) Students do rate different academic fields differently. In M. Theall & J. Franklin (eds.) "Student ratings of instruction: issues for improving practice." New Directions for Teaching and Learning # 43. San Francisco: Jossey Bass. Centra, J. A. (1974). The relationship between student and alumni ratings of teachers. Educational and Psychological Measurement, 1974, 34 (2), 321-326. Centra, J. A. (1976). The influence of different directions on student ratings of instruction. Journal of Educational Measurement, 13(4), 277-282. Cashin, W. E. (1992). Student ratings: The need for comparative data. Instructional Evaluation and Faculty Development, 12 (2), 1-6. Centra, J. A. (1993). Reflective Faculty Evaluation: Enhancing Teaching and Determining Faculty Effectiveness. San Francisco: Jossey-Bass. Centra, J. A. (1994) The use of the teaching portfolio and student evaluations for summative evaluation. Journal of Higher Education, 65 (5), 555-570.
Centra, J. A. (2003) "Will teachers receive higher student
evaluations by giving higher grades and less coursework?"
Insttructional Evaluation and Faculty Development, 22 (1). Available
on-line at: Centra, J. A., & Gaubatz, N. B. (1998) Is there gender bias in student ratings of instruction? Journal of Higher Education, 70 (1), 17-33. Chickering, A. W. & Ehrmann, S. C. (1997) Implementing the seven principles: technology as lever. American Association for Higher Education. Available on-line at: http://www.tltgroup.org/programs/seven.html Chickering, A. W. & Gamson, Z. F. (1991) "Applying the seven principles for good practice in undergraduate education. New Directions for Teaching and Learning # 47. San Francisco: Jossey Bass. Clark, D. J., & Bekey, J. (1979) "Use of small groups in instructional evaluation. Insight into teaching excellence. 7 (1), 2-5. Arlington, TX: University of Texas at Arlington. Cohen, P. A. (1980). Effectiveness of student-rating feedback for improving college instruction: A meta analysis. Research in Higher Education, 13, 321-341. Cohen, P. A. (1982). Validity of student ratings in psychology courses: A research synthesis. Teaching of Psychology, 9, 78-82. Costin, F., Greenough, W. T. & Menges, R. J. (1971) Student ratings of college teaching: reliability, validity, and usefulness. Research in Higher Education, 41, 511-536. Crittenden, K. S., & Norr, J. L. (1973) "Student values and teacher evaluation: a problem in person perception." Sociometry, 36 (2), 143-151.. d'Apollonia, S., & Abrami, P. C. (1997a). Navigating student ratings of instruction. American Psychologist, 52 (11), 1198-1208. d'Apollonia, S., & Abrami, P. C. (1997b). Scaling the Ivory Tower Part I: Collecting evidence of instructor effectiveness. Psychology Teaching Review, 6, 46-59. d'Apollonia, S., & Abrami, P. C. (1997c). Scaling the Ivory Tower Part II: Student ratings of instruction in North America. Psychology Teaching Review, 6, 60-76. d'Apollonia, S., & Abrami, P.C. (1997d). "How'm I Doing?": The effects of expressive writing style on reader attention. Change, 29(5), 18-19. Davis, B. G. (1993) Tools for teaching. San Francisco: Jossey Bass. Diamond, R. M. (1989) Designing and improving courses and curricula in higher education. San Francisco: Jossey Bass. Doyle, K. O. (1983) Evaluating teaching. Lexington, MA: Lexington Books Doyle, K. O., & Crichton, L. A. (1978). Student, peer, and self-evaluations of college instruction. Journal of Educational Psychology, 70, 815-826. Ebel, K. E. (ed.) (1980) "Improving teaching styles." New Directions for Teaching and Learning #1. San Francisco: Jossey Bass. Fairweather, James 1993. Academic Values and Faculty Rewards. The Review of Higher Education, 17(1), 43-68 Farmer, D. W. (1999) Institutional improvement and motivated faculty. In M. Theall (ed.) "Motivation from within: encouraging faculty and students to excel." New Directions for Teaching and Learning #78. San Francisco: Jossey Bass. Feldman, K. A. (1976). The superior college teacher from the student's view. Research in Higher Education, 1976, 5, 243-288. Feldman, K. A. (1977). Consistency and variability among college students in rating their teachers and courses: A review and analysis. Research in Higher Education, 6, 223-274. Feldman, K. A. (1978). Course characteristics and college students' ratings of their own teachers: What we know and what we don't. Research in Higher Education, 9, 199-242. Feldman, K. A. (1979). The significance of circumstances for college students' ratings of their teachers and courses: A review and analysis. Research in Higher Education, 10, 149-172. Feldman, K. A. (1984). Class size and college students' evaluations of teachers and courses: A closer look. Research in Higher Education, 21, 45-115. Feldman, K. A. (1988). Effective college teaching from the students' and faculty's view: Matched or mismatched priorities? Research in Higher Education, 28, 291-344. Feldman, K. A. (1989a). Instructional effectiveness of college teachers as judged by teachers themselves, current and former students, colleagues, administrators and external (neutral) observers. Research in Higher Education, 30(2), 137-194. Feldman, K. A. (1989b). The association between student ratings of specific instructional dimensions and student achievement: Refining and extending the synthesis of data from multisection validity studies. Research in Higher Education, 30, 583-645. Feldman, K. A. (1990). An afterward for "The association between student ratings of specific instructional dimensions and student achievement: Refining and extending the synthesis of data from multisection validity studies". Research in Higher Education, 31, 315-318.
Feldman, K. A. (1992a). College students views of male and female college teachers. Part 1: Evidence from the social laboratory and experiments. Research in Higher Education, 33 (3), 317-375.
Feldman, K. A. (1992b). College students views of male and female college teachers. Part 2: Evidence from students' evaluations of their classroom teachers. Research in Higher Education, 33 (4), 415-474. Feldman, K. A. (1995). Some unresolved issues in studying instructional effectiveness in student ratings. Invited address presented at the 76th annual meeting of the American Educational Research Association. San Francisco: April 21 Feldman, K. A. (1997) Identifying exemplary teachers and teaching: evidence from student ratings. In R. P. Perry & J. C. Smart (Eds.) Effective teaching in higher education: research and practice. New Yoprk: Agathon Press. Feldman, K. A. & Paulsen, M. B. (1998) Teaching and Learning in the college classroom. (2nd ed) Needham Heights, MA: Simon & Schuster. Feldman, K. A. & Paulsen, M. B (1999) "Faculty motivation: the role of a supportive teaching culture." In M. Theall (ed.) "Motivation from within: encouraging faculty and students to excel." New Directions for Teaching and Learning #78. San Francisco: Jossey Bass. Franklin, J. & Berman, E. (1998) "Using student written comments in evaluating teaching." Instructional Evaluation and Faculty Development, 18 (1). Available on-line at: http://www.uis.edu/~ctl/sigfed/backissues.html Franklin, J. L., & Theall, M. (1990). Communicating ratings results to decision makers: Design for good practice. In M. Theall & J. Franklin (Eds.) Student ratings of instruction: Issues for improving practice. New Directions for Teaching and Learning #43. San Francisco: Jossey Bass. Franklin, J. L., & Theall, M. (1995) The relationship of disciplinary differences and the value of class preparation time to student ratings of instruction. In N. Hativa & M. Marincovich, (eds.) "Disciplinary differences in teaching and learning: implications for practice." New Directions for Teaching and Learning # 64. San Francisco: Jossey Bass.
Frey, P. W. (1976). Validity of student instructional ratings. Does timing matter? Journal of Higher Education, 3, May-June, 327-336. Frey, P. W. (1978). A two-dimensional analysis of student ratings of instruction. Research in Higher Education, 9, 69-91. Frey, P. W. (1979). The Dr. Fox Effect and its implications. Instructional Evaluation, 3, 1-5. Grasha, A. F. (1996) Teaching with style. Pittsburgh, PA: Alliance Press. Greenwald, A. G. (1997). Validity concerns and usefulness of student ratings of instruction. American Psychologist, 52, 11, 1182-1186. Greenwald, A. G., & Gillmore, G. M. (1997a). Grading leniency is a removable contaminant of student ratings. American Psychologist, 52(11), 1209-1217. Greenwald, A. G., & Gillmore, G. M. (1997b). No pain, no gain? The importance of measuring course workload in student ratings of instruction. Journal of Educational Psychology, 89, 743-751. Hativa, N. (1993). Student ratings: A non-comparative interpretation. Instructional Evaluation and Faculty Development, 13 (2), 1-4. Hativa, N. (1999) "Towards a conceptual framework of dimensions of effective instruction: the role of high-, intermediate-, and low-inference teaching behaviors." Instructional Evaluation and Faculty Development, 19 (1), 3-12. Available on-line at: http://www.uis.edu/~ctl/sigfed/backissues.html Hativa, N. & Marincovich, M. (eds.) (1995) "Disciplinary differences in teaching and learning: implications for practice." New Directions for Teaching and Learning # 64. San Francisco: Jossey Bass. Howard, G. S., Conway, C. G., & Maxwell, S. E. (1985). Construct validity of measures of college teaching effectiveness. Journal of Educational Psychology, 77, 187-196. Kulik, J. A. (2001) Student ratings: validity, utility, and controversy. In M. Theall, P. Abrami, & L. Mets (Eds.) "The student ratings debate. Are they valid? How can we best use them?" New Directions for Institutional Research #109. San Francisco: Jossey Bass Kulik, J. A., & McKeachie, W. J. (1975). The evaluation of teachers in higher education. In F. N. Kerlinger (Ed.), Review of Research in Education (Vol. 3, pp. 210-240). Itasca, IL: Peacock. Machell, D. F. (1989) "A discourse on professorial melancholia." Community Review, 9 (1&2), 41-50. Marlin, J. W. J. (1987). Student perception of end-of-course evaluations. Journal of Higher Education, 58(6), 704-716. Marsh, H. W. (1977). The validity of students' evaluations: Classroom evaluations of instructors independently nominated as best and worst teachers by graduating seniors. American Educational Research Journal, 14, 441-447. Marsh, H. W. (1982a). SEEQ: A reliable, valid, and useful instrument for collecting student's evaluation of university teaching. British Journal of Educational Psychology, 52, pp. 77-95. Marsh, H. W. (1982b). Validity of students' evaluations of college teaching: A multitrait-multimethod analysis. Journal of Educational Psychology, 74, 264-279. Marsh, H. W. (1984). Students' evaluations of university teaching: Dimensionality, reliability, validity, potential biases, and utility. Journal of Educational Psychology, 76, 707-754. Marsh, H. W., Fleiner, H., & Thomas, C. S. (1975). Validity and usefulness of student evaluations of instructional quality. Journal of Educational Psychology, 67, 833-839. Marsh, H. W. & Hocevar, D. (1984) "The factorial invariance of student evaluations of college teaching." American Educational Research Journal, 21 (2), 341-366. Marsh, H. W., Overall, J. U., & Kesler, S. P. (1979). Validity of student evaluations of instructional effectiveness: A comparison of faculty self-evaluations and evaluations by their students. Journal of Educational Psychology, 71, 149-160.
Marsh, H. W., & Ware, J. E. (1982). Effects of expressiveness, content coverage, and incentive on multidimensional student rating scales: New interpretations of the Dr. Fox Effect. Journal of Educational Psychology, 74, 126-134.
McCallum, L. W. (1984). A meta-analysis of course evaluation data and its use in the tenure decision. Research in Higher Education, 21, 150-158.
McKeachie, W. J. (1987). Can evaluating instruction improve teaching? In L. M. Aleamoni (Ed.), Techniques for evaluating and improving instruction. New Directions for Teaching and Learning #31. San Francisco: Jossey Bass. McKeachie, W. J. (1991) "Effective teaching behaviors in the college classroom." in J. C. Smart (Ed.) Higher Education Handbook of Theory and Research. New York: Agathon, 135-172. McKeachie, W. J. (1996). Do we need norms of student ratings to evaluate faculty? Instructional Evaluation and Faculty Development, 15 (1 & 2), 14-17. McKeachie, W. J. (1997). Student ratings: The validity of use. American Psychologist, 52(11), 1218-1225. McKeachie, W. J. (1999a) Teaching tips: strategies, research, and theory for college and university teachers. 10th ed. Boston: Houghton Mifflin. McKeachie, W. J., Lin, Y., & Mann, W. (1971). Student rating of teacher effectiveness: Validity studies. American Educational Research Journal, 8, 435-445. Menec, V. H., Perry, R. P., Struthers, C. W., Schonwetter, D. J., Hechter, F. J., & Eichholz, B. L. (1994) "Assisting at-risk college students with attributional retraining and effective teaching." Journal of Applied Social Psychology, 24, 675-701. Menges, R. J. & Associates (1999) Faculty in new jobs: a guide to settling in, becoming established, and building institutional support. San Francisco: Jossey Bass. Menges, R. J. & Brinko, K. T. (1986) Effects of student evaluation feedback: a meta-analysis of higher education research. Paper presented at the 70th annual meeting of the American Educational Research Association. San Francisco: April 21 . ED 270 408. Menges, R. J. & L'Hommedieu, R. (1992). Toward more conclusive studies of student ratings feedback. Instructional Evaluation and Faculty Development, 12 (2), 17-23. Menges, R. J. & Mathis, B. C. (1988) . San Francisco: Jossey Bass. Menges, R. J. & Svinicki, M. D. (eds.) (1991) "College teaching: from theory to practice." New Directions for Teaching and Learning #45. San Francisco: Jossey Bass. Miller, R. I. (1987). Evaluating Faculty for Promotion and Tenure. San Francisco: Jossey Bass. Moss, P. A. (1992). Shifting conceptions of validity in educational measurement: Implications for performance assessment. Review of Educational Research, 62, 229-258. Moss, P. A. (1996). Enlarging the dialogue in educational measurement: Voices from interpretive research traditions. Educational Researcher, 25(1), 20-28. Murray, H. G. (1983). Low inference classroom teaching behaviors and student ratings of college teaching effectiveness. Journal of Educational Psychology, 71, 856-865. Ory, J. C., Braskamp, L. A., & Pieper, D. M. (1980). The congruency of student evaluative information collected by three methods. Journal of Educational Psychology, 72, 181-185. Overall, J. U., & Magee, K. N. (1992). Estimating individual rating reliabilities. Applied Psychological Measurement, 16, 77-85. Overall, J. U., & Marsh, H. W. (1979). Midterm feedback from students: Its relationship to instructional improvement and students' cognitive and affective outcomes. Journal of Educational Psychology, 71, 856-865. Overall, J. U., & Marsh, H. W. (1980). Students' evaluations of instruction: A longitudinal study of their stability. Journal of Educational Psychology, 72, 321-325. Pascarella, E. T. & Terenzini, P. T. (1991) How college affects students. San Francisco: Jossey Bass. Perry, R. P. (1991) "Perceived control in the college classroom." In J. C. Smart (ed.) Higher education: handbook of theory and research (Vol 7). New York: Agathon. Perry, R. P., Abrami, P. C., & Leventhal, L. (1979). Educational seduction: The effect of instructor expressiveness and lecture content on student ratings and achievement. Journal of Educational Psychology, 71 (1), 107-116.
Perry, R. P., Magnusson, J. L., Parsonson, K. L., & Dickens, W. J. (1986). Perceived control in the college classroom: Limitations in instructor expressiveness due to noncontingent feedback and lecture content. Journal of Educational Psychology, 78 (2), 96-107. Pintrich, P. R. (ed.) (1995) "Understanding self-regulated learning." New Directions for Teaching and Learning # 63. San Francisco: Jossey Bass. Pintrich, P. R., McKeachie, W. J., Smith, D. A. F., Doljanac, R., Lin, Y., Naveh-Benjamin, M., Crooks, T., & Karabenick, S. (1988) "Motivated Strategies for Learning Questionnaire" (revised edition). Ann Arbor, MI: National Center for Research to Improve Postsecondary Teaching and Learning (NCRIPTAL). Ann Arbor, MI: The University of Michigan. Remmers, H. H., & Brandenburg, G. C. (1927). Experimental data on the Purdue Rating Scale for Instruction. Educational Administration and Supervision, 13, 519-527. Rodin, M., & Rodin, B. (1972). Student evaluations of teachers. Science, 177, 1164-1166. Schuster, J. H.., & Wheeler, D. W., & Associates (1990) Enhancing faculty careers. San Francisco: Jossey Bass Scriven, M. (1983). Summative teacher evaluation. In J. Milman (Ed), Handbook of teacher evaluation (pp. 244-271). Beverly Hills, CA: Sage. Scriven, M. (1988). The state of the art in tertiary teacher evaluation. Research and Development in Higher Education, 10, 2-27. Seldin, P. (1980). Successful Faculty Evaluation Programs: A Practical Guide to Improve Faculty Performance and Promotion/Tenure Decisions. Cruger, NY: Coventry Press. Seldin, P. (1984). Changing Practices in Faculty Evaluation. San Francisco: Jossey Bass. Seldin, P. (1991) The teaching portfolio: a practical guide to improved performance and prootion/tenure decisions. Bolton MA: Anker Publications. Seldin, P. (1993). The use and abuse of student ratings of professors. The Chronicle of Higher Education, July 21, 1993, pp. A40. Seldin, P. (1999) Changing practices in faculty evaluation. San Francisco: Jossey Bass. Shore, B. M., Foster, S. F., Knapper, C. K., Nadeau, G. G., Neill, N. & Sim, V. W. (1986) "The teaching Dossier." Ottawa: Canadian Association of University Professors. Smart, J. C. & Feldman, K. A. (1998) "Accentuation effects of dissimilar academic departments: an application and exploration of Holland's theory." Research in Higher Education, 39 (4), 385-418. Sorcinelli, M. D. & Austin, A. E.. (1992) "Developing new and junior faculty. New Directions for Teaching and Learning # 50. San Francisco: Jossey Bass Stark, J.S., Lowther, M. A., Ryan, M. P., Bomotti, S. S., Haven, G. L., & Martens, G. (1988) Reflections on course planning: faculty and students consider influences and goals. Ann Arbor, MI: National center for Research to Improve PostSecondary Education. Theall, M. (1994). What's wrong with faculty evaluation: A debate on the state of the practice. Instructional Evaluation and Faculty Development, 14 (1 & 2), 27-34. Theall, M. (1996). When Meta-analysis isn't enough: A report of a symposium about student ratings, conflicting results, and issues that won't go away. Instructional Evaluation and Faculty Development, 15 (1 & 2), 1-14. Theall, M. (Ed.) (1999) "Motivation from within: encouraging faculty and students to excel." New Directions for Teaching and Learning # 78. San Francisco: Jossey Bass. Theall, M. (2002a) "Assessment and evaluation: an institutional context." In R. M. Diamond (Ed.) A field guide to academic leadership. San Francisco: Jossey Bass. Theall, M. (2002b) "Leadership in faculty evaluation. In R. M. Diamond, R. (Ed.) A field guide to academic leadership. San Francisco: Jossey Bass. Theall, M., & Franklin, J. L. (1990). Student ratings in the context of complex evaluation systems. In M. Theall & J. Franklin (Eds.), Student ratings of instruction: Issues for improving practice. New Directions for Teaching and Learning #43. San Francisco: Jossey Bass.
Theall, M., & Franklin, J. L. (Eds.) (1991a) "Effective Practices for Improving Teaching." New Directions for Teaching and Learning. # 48. San Francisco: Jossey Bass.
Theall, M., & Franklin, J. L. (1991b). Using student ratings for teaching improvement. In M. Theall & J. Franklin (Eds.), Effective practices for improving teaching. New Directions for Teaching and Learning #48. San Francisco: Jossey Bass.
Theall, M., & Franklin, J. L. (1995) The relationship of disciplinary differences and the value of class preparation time to student ratings of instruction. In N. Hativa & M. Marincovich (eds.) "Disciplinary differences in teaching and learning: implications for practice." New Directions for Teaching and Learning # 64. San Francisco: Jossey Bass Theall, M., & Franklin, J. L. (2001). Looking for bias in all the wrong places: A search for truth or a witch hunt in student ratings of instruction. ." In M. Theall, P A. Abrami, & L. Mets (Eds.), "The student ratings debate. Are they valid? How can we best use them?" New Directions for Institutional Research No. 109. San Francisco: Jossey Bass. Theall, M., Franklin, J. L., & Ludlow, L. (1990) Attributions and retributions: student ratings and the perceived causes of performance. Paper presented at the 70th annual meeting of the American Educational Research Association. Boston: April 16. ERIC # ED 319-764. Tierney, W G. and E. M. Bensimon. 1996. Promotion and Tenure. Buffalo: SUNY Press. Wilson, R. C. (1986) "Improving faculty teaching: effective use of student evaluations and consultants." Journal of Higher Education, 57 (2), 196-211. |
|
Table of Contents | Issues |
Credits |
©
Copyright 1996-2008. Published by James Rhem & Associates, Inc.
|