Volume 1, Issue 2, 2007    
       
  Head’s Up: Research and Student Evaluations of Instructor    
       
 

Susan Pass, Department of Teacher Education, Clemson University, USA, spass@clemson.edu

   
       
 

Abstract

Recent research indicates that there appears to be a statistically significant negative impact upon student evaluations of instructors (SEIs) if consistent research is done on those students by the instructor (i.e. one’s students are the subject of one’s research), F (1/15)=12.035, p= .004, N=367.  This finding impacts all professors whose SEIs are included in their review for promotion and tenure.

Introduction

Few SEI rating scales include an item by which students can rate their reaction/concern when their instructor does research on them.  Is it possible, therefore, that students use other items in the scale to report such dissatisfaction?  A brief review of the literature reveals that there are few studies available on the impact that doing research on one's college students has with one's student ratings of professor/course effectiveness.  Yet, an unofficial survey of current and past students of the assistant professor studied in this work indicates that the data in this article is valid and is useful information for those professors going up for tenure and promotion.

The Problem

Data from this case study reveals that, if a professor does research on his/her college students, the SEI ratings of that professor will be low.  Yet, many such professors are urged to do such research not only to improve their programs but to also increase their publications in peer-refereed journals.  Most institutions of higher education include student evaluations of instructor's scores when considering faculty promotion.  While doing research on one's students might result in program improvement and more academic publications, it can also result in a professor not receiving tenure/promotion, because the professor's student evaluation of instructor (SEI) ratings will be too low.  Thus, professors turn to other research that is done outside of their classes.  This might result in college programs not being adequately improved because doing research on current students is the best vehicle towards assessing program needs. The scope of this research prompted two research questions.

Two Research Questions

  • ·What is the effect of doing research on one's students upon SEI ratings?

  • ·Even if no research is being done, does the perception that research is being done on them (i.e., the instructor's students) have a negative impact upon SEI ratings?

Review of the Literature in College Education

Because the assistant professor in this case study used constructivist teaching techniques when doing research ("because it builds the higher-thinking skills, which I was concentrating on in my research") but traditional teaching techniques ("because it is easier and more efficient") when not doing research, this review of the literature is divided into three sections in order to better explain the research questions being investigated.

Student Evaluations of Professors

Some college seniors believe that SEIs are not worth filling out because, instead of reading them and getting rid of bad professors, administrators ignore the data (Mcilwain, 2006).  However, hard-working professors say that SEIs "worry them because they matter in my [department chair's] evaluations and in my raises" (Maines, 2006, State and Regional News).

Since the early 1970s, SEIs have been added to faculty folios for evaluation and promotion (Nilson, 2003).  Since the late 1980s, researchers have recommended that other data (besides SEIs) be included in tenure-and-promotion assessments (Cashin, 1998; Aleamoni, 1999; Arreola, 2000). Included in this would be alumni surveys and candidate exit surveys.  Both Cohen (1981) and Feldman (1998) found in independent studies that student achievement on an external exam correlates "decently with student ratings of teaching effectiveness” (Nilson, 2003, p. 228).  While some universities use SEIs to improve programs (as well as rate professors), Paul Grayson found that students' satisfaction with their professors' teaching had little to do with students' sense of program satisfaction, which was determined mainly by their satisfaction with their program in the previous year (Grayson, 2004).  Finally, Robert Thorson wrote that the best evaluation of college instruction comes long after the student has graduated.  "At this point, they will be mature, objective and grateful enough to decide which professors really mattered, and which did not" (Thorson, p. A11).

Literature on Problems with SEIs

SEIs might be too subjective in nature.  In 1994, Peter Brady found that positive professor attitudes towards their students result in statistically significantly higher student perceptions of professor effectiveness (Brady, 1994).  In 1997, Susan Lang found that student evaluations are statistically significantly higher when a professor has enthusiasm (Lang, 1997).  Researchers found that instructor friendliness and personality relate only weakly to achievement (Nilson, 2003).

However, other researchers have discovered that there are biases that exist and SEIs need to be read with these biases in mind (Nilson, 2003).   Both Cashin (1988) and Aleamoni (1999) state that some of the biases that do exist include instructor status (regular faculty are usually rated higher than adjuncts) and expressiveness.  In addition, they found that, if students' prior interest in the subject was high and the course was an elective, the SEI ratings were higher (Cashin, 1988; Aleamoni, 1999).  It also was discovered that the higher the level of the course, the higher the SEI ratings.  In addition, humanities courses receive higher ratings than social science courses do (Nilson, 2003).  Students also tend to rate more generously if they believe that their ratings will be used for personnel decisions and if they do not have to sign their SEI (Nilson, 2003).

Literature on Professor's Actions and SEIs

Contrary to popular opinion, the more demanding the course, the higher the SEI ratings (Nilson, 2003) but Greenwald and Gilmore found this was only the case if the professor of the demanding course graded leniently (1997). 

In 2006, Thomas Benton wrote that "The steady erosion of tenure and the use of SEIs as a faculty-culling device are turning college teaches into spineless crowd pleasers"(p.1) and ill prepares graduates for the world outside of college.  Michael Gordon (2006) wrote, "Grade inflation has many well-publicized roots, including the reluctance of faculty members to confronting students who have received low grades, or to endanger their careers by getting low ratings from aggrieved students on end-of-semester evaluations (p. 10).  Because of this tendency towards grade inflation plus the documented biases in SEI ratings, Nilson (2003) believes, "relying too much on student ratings to assess teaching effectiveness is …being unsystematic and capricious" and suggests that universities adopt a more comprehensive system for evaluating faculty performance -- especially for purposes of tenure and promotion (p. 228-229).

Data Collection

Because the SEIs and students of only one assistant professor were studied, this study used only education courses in its three data collections.  All 367 students were college upperclassmen. Data was retrieved on an assistant secondary social studies professor's SEI ratings over a period of eleven semesters and four courses.  .  The data was then sorted into two groups; namely, Group A (when research was being done on the college students) and Group B (when research was not being done on the college students). The table documenting all the data sets collected on the professors' SEI ratings is located at the end of this article in Figure One. Use of this data was done in three different collections:

  • First, with all the classes that the assistant professor taught -- see Group A and Group B (page 5)

  • Second, with only those two courses that had exactly the same students in them but in two different semesters -- see Group One and Group Two (pages 6-7).

  • Third, with two groups enrolled in the same course taught in the same semester but in two different sections; namely Group Alpha (who thought research was being done on them when it was not) and Group Beta (who knew that research was not being done on them) to discover if just the perception of research being done on them can impact SEIs (page 7).

Data Findings

After the data was collected, but before the statistics were computed, some interesting facts emerged and the data tables in Figure One at the end of this article will verify the following facts:

  • ·No research was done in the spring of 2001 and the spring of 2004 with Course A. The SEI ratings of those two semesters were significantly higher than the other semesters in which research was done on the students of Course A.

  • Interestingly, this tendency also works for exactly the same students in different courses.  In the spring of 2004, no research was done on the Course A students. However, the following semester, in Course B, research was done. Notice that (with the same students) the SEI ratings are lower in the semester that research was done on those students. Conversely, those same students gave a much higher rating to the professor in the prior semester (when no research was done on them). 

  • Although the SEI ratings were lowest during the first year that the assistant professor taught, the overall SEI ratings of all the assistant professor’s courses seem to have a significant pattern. Notice how much higher SEI ratings are of the same professor in courses C, D, and E (when no research was done on the students) than courses A and B (when research was done on the students).

The Instrument

A factor analysis was done on the SEI instrument (N=307) and there appeared to be two factors; namely, student assessment of instructor effectiveness and student assessment of course (as set up by the instructor) effectiveness.  A Cronbach's alpha of .80 revealed mild reliability (perhaps not high enough for an instrument upon which tenure and promotion is based). 

First Data Analysis and Results

The Impact of Doing Research on One's Students on SEI Ratings: Courses A, B, C and D

First, data was collected on all SEIs for all courses that the assistant professor taught over a period of 5 ½ years (N= 367).  Two groups were considered.  Group A were those classes that no research was done with the students and Group B were those classes where research was done on the students.  Group A gave significantly lower SEI ratings:
_____________________________________________________

Groups                         N                     Mean                  Std. Deviation  

A                                    3                      4.1800             .6286              

B                                  13                     3.1938             .4049                           .

Total                            16                     3.3788             .5847
_____________________________________________________

A one-way T-test between the mean items 1-14 (on the SEI instrument) between the classes that had no research done with them (Group A) and the classes that did (Group B) revealed that the difference in mean scores was statistically significant with 163 students enrolled in 15 classes, t (14) =3.46, p=.004,  (2-tailed), d=.98615:

_______________________________________________________________

Scores                                                  t           df         Sig. (2-tailed)   Mean Difference

Equal variances assumed                       3.469   14        .004                 .98615

Equal variances not assumed                2.596   2.398   .101                .98615

_______________________________________________________________
 

Since these education courses consisted of college upperclassmen specializing in different education programs (e.g., elementary, secondary English, etc.), a one-way ANOVA was computed to take into account those differences.  This statistic revealed that the differences between the two groups were still statistically significant, with lower SEI ratings given in the classes that the professor did research with, F (1/15) =12.035, p= .004:

_______________________________________________________________

Scores                          Sum of Squares            df         Mean Square    F                      Sig.

Between groups            2.370                           1          2.370               12.035           .004

Within groups                 2.758                         14        .197

Total                                 5.128                         15

_______________________________________________________________
 

Second Data Analysis and Results

The Impact of Doing Research on One's Students on SEI Ratings: Courses A and B (Same Students)

To further work with the data, a colleague suggested that data be run only on courses that had exactly the same students.  So, two similar courses that contained exactly the same students enrolled in exactly the same education program (students were required to take course A (methods) their first semester of their senior year followed by course B (student teaching seminar in the second semester of their senior year) were compared. 

Courses A and B data. Data was retrieved on the assistant professor's 164 SEI ratings over a period of eleven semesters with the two courses.  The data was then sorted into two groups; namely, Group One (class sections of courses A and B, when research was not being done on the assistant professor's college students) and Group Two, when research was done on the professor's college students. 

Comparing the means of the two groups revealed statistically significant higher SEIs scores given by the group that had no research done on them (Group One) even though the students were exactly the same in both groups. Conversely, the SEI scores were statistically significantly lower (Group Two) if research was done with those same students:

_______________________________________________________________

Groups                   N                 Mean            Std. Deviation        Std. Error Mean

One                         3                 4.183           .6286                     .3630

Two                       12                3.186           .4218                     .1218

_______________________________________________________________
 

A one-way T-test between the mean items 1-14 (on the SEI instrument) between the classes that had no research done on them (Group One) and the classes that did (Group Two) revealed that the difference in mean scores was statistically significant, t (13) =3.50, p=.005 (2-tailed), d=.9942.  Again, the SEI ratings were higher with students who had no research was being done on them:

_______________________________________________________________

t-test for Equality of Means

Scores                                       t         df       Sig. (2-tailed)         Mean Difference

Equal variances assumed             3.35   13      .005                       .9942

Equal variances not assumed       2.597 2.47   .098                     .9942

_______________________________________________________________
 

Each course consisted of students who were specializing in one of the following subjects: economics, geography, history, political science, psychology or sociology.  Because of this variety of subjects represented among the students being studied, a one-way ANOVA was done.  This revealed that the classes which had research done on them scored the professor statistically significantly lower than the classes which did not have research done on them, F (1/13)=11.224, p= .005:

_______________________________________________________________

Scores                       Sum of Squares      df       Mean Square           F                  Sig.

Between groups      2.372                       1        2.372                     11.224         .005

Within groups           2.747                     13      .211

Total                            5.120                     14

_______________________________________________________________
 

Third Data Analysis and Results

The Impact of Perception that Research is Done on Them on SEI Ratings: (Course B)

In the same spring 2006 course (student teaching capstone), two sections were compared.  Group Alpha was section 001 that believed research was being done on them and Group Beta was section 002 that knew no research was being done on them.  Perception appears to be reality as there were statistically significant higher SEI ratings in the section that knew no research was being done on them and statistically significantly lower SEI ratings in the section that thought research was being done on them, t (25)=8.072, (2-tailed) p=.000, df 23.:

_______________________________________

Groups         N                 Mean            Std. Deviation        

Alpha           12                3.4967                   .2067          

Beta             13                4.2600                   .2604                     .

Total            25                3.8936                   .4527

_______________________________________


_______________________________________________________________

t-test for equality of means

Scores                                       t         df                 Sig. (2-tailed)         Mean Difference

Equal variances assumed             8.072 23      .         000                        .7633

Equal variances not assumed      8.149 22.524       .000                       .7633

_______________________________________________________________
 

Each section consisted of students who were specializing in one of the following subjects: economics, geography, history, political science, psychology or sociology.  Because of this variety of subjects represented among the students being studied, a one-way ANOVA was done.  This revealed that the section which had research done on them still scored the professor statistically significantly lower than the section which knew no research was being done on them, F (1/23)=65.156, p= .000:

_______________________________________________________________

Scores                       Sum of Squares      df       Mean Square           F                  Sig.

Between groups      3.636                      1        3.636                     65.156         .000

Within groups           1.283                     23      5.580E-02

Total                           4.919                     24     

_______________________________________________________________
 

Comments

Surveys were done with the candidates a year after graduation.  One former student wrote that, while he did not appreciate the course while he was in it, he now realizes that what was done was "good" for him.  Nevertheless, he still "resented" the research.  Another student wrote that the students in the class were "under stress" and having research done with them “just added” to the stress.  Another former student stated that he always "resented" when professors were doing research because he felt that he was "used." Finally, a person in charge of our university's research compliance unit wrote, " Please address the discomfort that may be experienced by students in the class regarding how their participation or refusal to participate in the research study (that is, their decision to allow or not to allow their reflective essays to be used for research purposes) might influence the instructor’s interest in them as students and might influence their grade in the course."  When, asked, this administrator sent the following email message to clarify her position:

My statement was just a general statement, reflecting concerns that students might be uncomfortable with being asked to participate in research or might feel they don't truly have a choice to not participate in research conducted by a course instructor. I don't know of published research about negative student reactions to having research conducted in courses. However, it seems to me that there's a widespread belief in the IRB community that students function in an environment that is inherently coercive when it comes to participation in academic research and that care must be taken to reduce even the possible perception of coercion in order to ensure the voluntariness of student participation in research. I don’t know whether this belief is based in research, but I can tell you I've bought into it.  If you know of research that suggests it's not true, I'd be really interested in learning about it.

Limitations

Only one professor's students were used in this study. Second, this study's results might be impacted by the fact that constructivism was used in all research with the college students. It might have been the constructivism and not the actual research that led to negative SEI ratings. A review of the literature states that critical thinking is often an implicit characteristic of higher education (King & Kitchener, 1994; Browne & Freeman, 2000; Mentkowski et a., 2000). One colleague suggested that most college seniors fear their professors and having to use critical thinking (i.e, constructivist) techniques in a college classroom is perceived as too threatening. However, in the third data analysis and results (see pages 8 and 9), the same course was taught in two different sections – one section (Beta) knew that no research was being done one them and one section (Alpha) thought research was being done with them. Both sections were taught with traditional teaching methods; yet, the section that thought research was being done on them reported statistically significantly lower SEI scores. Recent research seems to indicate that college students react negatively on SEIs when constructivist teaching strategies are used because they find them threatening (Nilson, 2003). So, this research plans to duplicate the above study with just traditional means of instruction being done while doing research on one's students in order to compare the results with the above data, that used constructivism in the research.

Conclusion

Given the above data results, it is difficult to recommend research on one's own students who will eventually evaluate your teaching effectiveness. This case study has shown that such action results in lower SEI ratings. This finding is valid with both different students in different courses or the same students in different courses over time. Finally, this study also revealed that just the perception research is being done on them will lead students to give lower SEI ratings. More research needs to be done on this topic.

References

Aleamoni, L.M. (1999). Student ratings myths versus research facts: An update.  Journal of Personnel Evaluation. 13 (2), 11-16.

 

Arreola, R.A. (2000). Developing a Comprehensive Faculty Retention System. Bolton, MA: Anker Publishing.

 

Benton, Thomas. (2006, June 9). A tough-love manifesto for professors. The Chronicle of Higher Education. 52 (4), 1.

 

Brady, Peter. (1994, June). How likeability and effectiveness ratings of college professors by their students are affected by course demands and professors' attitudes.  Psychological Reports 74 (3), 197-204.

 

Browne, M.N. & Freeman, K. (2000). Distinguishing features of critical thinking classrooms.  Teaching in Higher Education.  5 (3), 301-309.

 

Cashin, W.E. (1988). Student ratings of teaching: A summary of the research. IDEA Paper No. 1, Center for Faculty Evaluation and Development, Kansas State University.

 

Cohen, P.A. (1981). Student ratings of instruction and student achievement: A meta-analysis of multi-section validity studies.  Review of Educational Research.  51, 281-309.

 

Fledman, K.A. (1998).  Identifying exemplary teachers and teaching: Evidence from student ratings.  In Teaching and learning in college classrooms.  K.A.Feldman & M.B. Paulsen (Eds.).  Needham Heights, MA: Simon & Schuster.

 

Gordon, Michael. (2006, August 11). When B's are better. The Chronicle of Higher Education. 52 (49), 10.

 

Grayson, J.P. (2004). The relationship between grades and academic program satisfaction. Journal of Higher Education. 24(2), 1-34.

 

Greenwald, A.G. & Gillmore, G.M. (1997). No pain, no gain?  The importance of measuring course workload in student ratings of instruction.  Journal of Educational Psychology. 90 (4), 743-751.

 

Howard, G.S. & Maxwell, S.E. (1982).  Do grades contaminate student evaluations of instructors? Research in Higher Education.  16 (2), 175-188.

 

King, P.M. & Kitchener, K.S. (1994). Developing reflective judgment: Understanding and promoting intellectual growth and critical thinking in adolescents and adults.  San Francisco, CA: Jossey-Bass.

 

Knox, John. (2005, September 2). Professor says rank better from students. The Augusta Chronicle. Metro, Bo5.

 

Lang, Susan.  (1997, fall). Student ratings soar when professor use enthusiasm. Human Ecology Forum. 25 (4), 24-25.

 

Maines, Sophia. (2006, September 18). Task force to review evaluation process. Journal World. State and Regional News section.

 

Mcilwain, Ben. (2006, April 25). The Diamondback. Column.

 

Mentkowski, M., Rogers, G., et al. (2000). Learning that lasts: Integrating learning, development, and performance in college and beyond.  San Francisco, CA: Jossey-Bass.

 

Murray, H.C. & Renaud, R.D. (1998). Disciplinary differences in classroom teaching behaviors.  In Teaching and learning in the college classroom. K.A. Feldman & M.H. Paulsen (Eds.).  Needham Heights, MA: Simon & Schuster.

 

Thorson, Robert M. (2006, August 24). Defining best professors. Hartford Courant. Editorial, A11.
 










   
       
  Return to top.    
 
| Home | Contact UsEditorial Board | Current Issue | Submission |
 
 
© Copyright 2006, Scientific Journals International.  All Rights Reserved.