医药类院校教师教学水平学生评教的多元概化分析

程楠; 邓皓远; 殷建忠; 吴蒙; 罗媛; 孟琼

doi:10.12259/j.issn.2095-610X.S20220702

医药类院校教师教学水平学生评教的多元概化分析

doi: 10.12259/j.issn.2095-610X.S20220702

程楠¹,
邓皓远¹,
殷建忠²,
吴蒙³,
罗媛⁴,
孟琼^1, ,

1.
昆明医科大学公共卫生学院，云南昆明　650500
2.
保山中医药高等专科学校，云南保山　678000
3.
南京医科大学公共卫生学院，江苏南京　210000
4.
贵州医科大学公共卫生学院，贵州贵阳　550025

基金项目: 云南省教育厅教研教改项目（JG2018083）；昆明医科大学教研教改重点特色项目（2022-JY-Z-03）

详细信息

作者简介:
程楠（1998～），女，湖北武汉人，在读硕士研究生，主要从事现代测量理论的应用工作

通讯作者:
孟琼，E-mail： mengqiong@kmmu.edu.cn

中图分类号: G642
计量
- 文章访问数: 2838
- HTML全文浏览量: 1852
- PDF下载量: 78
- 被引次数: 0
出版历程
- 收稿日期: 2022-04-11
- 刊出日期: 2022-07-25

Multivariate Generalization Analysis for Students’ Evaluation on Teaching Level of Teachers in Medical Colleges

1.
Institute of Public Health，Kunming Medical University，Kunming Yunnan 650500
2.
Baoshan College of Traditional Chinese Medicine，Baoshan Yunnan 678000
3.
School of Public Health，Nanjing Medical University Nanjing，Jiangsu 210000
4.
School of Public Health，Guizhou Medical University，Guiyang Guizhou 550025，China

摘要

摘要: 目的采用多元概化理论评价《医药类院校教师课堂教学水平学生评价量表》信度的同时对各维度条目数优化提出建议，并确定学生评教实践中适宜的学生人数。方法收集整理通过该问卷调查的某医科大学422名学生数据，使用mGENOVA进行多元概化分析。先在G研究中估计各种误差来源的方差分量，然后实施一系列改变条目数和学生数D研究获得不同情况下信度系数以评价量表信度。结果 G研究中每个领域均呈现学生嵌套于教师的方差分量最大。D研究中，在领域水平，除了教学组织和教学方法2个领域外，其余领域的概化系数和可靠性指数均大于0.80；在总量表水平，合成概化系数和合成可靠性指数均高于0.85。保证可靠性指数在0.80及以上的前提下，每班至少抽取的学生数为25人；保证概化系数在0.80及以上的前提下，每班至少抽取的学生数为28人。结论基于多元概化分析此量表总体上有很好的信度，若下一步需要修订可考虑在教学组织和教学方法2个领域进行内容调整，在高校学生评教实践中各班抽取28名学生来进行调查最合适。
- 多元概化理论 /
- 学生评教 /
- 教学水平 /
- 量表
Abstract: Objective To evaluate the reliability of the Student Evaluation Scale for the Teaching Level of Teachers in Medical Universities （SESTLTMU） and determine the appropriate number of students in the teaching evaluation based on Multivariate Generalizability Theory （MGT） . Methods The data of 422 students from a medical university who were surveyed by this scale were collected and analyzed by using mGENOVA, a special software of multivariate generalizability theory. The variance components of various error sources were estimated in Generalizability Study （G-study）, and then several Decision Studies （D-studies） with varying numbers of items and numbers of students were analyzed to obtain reliability coefficients including generalizability coefficient （G） and the indexes of dependability （Ф） in order to evaluate the reliability of the scale. Results In the G-study, the most prominent variation in every domain was introduced by student nested in teacher effect. In the D-study, at the level of domain, the G coefficients and the Ф coefficients for three of the five domains were approximately equal to or greater than 0.80, except for the teaching organization domain and teaching method domain （> 0.70 but < 0.80）. For the overall scale, the compositeG and composite Ф coefficients were larger than 0.85. Under the premise that the Ф is 0.80 or above, the minimum number of students selected from each class should be 25. Under the premise that the G is 0.80 or above, the minimum number of students selected from each class should be 28. Conclusions The scale has good reliability as a whole based on the results of MGT. If this scale needs to be revised in the future, it can be considered to adjust the content in the teaching organization domain and the teaching method domain. It is the most appropriate to select 28 students from each class for investigation in the practice of teaching evaluation by university students.
- Multivariate generalizability theory /
- Teaching evaluation by students /
- Teaching level /
- Scale

HTML全文

表 1 各领域方差及协方差分量估计

Table 1. The estimated variance-covariance components for every domain

效应	教学组织	教学内容	教学方法	教学态度	教学效果
t	0.0200	1.0282	1.0582	1.0586	1.0764
	0.0262	0.0324	1.0068	1.0223	1.0118
	0.0333	0.0404	0.0497	0.9881	1.0431
	0.0243	0.0299	0.0358	0.0264	1.0172
	0.0325	0.0389	0.0496	0.0353	0.0455
s:t	0.2235
	0.1911	0.1908
	0.2055	0.1976	0.2648
	0.1726	0.1643	0.1820	0.1901
	0.2112	0.2020	0.2686	0.1991	0.3510
i	0.0142
		0.0138
			0.0774
				0.0012
					0.0022
ti	0.0063
		0.0008
			0.0344
				0.0073
					0.0031
si:t	0.2083
		0.1472
			0.3348
				0.1083
					0.1697
对角线上加粗标注的值为各效应的方差分量，对角线以上的值是典型相关系数，而对角线以下值是各个领域的协方差分量。

下载: 导出CSV

表 2 基于原始测量长度条件下多元D研究结果

Table 2. D-study results for design based on original test length

指标	教学组织	教学内容	教学方法	教学态度	教学效果	总量表
指标	$n'_i =5$	$n'_i =7$	$n'_i=8 $	$n'_i =7$	$n'_i =3$	$n'_i=33 $
$\sigma^2_P $	0.0200	0.0324	0.0497	0.0264	0.0450	0.0356
$\sigma_{\delta}^2 $	0.0049	0.0030	0.0085	0.0039	0.0058	0.0033
$\sigma_\Delta^2$	0.0078	0.0050	0.0182	0.0041	0.0061	0.0040
$\sigma_{X_Pl}^2 $	0.0078	0.0091	0.0213	0.0062	0.0106	0.0085
G	0.8023	0.9145	0.8535	0.8720	0.8878	0.9152
Ф	0.7203	0.8664	0.7318	0.8671	0.8816	0.8981
$\sigma^2_P $全域分数方差， $\sigma_{\delta}^2 $：相对误差方差， $\sigma_\Delta^2 $：绝对误差方差， $\sigma_{X_Pl}^2 $ ：用样本均数来估计全域分数时的误差方差，G：概化系数，Ф：可靠性指数。

下载: 导出CSV

表 3 各个领域的领域条目数比例与方差贡献率间比较

Table 3. Comparison between the CRCUS and the PDS in every domain

指标	教学组织	教学内容	教学方法	教学态度	教学效果
条目数	5	7	8	7	6
领域条目数比例/权重系数（%）	15.15	21.21	24.24	21.21	18.18
领域全域分数对合成全域分数的方差贡献率（%）	11.79	20.26	28.76	18.29	20.89
方差贡献率与领域条目数比例间的绝对差（%）	−3.36	−0.95	4.52	−2.92	2.71
方差贡献率与领域条目数比例间的相对差（%）	−22.19	−4.49	18.64	−13.78	14.89
绝对差 = 方差贡献率−领域条目数比例；相对差 = （方差贡献率−领域条目数比例）/领域条目数比例×100%。

下载: 导出CSV

表 4 不同测量长度下各领域及共性量表的两信度系数间比较

Table 4. Comparison of two reliability coefficients of every domains and universe under different test length

领域	条目数			概化系数（G）			可靠性指数（Ф）
领域	模型1	模型2	模型3	模型1	模型2	模型3	模型1	模型2	模型3
教学组织	5	6	7	0.8023	0.8123	0.8196	0.7203	0.7411	0.7567
教学内容	7	6	4	0.9145	0.9128	0.9069	0.8664	0.8574	0.8272
教学方法	8	9	10	0.8535	0.8615	0.8681	0.7318	0.7497	0.7646
教学态度	7	6	4	0.8720	0.8660	0.8457	0.8671	0.8603	0.8376
教学效果	6	5	3	0.8878	0.8847	0.8724	0.8816	0.8773	0.8605
总量表	33	32	28	0.9152	0.9135	0.9088	0.8981	0.8937	0.8818

下载: 导出CSV

表 5 不同样本下各领域及共性量表的两信度系数间比较

Table 5. Comparison of the two reliability coefficients of every domains and universe under different samples size

样本模型	合计样本数	概化系数（G）	可靠性指数（Ф）
模型A	420	0.9152	0.8981
模型B	281	0.8815	0.8600
模型C	212	0.8524	0.8376
模型D	140	0.7944	0.7815
模型E	450	0.9289	0.9112
模型F	300	0.9010	0.8844
模型G	150	0.8264	0.8124
模型H	140	0.8168	0.8031
模型I	135	0.8115	0.7980
模型J	125	0.8000	0.7868
模型K	100	0.7633	0.7513

下载: 导出CSV

参考文献(14)

[1]	本书编委会. 全国普通高校本科教育教学质量报告（2020年度）[M]. 北京: 高等教育出版社, 2021: 1-264.
[2]	陈银燕. 高校发展性评价体系构建:教师和机构的双维度评价[J]. 内蒙古师范大学学报(教育科学版),2016,29(03):76-78.
[3]	Debroy A,Ingole A,Mudey A. Teachers’ perceptions on student evaluation of teaching as a tool for faculty development and quality assurance in medical education[J]. Educ Health Promot,2019,8:218-225.
[4]	Constantinou C,Wijnen-Meijer M. Student evaluations of teaching and the development of a comprehensive measure of teaching effectiveness for medical schools[J]. BMC Med Educ,2022,22(1):113. doi: 10.1186/s12909-022-03148-6
[5]	黎光明,甄锋泉,王幸君,等. 多元概化理论在教育测量与评价中的多维化分析[J]. 教育测量与评价(理论版),2016,180(2):13-17.
[6]	孟琼,张美霞,陈莹,等. 医科院校教师教学水平学生评价量表的信度效度分析[J]. 卫生软科学,2016,30(7):46-48+53. doi: 10.3969/j.issn.1003-2800.2016.07.012
[7]	张志明, 张雷. 测评的概化理论及其应用[M]. 北京: 教育科学出版社, 2003: 52-53.
[8]	Nicaise V,Bois J E,Fairclough S J,et al. Girls’ and boys’ perceptions of physical education teachers' feedback:effects on performance and psychological responses[J]. Sports Sci,2007,25(8):915-926. doi: 10.1080/02640410600898095
[9]	Wolbring T,Riordan P. How beauty works. Theoretical mechanisms and two empirical applications on students’ evaluation of teaching[J]. Soc Sci Res,2016,57:253-272. doi: 10.1016/j.ssresearch.2015.12.009
[10]	Doubleday A F,Lee L M. Dissecting the voice:Health professions students’ perceptions of instructor age and gender in an online environment and the impact on evaluations for faculty[J]. Anat Sci Educ,2016,9(6):537-544. doi: 10.1002/ase.1609
[11]	Briesch A M,Swaminathan H,Welsh M,et al. Generalizability theory:A practical guide to study design,implementation,and interpretation[J]. Sch Psychol,2014,52(1):13-35. doi: 10.1016/j.jsp.2013.11.008
[12]	Vispoel W P,Morris C A,Kilinc M. Applications of generalizability theory and their relations to classical test theory and structural equation modeling[J]. Psychol Methods,2018,23(1):1-26. doi: 10.1037/met0000107
[13]	Keller L A,Clauser B E,Swanson D B. Using multivariate generalizability theory to assess the effect of content stratification on the reliability of a performance assessment[J]. Advances in Health Sciences Education,2010,15(5):717-733. doi: 10.1007/s10459-010-9233-8
[14]	Ibrahim A M. Using generalizability theory to estimate the relative effect of class size and number of items on the dependability of student ratings of instruction[J]. Psychol Rep,2011,109(1):252-258. doi: 10.2466/03.07.11.PR0.109.4.252-258

相关文章(20)

[1]	崔继华, 李抒瑾, 宋玉, 杨艳飞, 孙艳会, 孙晶晶, 凌昱. 气质特点在注意缺陷多动障碍儿童中的预测价值, 昆明医科大学学报. doi: 10.12259/j.issn.2095-610X.S20230518
[2]	周继萍, 张俊玲, 刘青, 李兴梅, 范沛友, 姜和玲. “模块化教学法”在肾内科本科实习护生带教中的实践与优化, 昆明医科大学学报. doi: 10.12259/j.issn.2095-610X.S20220523
[3]	张江, 杨秉坤, 平文波, 赵喜娟, 苏艳, 吴江. 基于目标教学理论的雨课堂联合情景模拟在放射治疗科实习护生带教中的应用, 昆明医科大学学报. doi: 10.12259/j.issn.2095-610X.S20221123
[4]	武春桃, 林珂. 日间手术患者健康素养评估量表的构建与应用及其对早期术后恢复质量的影响, 昆明医科大学学报. doi: 10.12259/j.issn.2095-610X.S20220515
[5]	阮艳琴, 李茂涓, 和丽梅, 宋莹, 黄巧云, 陈莹, 刘畅. 以患者为主的炎症性肠病患者PRO量表特异模块条目筛选, 昆明医科大学学报. doi: 10.12259/j.issn.2095-610X.S20220310
[6]	宋肖肖, 白珊, 杨玥娜, 杨兵兵, 李赛仙, 蔺应辉, 马喆, 曾加, 屈凡伟. 来华留学生对全英文授课教学服务满意度量表的信度和效度分析----以昆明医科大学为例, 昆明医科大学学报. doi: 10.12259/j.issn.2095-610X.S20210337
[7]	孙春意, 卿清, 李岱株, 许长俊, 黄娅娟, 刘洋. 自我导向学习理论结合翻转课堂对《妇产科学》教学过程中SRSSDL评分及理论成绩的影响, 昆明医科大学学报.
[8]	谭睿璟, 廖芮, 和丽芬, 王俊瑛, 丁莉, 刘敏, 李红梅. 模块化+项目驱动教学法在医学生信息素养教学中的应用, 昆明医科大学学报.
[9]	杨琳琳, 杨宏英, 张磊, 赵凌锋, 王薇, 谭树芬. 布卢姆教育目标分类学在妇产科理论课教学中的应用, 昆明医科大学学报.
[10]	雷雯, 吴文娟, 李振坤, 李晶, 杨梅娟, 董昭兴. 参与式教学在MBBS学生见习教学中的效果评价, 昆明医科大学学报.
[11]	丁哲宇, 寸英丽, 查勇, 代佑果, 张文, 张中红, 杜锋, 万崇华. 生命质量测定量表体系之胃癌量表 (QLICP-STV2.0) 的修订, 昆明医科大学学报.
[12]	张曦鸿. 昆明市471例老年慢性牙周炎口腔健康相关生活质量的调查, 昆明医科大学学报.
[13]	罗赛美. 癌症患者生命质量测定量表体系之前列腺癌量表QLICP-PR的条目筛选, 昆明医科大学学报.
[14]	周艺. 135名低出生体重儿的贝利婴幼儿发展量表测试分析, 昆明医科大学学报.
[15]	张晓磬. 慢性病患者生命质量测定量表体系之骨关节炎量表的研制及考评, 昆明医科大学学报.
[16]	张晓磬. 慢性病患者生命质量测定量表体系之骨关节炎量表的研制及考评, 昆明医科大学学报.
[17]	张丹霞. 大学生抑郁自评量表（SDS）调查结果因子分析, 昆明医科大学学报.
[18]	孟琼. 胃癌患者生命质量测定量表EORTC QLQ-STO22中文版的制定和评价, 昆明医科大学学报.
[19]	唐新明. 师资队伍状况对高等医学教学水平的影响, 昆明医科大学学报.
[20]	. 概化理论在《医学统计学》期末考试成绩评估中的应用研究, 昆明医科大学学报.

施引文献

资源附件(0)

访问统计

点击查看大图

效应	教学组织	教学内容	教学方法	教学态度	教学效果
t	0.0200	1.0282	1.0582	1.0586	1.0764
	0.0262	0.0324	1.0068	1.0223	1.0118
	0.0333	0.0404	0.0497	0.9881	1.0431
	0.0243	0.0299	0.0358	0.0264	1.0172
	0.0325	0.0389	0.0496	0.0353	0.0455
s:t	0.2235
	0.1911	0.1908
	0.2055	0.1976	0.2648
	0.1726	0.1643	0.1820	0.1901
	0.2112	0.2020	0.2686	0.1991	0.3510
i	0.0142
		0.0138
			0.0774
				0.0012
					0.0022
ti	0.0063
		0.0008
			0.0344
				0.0073
					0.0031
si:t	0.2083
		0.1472
			0.3348
				0.1083
					0.1697
对角线上加粗标注的值为各效应的方差分量，对角线以上的值是典型相关系数，而对角线以下值是各个领域的协方差分量。

表(5)

计量

文章访问数: 2838
HTML全文浏览量: 1852
PDF下载量: 78
被引次数: 0

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

医药类院校教师教学水平学生评教的多元概化分析

doi: 10.12259/j.issn.2095-610X.S20220702

作者简介:
程楠（1998～），女，湖北武汉人，在读硕士研究生，主要从事现代测量理论的应用工作

通讯作者:
孟琼，E-mail： mengqiong@kmmu.edu.cn

计量

Multivariate Generalization Analysis for Students’ Evaluation on Teaching Level of Teachers in Medical Colleges

计量

目录

留言板

医药类院校教师教学水平学生评教的多元概化分析

doi: 10.12259/j.issn.2095-610X.S20220702

作者简介: 程楠（1998～），女，湖北武汉人，在读硕士研究生，主要从事现代测量理论的应用工作

通讯作者: 孟琼，E-mail： mengqiong@kmmu.edu.cn

计量

出版历程

Multivariate Generalization Analysis for Students’ Evaluation on Teaching Level of Teachers in Medical Colleges

计量

出版历程

目录

作者简介:
程楠（1998～），女，湖北武汉人，在读硕士研究生，主要从事现代测量理论的应用工作

通讯作者:
孟琼，E-mail： mengqiong@kmmu.edu.cn