iWrite与DeepSeek的评分信度与反馈内容对比分析

Comparison of scoring reliability and feedback content between iWrite and DeepSeek

ES评分 0

DOI 10.12208/j.sdr.20250263
刊名
Scientific Development Research
年,卷(期) 2025, 5(7)
作者
作者单位

武汉纺织大学外国语学院 湖北武汉

摘要
随着大语言模型性能的不断优化,大语言模型逐渐被教育者用来批改作文和提供反馈,与专门的作文评阅系统iWrite相比,其评分信度与反馈性能如何? 是否能成为一款可以信赖的评分与反馈工具。为探究此问题,本研究以国内某大学国际合作办学院系中艺术专业大二两个班的46篇雅思作文为样本,对比分析iWrite与DeepSeek的评分信度与反馈内容,以期为教育工作者在选择评分与反馈工具时提供借鉴。
Abstract
With the constant improvement of the performance of Large Language Models(LLMs), LLMs are gradually employed by teachers to score students’ writing and provide feedback for them. Compared with the professional Automated Essay Scoring system such as iWrite, the scoring reliability and the performance of generating feedback of LLMs are unclear. It remains doubtful whether these LLMs can be used as reliable tools for scoring and providing feedback. In order to answer this question, this study conducts a comparative analysis of iWrite and DeepSeek, evaluating their scoring reliability and feedback performance on IELTS writing tasks completed by 46 sophomore Art majors in a Chinese-foreign cooperative university program. It aims to provide some insights into choosing automated scoring and feedback tools for teachers and researchers.
关键词
iWrite;DeepSeek;英语写作;评分信度;反馈
KeyWord
iWrite; DeepSeek; English writing; Scoring reliability; Feedback
基金项目
页码 25-30
  • 参考文献
  • 相关文献
  • 引用本文

胡婕妤. iWrite与DeepSeek的评分信度与反馈内容对比分析 [J]. 科学发展研究. 2025; 5; (7). 25 - 30.

  • 文献评论

相关学者

相关机构