Journal Contents
all articles of volume 7 issue 2 | return to Journal Contents
Article of Volume 7, Issue 2, June 2012
The ACODEA framework: Developing segmentation and classification schemes for fully automatic analysis of online discussions
Authors: Karsten Stegmann, Jin Mu, Elijah Mayfield, Carolyn Rosé, Frank Fischer
Abstract: Research related to online discussions frequently faces the problem of analyzing huge corpora. Natural Language Processing (NLP) technologies may allow automating this analysis. However, the state-of-the-art in machine learning and text mining approaches yields models that do not transfer well between corpora related to different topics. Also, segmenting is a necessary step, but frequently, trained models are very sensitive to the particulars of the segmentation that was used when the model was trained. Therefore, in prior published research on text classification in a CSCL context, the data was segmented by hand. We discuss work towards overcoming these challenges. We present a framework for developing coding schemes optimized for automatic segmentation and context-independent coding that builds on this segmentation. The key idea is to extract the semantic and syntactic features of each single word by using the techniques of part-of-speech tagging and named-entity recognition before the raw data can be segmented and classified. Our results show that the coding on the micro-argumentation dimension can be fully automated. Finally, we discuss how fully automated analysis can enable context-sensitive support for collaborative learning.
Keywords: Online discussion, Automatic content analysis, Text classification
Citation: Mu, J., Stegmann, K., Mayfield, E., Rosé, C., & Fischer, F. (2012) The ACODEA framework: Developing segmentation and classification schemes for fully automatic analysis of online discussions. ijcscl 7 (2), pp. 285-305
DOI: 10.1007/s11412-012-9147-y
Preprint: mu_stegmann_mayfield_rose_fischer_7_2.pdf
About this article at link.springer.com [http://dx.doi.org/10.1007/s11412-012-9147-y] including a link to the official electronic version.