Predicting the confusion level of text excerpts with syntactic, lexical and n-gram features

Tiago da Silva Pedro; José Luís Silva; Rúben Pereira

Ciência_Iscte Publicações Descrição Detalhada da Publicação

Publicação em atas de evento científico

Predicting the confusion level of text excerpts with syntactic, lexical and n-gram features

Tiago da Silva Pedro (Pedro, T. S.); José Luís Silva (Silva, J. L.); Rúben Pereira (Pereira, R.);

10th International Conference on Education and New Learning Technologies

Ano (publicação definitiva)

2018

Língua

Inglês

País

Espanha

Mais Informação

Visitar Link

Web of Science®

Esta publicação não está indexada na Web of Science®

Scopus

Esta publicação não está indexada na Scopus

Google Scholar

N.º de citações: 2

(Última verificação: 2025-03-28 21:15)

Ver o registo no Google Scholar

Overton

Esta publicação não está indexada no Overton

Abstract/Resumo

Distance learning, offline presentations (presentations that are not being carried in a live fashion but were instead pre-recorded) and such activities whose main goal is to convey information are getting increasingly relevant with digital media such as Virtual Reality (VR) and Massive Online Open Courses (MOOCs). While MOOCs are a well-established reality in the learning environment, VR is also being used to promote learning in virtual rooms, be it in the academia or in the industry. Oftentimes these methods are based on written scripts that take the learner through the content, making them critical components to these tools. With such an important role, it is important to ensure the efficiency of these scripts. Confusion is a non-basic emotion associated with learning. This process often leads to a cognitive disequilibrium either caused by the content itself or due to the way it is conveyed when it comes to its syntactic and lexical features. We hereby propose a supervised model that can predict the likelihood of confusion an input text excerpt can cause on the learner. To achieve this, we performed syntactic and lexical analyses over 300 text excerpts and collected 5 confusion level classifications (0 – 6) per excerpt from 51 annotators to use their respective means as labels. These examples that compose the dataset were collected from random presentations transcripts across various fields of knowledge. The learning model was trained with this data with the results being included in the body of the paper. This model allows the design of clearer scripts of offline presentations and similar approaches and we expect that it improves the efficiency of these speeches. While this model is applied to this specific case, we hope to pave the way to generalize this approach to other contexts where clearness of text is critical, such as the scripts of MOOCs or academic abstracts.

Agradecimentos/Acknowledgements

Palavras-chave

Confusion,Supervised learning,Text,Presentation

Classificação Fields of Science and Technology

Ciências da Computação e da Informação - Ciências Naturais

Registos de financiamentos

Referência de financiamento	Entidade Financiadora
UID/MULTI/0446/2013	Fundação para a Ciência e a Tecnologia

Contribuições para os Objetivos do Desenvolvimento Sustentável das Nações Unidas

Com o objetivo de aumentar a investigação direcionada para o cumprimento dos Objetivos do Desenvolvimento Sustentável para 2030 das Nações Unidas, é disponibilizada no Ciência_Iscte a possibilidade de associação, quando aplicável, dos artigos científicos aos Objetivos do Desenvolvimento Sustentável. Estes são os Objetivos do Desenvolvimento Sustentável identificados pelo(s) autor(es) para esta publicação. Para uma informação detalhada dos Objetivos do Desenvolvimento Sustentável, clique aqui.

Identificadores da Publicação

Outro ID (fonte: ORCID)	cv-prod-id-1422000
DOI (fonte: autor)	10.21125/edulearn.2018.1959
Handle (fonte: Ciência-IUL)	http://hdl.handle.net/10071/16884
ID Ciência_Iscte	ci-pub-49599

Outros Detalhes da Publicação

Ano Publicação Online	2018
Editora	IATED
Indexação	--
ISSN	2340-1117 (print) 2340-1117 (online)
ISBN	978-84-09-02709-5 (print) 978-84-09-02709-5 (online)
Volume
Número Artigo
Páginas	8417 - 8426	Total Páginas	--
Avaliado Cientificamente	Sim
Editores	Luis Gómez Chova; Agustín López Martínez; Ignacio Candel Torres
Título do Evento	--
Organizador do Evento
Cidade	Palma
Tipo de Evento	Conferência
Classificação do Evento	Internacional
Ano do Evento	2018
Tipo de Publicação no Evento	Artigo Completo
Repositório ISCTE-IUL	Link para o repositório
Data Publicação (online)
Data Publicação (print)

Altmetric

Dimensions

PlumX Metrics

Captures

Readers: 4

see details