Multiple-choice questions (MCQs) are commonly used in educational assessments and professional certification examinations. However, managing vast collections of MCQs presents numerous challenges, including maintaining their quality and relevance. A notable issue in such repositories is the occurrence of conceptually identical questions presented in varied forms. These duplicates, while different in wording, fail to enhance the value of the repository. In this extended abstract, we present our approach for identifying and handling potential duplicate questions in large MCQ databases. Our proposed method involves three primary stages: initial pre-processing of MCQs, calculation of similarity based on Natural Language Processing (NLP) techniques, and a graph-based method for exploring these similarities.

Resolving duplicates in Large Multiple-Choice Questions Repositories / Albano, V.; Firmani, D.; Laura, L.; Mathew, J. G.; Paoletti, A. L.; Torrente, I.. - 3643:(2024), pp. 34-40. ( 20th Conference on Information and Research science Connecting to Digital and Library science Bressanone, Italy ).

Resolving duplicates in Large Multiple-Choice Questions Repositories

Firmani D.;Mathew J. G.;
2024

Abstract

Multiple-choice questions (MCQs) are commonly used in educational assessments and professional certification examinations. However, managing vast collections of MCQs presents numerous challenges, including maintaining their quality and relevance. A notable issue in such repositories is the occurrence of conceptually identical questions presented in varied forms. These duplicates, while different in wording, fail to enhance the value of the repository. In this extended abstract, we present our approach for identifying and handling potential duplicate questions in large MCQ databases. Our proposed method involves three primary stages: initial pre-processing of MCQs, calculation of similarity based on Natural Language Processing (NLP) techniques, and a graph-based method for exploring these similarities.
2024
20th Conference on Information and Research science Connecting to Digital and Library science
Digital Libraries; Data Integration; NLP
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Resolving duplicates in Large Multiple-Choice Questions Repositories / Albano, V.; Firmani, D.; Laura, L.; Mathew, J. G.; Paoletti, A. L.; Torrente, I.. - 3643:(2024), pp. 34-40. ( 20th Conference on Information and Research science Connecting to Digital and Library science Bressanone, Italy ).
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1739424
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact