10th International Congress on Information and Communication Technology in concurrent with ICT Excellence Awards (ICICT 2025) will be held at London, United Kingdom | February 18 - 21 2025.
Authors - Debjani Mazumder, Jiaul H. Paik, Anupam Basu Abstract - The large volume of online educational materials makes it difficult for learners to find adequate resources for better learning. Understanding these materials relies on identifying key concepts essential for comprehension. Automatic concept extraction is an important task in educational data mining and is similar to keyphrase extraction in Natural Language Processing (NLP). This process helps identify key ideas, organize documents, and build an insightful learning path. We present a probabilistic approach for concept extraction. Candidate concepts are generated using Wikipedia anchor texts. We identify the necessary concepts based solely on the educational context of a particular document using a graph-based probabilistic model. Evaluation of our method on two datasets (namely, a Physics school textbook and Physics articles 3) outperforms existing unsupervised and supervised methods.