dc.contributor.author | HABBAT, Nassera | |
dc.contributor.author | ANOUN, Houda | |
dc.contributor.author | HASSOUNI, Larbi | |
dc.date.accessioned | 2021-08-22T23:53:35Z | |
dc.date.available | 2021-08-22T23:53:35Z | |
dc.date.issued | 2021-08-23 | |
dc.identifier.issn | 2210-142X | |
dc.identifier.uri | https://journal.uob.edu.bh:443/handle/123456789/4504 | |
dc.description.abstract | Topic modeling algorithms can better understand data by extracting meaningful words from text collection, but the results are often inconsistent, and consequently difficult to interpret. Enrich the model with more contextual knowledge can improve coherence. Recently, neural topic models have emerged, and the development of neural models, in general, was pushed by BERT-based representations. We propose in this paper, a model named AraBERTopic to extract news from Facebook pages. Our model combines the Pre-training BERT transformer model for the Arabic language (AraBERT) and neural topic model ProdLDA. Thus, compared with the standard LDA, pre-trained BERT sentence embeddings produce more meaningful and coherent topics using different embedding models. Results show that our AraBERTopic model gives 0.579 in topic coherence. | en_US |
dc.language.iso | en | en_US |
dc.publisher | University of Bahrain | en_US |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | * |
dc.subject | Neural topic model | en_US |
dc.subject | ProdLDA | en_US |
dc.subject | AraBERT | en_US |
dc.subject | topic coherence | en_US |
dc.title | AraBERTopic: A Neural Topic Modeling Approach for News Extraction from Arabic Facebook Pages using Pre-trained BERT Transformer Model | en_US |
dc.identifier.doi | http://dx.doi.org/10.12785/ijcds/140101 | |
dc.volume | 14 | |
dc.contributor.authoraffiliation | RITM Laboratory, CED ENSEM Ecole Superieure de Technologie Hassan II University | en_US |
dc.contributor.authoraffiliation | RITM Laboratory, CED ENSEM Ecole Superieure de Technologie Hassan II University | en_US |
dc.contributor.authoraffiliation | RITM Laboratory, CED ENSEM Ecole Superieure de Technologie Hassan II University | en_US |
dc.source.title | International Journal Of Computing and Digital System | en_US |
dc.abbreviatedsourcetitle | IJCDS | en_US |
The following license files are associated with this item: