University of Bahrain
Scientific Journals

AraBERTopic: A Neural Topic Modeling Approach for News Extraction from Arabic Facebook Pages using Pre-trained BERT Transformer Model

Show simple item record

dc.contributor.author HABBAT, Nassera
dc.contributor.author ANOUN, Houda
dc.contributor.author HASSOUNI, Larbi
dc.date.accessioned 2021-08-22T23:53:35Z
dc.date.available 2021-08-22T23:53:35Z
dc.date.issued 2021-08-23
dc.identifier.issn 2210-142X
dc.identifier.uri https://journal.uob.edu.bh:443/handle/123456789/4504
dc.description.abstract Topic modeling algorithms can better understand data by extracting meaningful words from text collection, but the results are often inconsistent, and consequently difficult to interpret. Enrich the model with more contextual knowledge can improve coherence. Recently, neural topic models have emerged, and the development of neural models, in general, was pushed by BERT-based representations. We propose in this paper, a model named AraBERTopic to extract news from Facebook pages. Our model combines the Pre-training BERT transformer model for the Arabic language (AraBERT) and neural topic model ProdLDA. Thus, compared with the standard LDA, pre-trained BERT sentence embeddings produce more meaningful and coherent topics using different embedding models. Results show that our AraBERTopic model gives 0.579 in topic coherence. en_US
dc.language.iso en en_US
dc.publisher University of Bahrain en_US
dc.rights Attribution-NonCommercial-NoDerivatives 4.0 International *
dc.rights.uri http://creativecommons.org/licenses/by-nc-nd/4.0/ *
dc.subject Neural topic model en_US
dc.subject ProdLDA en_US
dc.subject AraBERT en_US
dc.subject topic coherence en_US
dc.title AraBERTopic: A Neural Topic Modeling Approach for News Extraction from Arabic Facebook Pages using Pre-trained BERT Transformer Model en_US
dc.identifier.doi http://dx.doi.org/10.12785/ijcds/140101
dc.volume 14
dc.contributor.authoraffiliation RITM Laboratory, CED ENSEM Ecole Superieure de Technologie Hassan II University en_US
dc.contributor.authoraffiliation RITM Laboratory, CED ENSEM Ecole Superieure de Technologie Hassan II University en_US
dc.contributor.authoraffiliation RITM Laboratory, CED ENSEM Ecole Superieure de Technologie Hassan II University en_US
dc.source.title International Journal Of Computing and Digital System en_US
dc.abbreviatedsourcetitle IJCDS en_US


Files in this item

The following license files are associated with this item:

This item appears in the following Issue(s)

Show simple item record

Attribution-NonCommercial-NoDerivatives 4.0 International Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivatives 4.0 International

All Journals


Advanced Search

Browse

Administrator Account