Abstract:
The impact of fake news is far-reaching, affecting journalism, the economy, and democracy. In response, there has been a surge
in research focused on detecting and combatting fake news, resulting in the development of datasets, techniques, and fact-verification
methods. One crucial aspect of this effort is the creation of diverse and representative datasets for training and evaluating machine
learning models for fake news detection. This review paper examines the available datasets relevant to detecting fake news, with a
particular emphasis on those available in the Indian context, where few resources exist. By identifying research opportunities and
highlighting existing corpora, this paper aims to assist researchers in improving their fake news detection studies and contributing to
more comprehensive research on the topic. To the best of our knowledge, no survey has specifically focused on accessible corpora in
the Indian context, making this review a valuable resource for researchers in the field.