Please use this identifier to cite or link to this item: http://dspace.aiub.edu:8080/jspui/handle/123456789/1356
Title: Authorship Attribution for Bengali Language Using the Fusion of N-Gram and Naive Bayes Algorithms
Authors: Anisuzzaman, D. M.
Salam, Abdus
Keywords: naive bayes
n gram
authorship attribution
bengali language
natural language processing
Issue Date: Oct-2018
Publisher: MECS Press
Citation: 11
Abstract: This research shows the authorship attribution for three Bengali writers using both Naïve Bayes method and a new method proposed by us which performs better than Naïve Bayes for authorship attribution. Though a lot of works exist in the field of authorship attribution for other languages (especially English); the amount of work in this field for Bengali language is very low. For this experiment, we make our own dataset having 107380 words and 21198 unique words. For both methods, we pre-process our dataset to be compatible to work with the method experiments. For our dataset, Naïve Bayes gives an accuracy of 86% while our method gives an accuracy of 95%. The main inspiration behind our method is that every author has a nature to write some adjacent words and some single words repeatedly.
URI: http://dspace.aiub.edu:8080/jspui/handle/123456789/1356
Appears in Collections:Publications: Journals



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.