Please use this identifier to cite or link to this item:
http://dspace.aiub.edu:8080/jspui/handle/123456789/1356
Title: | Authorship Attribution for Bengali Language Using the Fusion of N-Gram and Naive Bayes Algorithms |
Authors: | Anisuzzaman, D. M. Salam, Abdus |
Keywords: | naive bayes n gram authorship attribution bengali language natural language processing |
Issue Date: | Oct-2018 |
Publisher: | MECS Press |
Citation: | 11 |
Abstract: | This research shows the authorship attribution for three Bengali writers using both Naïve Bayes method and a new method proposed by us which performs better than Naïve Bayes for authorship attribution. Though a lot of works exist in the field of authorship attribution for other languages (especially English); the amount of work in this field for Bengali language is very low. For this experiment, we make our own dataset having 107380 words and 21198 unique words. For both methods, we pre-process our dataset to be compatible to work with the method experiments. For our dataset, Naïve Bayes gives an accuracy of 86% while our method gives an accuracy of 95%. The main inspiration behind our method is that every author has a nature to write some adjacent words and some single words repeatedly. |
URI: | http://dspace.aiub.edu:8080/jspui/handle/123456789/1356 |
Appears in Collections: | Publications: Journals |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Authorship Attribution for Bengali Language Using the Fusion of N-Gram and Naive Bayes Algorithms (DSpace).docx | 4.55 MB | Microsoft Word XML | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.