skip to main content
research-article

Optimisation of plagiarism detection using vector space model on CUDA architecture

Published: 01 January 2022 Publication History

Abstract

Plagiarism is a rapidly rising issue among students during submission of assignments, reports and publications in universities and educational institutions, due to easy accessibility of abundant e-resources on the internet. Existing tools become inefficient in terms of time consumption when dealing with the prolific number of documents with large content. Therefore, we have focused on software-based acceleration for plagiarism detection using CPU/GPU. Initially serial version of vector space model was implemented on CPU and tested with 1,000 documents, which consumed 1,641 s. As processing time was a bottleneck of performance, we indented to develop parallel version of the model on the graphics processing units (GPUs) using compute unified device architecture (CUDA) and tested with the same dataset which consumed only 36 s and gained 45x speed up compared to the CPU. Then the version was optimised further and took only 4 s for the same dataset which was 389x faster than the serial implementation.

Index Terms

  1. Optimisation of plagiarism detection using vector space model on CUDA architecture
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image International Journal of Innovative Computing and Applications
        International Journal of Innovative Computing and Applications  Volume 13, Issue 4
        2022
        63 pages
        ISSN:1751-648X
        EISSN:1751-6498
        DOI:10.1504/ijica.2022.13.issue-4
        Issue’s Table of Contents

        Publisher

        Inderscience Publishers

        Geneva 15, Switzerland

        Publication History

        Published: 01 January 2022

        Author Tags

        1. graphics processing units
        2. GPUs
        3. compute unified device architecture
        4. CUDA
        5. plagiarism detection
        6. vector space model
        7. CPU
        8. VSM
        9. parallel computing
        10. speed up
        11. acceleration
        12. idf
        13. web-based commercial tool
        14. kernel
        15. Google Cloud

        Qualifiers

        • Research-article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • 0
          Total Citations
        • 0
          Total Downloads
        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 05 Jan 2025

        Other Metrics

        Citations

        View Options

        View options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media