Publication Details
Issue: Vol 2, No 11 (2024)
ISSN: 2995-486X
Abstract
We propose a new way to spot search engine spam created by computer programs. Our method looks at how often certain writing styles and genres appear in a text. We use machine learning to automatically identify spam based on these patterns. Experiments show that our method can successfully detect spam created by a common type of text generator called a Markov chain.
Keywords
Search Engine Spam
Spam Detection
Machine Learning
Text Generation
Markov Chains
Spam Filtering Algorithms
Automated Text Generation
Natural Language Processing
Language Patterns
Duplicate Content Detection
Non-Dictionary Features