skip to main content
10.1145/1571941.1572090acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
poster

Is spam an issue for opinionated blog post search?

Published: 19 July 2009 Publication History

Abstract

In opinion-finding, the retrieval system is tasked with retrieving not just relevant documents, but those that also express an opinion towards the query target entity. This task has been studied in the context of the blogosphere by groups participating in the 2006-2008 TREC Blog tracks. Spam blogs (splogs) are thought to be a problem on the blogosphere. In this paper, we investigate the extent to which spam has affected the participating groups' retrieval systems over the three years of the TREC Blog track opinion-finding task. Our results show that spam can be an issue, with most systems retrieving some spam for every topic. However, removing spam from the rankings does not markedly change the relative performance of opinion-finding approaches.

References

[1]
P. Kolari, A. Java, and T. Finin. Characterizing the Splogosphere. In Proceedings of 3rd WWE Workshop at WWW'06, Edinburgh, UK, 2006.
[2]
I. Ounis, C. Macdonald, and I. Soboroff. On the TREC Blog Track. In Proceedings of ICWSM-2008, Seattle, USA, 2008.
[3]
I. Ounis, C. Macdonald, I. Soboroff. Overview of TREC-2008 Blog track. In Proceedings of TREC-2008, Gaithersburg, USA, 2009.
[4]
C. Macdonald and I. Ounis. The TREC Blog06 Collection: Creating and Analysing a Blog Test Collection. DCS Technical Report TR-2006-224. Univ. of Glasgow. 2006. http://www.dcs.gla.ac.uk/~craigm/publications/macdonald06creating.pdf

Cited By

View all

Index Terms

  1. Is spam an issue for opinionated blog post search?

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
    July 2009
    896 pages
    ISBN:9781605584836
    DOI:10.1145/1571941

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 19 July 2009

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. blogs
    2. opinion-finding
    3. spam
    4. splogs

    Qualifiers

    • Poster

    Conference

    SIGIR '09
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 792 of 3,983 submissions, 20%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 06 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media