DISQUS

Plagiarism Today: 5 Steps: How Not to Look Like a Spam Blog

  • Voyagerfan5761 · 1 year ago
    Hmm, that first tip of yours would flag me as spam automatically. That might be why my visitor count in the last month has only been around 1,500. I have been thinking about grabbing my own domain at some point, though. (I hope to set one up by the end of the year, actually. I won't mention the options I'm considering in case someone else would try to register them before I do and then sell one to me for an inordinately high price.)
  • Jonathan Bailey · 1 year ago
    Voyagerfan: I have to be honest and say that, the first time I ran across your site, I had assumed it was spam. I quickly learned otherwise and haven't questioned it since, but the use of blogspot did make me nervous.

    For the record, it was your writing that convinced me it wasn't spam. That is something I should had probably added to this piece but, then again, most people really don't have that much control over their writing style.

    Good idea to not mention the domains, though I doubt there are too many domain prospectors on this site.
  • Steve Imparl · 1 year ago
    Jonathan,

    This is very helpful information. It can sometimes be difficult to spot the splogs. Your pointers give a useful way to to separate the spam from real blogs.

    I create daily news headlines for some of my sites and I am always facing the issue of deciding what is spam and what is not. While creating those brief e-mailings, I will find an article that has a good title and even text that looks like something informative and interesting to my readers. Often, however, upon closer inspection, it becomes clear that the text was scraped from some other site or an article in the MSM, of course, with no attribution.

    In most cases, I will simply not use a link to that article. Unless the text is really extraordinary, it is just too much work to go hunting all over the Web for the original source of the writing. That's unfortunate, but it's the reality. If I know some content has been scraped or plagiarized from another site, I won't use it. Sometimes, it's a close call, but I tend to err on the side of caution in this area because it bugs me.

    One other point that is a huge warning about spam is the absence of any paragraphs or white space in the text. I can't count the number of times I have visited what I thought was a legitimate blog, only to be greated by the default blue-and-white WordPress theme and a column of about 5,000 words of text. As soon as I see that kind of content, I sprint away from that blog (actually, I click the "back" button on my browser as quickly as I can) and go on to the next entry my news aggregator shows me.

    When I first started that practice of dropping those blogs like hot potatoes, I was a little worried that I might be discarding some useful content that was just formatted poorly. While that remains a remote possibility, I console myself by remembering that my readers are looking to me to suggest interesting articles to them; they won't appreciate it if I send them to a sites that really appear to be splogs, due to their awful formatting, even if they aren't.
  • ian in hamburg · 1 year ago
    Wordpress.com is definitely not immune to spam blogs. A few months ago wp.com was awash in a splog shower, a huge dump of spam sites all pushed out there at once.

    But a great feature of wordpress is tag surfing - sort of like an internal stumbleupon where you can click through a selection of blogs that match your selection of tags. If you come across a spam blog, you simply report it to wordpress.com as spam for a modlook and it gets zapped over the following day. During the height of the splog storm I was doing my part, helping to zap 15 - 20 a day - didn't have time for more.
  • Jonathan Bailey · 1 year ago
    Steve: I'm glad that you found the information useful. They are just some broad pointers but do seem to work most of the time, especially when you look at the spam blog above that scraped this post out of sheer stupidity.

    However, your problem with finding original articles is really indicative of the whole problem. The one suggestion I would make is, if you find an article that is suspect, take a quote from it and do a quick Google search, limit yourself to news results if possible. That's how I track down stories for the Copyright 2.0 Show.

    Over all though, I think you have the right idea. You're being careful about it and not intentionally linking to spam sites, that is more than what many do!

    Ian: WordPress is definitely not immune to spam but, compared to Blogspot, has a much higher ration of ham to spam. The team there has been much more effective at stopping spam blogs and their policies don't attract as many bad guys.

    Thank you for doing your part though with the spam blog wave. It seems to have died down there, I suspect the sploggers are now using dedicated servers they set up. It is a much more reliable system for them.

    Thank you again for all that you've done!
  • Voyagerfan5761 · 1 year ago
    @Jonathan (comment 2): Now that I think about it, perhaps the fact that I've stayed hosted on Blog*Spot has contributed to my high (80-90%) bounce rate. People click the result, see it's Blog*Spot, and hit Back, like Steve says he does in comment 3. There is, of course, the question of: "How will changing my domain after a year and a half affect my search engine presence?" But if it leads to lower bounce rates and more readers, I'd gladly undergo the Google Transition Period. Sorry if this is bordering (or just flat-out) off-topic, but any thoughts? I'm pretty sure Blogger sets up correct 301 redirects to properly transfer PageRank and all that.
  • Recording Studio · 1 year ago
    I have revived a personal site with a proper domain and in about two week's time shall launch a revived blog rather than a static site via wordpress. I have every intention of following your advise. I am unlikely to be posting many write ups on plagiarism except what I would consider to be of interest to you and give proper reference as I have been doing through email. I am glad that I have not yet built up any great traffic on my personal blog as I would be in the same trouble as voyagerfan5761.
  • Voyagerfan5761 · 1 year ago
    Just as a response to @Recording Studio: Not only do I have lots of visits through Google, but countless links from Google Groups postings that I put in my signature. That redirection is definitely the way to go; I'm glad Blogger supports it. But I am nervous that when I move I'll lose a lot of traffic. I don't have ads or anything to generate revenue on the site, but I don't think I could stand writing to a vacuum again like it was when I started blogging.

    But enough of my whining. ;-) Good luck launching your site!

    And this will probably be lost to the wind, but is there any chance of you launching OpenID support, Jonathan? Personally, I think it should be built into WordPress by default. (Is it? I'm not super familiar with the application.)
  • Jonathan Bailey · 1 year ago
    Voyagerfan: I really have no clue how Blogspot handles that kind of transfer. I am probably the worst person in the world to ask. My guess would be yes but I can't make any promises.

    However, your bounce rate isn't that unusual for a blog. Most blogs I know have a very high bounce rate as people come, read the article they want and leave. I don't know what PTs is off the top of my head but I know it is well over 50%. I've taken steps to reduce it but it is the nature of the Web.

    RS: Good luck launching your site. However, as I said, don't be too worried about your bounce rate. An extremely high one might be an indication of a larger problem but, let's face it, how many sites do you and I click more than one page on?

    Good luck with your new site, definitely send me an email when it goes live as I'm going to want to subscribe.
  • cybele · 1 year ago
    Voyagerfan - I moved my personal blog from blogspot to my own domain about 2 years ago. I left a redirect message on blogspot and exported all my old posts off (I couldn't get all my comments because they were on haloscan, which I installed long before blogger even HAD a commenting system, that's how oldskool I was). I moved to Expression Engine, a smaller platform, but it had lots of other options that I was looking for in a content management system.

    Anyway, yes, my traffic took a dip for a couple of months, but slowly built back up. It's something to do sooner, not later. If you're going to continue using blogger, it will completely re-create your site for you anyway, so it's pretty much no-muss-no-fuss. So even if you migrate to wordpress or MT, at least folks are with you for that. Feedburner makes the feed move pretty easy as well.
  • Jonathan Bailey · 1 year ago
    Cybele: Thanks for the information. However, that is very oldskool. I can remember when Blogger didn't have comments, but I certainly wasn't using it or any other blog platform then.
  • Karl · 1 year ago
    I noticed something else in that screenshot you posted of the spamblog copying your post. The Categories list shows over 4000 posts "uncategorised" and no actual user-created categories used at all. A genuine blog would (or should) use categories and/or tags; a spam-blog can't use them correctly (or at all) because it's just pasting someone else's feed without looking at or analysing its content.
  • Jonathan Bailey · 1 year ago
    Karl: I kind of file that under the "other defaults" heading. If you can't at least create real categories, don't bother creating a blog. I've at least replaced uncategorized with "Articles" as my default.

    But yes, an excellent point. categories are crucial. You can't have just one (the default) and you can't have eighteen million either. I've seen both and both are clearly spammy in nature...
  • Lee · 1 year ago
    Huh, call me oblivious but I'd never even thought a blog could be spam- it might be because I've never really went looking for them or much blogs outside of the Livejournal realm (call me old-fashioned, I don't care). Now I'm kind of curious to find out more about them. I rather like the fact I came across your site (via a link about the Photobucket issue), thank you for all the info.
  • Jonathan Bailey · 1 year ago
    Lee: No, not oblivious. Actually LJ has done a very good job keeping the spam blogs at bay so the fact you haven't seen anything spammy on the service is actually a good sign. Very welcome for all of the info though, let me know if I can help in any way!
  • Voyagerfan5761 · 1 year ago
    @Cybele: Sounds good. I'm planning to just use Blogger's custom domain feature, and I already have FeedBurner for my site feed, so it sounds like I'm set. Thanks for sharing your experience! There's also WordPress, but I'm looking to use that much later, if ever. I hear the import capabilities are good, though; it even grabs your comments?
  • Jonathan Bailey · 1 year ago
    Voyagerfan: If you follow WPs import instructions, it can grab your comments. I noticed that when I was doing my research on data portability a while back.