Recently in Spammers Category

Spammers of the Internet clogging available resources.

  1. Nov21

    The Spam Farms of the Social Web

    Blogs and other social media tools have changed the publishing landscape over the past few years, making it easier than ever to share information with the world. The ease of use and focused attention of the medium has also helped create new opportunities for spammers to automatically generate content, buy links, and get noticed by search engines and other points of aggregation. In this post I will break down the operations of one spam network utilizing social media technologies such as WordPress, Digg, del.icio.us, and more to climb the search results and generate revenue through ads and affiliate programs.

    Last weekend I noticed a Digg submission about weight loss tips had climbed the site's front page, earning a covetous position in the top 5 technology stories of the moment. The 13 sure-fire tips were authored by "Dental Geek" and posted to the "Discount Dental Plan" category on his WordPress blog. Scanning the sidebar links and adjacent content it was obvious this content was out of place on a page optimized for dental insurance. The webmaster of i-dentalresources.com had inserted some Digg bait, seeded a few social bookmarking services, and waited for links and page views to roll in, creating a new node in a spam farm fueled by high-paying affiliate programs and identity collection for resale.

    eBizzSol portfolio snapshot

    The spammer's domain is managed by eBizzSol, a company with fake domain registration information including the address block of a Christian church in Fullerton, California. The dental site is registered to an address in Dhaka, the capital of Bangladesh. Based on the broken English I've found on the network's sites an offshore base of operations would not surprise me. eBizzSol mentions about 200 sites in its portfolio, including real estate, mortgage, casinos, and more. They even advertise a content generation service for SEOs offering six blog posts a month for $75 optimized for specific keywords, including guarantees for blog directory and ping submissions. There are other sources of content generation available for hire online, creating a flow of content republished across a target category optimized for specific terms.

    Follow the money

    Why would someone want to create a site optimized for dental services? A search engine such as Google or Yahoo! discovers the site, indexes its pages, and starts including its content in search results for targeted keywords. Web searchers associate search engine rank with authority on a subject such as lowering an insurance premium or mortgage and generate a large amount of money per action. This particular site is collecting $40 or more per dental plan sold through a dental plan reseller and targeting specific keywords of value and boasts search engine index inclusion of "just a few hours" on its pages.

    The dental terms targeted cost up to $18 a click, offering incentives for top organic search conversion. Below is a price estimate from Google for keyword targeting in the United States.

    Google AdWords pricing
    Search termCPC ($)
    teeth whitening18.66
    sedation dentistry12.80
    cosmetic dentistry12.76
    dental plans9.78
    dental implant6.85
    pediatric dentist6.77
    discount dental plans5.93
    oral surgery4.95
    braces3.39
    cavity1.88

    Gathering links

    Directories

    Yahoo! directory pricing

    This webmaster bought links from the Yahoo! directory, the Microsoft Small Business Directory, Business.com, and a few others, placing a link to their site within targeted categories. They are cheaper than the $1000 links purchased on sites such as the W3C, but these listings are often just as spammy.

    Virality

    Digg sample count

    The article link was submitted to Digg by a user who joined Digg last month yet is already ranked in the top 150. The story received over 900 Diggs and is currently buried. A newly minted user posted to Reddit, posted to Newsvine, and posted to del.icio.us using the same name on each service. Seeding and voting up the content worked, as the blog post made its way to the top story listings on each social news service.

    As of this evening the spam site has 353 inlinks from 212 external pages, mostly due to its viral marketing efforts on social networks. Some social bookmarking users include their bookmarked links in their blog sidebar, creating additional direct links throughout their entire site in addition to the original bookmarking service location. The spam network had successfully spread a piece of content throughout multiple user communities, and onto individual blogs in the process.

    Summary

    Certain topics are especially well suited for baiting the technology-oriented crowds of social news and bookmarking sites. Stories focused on Apple, Firefox, Google, Nintendo, history of computers, top X lists, or the target social site itself are common baiting practices used to attract attention and place a new content node on the map. Opportunists will continue to jump into new networks of influence and promote their own sites, gathering search engine juice even when the brief blip of attention has passed and the crowd moves on to another story of the moment.

    World of Warcraft female human with shovel

    I believe social media accounts are currently available for rent or for sale, rewarding active users with paid placements or account resells in much the same way as a World of Warcraft character might be resold on eBay. Social media sites and search engines need to stay on top of this new form of content creation, continually analyzing data and scrubbing out the dirt. Sites overrun with web spam quickly lose their utility and might be banned from search engines.

    Social media sites continue to change the way we interact with data but expect more activity and content shaping in the future from marketers targeting the social media space for a quick link injection.

  2. May31

    AdSense API enters beta

    AdSense has a new API, allowing users to create and manage AdSense accounts programmatically using SOAP.

    Sounds ideal for all the spam bots creating new scraper pages for asbestos and cancer news. If your bot creates a new bot account and earns over $100, you get $100 too!

    Yes, there are more serious uses such as a reputable blog provider creating an AdSense ID for its members, put I just see the piles of web spam getting worse.

  3. Oct23

    Google spam suite primer

    Google provides a full suite of services for the entry-level blog spammer. There are plenty of legitimate uses for all of these Google services, but Google's market-leading position in search creates a spam ecosystem that inflates corporate revenues, index size, and user data. Google's blog hosting service, Blog*Spot, received a lot of attention this week as blogosphere neighbors threw up their arms in protest of the host, which is like the seedy motel at the edge of town that rents by the-hour. It's cheap and inviting to those who know no better, but those in the know don't want anything to do with it.

    I will describe the Google elements that contribute to a spam farm in an attempt to create more understanding about how your content ends up where you may not want it.

    The host

    Blogger's Blog*Spot hosting is a quick and easy way to create new blogs. It's free, you can post via e-mail, and many people think a Blog*Spot blog is the quickest way into Google's search index since the blog hosting servers might be only a few rows away from the Google crawler and of course Google knows how to find all of the content inside its own system.

    Blogger CAPTCHA

    The image above is a completely automated public Turing test to tell computers and humans apart, commonly referred to by its acronym: CAPTCHA. A CAPTCHA is supposed to be easy for a human to decipher, but difficult for computers using image recognition software.

    Blogger requires users to solve the above CAPTCHA before creating a new blog. Yet the system is bypassed daily and thousands of new blogs are created.

    A simple CAPTCHA can be broken using optical character recognition, the same technology that scans a printed page and converts the words to plain text.

    A common way to bypass a CAPTCHA system is to offer humans a reward for successfully entering the scrambled word. Some sites trade free porn for a CAPTCHA solutions, others hire people in low-income areas of the world to sit in front of a computer and solve CAPTCHAs all day.

    The content

    Google provides a lot of free content for someone to repurpose on their newly created Blog*Spot blog. Search Google's web, news, or blog results for the keyword of your choice and you will receive a list of content sources Google has determined is most relevant to the query. Copying from the top of these results is an easy way for spammers to obtain content already deemed relevant by Google for inclusion in its own pages.

    You will often see spam blogs composed of a group of results including a title, link, and except for targeted keywords. These pages are meant to attract search referrals for advertising or create more pages linking to a site the spammer would like to promote.

    Google blog search is the newest Google search service with relevant content available for scraping. Many of the cries from bloggers over the past week were most likely a result of a spammer using a script to retrieve the top search results on Google's blog search ranked by relevance for inclusion on a newly created Blog*Spot blog.

    The payout

    Google AdWords places text advertisements across the web related to the textual content of a page. Every time someone clicks on a Google text ad for "refinance" it costs the advertiser over $35 and makes the site owner some money. "Vioxx" pays about $16.50 a click, "poker" pays about $2.50 a click, and "camcorder" pays about $2.60 a click on Google's advertising network. The newly created blog can make money from these advertisements based on how many people are searching for their targeted keyword, the likelihood of a visitor to click on an ad, and the payout for such keywords.

    Automation

    The above process becomes even easier through the use of automated tools for blog creation, content retrieval, and advertising placement. More expensive tools include the use of pre-configured Blog*Spot blogs for a quick start.

    Conclusion

    Free web hosts have hidden costs. You don't have friendly neighbors and it's possible that search engines will not want to help others discover your area of the web.

    Google has taken more steps to protect its e-mail service, Gmail, from spammers than it has taken them away from Blog*Spot. There is a lot more that Google can do to reduce spam, reduce click fraud, and improve their Blogger service, but it might involve losing some advertising revenue in the short-term. I think no company in the business of content generation, indexing, or payment can afford to ignore the problem.

  4. Jun29

    Microsoft bidding on Claria?

    The New York Times reports Microsoft is currently in talks to acquire Claria for $500 million. Claria, formerly known as Gator, is known for its software installed on Windows computers to track browsing behavior and serve personalized advertisements based on this acquired user behavior. The article reports MSN is very interested in personalization technologies and the increased advertising revenues they provide and is pursuing companies in the space in an attempt to close the gap on Google.

    I am not a big fan of the methods used by Claria to deliver personalized listings. I think MSN could accomplish similar tracking behavior as an option in MSN Messenger or a toolbar before poisoning their evil empire image with Claria.

    Tags: ,

  5. Oct25

    Shirky spam?

    Over the last few days I received multiple e-mails about a message submitted using an e-mail address I use only on my weblog and it's feeds. The interesting thing is it appears this e-mail address was used to send a message to Clay Skirky's NEC list. It looks like a spammer might be crawling weblogs specifically to spam weblog mailing lists and it's the first time I have observed such a thing. Watch your inbox for a message from "nec-bounces@shirky.com" for approval of a message subject of "Re:" and you might observe part of a trend.
  6. Jul28

    MailFrontier tests phishing

    MailFrontier showed 1,000 consumers examples of phishing e-mails as well as legitimate e-mails. The respondents fell for the phishing message 28 percent of the time. Take the sample quiz and see how you score. (via Slashdot) "About 20 percent of the Web sites devoted to stealing information are hosted in South Korea; another 16 percent are in China, and 7 percent are in Taiwan."
  7. May31

    NY Times: When Software Fails to Stop Spam, It's Time to Bring In the Detectives

    Monday's New York Times article about Microsoft's fight against junk e-mail senders. I receive over 500 spam messages a day and would love to do something to stop the senders. "Microsoft's two-year-old digital integrity unit - which also fights online fraud, identity theft and spyware - employs more than 100 people around the world and has an annual budget of more than $10 million." "In the last 15 months, Microsoft has filed 53 civil cases against spammers."
  8. Mar17

    Sam Ruby on Wiki spam

    Sam Ruby has observed Wiki spammers reaching his sites through Google crawls. The WikiSpam has begun.

Niall Kennedy Niall Kennedy is a web technologist in San Francisco, California in the United States. I am very interested in the world of... MORE »

Search this weblog:

Subscribe:

Latest feature: Widget development

Archives: Popular Categories

Sites: More from Niall