Upcoming Events

Cloud Connect
Santa Clara
Feb 13-16, 2012

Cloud Connect brings together the entire cloud eco-system to better understand the transformation we're experiencing and promises to be the defining event of the cloud computing industry. Learn about the latest cloud technologies and platforms from thought leaders in Cloud Connect’s comprehensive conference.

Register Now!

More Events »

Subscribe to Newsletter

  • Keep up with all of the latest news and analysis on the fast-moving IT industry with Network Computing newsletters.
Sign Up

Email Email  Print  Share


Filters Take a Bite out of SPAM

Tags: , , , , , , , , , , , , , ,

Channel: Other, Data Protection

Weighty Matters

Because our weighted accuracy rating determined the products that made it into this review, it's important for you to understand our definition of accuracy. We used both false positives and false negatives to determine an accuracy score for each spam filter because both measurements represent classification mistakes. But because false positives are more costly to your organization than false negatives, we took our accuracy ratings a step further by weighting each false positive by a factor of five (for our definitions of false positives and negatives and other spam-related terms, see Glossary). We include the nonweighted accuracy in our table (page 62) for comparison but used the weighted ratings to determine which vendor would make the final cut.


The Long List
Click to Enlarge

Note that our weighted accuracy scores are lower than the accuracy ratings published by antispam vendors. This is due, in part, to our giving more weight to false positives. In addition, procedural issues had a larger effect on some products than others. For example, Postini complained (after the fact) that our test methodology caused it an unduly large number of false negatives because its transport heuristics were rendered useless. Postini uses transport heuristics to examine the content of the SMTP conversation prior to the data command in the SMTP protocol and drops up to 30 percent of inbound SMTP connections as spam before any message content is received. Because our messages were mirrored from our production e-mail server, Postini's transport heuristics didn't come into play, forcing its content filters to do 100 percent of the spam detection. Likewise, vendors that rely on customer training for their Bayesian engines fared worse than vendors with Bayesian engines that ship with an extensive pretrained database.

Let's Talk SPAM
Join us Tuesday and Thursday (May 18th and 20th) at 12:30pm eastern to talk live with Ron Anderson about his recent review of 35 Anti-Spam hardware and software solutions.

Another reason our accuracy numbers are lower than the vendors' is because their stats look at only part of the picture and are based on best-case scenarios. Vendors usually report their tuned catch rate, which counts only true positives and reflects customer-specific tuning to help increase accuracy, or their false-positive rate. For example, Brightmail reports its product to be 99.9999 percent accurate based on its claim of 1 in 1,000,000 false positives, with no reference to false negatives.


Accuracy Test Results

Click to Enlarge

Finally, our test bed used real e-mail directed to NETWORK COMPUTING editors, including scads of press releases, HTML-formatted industry newsletters and other spammy-looking legitimate missives that are tough to analyze correctly. Remember that this is a point-in-time test that emphasized out-of-the-box performance and defined accuracy in a certain way--your mileage may vary.


Page:  1 | 2 |3 |4 |5 |6 |7 |8 |9 |10 |Next Page »

Related Reading


More data-protection Insights



Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

Network Computing encourages readers to engage in spirited, healthy debate, including taking us to task. However, Network Computing moderates all comments posted to our site, and reserves the right to modify or remove any content that it determines to be derogatory, offensive, inflammatory, vulgar, irrelevant/off-topic, racist or obvious marketing/SPAM. Network Computing further reserves the right to disable the profile of any commenter participating in said activities.

 
Disqus Tips To upload an avatar photo, first complete your Disqus profile. | View the list of supported HTML tags you can use to style comments. | Please read our commenting policy.
 

Research and Reports

Hypervisor Derby
August 2011

Network Computing: August 2011

TechWeb Careers