Search DominoPower's 11,320 Lotus-related article archive 
Home
EasyPrint
News details Click here for the RSS feed's XML code. This is not a browser URL.
Articles-only Click here for the RSS feed's XML code. This is not a browser URL.
Twitter Feed Click here for the Twitter feed.
DOCUMENT MANAGEMENT
Making business sense of classification technology choices
By Bain McKay

Last month, we talked about the need for automated classification systems. This month, we go into more detail.

While automated classification systems are coming to market using advanced technology, not all are created equal. How can you be sure you understand what matters about automated classification systems so you can make the right choice for your corporation? This month, we look at the two main and yet very different types of classification technology: neural network and adaptive clustering or classification.

Interestingly enough, these technologies use opposite methods to arrive at the clustering or classification of hierarchies. Neural network clustering classifies by difference in a batch process, whereas adaptive clustering technology classifies on sameness through incremental convergence in real-time. As a result, they each deliver a different business value beyond the immediate value of records classification for records management.

Neural network classification systems
Neural network clustering technology, which has been around for some time, is aimed at simulating the way the brain classifies the volumes of information it compartmentalizes every second of every day -- clearly a massive volume information management problem in its own right. Neural networks compartmentalize data by highlighting the differences between documents based on a set of significant representative phrases contained in the documents. In effect, it builds walls between the data, delivering data silos.

Neural network classification hierarchies are developed by training the network on a representative sample of documents from the target domain. Because neural network technology is so time consuming, decisions must be made as to how much data should be sampled to build the classification hierarchy. Statistical methods are used to ensure that proper domain data sampling is done to arrive at an approximate representation of the document domain. Building a neural network can take two or more days of processing, depending on the size of the data sample and the power of the computer used in processing the neural network.

After neural network processing has been completed, the names of the nodes in the classification hierarchy must be edited to provide meaningful names that you understand. This is much like the manual editing process used in structured data modeling. After editing, the classification hierarchy is published to a server to begin its document classification work. Documents are processed through the classification hierarchy, which acts as a sorting bin, placing each document in the closest-fit classification node in the hierarchy.


1  ·  2  ·  3  ·  4  ·  5  ·  Next »
Other articles you might like
Home > Strategies > Knowledge Management (22 articles)
   Inside the architecture of a hyperspatial Knowledge Management application
   Leveraging components in a hyperspatial knowledge management application
   Making sense of the Knowledge Management jargon
Get Weekly Email Updates
Subscribe to our regular weekly email newsletter. It's packed with tips, reviews, deep analysis, and the latest news.
 
Recent DominoPower Articles
Lotusphere 2010: mobility and collaboration
2010: A Lotusphere of change
Five trends for 2010
DominoPower TV Episode 1: Inside a strategy session with Teamstudio
More about Domino log files
Say goodbye to the Uh-Ohs. Long live the Tens.
Why your log.nsf might not be purging properly
Latest Lotus Headlines
SnTT: XPages Blank Calendar Control (Part 2), adding data
Have your Lotus Notes calendar display multiple time zones
Sample Database for Microsoft Office and Lotus Symphony Integration
Symphony 3.0 beta signals another attack on Office
Enabling DAOS on a database - new recommendation
Need your opinion on some new policy settings for Mail
Sometimes IBM Lotus Domino HTTP RPC Agents aren't the answer...
>> Read all the news
More from the ZATZ journals
Computing Unplugged: The iPad: Apple's latest heartbreaker
David Gewirtz Online: CNN commentary and analysis
OutlookPower: Running auto-respond rules when Outlook is closed
-- Advertisement --

Learn Notes and Domino 8 at your place and pace!
Learn Notes and Domino in your office and/or home! TLCC's highly acclaimed distance learning courses for users, developers, and admins will enhance your career and your resume.

The many included activities and demos will make you a pro! Expert instructor help is a click away.

Click here to try a FREE demo course!!

-- Advertisement --

Struggling with exporting Notes data to spreadsheets? No More!
Try IntelliPRINT, The world's leading Reporting, Dashboards, and Analysis solution for Notes & Domino

  • Don't spend unproductive time maintaining different versions of the same spreadsheet
  • Preserve data integrity and security in multi-user environments
  • Create reports in minutes INSIDE Notes
  • Get freedom from iterative report requests, deliver self-serve capabilities

Experience Reporting, Dashboards, and Analysis INSIDE Notes.

Try IntelliPRINT NOW!

ZATZ Home  ·  News  ·  Back Issues  ·  Credits/Trademarks ·  Link To Us
Copyright © 1998-2010, ZATZ Publishing. All rights reserved worldwide.
Editor's Login