Search DominoPower's 10,675 Lotus-related article archive 
Home
EasyPrint
News details Click here for the RSS feed's XML code. This is not a browser URL.
Articles-only Click here for the RSS feed's XML code. This is not a browser URL.
Twitter Feed Click here for the Twitter feed.
DOCUMENT MANAGEMENT
Making business sense of classification technology choices
By Bain McKay

Last month, we talked about the need for automated classification systems. This month, we go into more detail.

While automated classification systems are coming to market using advanced technology, not all are created equal. How can you be sure you understand what matters about automated classification systems so you can make the right choice for your corporation? This month, we look at the two main and yet very different types of classification technology: neural network and adaptive clustering or classification.

Interestingly enough, these technologies use opposite methods to arrive at the clustering or classification of hierarchies. Neural network clustering classifies by difference in a batch process, whereas adaptive clustering technology classifies on sameness through incremental convergence in real-time. As a result, they each deliver a different business value beyond the immediate value of records classification for records management.

Neural network classification systems
Neural network clustering technology, which has been around for some time, is aimed at simulating the way the brain classifies the volumes of information it compartmentalizes every second of every day -- clearly a massive volume information management problem in its own right. Neural networks compartmentalize data by highlighting the differences between documents based on a set of significant representative phrases contained in the documents. In effect, it builds walls between the data, delivering data silos.

Neural network classification hierarchies are developed by training the network on a representative sample of documents from the target domain. Because neural network technology is so time consuming, decisions must be made as to how much data should be sampled to build the classification hierarchy. Statistical methods are used to ensure that proper domain data sampling is done to arrive at an approximate representation of the document domain. Building a neural network can take two or more days of processing, depending on the size of the data sample and the power of the computer used in processing the neural network.

After neural network processing has been completed, the names of the nodes in the classification hierarchy must be edited to provide meaningful names that you understand. This is much like the manual editing process used in structured data modeling. After editing, the classification hierarchy is published to a server to begin its document classification work. Documents are processed through the classification hierarchy, which acts as a sorting bin, placing each document in the closest-fit classification node in the hierarchy.


1  ·  2  ·  3  ·  4  ·  5  ·  Next »
Other articles you might like
Home > Strategies > Knowledge Management (22 articles)
   Inside the architecture of a hyperspatial Knowledge Management application
   Leveraging components in a hyperspatial knowledge management application
   Making sense of the Knowledge Management jargon
Get Weekly Email Updates
Subscribe to our regular weekly email newsletter. It's packed with tips, reviews, deep analysis, and the latest news.
 
Recent DominoPower Articles
What to look for in a Domino-based document management solution
Understanding Domino.doc end-of-life options
When the debugger won't debug hidden code that isn't hidden
What to do if the LotusScript debugger won't single-step over code
Top 10 ways to launch and build a Lotus consulting practice (with a little help from the Beatles)
Troubleshooting an OpenSuse Notes install
Incident report: denial of service attack against ConnectedPhotographer.com
Latest Lotus Headlines
SnTT - Enabling ALL the bells and whistles!
Tivoli Data protection causes Domino to crash
Fun when running DB2 CLP scripts
Introducing Flippr, the easy way to admin Quickr
DXL and fake security
Using search forms in IBM Workplace Collaborative Learning 2.7
Schmidt, Freed, and Gering on the OVF Toolkit
>> Read all the news
More from the ZATZ journals
Computing Unplugged: Eight steps to successful and reliable home backups
David Gewirtz Online: CNN commentary and analysis
OutlookPower: Can Outlook run when it's not running (and other mysteries)?
-- Advertisement --

AUTOMATE LOTUS NOTES USER ID MANAGEMENT
ID Manager 4.5 from HELP Software provides a new level of automaton for managing Lotus Notes IDs. ID Manager lets Lotus Notes administrators get out of the business of creating and managing user IDs. Use our ROI calculator to see how quickly ID Manager will pay for itself.

Learn more about HELP Software products
-- Advertisement --

Want The Top Lotus Experts By Your Side Without Paying Hefty Consulting Fees? Look No Further.
Like having a team of consultants by your side -- ones who have all the answers and never make mistakes -- THE VIEW gives you immediate access to field-tested instruction, guidance, and best practices from the brightest Lotus professionals around.

Join your peers who realize their Lotus technology is too important to let people from blogs and forums tell them how they should implement it, run it, and use it. THE VIEW is where only the world's top Lotus experts provide validated support to you on a weekly basis to ensure you work more efficiently, get more out of your Lotus technology, and stay clear of costly mistakes.

Check out the new instruction, tips, and best practices added to THE VIEW this week.

ZATZ Home  ·  News  ·  Back Issues  ·  Credits/Trademarks ·  Link To Us
Copyright © 1998-2009, ZATZ Publishing. All rights reserved worldwide.
Editor's Login