By vijay.ramanathan on Mar 31, 2009
Some of you might have noticed that we recently added a new component called "UCM Clean Content Component" to the UCM Document Management 10gR3 bundle. If you haven't downloaded it yet, you'll find it on the Universal Content Management Product Download page on OTN under the "Oracle Universal Content Management Document Management 10gR3" section.
So what is "Clean Content?" Clean Content is Oracle's really cool SDK that allows you to examine Office documents for hidden content, and if desired, scrub the hidden information from the files. It has been available to Oracle Outside In Technology OEM customers for quite some time, and now we're making it available to UCM customers. Clean Content nicely complements the suite of content filtering and extraction technology offered by Oracle to OEM and UCM customers.
Why is this useful? Many Microsoft Office documents contain hidden content that you're not aware of and would not want exposed to a wide audience. For example, Word documents might contain hidden comments, deleted slides from PowerPoint files might still be in the file, or Excel files that connect to databases might contain database user names, passwords and other connection information that you don't want released.
The Clean Content Component for UCM 10gR3 uses Oracle's Clean Content technology to allow users to examine managed documents for hidden content. The component provides a service that analyzes a managed document, and generates a Document Risk Analysis Report. Users can access this functionality via the "Examine Document" option in the "Action" menu on the Content Information page of the desired item. The component does not automatically scrub content at this time (for example, before publishing files to an extranet or at the end of the workflow before saving the final revision). However, we look forward to adding that functionality in a future release -- stay tuned!