AIIM — The Enterprise Content Management Association

The source for solving your business content challenges.

SharePoint Micro Site

On the Record with SharePoint

A look at enterprise information lifecycles and content governance. Not thinking about these items today? Read this.

Jan 08, 2010


This column focuses on the enterprise content management (ECM) and records management (RM) dimensions of SharePoint, including both Microsoft Office SharePoint Server (MOSS) 2007 and the upcoming release of SharePoint 2010. We explore the broad solution ecosystem that exists for SharePoint content and records management. This week, we focus on Enterprise Information Lifecycles and Content Governance.

Solving Business Problems with an Enterprise Information Lifecycle
First, what is a lifecycle? A lifecycle is a succession of conditions through which information is processed from creation or receipt to its final disposition. A lifecycle is comprised of states, such as “draft” or “work in process” or “final/record.” Each lifecycle state is a point in the life of information during which processes, ordered activities which initiate the application of a set of business rules, are carried out. The enterprise information lifecycle establishes states for all information that is created or received in an enterprise through to its final disposition.

The reason that a standard enterprise information lifecycle is so important is that it enables the management of content according to an enterprise taxonomy or file plan (providing classification and naming standards) and retention schedule (providing retention management rules) regardless of whether the content is electronic or physical or whether the decisions are made by humans or automated processes. The information lifecycle defines processes, rules, and repositories that are globally applied across the enterprise to all information and are separate from most business processes and applications. The lifecycle can be defined to support the full existence of information across multiple types of repositories and can support the infrastructure requirements of the largest organizations. The result is consistent content governance combined with consistent user expectations for how they should participate in the management and retention of enterprise information.

Below is a graphical example of an Enterprise Information Lifecycle:

 Lifecycle diagram
 
Figure 1 - Information Lifecycle Diagram

These states and process components (represented by movement between states) comprise an enterprise information lifecycle model. How these states relate to one another might be different depending on the business unit, repository, type of content, and associated business rules; but information always needs to exist in one of these states.

Business rules identify the constraints to information which prevent or enable it to move between lifecycle states. Business rules are defined for a variety of information elements such as a business event (e.g., document approved for use) or system trigger, audit and retention requirements, file format requirements, de-duplication rules, security rules, metadata rules, etc. Each of these “rules” represent a specific set of enterprise guidelines that apply based on each unique representation of the lifecycle model for an application or process. Defining business rules for each lifecycle state provides the basis for consistency in the enforcement of these rules. For example, transition to the “final” (record) lifecycle state will be the trigger that ensures the appropriate retention policy is applied.

Without a standard enterprise information lifecycle, organizations will continue down the same path they have been experiencing:
• Business units find themselves with too much information
• Productivity and business effectiveness decreases as users struggle to find the information they need to do their jobs
• Business units implement processes to control their information that are inconsistent and incompatible
• Information is often stored in a random and inconsistent locations
• Minimal information is captured about the document or it is not standardized
• Information may become orphaned, and no one knows what it is
• Users have too much or too little permission to access the information
• There is no clear determination how accurate or stale information might be
• There is no clear determination what should be kept – too much information will be retained for too long or not long enough

These issues are even more common in SharePoint environments that have been implemented without consistent standards or content governance.

Defining an Enterprise Information Lifecycle in SharePoint
A SharePoint information lifecycle typically begins with the definition of a baseline enterprise information lifecycle model that defines the states and the transitions through which content flows from creation through to destruction. The enterprise information lifecycle is further defined into a specific set of components, including an enterprise retention schedule, business rules, requirements for viewing and navigating to content, tagging content for search and retrieval, and rules and procedures for metadata management for each type of content in the SharePoint repository.

Processes and procedures for information lifecycle management identify the formalized steps or actions which must be undertaken or performed to meet these rule conditions and address the application and integration of retention schedules. These processes and procedures include but are not limited to:
• Information creation
• Information storage
• Information organization, including creating taxonomies and assigning metadata
• Information searching and retrieval
• Records declaration
• Information disposition

A lifecycle model provides business units with a default set of best practices for information management within their SharePoint sites. Use cases depict how the model and its associated rules, processes and procedures would be defined using specific examples, such as the creation of Word documents, InfoPath forms, email, file shares, SharePoint lists and libraries, and paper. These should indicate default events and common exceptions, such as the Hold state, that information may enter. Use cases are also often used as tools for promoting user awareness and change management.

The information lifecycle model can be used to support many different types of applications, processes and content types. Nearly all recorded information in all sizes of organizations can be managed using fairly simple enterprise information lifecycle states. While the use of lifecycles in SharePoint typically results in a single lifecycle model, multiple models may be necessary to support the wide variety of information and record types created by the organization.

Mapping Lifecycles and Content Governance to a SharePoint Site Hierarchy
As SharePoint sites multiply across the organization, it is imperative that the information and documents contained in these sites be managed in a consistent way. Figure 2 below illustrates how a large organization might implement a hierarchy of SharePoint sites to include My Sites and Team Sites that support the requirements of a wide variety of types of teams as well as enterprise sites that provides portal capabilities and application connectivity to an organization.

heirarchy diagram
 
Figure 2 - Example SharePoint Site Hierarchy

SharePoint 2007 introduced new capabilities for records management. This included the SharePoint Records Center which contained the Records Routing Table and was the repository for content that had been declared as a “record.” The information lifecycle defines standards for the flow of information into the Records Center from other SharePoint sites. The role of the Records Center is illustrated in Figure 3:

record center diagram 
Figure 3 - SharePoint Record Center

All information is either manually or programmatically submitted to the Record Center defined for the SharePoint farm. Depending on the content types, logical content definitions within SharePoint, the information is stored in the appropriate document libraries within the Record Center based on rules defined in the Record Routing table.

The SharePoint 2007 Record Center has a unique set of default behaviors that results in the requirement for careful analysis, planning and, potentially, customization to fully support the typical lifecycle requirements for content and records management in large organizations.

Information Lifecycles for SharePoint Sites
A primary focus of SharePoint Content Governance is to provide an information lifecycle to manage the retention and disposition of content that is maintained in SharePoint sites. The figure below illustrates an approach for SharePoint sites that incorporates the SharePoint Records Center, a Master Site for standard Content Types and Information Policy, an automated provisioning tool for SharePoint sites, and a File Plan and Retention Schedule tool to manage the synchronization and coordination of retention and disposition policies for the SharePoint Records Center.
 
Master Sites diagram
Figure 4. SharePoint Master Sites and Auto-Provisioning

In SharePoint 2007, a Master Site is often used to maintain standard set of content business rules that are managed in a single location. This set of business rules includes the standard SharePoint content type structure, a base set of metadata properties for the enterprise, and retention management policies. The master site contains these rules and structures and is used to publish this classification information to sites within the farm. This provides the ability to classify and manage content in a standard manner across and within numerous sites.

The master site includes the definition of metadata properties, or attributes. These attributes are inherited in the content type hierarchy, and this is integrated with the SharePoint Records Center and Records Routing Tables to enforce an enterprise lifecycle. A single Master Content Type, which is a parent that manages and controls the federation of a common definition of enterprise standard metadata, defines the subset of metadata that is required for all content to be classified. We found this is to be a best practice in SharePoint 2007.

Automated Provisioning is an approach to rapidly create and deploy structured and governed SharePoint sites. These tools insert all of the enterprise rules and SharePoint features that enable a standard information lifecycle, taxonomy elements, site navigation, search, security, and other required capabilities into every site in the same way. This ensures that the information lifecycle and governance policies are consistently deployed and consistently enforced.

In addition to the consistent deployment of a common set of SharePoint structures, the Automated Provisioning includes the definition of site administration features that ensure that enterprise governed controls are enforced and not circumvented through user or site administration activities.

SharePoint Content Governance needs to let SharePoint be SharePoint. Deployment standards are required, but users need to be able to customize their SharePoint for their purposes. Organizations need to manage the consistency of the overall Information Architecture while enabling site owners to create both customizations and documents using a variety of standard document templates in a governed and structured way. We believe it is a best practice to enforce standards through automation and by limiting the choices that are available to both users and local site administrators.

In SharePoint 2010, the Managed Metadata Service and the Content Type Hub use a similar model to incorporate content governance into overall SharePoint governance across SharePoint deployments. This is an important advancement and will make site content administration much more consistent. Additionally, records management will be extended into local sites and records will be able to be declared outside of the Record Center, so records management goals will be able to be achieved more flexibly. We believe the new capabilities of SharePoint 2010 make it even more important to think through your requirements for enterprise content and records management across the organization before deploying hundreds or thousands of sites.

A standard enterprise information lifecycle provides a powerful approach to content governance and records classification that enables users to manage their information consistently across the SharePoint sites of an organization in SharePoint 2007 today and in SharePoint 2010 in the future.

Mike Alsup is a Sr. Vice President with Gimmal Group , an ECM and RM systems integrator. He blogs at (kqj109.wordpress.com). He welcomes comments or scathing remarks at malsup@gimmal.com.

Preferred Solution Providers







SharePoint Recent Expert Blogs

Daniel Antion

Rogue Users by Daniel Antion Vice President Information Services

Working in a small company makes it hard to blog about the things people do. You have to be careful when complementing people, so you don’t steer too much credit in any one d... read more

Nick Inglis

Mapping Content Types & Metadata (White Board Video) by Nick Inglis SharePoint Program Manager

I created this quick white board tutorial to explain the relationship between content types and metadata (or columns). read more

Dave Martin

Welcome to the club SharePoint (Club ECM) by Dave Martin Director | Microsoft Solutions Group

It was about six months ago when a friend of mine from Microsoft told me a story where he had been in a room with all of the established ECM solution vendors as well as some ... read more

View All