Thursday, July 28, 2011

Open XML Opens Office Document Metadata (without Office)

Microsoft has been releasing a number of Open XML SDK v2 samples on the MSDN Code Gallery in the past few days, samples that interested me professionally, and so wanted to round the up for easy reference.

The key point is that these provide examples of doing things for the old/binary formats, required using Office COM, whereas now we can do it all without installing Office. Just another reason to love the more modern approach taken with the Open XML file format.

Here's an example of one of the above links;

Retrieving Comments from Word 2010 Documents by Using the Open XML SDK 2.0

"The sample provided with this article includes the code necessary to retrieve the XML block that contains all the comments from a Word 2007 (or later) document. The following sections walk you through the code, in explicit detail. When you use the sample code to retrieve the comments, the procedure returns an XML element, named w:comments, which contains the XML block of information from the original document. It's up to you (and your application) to interpret the results of retrieving the comments.




Related Past Post XRef:
Checking for Microsoft Word DocX/DocM Revisions/Track Changes without using Word... (via OpenXML SDK, LINQ to XML or XML DOM)
LINQ to XlsX... Using VB.Net, LINQ, the OpenXML SDK and a little C# helper, to query an Excel XlsX

Official boat-load, as in supertanker, sized OpenXML content list (Insert "One OpenXML content list to rule them all" here)
So how do I get from here to OpenXML? Got a map for you, an Open XML SDK Blog Map…
Where to go to scratch your OpenXML dev info itch…
"Open XML Explained" Free eBook (PDF)

Open XML SDK 2.0 for Microsoft Office Released – Automate Office documents without Office
Opening OpenXML, the Open XML Package Editor Power Tool for Visual Studio 2010
Open XML 2.0 Code Snippets for VS2010 (and VS2008 too)
Open XML Format SDK 2.0 Code Snippets for Visual Studio 2008 – 52 C#/VB Code Snippets to help ease your Open XML coding

OpenXML Viewer 1.0 Released – Open source DocX to HTML conversion, with IE, Firefox and Opera (and/or command line) support

Powering into OpenXML with PowerShell

Microsoft Office File Formats and Microsoft Office Protocols Documentation Refreshed
Microsoft Office File Formats and Protocols documentation updated for Office 2010 (Think “Now with added ‘X’ flavor… DocX, PptX, XlsX, etc”)

No comments: