2010-03-23 - I have a new client who has provided me with an extensive library of PDF documents. These are technical documents that are well formed and utilize proper markup from the originating source. The client have provided the documents already converted to PDF. I wanted to share with you the steps that I'm taking to make sure that each and every PDF are Optimized for Search.
This quick Tutorial assumes that you have Adobe Acrobat and/or a similar authoring program for the editing of PDF Documents.
Note that you DO NOT need the publishing program originally used to create the documents e.g. InDesign. I'm using Adobe Acrobat for this Tutorial/Pictorial.
My example uses the Google Guidelines for Quality Raters PDF which is optimized for search.
Let's get started! From Adobe Acrobat go to the File menu and select Properties.
The Description Tab is where most of your search optimized content resides.
<title> - If a Title is not present (see next), the file name is used by default. This is why it is imperative that your file names are relevant (suggest hyphens as keyword separator) and that you've crafted an optimized document title.
<title> - This is the single most important element of your PDF Document Properties. This is used as the title in the SERPs. Remember, if this is not populated, the file name e.g.
quality-rater-guidelines-2007.pdf is used by default.
This can be a company name or an individual's name.
Subject = Snippet - The Subject serves as the META Description for your PDF Document. It is the second most important element after the Title.
The Keywords dialog is here for a reason, use it effectively. I keep them brief, usually 3-5 primary keywords, no fluff.
Once you've populated the Description information, you can build Additional Metadata into your PDF Document. Click on the Additional Metadata button and you are presented with this dialog window.
Notice how most of the information you entered in the Description Tab is already populated in this dialog. The Additional Metadata you can enter are;
If you look at the left menu, you'll notice an Advanced selection at the bottom. If you choose that, you'll see a good overview of how your PDF Document may be interpreted by crawlers. See the 5 examples below.
The above covers just the basics of PDF Document Optimization for Search. Once you start to optimize PDFs on a regular basis, you find out how important they can be in the overall equation.
There are a variety of strategies that can be utilized when working with a library of PDF Documents, the above is first and foremost with ALL documents, not just PDFs. This applies to Word, Excel, PowerPoint, and/or any publishing software.
While we're in the Document Properties dialog, we might as well optimize the document for our visitors viewing pleasure.
These are the default settings for the Document's Initial View, what the visitor sees when they open the PDF from a link. In many instances, this is really not the best setting as the user ends up with a document that is not optimized for viewing.
Most of the time we're working with multi-page PDF Documents. If there are too many pages and the document can be broken down into logical mini-documents, we may suggest that. It's like taking really long web pages and breaking them down into groups of highly targeted pages.
I like using the Pages Panel and Page as Initial View.
I assume that most users have a viewport that may not display two-up pages large enough to read on initial open. In fact, I've had to make adjustments myself when opening PDFs that are configured using the two-up options. I suggest the Single Page Initial View.
I like fluidness in layout. Using any of the Fit options provide a fluid experience for the viewer. I prefer using the Fit Visible as most users have their browser viewports set to a comfortable width and height that allows them to read most content without changing the size of their viewport.
I'll choose the
Resize window to initial page and
Center window on screen options. Be sure to choose the Show: Document Title option.
That's it for optimizing PDF Document Properties. This article discusses just one aspect of SEO for PDFs. The actual document itself can also be optimized for search. Use PDFs to your advantage. Think about taking larger multi-page PDFs and breaking them down to their least common denominator then offer them up as a group of PDFs from within an HTML Document that is well formed and semantically structured.
Review the first page of your PDF Document and make sure that it best represents the content of your PDF. In many instances, this may be a Table of Contents which usually isn't the most optimal page to have as the first page, not from a search perspective. That is why using the Document Properties to their full extent is imperative when Optimizing PDFs for Search.
If you can, I'd suggest making an introduction page for the PDF that contains a Heading and an Abstract of the document. Use an Inverted Pyramid Writing approach when constructing the Abstract and I typically suggest a target of 140 to 160 characters per paragraph.