Raising the Bar: An Internet Marketing Blog for Lawyers.

Blogging on Law Firm, Professionals and Business Web Design

Peter Boyd

How to Create Search Friendly PDFs.

A question from our readers:

“How would” I create a text version of these documents so that they can be crawled by search engines?”

Easy. Google already indexes PDFs. In fact, Google can index just about anything these days. See a chart here.

  • Adobe Portable Document Format (.pdf)
  • Adobe PostScript (.ps)
  • Atom and RSS feeds (.atom, .rss)
  • Autodesk Design Web Format (.dwf)
  • Google Earth (.kml, .kmz)
  • Lotus 1-2-3 (.wk1, .wk2, .wk3, .wk4, .wk5, .wki, .wks, .wku)
  • Lotus WordPro (.lwp)
  • MacWrite (.mw)
  • Microsoft Excel (.xls)
  • Microsoft PowerPoint (.ppt)
  • Microsoft Word (.doc)
  • Microsoft Works (.wks, .wps, .wdb)
  • Microsoft Write (.wri)
  • Open Document Format (.odt)
  • Rich Text Format (.rtf)
  • Shockwave Flash (.swf)
  • Text (.ans, .txt)
  • Wireless Markup Language (.wml, .wap)

Now from an SEO perspective, we have found the following to be helpful. In order to give them a boost, you may want to do one of the following:

  1. Create a new page summarizing the PDF’s contents and then link to the PDF itself. The theory is that the summary will add additional keyword phrases to index and by linking to the article, you will increase the documents authority.
  2. Rename the PDFs to be keyword phrases you want to rank high for (i.e. instead of article1.pdf it becomes “real estate how to guide.pdf.”)”  The theory is that the file name gives Google another indicator about what the article is about AND all links to the file will contain keyword phrases.
  3. Alternatively, you could cut/paste the entire PDF as HTML text. The theory is that HTML documents tend to rank higher than PDF documents. You would create links to the PDF internally on the site and from other web sites. This will give the document more authority and rank it higher in the search engines. Make sure that you do not copy and paste the entire PDF document AND have the PDF indexed. That is technically duplicate content. You may want to indicate to Google to not index the PDF.You can do this by disallowing the PDF folder of all files.
  • http://www.designedbylucas.com Pit: Miami Web Design

    Wow! I never thought of that, I am in the middle of a self seo campaign and PDF files will actually increase the content on my site.




Categories:

About Us

PaperStreet creates new web sites and revitalizes aging ones. In addition to creating web sites that are engaging, we also have a knack for getting results.

Read More »



Featured Portfolio:

View All
Contact Us