Google Base blog import instructions

Google Base logo

Google Base launched last Tuesday as a new repository of information for distribution across Google’s network of sites including Google search, Froogle, and Google Local. You can add your existing content to the Google Base for broad distribution with only a few easy steps. I’ll show you how.

  1. You need a Google account associated with Google Base to submit items. Sign-in to Google Base to get started.
  2. Complete your extended profile. Market yourself with your full name or the name of your site, a description up to 400 characters in length, your URL, location and contact information.
  3. Create a new feed containing additional elements from the “news and articles” information type. These additional elements include author name, tags and categories (label), and a publication date. I set the number of pages to 1 because all my posts exist on their own individual web page. I used the Atom template because both Google and Atom require dates in ISO 8601 format.
    • Do you use Movable Type or TypePad Pro advanced templates? You can use my Movable Type Google Base template to easily output your last 1000 blog posts. You may output up to 100,000 items but the resulting file must be under 10 MB in size.
  4. If the resulting feed is under 195 KB you can use the Feed Validator to test your markup.
  5. Download the resulting file.
  6. Register your bulk upload. Your item type should be Reference Articles until there is a better pre-defined category for distributing your content.
  7. Upload your file via the Google Base dashboard bulk upload.
  8. Wait a long while for Google to consume all of your content. They claim 20-60 minutes but in my experience the process takes hours.
  9. Search Google Base to see your newly created content.

I created my own Google Base template for Movable Type to make the process a lot easier. I think Google should index Dublin Core elements and other information in the feed other than its own namespaced creation, so I left some extra items in the feed that currently serve no purpose.

You can submit an image link (image_link) for each item to stand out in the search results. Logo images are not acceptable but Google does not seem to be enforcing the rule: I have seen many logos throughout their result pages.

Once your blog posts are successfully updated you can edit each post, view the description, and add additional labels and attributes by clicking “edit” next to the item title on your dashboard.

I also recommend creating a separate People Profile entry for yourself. Your entry will expire in 31 days but you will drive new traffic to your site for keywords and topics of interest.

Why should you go to the trouble of submitting your information to Google Base? You will be completely sure that Google has all your latest content complete with the appropriate link back to your site. Feeding the content directly to Google may help your posts place better in Google search results.

11 comments

Commentary on "Google Base blog import instructions":

  1. Sam Ruby on wrote:

    Step 3.5: validate.

    As of this weekend, the feedvalidator understands and will validate elements in the Google base namespace.

  2. ME Strauss on wrote:

    Thanks Niall,
    For this information. This makes a whole lot of sense. I appreciate it.
    Liz

  3. Steve Hall on wrote:

    Yikes. doesn’t Google SiteMap accomplish the same thing without all this extra work?

  4. Niall Kennedy on wrote:

    Steve,
    Google Sitemaps tell the Google crawler where it should look to find your latest entries. Google Base allows you to enter your full content directly into Google where you can see your content’s status in real-time and add additional metadata such as the location of the post and an associated logo.

    Google Base allows much more control over the content submission process but it’s a good idea to output both to make sure you are doing everything you can to drive the most traffic to your site.

  5. CT on wrote:

    I was going to say… Is this really an intended use for Google Base? SiteMap is a dedicated way to help Google index your site; and even the simple “Submit Your Site” form should cover you as well.

    Not only do I not see the point, but I wonder if Google wouldn’t pull the plug on this, especially if it gets flooded with both legitimate sites and splogs etc.

  6. liberalcowboy on wrote:

    I definetly think Google will pull the plug on this. Because the real purpose of Base is for it to become a porn search engine. It is so slow lately. We know why now. Google Base is chock full of porn, soon enough it will be full of splogs.

    Sometimes I don’t like Google. If MSN were smart they would jump on the bandwagon take all that other stuff off their site and create a good search engine. Because I am almost tired of Google.

  7. Dave Carpe on wrote:

    why not just use it for select entries – as in, the good ones only and not the random blurbs? that makes sense. i’ve reposted articles from competitive intelligence magazine on my blog (that i wrote, obviously) and was mulling over moving certain posts over as reference articles – though then worry about duplication of results when the site integrates more smoothly…seems the sitecrawler takes care of most of it (they’re all easily found by search engines, so really not sure where base takes it)

  8. Jeroen on wrote:

    I tried out your MT template, but Google Base did not accept the results. I found out through the feed validator that part of the problem was the date format, which were not conform ISO8601.

  9. Niall Kennedy on wrote:

    Jeroen,
    Date output and Google Base input works for me. Perhaps you have a configuration issue.

  10. Spanish Googler on wrote:

    I’ve found an Open Source library for helping PHP developers to generate and upload through FTP Google Base valid data.

    The website is http://www.phpbase.org

    It has a plug-in style for attributes and schemes. But currently only Housing scheme has been defined.

  11. Raja on wrote:

    Useful info Niall. I’ve tried google submit site, but my pages were not included in the google index [or] not showing up in the search results. I find the method you suggested to be more easy. I could see the results immediately(after a few hours).
    Thanks and Regards,
    Raja AG