Speaking at PubCon on Tuesday

I’ll be in Las Vegas early next week speaking at PubCon on feed syndication best practices. The session takes place from 10:15-11:30 a.m. on Tuesday if you are attending the search conference. I have not been to Las Vegas in a few years so I’ll be checking out new pieces of grandeur at the Wynn, new Caesars Palace, Treasure Island, etc. Hopefully there will be lots of search geeks in attendance leading to interesting conversations….

Google collaborative appliance on the way?

Google’s moves into application bundles and collaboration software are setting it up for a bigger enterprise play, taking on Microsoft in an area that consistently feeds their R&D. Maybe you read about the JotSpot acquisition this morning on the Google enterprise blog. We look forward to putting those wikis to work. Google currently searches the enterprise through its search appliance, a brightly colored box you place in your rack and configure to crawl behind the firewall. Just one application on this box seems like a waste of space and could perhaps open up some more applications for small to…

Bookmarking and social sharing trends

The ability to save a URL has been around since Mosaic 0.2 but is currently experiencing a transformation as we learn more about the pages and content behind the pointers and share our findings with others through social networks. Hotlists, bookmarks, and favorites are changing and this month’s SF Tech Sessions next Monday will take a look at a few new companies changing the way we think about sharing bookmarks. Photo by Scott Beale / Laughing Squid The inspiration for this month’s SF Tech Sessions came out of a conversation with Jeff Weiner and Joshua Schachter of Yahoo! earlier this…

Google Alerts for blog content

Google Alerts now supports blog search content. If you subscribe to a Google News alert for your brand or topic of interest you can now receive the same style alerts for content in the Google Blog Search index. Google Alerts tracks news, blogs, Usenet groups, and Google Groups discussions using the same search syntax found on their respective website. The new blog search e-mail notification will be an easy extension of vertical search for existing users. Advanced users can setup an advanced search, or choose to receive general updates via web feeds and critical updates via e-mail. I expect Google…

New Googlebot controls for webmasters

Google has added new features to its tools for webmasters, allowing us to request Google index our site faster and more thoroughly than before. Crank it up! Control Googlebot crawl rate You can now control how frequently Googlebot crawls your site over the next 90 days. Webmasters can ask Googlebot to slow down or speed up for the next 90 days. Your choice may affect your total bandwidth usage but the tradeoff is possibly more frequent visits from Google’s discovery and indexing tools. Enhanced image search Webmasters can now opt-in to Google enhanced image search. If you opt-in Google may…

The current state of video search

When I lived in L.A. it seemed like everyone wanted to be a movie star. The Starbucks barista waiting to be discovered as he pronounced “Frappuccino,” friends scheming to be placed on a reality show and win a trip to a tropical island, and the many writers trying to get their latest script into the hands of Steven Spielberg. The recent boom in online video and its associated capture hardware has created a new class of stars. The next American Idol might submit a cover song to YouTube and video of a child’s first steps are uploading to the Web…

The current state of audio search

Online audio is definitely on an upswing, fueled by the iPod revolution, improved online playback, and broadband penetration. Audio search is keeping up with demand for new content, thanks in part to national security spending in the Cold War and beyond. In this post I will outline the current state of audio search, and how machines make sense of spoken word, progressing from easy to difficult. First, let’s define the space. I’m interested how a search engine might index content with non-professionally produced metadata. The President’s weekly radio address contains a full transcript. Music catalogs are available for purchase from…

The current state of image search

A picture is worth a thousand words, especially to search engines trying to match a brief search query to a set of appropriate visual results. How can a web search engine collect enough data about a particular image to provide a user with relevant results? In this post I will outline image search concepts, the current state of the art, and outline some of the challenges with still image search. Image on your website You might recognize the depiction above as Yoda, a popular character the Star Wars movie series. More specifically this is a picture of a Yoda statue…

Google acquires YouTube for $1.65 billion

Google shocked the online world this weekend with its acquisition of leading video site YouTube for $1.65 billion in Google stock. YouTube will maintain its brand and site, and move into its new San Bruno offices this week as planned. Hitwise estimates YouTube’s market share in September at 46%, an even stronger share in Europe. Google Video had an estimated market share of 11% in the same period. The $1.65 billion acquisition places YouTube at about the same purchase price adjusted for inflation as eBay’s acquisition of PayPal in 2002 for $1.5 billion. I’m sure the similarities are not lost…

Google Blog Search adds ping beacon, changes.xml

Google Blog Search is now accepting pings and republishing the updates it observes. You can submit an update using XML-RPC or REST, similar to other blog services and easily added to your weblog’s ping configuration. RPC endpoint: http://blogsearch.google.com/ping/RPC2 Google publishes the last 5 minutes of ping activity in its changes.xml file. It is possible to receive pings of different recency by adding the last parameter to your request with a number of seconds between 1 and 300. The new service is a change in Google’s view of the web, accepting the value of fresh index content within minutes instead…