Matt McAlister moves to YDN

Matt McAlister has changed jobs within Yahoo!, moving away from the RSS group and into the Yahoo Developer Network. He follows former RSS PM Scott Gatz into the advanced products and media side of the business under Bradley Horowitz.

Matt’s move is interesting timing given the recent addition of a Yahoo! Mail full beta and all the users that come with it to the backend RSS platform.

The Product Strategy group at Yahoo! continues to attract company talent into a side of the business I call “Jerry’s slush fund.” Employees are given more opportunity to think big and take risks without revenue concerns. The group includes free developer APIs, a small incubator, and research groups in Berkeley and London.

Yahoo Hack Day, a career fair for an era of participation

Yahoo! is hosting an open hack day at their Sunnyvale headquarters next Friday and Saturday, introducing developers to well-known Yahoo! employees and development tools. I view the whole thing as a new take on the career fairs of the past, where introductions happen over clever code instead of a carefully crafted resume attempting to make its way through the various cogs on its way to a decision maker. You can try navigating the Yahoo! careers site to figure out which of the 115 open PHP positions are right for you.

There are few details available about the event but that hasn’t stopped people from flying in from as far away as Australia to attend. A few Yahoo! employees have told me there will be big-name music acts performing each night in what could be the biggest corporate party since Pixelon’s $10 million iBash in 1999.

I’ll be attending Hack Day as a curious observer. There should be good talks next Friday on YUI, JSON, and PHP, and the big party is on Saturday night. I’d like to hear a little bit about how Yahoo’s tweaked versions of Apache and PHP differ from trunk and learn about some of their real-time attack and statistics analysis. Perhaps a few people will gather on the lawn to talk about running business and product.

I’ll post pictures and updates from the event. Some things that are on my mind that might come into more focus over the next 10 days include:

  • What are reward and recognition methods for top contributors and developers other than being hired by the API or platform provider?
  • Where is the current state of commercial API agreements at large providers? Do popular mashups receive priority access and support?
  • How have structured corporate collaboration events such as hack days changed product group interactions within big companies?

NewsGator syncs Windows RSS platform into its cloud

NewsGator just released NewsGator Desktop Sync in beta. The Windows application sits in your desktop tray and keeps your feeds, folders, and item read states synchronized between NewsGator Online and the Windows RSS Platform. NewsGator Online may be viewed in a web browser or synchronized to your mobile phone, Mac, Windows Media Center, or other applications hooked into NewsGator Online.

NewsGator Desktop sync connects Windows PCs running the Windows RSS Platform present in Internet 7 and above. Synchronization is like a personal teleporter for your feed data, breaking up your data into many tiny bits capable of being reassembled wherever you would like to go. Desktop sync allows NewsGator to connect the siloed Windows RSS Platform to its own teleportation device, retransmitting the bits to any application using the NewsGator API.

Authenticated and private feeds

Some syndication feeds are not meant to be displayed for the world to see. Our everyday lives contain private and confidential data we wouldn’t want anyone else to see, and especially not search. There are a few options for trying to keep things private in your feed aggregator but the implementations require proper coding and privacy from all implementors.

Examples of private feeds intended for 1:1 communication include bank balances, e-mail notifications, project status, and the latest bids on that big contract. Data in the wrong hands could be dangerous, and many companies will stay away from the feed syndication space until they feel their users’ personal data is secure.

A private feed’s data could be exposed in a variety of ways. A desktop aggregator’s feed content might be available to other users on the same computer, either through directory access or desktop search. An online aggregator might expose a feed and its content in search results or a preview mode.

Security through obscurity

Sites such as Flickr hide private photos from navigational view, but do not restrict access to photo data if shared, or if someone were to guess the semi-scrambled URI. Private things are kept private because no map or directions are available for public navigation.

Permission-based exclusion

A feed might specify a desire not to be available to other users through means such as search but it’s up to each to obey a publisher’s preferences. I proposed feed exclusion using category last February and Bloglines recently introduced a feed access control namespace specific to their application.

Publishers cannot rely on an application to nicely obey access control specified in their feeds of sensitive data.

HTTP Authentication

Feeds can be accessed using HTTP/1.1 and access authentication. A few feeds online currently use this method to deliver personalized information to their users.

  • 37signals Basecamp – Track your project status.
  • Measure Map – a daily update of your blog stats.
  • Daring Fireball – paying members receive a user-name and password to access a full-content version of the site’s feed.

HTTP Authentication works with most desktop aggregators but runs into trouble with most online aggregators which rely on a common feed store based on feed and/or link URIs. Bloglines and Google Reader fail to load authenticated feeds, do not request credentials, and do not provide a meaningful error message. NewsGator Online supports secured feeds.

Feeds with authentication might be exposed to a broader audience than the original provider of proper credentials. A search for “Basecamp” on My Yahoo! exposes the private project management data of over 25 customers of 37signals’ Basecamp.

Summary

A feed publisher could whitelist the user-agents it knows comply with its access policies. SSL encryption might not be a bad idea either as shared aggregation spaces might not store content requested over HTTPS. It would place extra load on the server as each request requires extra processing, but if the alternative is placing your customer’s data in the Yahoo! search index then that’s not such a bad thing.

I believe large publishers such as Salesforce.com or eBay would produce more feed content if they knew their customers’ data was kept private and secure. There’s a definite demand for more content transmitted over feed syndication formats but it will take the cooperation and collaboration of security formats and consistent aggregation practices to really move the needle in the right direction.

Expressing threads and comments in feeds

A feed entry is not a silo, but often connected to other entries in the feed world. In this post you will learn how to express relationships between data using RSS, Atom, and common namespaces.

Comment count

Reading a feed entry can sometimes evoke action in the form of commentary. Readers often become writers, leaving comments attached to a post or article. Popular technology news site Slashdot has had an active commenting community for years, and created a namespaced element to express the total number of comments associated with a specific item.

<slash:comments>42</slash:comments>

The above example uses the slash module to define the number of comments associated with a specific item or entry. This additional data helps a user determine the popularity (or existence) of conversations around the individual entry. Aggregators may choose to sort on comment count, or simply display the total number of comments before linking to the entry’s comments area (defined in RSS 2.0).

Comment feeds

Each entry might output a separate feed for comments. Aggregators can count the total number of individual entries contained in this feed to display a total comment count, or display the comments within the post view.

<wfw:commentRss>http://example.com/post1/comments/rss</wfw:commentRss>

The above example uses the Well-Formed Web Comment namespace to define a RSS feed containing comments for the given individual entry. An aggregator can pass this URI to its parser to better understand the conversation happening in the individual entry’s comments.

Citing the source

A source element may be specified to provide more information about the origination of described data. An entry might be a copy of another resource or inspired by another piece of work.

The Flickr explore feature contains one picture sourced from an individual member’s photo stream for example. A user might want to explore more photographs from that Flickr member or immediately subscribe to his or her photo feed. Another example might be crediting an original source, such as a blogger commenting on a story reported by The New York Times.

RSS 2.0

RSS source elements exist to give credit for links and provide a method for tracking story origination. You may specify the feed URI and title in this item element.

<source url=”http://example.com/rss.xml”>Example Title</source>

If a reader notices Bob tends to find really good stories from Sue he might add Sue to a list of subscriptions. A machine might notice the relationships between stories, and follow multiple connections until it locates a node of origin.

Atom 1.0

Atom’s source element (IETF 4287, 4.2.11) credits another feed when the current entry is a copy of an original source. This applies to the Flickr example given above, a search result within a feed search engine, or a spliced feed. Elements such as feed author, copyright, and links are maintained, allowing a user to subscribe directly to a source feed, visit the corresponding web page, or view more information about the author.

Atom Threading Extensions

Atom Threading Extensions combines comment counts, comment feeds, and giving credit for the source of a story.

<link rel="replies"
      type="application/atom+xml"
      href="http://example.com/post1/comments/atom"
      thr:count="42"
      thr:updated="2006-06-28T12:11:10Z" />

The above example defines a feed containing comments about the individual entry and a count of comments received including the time of last comment. Each attribute may be interesting enough to spark an action.

Summary

There are multiple ways to express relationships between resources and spawn related actions for individual feed entries. Implementations vary, but exposing these connections and actionable data leads to a richer and more intertwined web.

Implementations vary by each piece of feed parser but here are some “in the wild” numbers taken from a recent snapshot of Google Reader subscriptions:

  1. Well-Formed Web: 8.39%
  2. Slash: 3.66%
  3. Atom Threading: 0.54%

Feed syndication beyond blog updates

Many people in the technology world view the world of RSS, RDF, and Atom as a way of outputting blog content and reading the latest information in a feed aggregator. While that simplified view is true, the full world of feed syndication is a bit more complex.

Syndication feeds are extremely popular expressions of structured XML. The popularity of content production and consumption using feeds has resulted in widespread deployment of parsers able to turn something simple such as a title, publication date, and a body of text into an easily displayed message, communicating recent updates and atomic changes in our online world.

Power users might subscribe to hundreds of blogs within an aggregator such as NewsFire or Bloglines but feeds are also used to transfer specific data related to specific applications such as e-mail or a supply chain status.

Google

Gmail logo

Google Mail produces an Atom feed for every user. The feed contains information about the number of unread e-mail messages in the account as well as entries detailing each unread message. An authenticated user can view the message title, body, sender, and send date through data contained in the Atom feed. Google uses Atom 0.3, a deprecated version of the standard, and doesn’t properly namespace their added elements, but that doesn’t stop third party sites such as Netvibes or Gmail desktop notifiers from parsing the feed to display timely information for their users.

Most of Google’s APIs are expressed using Atom or RSS syndication formats. You can access your Google search history or access your Google Calendar data expressed as a feed.

Netflix

Netflix users can keep track of the latest movie releases, their movie queue, or recommendations using Netflix data expressed as RSS. Netflix members can access this data anywhere and at any time, or other services might build upon the Netflix data.

Earthquakes

USGS logo

The United States Geological Survey collects earthquake data from around the world and outputs the data in multiple formats for use by enthusiasts and researchers. The RSS feed includes the time of the earthquake, the epicenter expressed as latitude and longitude, the depth of rupture, and other data. Standard RSS parsers can understand basic data contained in the feed while more advanced readers can comprehend and display detailed information present in namespaced elements.

Summary

What commonly updated data do you produce that can be expressed to the world in the form of feed standards? Widely deployed feed parsers are ready to listen, delivering your latest data to interested people and/or devices around the world.

Yahoo! bundled with Acer computers

Yahoo! and Acer announced a multi-year agreement to bundle Yahoo! Search, Toolbar, and start page with all Acer computers sold around the world. Acer is the top notebook vendor in Europe, Middle East, and Africa and third largest in Asia Pacific according to Gartner. The new deal should give Yahoo!’s international market share a nice boost.

The press release mentions a cobranded homepage in the style of Yahoo!’s recent redesign. Acer will set Yahoo! as the default search in Internet Explorer 7 and Yahoo! Toolbar will come pre-installed. Yahoo! services will be present on all Acer computers shipping after October 1.

TechSession: Mobile communities

The mobile industry in the United States is starting to take advantage of new data networks and cheaper messaging costs to connect subscribers and their interests. It’s a world filled with acronyms describing ways to help you stay connected as you leave your home-base and wander around cell space with an info beacon in your pocket pinging the network and collecting new data in between voice conversations. What can EV-DO, HSDPA, SMS, MMS, WAP, and XHTML do for you?

Next week’s SF Tech Sessions event is all about mobile communities will bring together three companies exploring new methods of communication through a mobile phone. WAPtags leverages the power of your friends to display the most relevant search results on-the-go. Twitter helps you send messages to a group of friends with your latest status and thoughts. TextMarks combines a short code and keyword to create data services similar to a mobile mailing list or alerts service.

SF Tech Sessions is demo-focused, with the product and development teams leading discussions about new technologies and their company’s unique perspective. It all takes place next Wednesday, September 20, from 7-9 p.m. at CNET in San Francisco. For more details, and to RSVP, visit the SF Tech Sessions blog entry.

Fox Interactive Media Labs testing widget platform

TheSpringBox RSS widget

A small group of of Fox Interactive Media employees are working on a new widget platform called TheSpringBox. TheSpringBox allows users to embed Flash widgets in any webpage and download their favorite widgets to the desktop engine for more frequent use. The new widget engine was developed by FIM Labs just outside Atlanta in Marietta, Georgia. (via Mashable)

TheSpringBox desktop

The existence of another desktop and webpage widget engine isn’t too big of a deal, but this is a platform under development from Fox Interactive Media, home of MySpace and a huge widget economy. Peter Chernin, COO of parent company News Corp. News Corporation, recently told investors it’s time to cut out the middleman in MySpace widget services.

If you look at virtually any Web 2.0 application, whether its YouTube, whether it’s Flickr, whether it’s Photobucket or any of the next-generation Web applications, almost all of them are really driven off the back of MySpace.

If we build adequate, if not superior, competitors, I think we ought to be able to match them, if not exceed them.

By providing its own widget platform Fox Interactive Media would be able to whitelist widgets included on the pages and desktops of MySpace users, restricting options and promoting their own services. TheSpringBox could also be a small project explored by a small group of creatives 2000 miles away from FIM headquarters, but based on Peter Chernin’s recent comments I can’t help but think of the protectionist business practices MySpace might introduce.

Microsoft is the only other company to currently bridge the web and full desktop widget space with Microsoft Gadgets on Live.com, Spaces, and the desktop. Google currently powers sidebar widgets on the desktop, Pages, and iGoogle.

Yahoo! Mail enters public beta

The new Yahoo! Mail has entered public beta, incorporating many features from Oddpost into a new PHP front-end. The new Yahoo! Mail features a two-pane interface for reading feeds in one scrollable page.

Yahoo Mail fetch rss feed

Yahoo! Mail product manager Ethan Diamond told Richard MacManus “the [feed reading] feature is kinda in stealth mode; we are not drawing much attention to it.” Yahoo! Mail will auto-subscribe users to “the most popular feeds across the Yahoo! network”, adding a few feeds to Yahoo! Mail’s user base of over 250 million users.

The Yahoo! Mail feed view is built on-top of the My Yahoo! feed platform backend (api1.my.mud.yahoo.com). A user’s list of subscribed feeds are the same throughout the system and there is no display of read/unread count. Yahoo! Mail will display information from the most recent fetch of your feed in descending order by creation date for up to 51 items. Yahoo! tracks the view state of each entry on mouse-over but there is no click-tracking on outbound links.

Yahoo! displays the time and date it discovered a published entry regardless of the publisher’s own data. There seems to be a bit of a lag based on my testing of frequently updated feeds this morning, so be sure to ping Yahoo! with each post or millions of users might not see your latest content. Yahoo! is still failing Atom title conformance tests among other things.