Interacting with Recommender Systems
Don Turnbull (donturn@fis.utoronto.ca)
Interests
My primary interest in designing Recommendation Systems is from my research on designing a system to locate, evaluate, and recommend Web pages for intranets of Web users. I've chosen this problem domain mainly because of the ease of collecting Web use, the items used (Web pages), and the relatively open internet development standards.
In many ways, I'm approaching Recommender System design from an Information Retrieval perspective. I propose that some of the most applicable systems for recommendations are concerning documents, and also that the best way to help tune Recommender System interfaces is using statistics to measure their use. With multiple participating users, we can both match recommended resources and modify the view of the resources and the interface itself. I find a certain amount of similarity in ideas of precision and recall in relation to interface design. Maybe this is a move from User Centered System Design (Draper and Norman, 1986) to Use Centered Systems Design.
Ideas
I offered these comments on the COLLAB mailing list on what I thought the differences were between the different types of Recommender Systems:
Social Filtering seems to be when a group of users are helping each other anonymously or unaware of all that their preferences do- as in these "most-visited" web sites lists or to build an ad hoc kind of Yahoo for information objects (which is fairly obtuse and rigid, without much rapid change). For some reason words like "trends" and "memes" come to mind when thinking about how social filtering will try to point to what's hot or what's on the rise.
Collaborative Filtering seems to have a more team-oriented (a corporation, a group of people with a common purpose?) purpose and might involve some explicit ranking or rating of resources. This one seems to be the most complex as I tend to think about it having a more complex processing system- dealing with information objects and is constantly changing in that new information is added and relevancy of information is always shifting as well. A corporate intranet with filtered information on topics of interest or the underlying system to coordinate intranet (and internet) knowledge management might be two vague examples. We would have to measure the changing value of the information objects AND each user's value of the object (remembering that uniqueness, confidentiality or freshness of the information can influence the value).
Recommendation Systems seems to be the most simple of the three usually dealing with concrete objects (compact discs, movies, plumbing equipment) that are fairly static and have a definite, user-understandable set of characteristics (both for rating and comparing). The recommendation system can treat each characteristic as a value and simply run correlations for recommendation. I tend to think of these systems as being independent of the content they are recommending, changes in the recommended objects aren't likely to happen - however changes in the user's rating of the object might change over time. Obviously, Firefly would fit into this type of system.
I still feel that Collaborative Filtering is a more accurate term for this genre of work, at least for the domain I'm interested in - the Web. I'm convinced that the best places to start building CF Systems are on the Web and for Web content. Here are a few reasons why the Web is a good domain for Recommender Systems
I've been working on some of these ideas to learn more about how people use the Web which can also be used to recommend new Web resources. In many ways, this is simply new Information Retrieval work.
Information Retrieval and Recommendation Systems
It is difficult to design an interface to do all of the information display for an application - sometimes it's easier to leverage the intelligence of its users. Typical interface issues are how to display a large amount of data at once and how to show (the perceived) most relevant information first. Even with the best-designed interface, users will gradually use some features more than others and read certain types of notes more often, filtering helps use this feedback to enhance the application's usability (or the information gathered via the application).
Belkin suggests that Collaborative Filtering is a new implementation of Information Retrieval. What makes it different from IR is that feedback is more important and aggregate feedback (how the information retrieved is used) among groups of users is measured as well. Some examples of reasonably filtering applications:
Is Collaborative Filtering, the next big thing in Information Retrieval? These are the key issues that take filtering beyond typical Information Retrieval:
This next addition to IR is the idea of building and learning from user profiles. I think a lot of issues related to Data Mining and more specifically Knowledge Discover in Databases (now a new ACM SIG, SIGKDD) will prove useful in leveraging data for Recommender Systems.
Information Retrieval and the Web
These systems comprise a variety of interaction techniques, interfaces, and types of recommendation information that have had success:
Mosaic- Berners-Lee, CERN
Firefly - (previously HOMR and Ringo) Metral, Shardanand, and Maes, MIT
Expanded from movies and music to community interests and other media.
Yahoo - Filo and Yang
Point's Top 5% might also be in this category. These Web page classifier sites are de facto recommender systems as most of their links are sent in by users. It is not interactive or adaptive however.
Lotus Notes - Maltz and Ehrlich, Lotus, Corp.
InterNotes Navigator also adds browsing features to Notes and allows for sharing of bookmarks. It adds this feature on to an existing application, not a whole new application.
Interfaces for Collaborative Systems/Browsers
The following systems provide some good design examples of how to use filtering and Collaborative Filtering to make finding (the right) things easier.
GroupLens - Resnick, et. al. (Univ. of Minnesota)
Explicit rating is required. It focuses on Usenet news and included the following features:
Of course, most of its benefits are derived mathematically, not with its interface (of the information space). However, its long success shows that its filtering capabilities are useful enough that users continue to use the system. I think explicitly rating something is the antithesis of a good interface however.
SenseMaker - Baldonado and Winograd, Stanford Digital Libraries Project
SenseMaker's real strengths are:
While many of these ideas are seen in many other examples, the right combination seems to be found here. SenseMaker provides a good interface and view into the data.
WebBook and WebForager- Card, Robertson, and York, Xerox PARC
InXight, WebBook/WebForager basically groups information into virtual books. These books facilitate moving among various books and pages of information (mostly web sites and wb pages) in a more native manner to beginning users. It's main strengths are:
Unfortunately, its weakness is trying to use a bad metaphor for a large amount of information. Who would want to be constrained by physical-oriented models when dealing with such a vast amount of information? Possibly useful as training wheels for new Web users.
Design Problems
In looking at several of these more useful examples, we can see some obvious areas where more design work is needed:
My Background
Don Turnbull (donturn@fis.utoronto.ca) is a Doctoral Candidate in the Faculty of Information Studies at the University of Toronto. He was a Lead Technical Architect at IBM Interactive Multimedia - Atlanta, working on the World Book/IBM Multimedia Encyclopedia and other Internet-related products. Don received his MS in Information, Design, and Technology from the Georgia Institute of Technology (Georgia Tech) in 1995. He also worked as a Methodologist at KnowledgeWare, Inc. designing CASE tools for client-server applications as well as working on Usability and Online Information Development issues. His research interests include intelligent agents, collaborative filtering, information seeking theory, and World Wide Web software architectures. He is a co-author of "Web Work: Information Seeking and Knowledge Work on the World Wide Web" due to be published in 2000.